Loading TOC...
Ops Director Guide (PDF)

Ops Director Guide — Chapter 4

Monitor View

The Monitor view is an aggregate view that shows the summary of the state of resources in your enterprise.

Use the Filter by pulldown menu to specify the Resource Groups to monitor relevant information in this view. Select either All Resources, or choose from the available Resource Groups. Select Omit ancillary databases to omit system databases, such as App-Services, Extensions, Fab, Last-Login, Meters, Modules, Schemas, Security, and Triggers, from this view.

This chapter covers the following topics:

Monitor View Graphs

The default Monitor view displays the following graphs:

Some graphs, such as Busiest Servers, Slowest Servers, enable adjusting the number of resources displayed from the menu. For example, instead of monitoring all of the slowest servers, you may select the top 10, 20, or 50 servers.

All graphs enable switching to a full-size window by clicking the maximise icon , thereby allowing for more information to be displayed. You may click on the minimise icon or click anywhere on the screen to get back to the previous state.

Cluster Problem Distribution

The Cluster Problem Distribution graph provides a visualization of the time and date of cluster events over the last seven days. Hover over the event icon to view details.

Top Problematic Hosts

The Top Problematic Hosts graph provides a visualization of the hosts in your enterprise that have experienced the greatest number of events over the last 24 hours. Hover over the host bar to view details.

The Top Problematic Hosts graph can be sorted by a simple count of Total Alerts, or weighted based on the severity of the messages.

Busiest Servers

The Busiest Servers graph provides a visualization of the App Servers in your enterprise that have experienced the greatest request rate activity over the last 24 hours. Hover over the App Server bar to view details.

Slowest Servers

The Slowest Servers graph provides a visualization of the App Servers in your enterprise that have the greatest latency the most activity over the last 24 hours. Hover over the App Server bar to view details.

Overview Pages

The key performance indicators section is a set of active tabs for each resource type: clusters, hosts, databases, and application servers. Selecting a resource type tab leads to a heat map view for that resource type. These tabs provide at-a-glance statistics, including overall system health and resource health for clusters, hosts, databases, and application servers.

Overall system health is indicated with a color-coded vertical bar at the top left side of the Monitor view, where red indicates a Critical condition, yellow indicates At Risk, and green indicates Healthy.

The overall health of the resources is calculated from a score generated for each alert type (only critical, at-risk and unknown alerts are considered). The alert type that scores highest represents overall health of the system. The color of the vertical bar is the same as the alert type with highest score. Since there are no alerts for healthy, its score is calculated by getting a count of all healthy resources.

When only a subset of alerts are displayed, the overall health is calculated from the visible alerts. If there are no alerts, then the overall health is represented by green.

The overall health for the individual resources (clusters, hosts, databases, and application servers) is indicated by a combination of graphical and textual elements. For each resource type, the number of resources is presented; a horizontal bar depicts the distribution of health status as Critical (red), At Risk (yellow), Offline (light gray), Maintenance (dark gray), and Healthy (green); and the number of resources that are Critical and At Risk for a selected resource group.

To get to an overview page of a resource group (clusters, hosts, databases, or application servers), click on the corresponding resource group tab. In each overview page, you can sort the resources either by Health Status or Name.

Select Group by Clusters to sort your view of the resources by their clusters.

To close an overview page for a resource group and return to the top-level Monitor view, click the X to the right of the 'View by' dropdown menu.

The following sections describe each overview page in detail:

Clusters Overview

The Clusters overview page displays colored rectangles, each representing a cluster. You can quickly grasp the health status of large number of clusters and comprehend the overall impact at the enterprise/group level. Clicking on a cluster displays the key performance metrics and any alerts for that cluster. The gauges represent resource consumption with percentage values.

Hosts Overview

The Hosts overview page displays colored rectangles, each representing a host. You can quickly grasp the health status of large number of hosts and comprehend the overall impact at the enterprise/group level. Clicking on a host displays the key performance metrics and any alerts for that host. The gauges represent resource consumption with percentage values.

Databases Overview

The Databases overview page displays colored rectangles, each representing a database. You can quickly grasp the health status of large number of databases and comprehend the overall impact at the enterprise/group level. Clicking on a database displays the key performance metrics and any alerts for that database. The gauges represent resource consumption with percentage values.

App Servers Overview

The App Servers overview page displays colored rectangles, each representing an application server. Administrators can quickly grasp the health status of large number of App Servers and comprehend the overall impact at the enterprise/group level. Clicking on an App Server displays the key performance metrics and any alerts for that App Server. The gauges represent resource consumption with percentage values.

Alerts Panel

The Alerts panel on the right-hand side lists the most recent system alerts for your enterprise. The Alerts panel lists up to 50 latest unacknowledged, open, and closed alerts. Every 30 sec, the ticker is refreshed with latest 50 alerts.

.

The Alerts panel contains tabs for Critical, At Risk, and Info alert types, as well as the All tab that shows all the above alert types. There are no tabs for other alert types, such as Offline and Maintenance. You can navigate the tabs to list events by their level of severity.

,
Tab Description
All Display all alerts.
Critical Display only Critical alerts.
At Risk Display only At Risk alerts.
Info Display only Info alerts.

Click on View Alert Details to navigate to the System Alerts page in the Support view. This page is described in System Alerts.

Click on the alert to navigate to the Metrics page for the resource in the Manage view. These pages are described in the following sections: Cluster Metrics, Host Metrics, Database Metrics, and App Server Metrics.

Next to each alert in the list is a dropdown menu that allows you to mark the alert as acknowledged, which suppresses the alert from further notifications, as described in System Alerts. Selecting View Logs displays the list of log files for the resource associated with the alert, as described in Event Logs.

« Previous chapter
Next chapter »