Monitoring service health
BMC Helix AIOps displays services from the following components:
- Service models created and managed in BMC Helix AIOps
- Groups published as services in BMC Helix Operations Management
- All business services from BMC Helix Discovery
- Service models or topologies ingested from third-party application through BMC Helix Intelligent Integrations connectors
To get started with service monitoring
In the BMC Helix AIOps console, all services are displayed on the Services page.
- Click Services to view the following information:
- All services color-coded by severity in a heatmap or tile view
- Child services associated with the parent services
- Search, basic filters, and advanced filters for services
Option to create new services (available for the Service Designer role only)
- Hover over a service to view a quick summary of the impact.
The impact score, situations, events, and incidents associated with a service are displayed.
- (Optional) If there is a child service , click the service box to view the next level of child services associated with the service.
If a child service is impacted, the impact and the health score are propagated to all the services that the impacted service depends upon. If the parent service is not impacted and the health score is propagated from a dependent service, a label (Propagated) is displayed next to the health score.
- (Optional) Choose how to view services on the heatmap view:
- Basic search : Enter a service name in the search box and click search
.
- Basic filters: Select or clear the severity filter checkboxes to view services based on the selected severity. If the parent service matches the selected filter, the child services are also displayed. You can also click Select all to view all services. Filter selection is retained even if you access other pages and navigate back to the Services page.
- Advanced filter: Click to view services by Service Kind (Business Service, Technical Service, Business Application) or services with specific label-value pairs. By default, business services and technical services are displayed. Search and filter options are retained even if you switch between the heatmap and tile views.
- Number of services per page: Click to select the number of services to be displayed on the page.
The service count displayed on the heatmap view shows only the parent services filtered by Severity and Service Kind. - Refresh page: Click Refresh
to refresh the page.
By default, the Services page is automatically refreshed after every five minutes. The automatic refresh interval is applicable for active browser sessions only. If you navigate to any other browser tab, the session no longer remains active and the UI pages do not get automatically refreshed. To change the refresh interval duration, see Configuring-general-settings.
- Basic search : Enter a service name in the search box and click search
(Optional) Click Tile View
to view services in a tile view.
Each tile represents a service and displays the service name, service impact score, and the count of situations, events, and incidents, and the total impacted CI count associated with the service. Search results or filters are retained across both the tile view and the heatmap view.- (Optional) Save a set of preferences (severity filters, advanced filters, and the page view (Heatmap view or Tile view)) in a preset. These preferences are saved across sessions:
- Click Add Preset.
- Enter a name for the preset.
- (Optional) Set the preset as the default preset.
- Click Create.
The presets you create on the Services page are applied to the Situations page as well. f you select a preset on the Services page, the situations are displayed according to the services filtered by the preset. Also, if you select a preset on the Situations page, the Services page displays services according to that preset.
- (Optional) Click Save Preferences to save your page preferences.
Your selected severity filters, advanced filters, and the page view (Heatmap view or Tile view) are saved until you change your page preferences again.
To monitor service health
- Click Services and click a service name to view the following details:
- Service name and severity level
Health score
If a dependent service is impacted, the health score is propagated to all services that the impacted service depends upon.- Incidents: Click to view incident details.
The incident message cross-launch link opens the incident details in BMC Helix ITSM (if you have permission to access the application). - Total Events: Number of events generated for the service.
- Impacting Events: Number of events used to compute health score for a service.
- Impacting Child Services: Number of direct and propagated child services impacting the parent service.
Click the Information iconto see the child services that are impacting the parent service in the following order:
- Total Impacted Child Services
- Propagated Child Services
- Direct Impacting Child Services
Top 10 Direct Impacting Child Services
A maximum of the top ten directly impacting child services are listed.
- Refresh page: Click Refresh to refresh the page.
By default, the services details page is not automatically refreshed, and the Auto Refresh Interval option is set to Off. To change the refresh interval duration, see Configuring-general-settings.
- Health timeline
- CI topology
- Service hierarchy
- Health indicators (in the View Health Indicators section)
- Situations (in the Analyze Situations section)
- Root cause (in the Analyze Root Cause section)
- Service insights (in the Analyze Service Insights section)
- Click the Impacting Events link to view the impacting events, situations, incidents, and changes for a service, and perform the following optional steps:
Click any event, situation, incident, or change to view details, related events (for situations only), logs, and notes, and perform additional operations.
You can also use the More Details cross-launch link to view the selected event, situation, incident, or change in BMC Helix Operations Management.(If integrated with
ServiceNow
Change Management): Change requests from ServiceNow are displayed.
- Select a time range to view events, incidents, or changes that occurred in the selected period.
By default, data is displayed for the last three hours. You can select a time range of 6 hours, 12 hours, 24 hours, or 7 days. - Hover over a time slot to view the exact health score.
- Hover over the event, incident, or change on the health timeline to view details.
To learn more about the health timeline, see Service-health-score-and-health-timeline. (Optional) Hover over the move icon
for a section to rearrange the section on the service details page. After the icon changes to a hand pointer, drag and drop the pointer as needed.
Where to go from here
Based on the health of and impact on a service, you can perform any of the following tasks:
- View CI topology for impacted services, see Identifying-the-impacted-CI-nodes-from-CI-topology-view .
- Investigate impacting events, incidents, and changes for nodes in service hierarchy, see Investigating-the-service-nodes-from-service-hierarchy-view.
- View health indicators for an impacted service, see Monitoring-service-health-indicators .
- View the causal analysis for the impact, see Performing-causal-analysis-of-impacted-services .
- Get an insight into the service behavior and its severity pattern over a pre-defined period, see Monitoring-service-insights.