This documentation supports an earlier version of BMC Helix Operations Management.

To view the documentation for the latest version, select 23.2 from the Product version picker.

Monitoring and investigating services and situations

Monitor system health, reduce event noise, perform probable cause analysis for impacted services, and boost the remediation opportunities of your integrated environment. 

The following table identifies the tasks that help you monitor and investigate services and situations:

ActionReference

Set up user accounts and grant access

Configure your identity provider to set up more users and user groups and assign specific permissions to them. User management functions are performed in BMC Helix Portal .

Roles and permissions

Configure situation monitoring

Create event correlation policies to aggregate events, which are identified as situations.

Event correlation for aggregating related events

Access the BMC Helix AIOps console

Log on to BMC Helix Portal , and click BMC Helix AIOps to launch the BMC Helix AIOps console to monitor services and situations as explained in the following sections.

Monitor health summary

As an operator, view key performance indicators and entities from all the integrated products to get a quick-peek summary of the overall system health status through the following widgets:

Monitoring key performance indicators and entities

Monitor situations

As an operator, monitor and investigate policy-based situations. Monitoring situations provide the ability to:

  • Dynamically aggregate events based on event correlation policy to derive actionable insights.
  • Investigate the aggregated events.
  • Reduce the event noise.
  • Improve the mean-time-to-resolve (MTTR) issues based on the situation-driven workflow.
  • Lower the mean-time-to-detect or discover (MTTD) and the time required for investigating tickets.
Monitoring and investigating policy-based situations

Monitor services

As an operator, monitor services to assess the service health and perform probable cause analysis using the following options:

  • Comprehensive health timeline for predefined time ranges.
  • Probable cause analysis impact of causal entities
  • Impactful events and change requests.
  • Topology maps showing the relationship between services and nodes.
  • Metrics data graphs.
Monitoring service health and performing causal analysis


Was this page helpful? Yes No Submitting... Thank you

Comments