Monitoring and investigating services and situations

Monitor system health, reduce event noise, perform probable cause analysis for impacted services, and boost the remediation opportunities of your integrated environment.

The following table identifies the tasks that help you monitor and investigate services and situations:

Action	Reference
Set up user accounts and grant access Configure your identity provider to set up more users and user groups and assign specific permissions to them. User management functions are performed in BMC Helix Portal .	Roles and permissions
Configure situation monitoring Create event correlation policies to aggregate events, which are identified as situations.	Correlating events
Access the BMC Helix AIOps console Log on to BMC Helix Portal , and click BMC Helix AIOps to launch the BMC Helix AIOps console to monitor services and situations as explained in the following sections.
Monitor health summary As an operator, view key performance indicators and entities from all the integrated products to get a quick-peek summary of the overall system health status through the following widgets: Total events, anomalies, and incidents from the integrated monitoring systems Average mean time to resolve (MTTR) the incidents Overall event noise reduction score Impact severity and availability of top services Event count and status of top situations	Monitoring key performance indicators and entities
Monitor situations As an operator, monitor and investigate policy-based situations. Monitoring situations provide the ability to: Dynamically aggregate events based on event correlation policy to derive actionable insights. Investigate the aggregated events. Reduce the event noise. Improve the mean-time-to-resolve (MTTR) issues based on the situation-driven workflow. Lower the mean-time-to-detect or discover (MTTD) and the time required for investigating tickets.	Monitoring and investigating policy-based situations
Monitor services As an operator, monitor services to assess the service health and perform probable cause analysis using the following options: Comprehensive health timeline for predefined time ranges. Probable cause analysis impact of causal entities Impactful events and change requests. Topology maps showing the relationship between services and nodes. Metrics data graphs.	Monitoring service health and performing causal analysis

Monitoring and investigating services and situations

Comments