As an operator or a site reliability engineer (SRE), it's critical that you are able to observe the business services in your organization to monitor their overall health. When a service gets impacted by any factor, you need to view the events generated because of the impact, analyze the causes of the impact, and quickly remediate those events to restore the health of the impacted services.
BMC Helix AIOps provides a set of comprehensive, service-centric monitoring capabilities. A service is a logical group of applications, middleware, security, storage, networks, and other child services that work together to achieve a business goal.
Additionally, BMC Helix AIOps provides advanced monitoring and analytical capabilities to:
BMC Helix AIOps offers a single pane of view for all the business services used by your organization. The single pane of view helps the users to view all the relevant information at one place and to respond faster. Operators or SREs can view the following information:
- Number of impacted services by severity - Critical, Major, Minor, or OK
- Number of situations (correlated events), events, incidents, and CIs for each impacted service
- Association between child services of each service
- Details of each service related to its health, and impacting events, situations, incidents, or changes
Learn about the advanced monitoring and analytical capabilities by using the topics listed in the following table:
Learn about the service health, health score, and metrics.
|Service health score and health timeline
View the services in a heatmap view or tile view and monitor the overall health of each service on the Services page.
|Getting started with service monitoring
On the service details page: