Diagnosing and fixing problems faster


diagnose_fix_problems.png

Take advantage of the centralized data lake to observe the data ingested from multiple sources and the power of AI/ML event correlation to streamline root cause diagnostics and isolation. Enable automated remediation to fix recurring problems faster. 

Multisource data ingestion for observability

To observe the data and to make accurate and informed data-driven decisions, you must collect event, metrics, logs, and topology data from various third-party products into a central data lake.

  • Set up BMC Helix Discovery  to automatically discover IT assets and their topological relationships.
    For more information, see Setting up and going live.

  • Collect performance and health data of your system by Deploying Knowledge Modules (KMs) and configuring BMC PATROL Agent to help your system understand what data to collect and configuring monitoring policies in BMC Helix Operations Management.
    For more information, see Collecting data by configuring monitoring policies.

  • Collect log data by using BMC Helix Log Analytics for analyzing and understanding the root cause of an issue.
    For more information, see Collecting logs.

  • Configure BMC Helix Intelligent Integrations to ingest data from third-party products.
    For more information, see Integrating-by-using-BMC-Helix-Intelligent-Integrations.




ingest_data_diagram.png

Root cause diagnosis of correlated events and situations


event_correlation_diagram.png


For faster root cause diagnosis of the problem, the AI/ML techniques are applied to correlate events into situations.

The service-centric approach to monitoring and observability in BMC Helix AIOps provides a direct link from situations to their services and shows events, incidents, and changes associated to various nodes or CIs of those services. With visibility into the entire service and CI topologies in real-time and clear insight into the overall system health, identification of the problem root cause is faster and efficient and saves you time and resources.

Automated remediation of recurring issues

After identifying the root cause, the logical next step is to enable automated remediation, which helps you create automation policies to resolve the recurring issues swiftly.

remediate_actions_diagram.png

 

Tip: For faster searching, add an asterisk to the end of your partial query. Example: cert*