Diagnosing and fixing problems


Multisource data ingestion for observability, root cause diagnosis of correlated events and situations, automated remediation of recurring issues.

Take advantage of the centralized data lake to observe the data ingested from multiple sources and the power of AI/ML event correlation to streamline root cause diagnostics and isolation. Enable automated remediation to fix recurring problems faster. 

Multisource data ingestion for observability

To observe the data and to make accurate and informed data-driven decisions, you must collect event, metrics, logs, and topology data from various third-party products into a central data lake.

  • Set up BMC Helix Discovery to automatically discover IT assets and their topological relationships.
    For more information, see Setting up and going live.

  • Collect the performance and health data of your system by deploying Knowledge Modules (KMs) and configuring BMC PATROL Agent to help your system understand what data to collect and configuring monitoring policies in BMC Helix Operations Management.
    For more information, see Collecting data.

  • Collect log data by using BMC Helix Log Analytics for analyzing and understanding the root cause of an issue. For more information, see Collecting logs.

  • Ingest data from third-party products.
    For more information, see Integrating by using BMC Helix Intelligent Integrations.

ingest_data_diagram.png

Root cause diagnosis of correlated events and situations

event_correlation_diagram.png

For faster root cause diagnosis of the problem, the AI/ML techniques are applied to correlate events into situations.

The service-centric approach to monitoring and observability in BMC Helix AIOps provides a direct link from situations to their services and shows events, incidents, and changes associated with various nodes or CIs of those services. With visibility into the entire service and CI topologies in real time and clear insight into the overall system health, identification of the problem root cause is faster, and efficient, and saves you time and resources.

Automated remediation of recurring issues

After identifying the root cause, the next step is to enable automated remediation, which helps you create automation policies to resolve the recurring issues swiftly.

remediate_actions_diagram.png

 

Tip: For faster searching, add an asterisk to the end of your partial query. Example: cert*