This documentation supports the releases of BMC Helix Service Monitoring till September 2021 (21.3.03). Documentation for later versions is available in the BMC Helix AIOps documentation space. To view the documentation, select a version from the Product version menu.

Performing the ML-based root cause isolation of an impacted service


As an operator user or a user with required permissions, you can view the service details and perform ML-based root cause isolation of the impacted services.


As an operator user or a user with required permissions, you can:


To view the details of a service

  1. In the BMC Helix Service Monitoring console, do one of the following actions to view the service details page:
    • Click the Overview tab, and from the Services widget, click any of the impacted services whose details you want to view.
    • Click Entities and on the Services page, click an individual service tile.
      entities_services_page.png

      rca_health_timeline_221.png
      • Service name, severity, incident ID associated with the service, service impact score in percentage, and the date and time when the service was last updated.
      • Top 3 impacted entities (business services) that are associated with the service. In this example, Banking Zone Service the impacted business service associated with the Banking Core Servers service.
      • Pie chart displaying the count of open events impacting the service. The events are categorized by event status. The pie chart does not consider the INFO and OK events while displaying the event count.
      • Health score for the selected time range. Hover over a time slot to view the health score. For more information, see Service-health-score-impact-score-and-metrics.
      • Legends to indicate incidents, events, and change requests on the health timeline. Hover over an event, incident, or change request to view the details. For more information, see Total-incident-count-and-mean-time-to-resolve-MTTR-indicators-for-a-reliable-incidence-response-process. The health timeline does not display the INFO and OK events.

To view the ML-based root causes of an impacted service

  1. In the BMC Helix Service Monitoring, do one of the following to view the root causes of an impacted service:
    • Click the Overview tab, and from the Services widget, click any of the impacted services whose details you want to view.
    • Click Entities and on the Services page, click an individual service tile.
  1. On the individual service page, click Root Cause Isolation.
  2. To view the causal event details by causal nodes or situations, click View By and select one of the following options:
    • Causal Nodes: Displays the top 3 causal nodes impacting the service. Click a causal node and perform the following actions to view the event and change request details:
      RCI_causal.png

      • To view the event details:

        • Click Events to view top causal events. 
        • Hover over the score to view the score calculation details for the event.
        • Click on an event to view event details.

          Performance View tab for event details

          Performance View tab displays only for an alarm class event. It contains the time-series data collected from key attributes of the causal events.

      • To view the change details:
        • Select Changes to view top three change requests.
        • Hover-over the score to view the score calculation details for the change.
        • Click on a change to view change details.
      View all events or all changes
      • Click Show all events or Show all changes link to view all events or all changes for a particular causal node.
      • You can switch back to view only the top events or top changes, by clicking the Show top causal events or Show top causal changes link.
    • Situations: Displays the top 3 situations impacting the service. Click a situation to view the associated events. Click an event to view its details.

      RCI_situations.png

      Launch the situation details page on the Situations tab

      Optionally, you can click thelaunch_situations.pngicon to launch the situation details page on the Situations tab. For more information, see Investigating-ML-based-situations.

To view the service topology

  1. In the BMC Helix Service Monitoring console, do one of the following to view the service details page:
    • Click the Overview tab, and from the Services widget, click any of the impacted services whose details you want to view.
    • Click Entities and on the Services page, click an individual service tile.
  1. Click Topology. To know how to view and understand the service topology and its components, see:
    Visualizations toolset
    Information, zoom and undo/redo controls
    Note: The impacted node and the downstream path is marked in red color. Any non-impacted node is greyed out as shown in this image.

    view_serv_topology.png

  2. Click a node to view the node details as shown in this image.
    Node details with incident.png
  3. (Optional) Click on an event, incident, or change to view event details, incident details, or change details.

To view the metrics of the causal node

  1. In the BMC Helix Service Monitoring console, do one of the following to view the service details page:
    • Click the Overview tab, and from the Services widget, click any of the impacted services whose details you want to view.
    • Click Entities and on the Services page, click an individual service tile.
  1. Click Metrics to view the metrics chart for the top attributes of the causal node. If there are more than three metrics, only the top three trending metrics are displayed.
    Based on the metric data and its trend, you can take action to resolve the issue. For more information, see Service-health-score-impact-score-and-metrics.

Visualizations toolset

The following table lists the options you can use to manipulate a visualization and perform modeling tasks.

Some content is unavailable due to permissions.


 

Tip: For faster searching, add an asterisk to the end of your partial query. Example: cert*