This documentation supports the releases of BMC Helix Operations Management up to December 31, 2021.To view the documentation for the latest version, select 23.1 from the Product version picker.

Total incident count and mean time to resolve (MTTR) indicators for a reliable incidence-response process


Incidents

An incident is any event that is not part of the standard operation of a service and that causes an interruption or a reduction in the quality of that service. 

What are the incident sources?

The Total Incidents widget displays INCIDENT_INFO events from BMC Helix Operations Management.

The Overview page in the BMC Helix AIOps console displays the total incidents for a selected time range as shown in the following example. In the example, there are 292 incidents in the last 24 hours:

total_incidents_concept.png

Mean time to resolve (MTTR)

MTTR represents the average time taken to resolve a set of incidents. This metric includes the time spent during the alert and diagnostic processes before repair activities are initiated. In other words, MTTR describes both the reliability and availability of a system. Reliability refers to the probability that the service will remain operational over its life cycle. Availability refers to the probability that a system will be operational at any point in time. The shorter the MTTR, the higher the reliability and availability of the system.

What is the source of incidents for MTTR computation?

To compute MTTR value, the INCIDENT_INFO events from BMC Helix Operations Management are considered.


The Overview page displays the MTTR and its trend for a selected time range as shown in the following example. In the example, the average time taken to close 4 incidents in the last 24 hours is 4 hours and 33 minutes:

mttr_concept.png

MTTR computation

The MTTR value is computed as

MTTR = The time taken to close the incidents for a selected time range/Total incidents closed for a selected time range

Example

Total incidents closed in the last 24 hours = 4

Time range selected is Last 24 hours

Time taken to close these 4 incidents:

  • Incident 1 was closed in 5h 45minutes (345 minutes)
  • Incident 2 was closed in 3h 50minutes (230 minutes)
  • Incident 3 was closed in 6h 15minutes (375 minutes)
  • Incident 4 was closed in 2h 22minutes (142 minutes)

MTTR = (345 + 230 + 375 + 142)/4 = 273 minutes = 4h 33minutes


 

Tip: For faster searching, add an asterisk to the end of your partial query. Example: cert*