Learning about Problem Management
The mission of the Problem Management process is to minimize the number of incidents. Problem Management can proactively prevent the occurrence of incidents, errors, and additional problems. The problem lifecycle includes request review, root cause analysis, analysis review, and closure. BMC Helix ITSM: Service Desk uses automated Problem Management processes to ensure reactive resolution of customer questions and issues, and proactive work to prevent recurring issues.
You use the following consoles to access the features in Problem Management:
The Overview Console
Using the Overview Console, specialists can view problem investigations that were assigned to them through the Problem Management application.
The Problem Management Console
TheProblem Management Console provides problem coordinators and specialists with a single point from which they can create problem investigations and known errors.
Process overview
The Problem Management process consists of the following stages:
Stage | Description |
---|---|
Review incident requests | The Problem Management process usually begins with an incident request review, which the problem coordinator performs at regularly scheduled intervals. During the review, the problem coordinator analyzes the incident request information to identify the problems with the services they are responsible for. |
Create the problem investigation | If a problem investigation already exists for an incident request, the problem coordinator relates that incident request to the existing problem investigation. When a new problem is identified, the problem coordinator creates a problem investigation and relates it to the corresponding incident requests. The problem coordinator then assigns the problem investigation to a specialist who has the appropriate combination of skills, availability, and access rights to perform a root cause analysis. Problem investigations can also be triggered by other processes. For example, a service desk specialist can create a problem investigation if they think that an incident request was caused by an underlying problem. Emergency changes as well as periodic availability management and capacity management reviews can also trigger problem investigations. |
Perform a root-cause analysis | When a problem investigation is assigned to a specialist for root cause analysis, the specialist looks for a temporary workaround to restore the affected service. If the specialist implements a workaround, they add information about it to the problem investigation record. This helps other specialists resolve future incidents caused by the problem until a permanent or structural solution is implemented. Next, the specialist looks for the root cause of the problem. The specialist can also create a known error related to the problem investigation. If the specialist finds a root cause, they update the problem investigation with information about the root cause, workaround, and resolution. |
Identify changes | If the specialist thinks that a change is required to remove the root cause or to enable a workaround, they inform the problem coordinator that the change coordinator must be included in the analysis. Otherwise, the specialist implements the preferred structural solution. If the specialist cannot find the root cause or cannot propose a structural solution, the specialist adds the reason to the problem investigation record. Regardless of whether a structural solution is proposed or implemented, the specialist informs the problem coordinator when the root cause analysis is completed. |
Close or reassign problem investigations | After the root cause analysis is completed, the problem coordinator reviews the analysis. The following scenarios could take place:
|
Watch the following video (8:25) to learn about the basic flow of the Problem Management process.
Testing your knowledge
Check your knowledge by answering each question.
Do you want to learn more?
For information about managing problem investigations and known errors, see Managing-problem-investigations.
For information about managing incident tickets, see Managing-incident-requests.
To learn about the roles and permissions required for Incident Management, see Problem-Management-roles and Problem-Management-permissions.