ServiceOps overview
ServiceOps is the combination of IT Service Management (ITSM) and IT Operations Management (ITOM). It unifies people, processes, and technology platforms that are highly interdependent. These entities must operate in parallel to make sure that service delivery is effective and infrastructure operations are incident-free. By unifying the entities, ServiceOps improves user experience and service delivery through real-time visibility, automation, and intelligent collaboration across siloed ITSM and ITOM teams.
ServiceOps uses the following key components for collaborative operations:
- AI Agents: Enable autonomous detection, resolution, and prevention of issues.
- Open integrations: Connect ITOM and ITSM seamlessly with various SaaS or on-premises monitoring tools, such as Dynatrace and Prometheus to collect data from them.
- Common data stores: Store data collected from ITSM and ITOM and make it available for analysis and decision making.
- Common services: Deliver reusable functions and components for ITOM and ITSM teams.
- Open APIs: Offer standardized interfaces for system extension and integration.
- Integration services: Link workflows, processes, and systems across ITSM and ITOM.
Benefits of ServiceOps
ServiceOps uses AI, observability, and automation to offer many benefits, such as:
|
---|
Analyzes and predicts the contextual risk of vulnerabilities and proposed changes. It uses AI-powered root cause analysis and contextual topology to diagnose issues faster. |
|
Understands the service health and service dependencies, and the change impact across applications, infrastructure, and network. |
|
Continuously monitors service health and automatically triggers remediation workflows to prevent incidents or reduce their impact. |
|
Correlates changes and incidents to situations. It also enables collaborative incident response through tools such as Microsoft Teams with AI-generated summaries. |
|
Combines observability, automation, and AI into a unified platform to facilitate seamless collaboration between ITOM and ITSM teams. |
|
Uses AI agents, such as Service Collaborator and Change Risk Advisor, to streamline investigations, suggest fixes, and enable proactive operations. |
Key capabilities of ServiceOps
With ServiceOps, teams work together to proactively identify and mitigate incidents before they impact end users.
The following table describes the key capabilities of ServiceOps:
Capability | Description |
---|---|
BMC HelixGPT driven situation management | Recommends the most effective actions to resolve a situation by using the Best Action Planner AI agent. |
Collaboration by using Microsoft Teams | Helps investigate and resolve situations faster within the Microsoft Teams interface by using the AI agents, such as BMC Helix Ops Swarmer and AIOps Chat Agent. |
Discovery of IT assets and dynamic service modeling | Discovers IT assets and their relationships and helps you to create service models that depict how these assets deliver a business service. It combines observability data from monitoring tools with service context to determine the cause of issues and identify the services dependent on the affected assets. |
Embedded dashboards and service health timeline | Provides dashboards with metrics, logs, and topology that help teams monitor and analyze services without switching applications. In addition, a visual timeline shows how a service’s health has changed over time, and correlates those changes with incidents, changes, and anomalies. |
Probable cause analysis | Identifies root causes faster by correlating changes to incidents and monitoring their impact through the service health timeline and reconciled topology, which maps IT assets, service dependencies, and relationships. |
Change risk analysis | Simulates the impact of proposed changes in a service context by using an AI agent, for example, Change Risk Advisor. The agent analyzes risks in advance to avoid making changes that are likely to cause an incident. |
Incident to situation correlation | Reduces incident noise by creating a single incident in BMC Helix ITSM for a situation (correlated events) through automation. |
Automated remediation | Triggers automated remediation workflows |
ServiceOps implementation overview
The following table provides an overview of the tasks that you need to perform to implement ServiceOps:
Task | Action | Description | |
---|---|---|---|
1 | Define the objectives | Set clear, result-oriented objectives for ServiceOps that both ITSM and ITOM teams can align with. The objectives might include reduced MTTR, faster incident resolution time, or reduced event or incident noise. | |
2 | Assess the current maturity level | a. Evaluate the existing ITSM processes, ITOM capabilities, tools, and integrations to understand your baseline. b. Identify the gaps in visibility, collaboration, and automation to determine improvement areas. | |
3 | Identify the use cases and tools | a. Identify the use cases and the ITOM and ITSM tools or products, such as change management and service desk management, required to achieve the use cases. b. Define the workflows for incident management, change management, and other processes, and make sure that the data flows seamlessly into the incident management system. c. Integrate monitoring tools, a real-time CMDB, and service desk management systems to provide end-to-end visibility into the workflows between ITOM and ITSM. | |
4 | Build cross-functional teams | a. Build a team with members from both ITOM and ITSM teams, such as incident managers, automation engineers, and change managers. b. Share common KPIs, such as MTTR, service health, and incident resolution speed, to keep the teams aligned with the objectives. c. Provide the required training to the teams. | |
5 | Facilitate collaboration and share dashboards | a. Provide shared dashboards that display real-time service health, incidents, and root cause analysis in a single view. b. Integrate with collaboration tools (for example, Microsoft Teams and Slack) so that ITOM and ITSM teams can collaborate and coordinate with each other. | |
6 | Monitor and improve | a. Continuously track KPIs, analyze patterns, and adjust rules, automation logic, and processes over time. This iterative approach ensures that ServiceOps evolves with business needs. b. Implement monitoring and automation features to streamline workflows and improve incident management. |
Use cases
The following use cases help you learn more about achieving value with ServiceOps offered by BMC Helix:
Use case | Business value | User | Products used |
---|---|---|---|
Automate incident creation with Proactive Service Resolution | With the automated approach, you no longer have to manually create incidents. As a result, you spend less time on repetitive tasks, resolve issues faster, and keep services running smoothly. For more information, see Creating incidents in BMC Helix IT Service Management for events. | Tenant Administrator Operator Automation Engineer Service desk agent | BMC Helix Intelligent Automation BMC Helix AIOps BMC Helix ITSM |
Reduce incident noise in BMC Helix ITSM by using Proactive Service Resolution | With a reduced volume of incidents, you can concentrate on identifying and addressing the root cause. The single incident created in BMC Helix ITSM contains details such as the causal configuration item (CI) and a direct URL to the situation in BMC Helix AIOps. The details help you perform a faster root cause analysis to restore the infrastructure's health quickly. For more information, see Reducing incident noise in BMC Helix ITSM by using Proactive Service Resolution. | Tenant administrator Operator Automation engineer Service desk agent | BMC Helix Intelligent Automation BMC Helix AIOps BMC Helix ITSM |
Reduce event noise during blackout windows | The solution automatically creates blackout policies based on a scheduled change request created in BMC Helix ITSM. If a change request is updated, the blackout policy is updated automatically. By using the automated solution, you can save the time and effort required to manually create and maintain the blackout policies. For more information, see Overview of Proactive Service Resolution. | Tenant administrator Operator Automation engineer Service desk agent Change manager | BMC Helix Intelligent Automation BMC Helix Operations Management BMC Helix ITSM |
Make informed decisions with ML-based change risk assessment | The cognitive risk assessment predicts the risk factors associated with a change request based on similar change requests implemented in the past. By using AI-driven insights, historical data, and contextual analysis, the change risk assessment helps you accurately predict the likelihood of change failure or service disruption. The assessment results in more informed decision-making, reduced change-related incidents, and fewer service outages. For more information, see Assessing the risk of a change request. | Tenant administrator Operator Service desk agent Change manager | BMC Helix AIOps BMC Helix ITSM |
Accelerate incident resolution with probable cause analysis | The system analyzes alerts, logs, and metrics to identify the most probable root cause of service disruptions, eliminating the need for manual data review. Identification of the root cause reduces MTTR, minimizes business impact, and improves service availability. For more information, see Probable cause analysis in Incident Management by using BMC Helix Service Monitoring. | Tenant administrator Operator Service desk agent Change manager | BMC Helix AIOps BMC Helix ITSM |