Managing the capacity of your vSphere infrastructure


What can you do with BMC Helix Capacity Optimization?

HOCO_goals.png

As a Capacity Planner or vSphere Technology Specialist, you can use BMC Helix Capacity Optimization to configure, administer, and manage the capacity of your vSphere infrastructure. You can collect and manage data for a vSphere environment that has one or multiple vCenters and contains vSphere infrastructure elements such as Providers (clusters, hosts, datastores, resource pools) and Consumers (virtual machines). 

manage_vsphere.png

As the flow diagram illustrates, the data source (VMware vSphere or Gateway Server) collects data from the vSphere infrastructure elements. The collected data is transferred to the Capacity Optimization data warehouse where it is processed, and then displayed on the user interface. You can use the product features to review, analyze, and manage the capacity of your vSphere infrastructure providers and consumers. 

The following sections describe how you can achieve these goals:

Managing the capacity of vSphere infrastructure providers

You can analyze and manage the capacity of your vSphere infrastructure providers by using the vSphere Infrastructure view. For the infrastructure data to be available in the view, the Administrator must first set up the data source to collect data. 

Step 1. Collect data and install the views

As an Administrator, configure and use the out-of-the-box VMware-vCenter-Extractor-Service ETL for data collection. 

In the ETL configuration, you can choose to collect data only from the providers or from all the vSphere elements, including consumers.

To collect data only from the providers, select the Data type option - metrics at Cluster, Resource Pool, Host, Datastore level.

To collect data from all the vSphere elements, including consumers, retain the default Data type option (metrics at Cluster, Resource Pool, Host, Datastore, and Virtual Machine level).

After data collection starts, data is loaded in the data warehouse daily and Indicators are available in the Workspace.

As an Administrator, you must install the vSphere views and Capacity Pools view and grant the necessary permissions to Capacity Planners and vSphere Technology Specialists to access these views.

Step 2. Analyze the collected data

To get a high-level view of the infrastructure usage and health, use the out-of-the-box capacity pools in the Capacity-Pools-view.

For detailed analysis, use the vSphere Infrastructure view

The following common use cases are described here:

Understand the Usage and Health of your vSphere infrastructure providers

Review and analyze the out-of-the-box capacity pools for vSphere clusters and hosts for a high-level understanding of their usage and health. For more information, see Capacity-Pools-view

You can drill down into a specific cluster or host for detailed analysis.

An Administrator can create capacity pools as per your requirement. You can then view and analyze them in the Capacity Pools view.

See the following how-to video:

For detailed analysis, click the cluster name (element). The cluster details page in the vSphere Infrastructure view opens. See the following screenshot:

pool2clusterdetails.png

Understand resource availability, utilization, and utilization trend of your vSphere infrastructure providers

Depending on the provider, review and analyze the resource metrics on the relevant page in the vSphere Infrastructure view. 

For more information, see Clusters-page-in-the-vSphere-Infrastructure-view and Hosts-page-in-the-vSphere-Infrastructure-view.

See the following how-to video for vSphere clusters:

See the following how-to video for vSphere hosts:

Evaluate the residual or spare capacity of vSphere clusters

Review the estimate of additional VMs that can be added to a cluster in the Clusters > Capacity page of the vSphere Infrastructure view. This estimate is based on a typical VM configuration, which is automatically computed. Using this estimate, you can identify clusters that have residual or spare capacity.

You can modify the typical VM configuration based on your requirement, and re-estimate the spare capacity.

Identify vSphere infrastructure providers that have exhausted or exhausting resources

Use the Recommendations-page-in-the-vSphere-Infrastructure-view to get a quick view of the vsphere infrastructure providers that have exhausted or exhausting resources (CPU, memory, storage). The page also provides out-of-the-box recommendations to help you take appropriate action on these elements to resolve the issue.

The Future-Saturation-page-in-the-vSphere-Infrastructure-view displays a list of all such VMs along with some of their key metric details.

Identify vSphere datastores that have overprovisioned capacity or resource contention

Use the Watchlist-page-in-the-vSphere-Infrastructure-view to see details of the vSphere datastores whose provisioned storage is greater than its actual capacity. Also, you can identify the datastores that have read or write latency issues.  You can drill down into a specific cluster or host for detailed analysis. 

Identify vSphere infrastructure providers that have efficiency issues

Use the Recommendations-page-in-the-vSphere-Infrastructure-view to get a quick view of the vSphere infrastructure providers that have a capacity risk or an efficiency issue. The page also provides actionable recommendations to help you resolve the issue.



Managing the capacity of vSphere infrastructure consumers

You can analyze and manage the capacity of your vSphere infrastructure consumers (virtual machines) by using the capacity views. For the infrastructure data to be available in the view, the Administrator must first configure data collection.

Step 1. Collect data and install the views

As an Administrator, you can use the following data sources to collect data for your vSphere virtual machines:

  • VMware-vCenter-Extractor-Service ETL: Configure and use this ETL to collect the required configuration and performance metrics for the VMs. 
  • BMC - TrueSight Capacity Optimization Gateway VIS files parser: Configure and use this ETL to collect additional metrics, more accurate memory utilization metrics, and performance metrics at a higher granularity. For example, use this ETL to collect the OS-level real memory usage values that are used to generate overallocated VM recommendations. Before you use this ETL, you must instrument the VMs. 

    Instrumenting VMs

    The memory utilization value that is collected from an instrumented VM is based on the actual memory of the VM. The Agent collects resource consumption breakdown at process or workload level and helps you to detect specific in-guest OS level resource constraints (for example, in-guest paging due to the physical memory configuration of the VM being too low).
    The memory utilization value from a non-instrumented VM is based on the data collected by the hypervisor. This value might be over-reported and the VM might not be detected as overallocated. BMC recommends that you instrument your business-critical VMs to collect OS-level memory usage values.

    To instrument a VM:
     1. Install a Capacity Agent inside the VM from which you want to collect metrics
     2. Configure the Gateway Server and Capacity Agent to initiate data collection. For more information, see Collecting-data-via-Capacity-Agents.
     3. Configure and use the out-of-the-box BMC-TrueSight-Capacity-Optimization-Gateway-VIS-files-parser to collect the required metrics from the VM.

After data collection starts, data is loaded in the data warehouse daily and Indicators are available in the Workspace.

In the Workspace:

  • Virtual machines whose data is collected by the VMware vCenter Extractor Service ETL are represented by the Virtual Machines - VMware system type
  • Instrumented VMs are represented by the Virtual Node - VMware system type
  • If you have configured the ETLs to share the same entity catalog, then these two system types, Virtual Machine - VMware and Virtual Node - VMware, are internally linked and you can view their relationship for each VM.


VM data extracted using the VMware vCenter Extractor Service ETL:

VM data extracted using the BMC - TrueSight Capacity Optimization Gateway VIS files parser


etl_vm.png

ca_vn.png

vm_inhierarchy.png

vn_inhierarchy.png


As an Administrator, you must install the vSphere views and Capacity Pools view and grant the necessary permissions to Capacity Planners and vSphere Technology Specialists to access these views.

Step 2. Analyze the collected data

Use the capacity views to analyze the imported vSphere virtual machine data. Depending on the data source, you can use the vSphere Virtual Machines view or the Servers Overview view for analysis.

To view and analyze the key capacity metrics and charts for vSphere VMs, use the vSphere-Virtual-Machines-view. It displays metrics collected by the VMware-vCenter-Extractor-Service ETL.

To view and analyze the granular metrics (actual memory utilization value, workload data, and process level data) that are collected from an instrumented VM, use the Servers-Overview-view. The vSphere Virtual Machines view does not display these metrics. 

The following common use cases are described here. 

Identify overallocated VMs

Use the Recommendations-page-in-the-vSphere-Virtual-Machines-view to identify vSphere virtual machines that are overallocated. The page also provides actionable recommendations to help you resolve the issue.

The Overallocated-VMs-page-in-the-vSphere-Virtual-Machines-view displays a list of all such VMs along with some of their key metric details.


Identify idle or unused VMs

Use the Recommendations-page-in-the-vSphere-Virtual-Machines-view to identify vSphere virtual machines that are idle. The page also provides actionable recommendations to help you resolve the issue.

The Idle-VMs-page-in-the-vSphere-Virtual-Machines-view displays a list of all such VMs along with some of their key metric details.

Determine and analyze the available resources and their utilization per VM

Review and analyze the relevant metrics on the All-VMs-page-in-the-vSphere-Virtual-Machines-view to determine the available resources and their utilization per VM. For example, utilization metrics for CPU, Memory, and Datastore.

Identify and analyze VMs with the most and the least utilized resources

Review the details on the Top-Bottom-VMs-page-in-the-vSphere-Virtual-Machines-view to identify and analyze the virtual machines with the most and the least utilized resources.

Identify VMs with snapshots

Review the Recommendations-page-in-the-vSphere-Virtual-Machines-view to identify vSphere virtual machines that have old snapshots. The page also provides actionable recommendations to help you handle the issue.

To view all VMs that have snapshots, review the VM-Snapshots-page-in-the-vSphere-Virtual-Machines-view.

Identify vSphere VMs that have exhausting resources

Review the Recommendations-page-in-the-vSphere-Virtual-Machines-view to identify vSphere virtual machines that have exhausting resources (CPU, memory, storage). The page also provides actionable recommendations to help you handle them.

The Future-Saturation-page-in-the-vSphere-Virtual-Machines-view displays a list of all such VMs along with some of their key metric details.

Determine the VMs that have resource contention

Use the Watchlist-page-in-the-vSphere-Virtual-Machines-view to determine the vSphere virtual machines that have resource contention. You can analyze them further by reviewing their metrics. For example, CPU starved VMs, memory starved VMs, read or write latency issues. 

You can further drill down into a specific virtual machine for detailed analysis.

Analyze the trend and behavior of the actual (OS-level) memory utilization of vSphere VMs

Use the Server-views to review and analyze the trend and behavior of the actual memory utilization of the vSphere virtual machines.

These views display data that is collected from instrumented VMs.

Performing advanced analysis

The earlier sections explained how you can use the out-of-the-box capacity views to manage your environment. These capacity views help you analyze your vSphere infrastructure using a predefined set of metrics.

To perform advanced analysis on the imported vSphere data, such as identifying specific performance issues, trends, and bottlenecks, you can use Analysis.

About Analysis

An analysis is a visual tool that you can use to identify the behavior of a set of metrics and the relationships among them. Each analysis can focus on the business driver metrics of an application, on the performance of an application's systems, and on the events related to an application. Analyses can also be used to compare performance and business driver metrics to determine a system's behavior under load. 

Here are some use cases for which you can create and use Analyses:

For more examples, see Creating-an-analysis.

Analyze the CPU utilization of a cluster over time and understand the trend

Identify the growth trend of virtual machines in a vSphere cluster over the years

Analyze the resource utilization pattern of vSphere clusters

  1. Create an analysis. 
    See sample configuration values: 
    vsphere_res_util_analysis_1.png
    vsphere_analysis_resutil_2.png

  2. Review the analysis results.
    The analysis results are shown in a tabular format. The summary table shows the resource utilization metric values for a cluster. The page shows a table for each cluster.

    vsphere_analysis_resutil_result.png

Managing future demand

By using the capacity views and analysis charts, you can analyze the data of your existing capacity. To predict and plan your IT resource needs, use Models, Reservations, and the Virtual Planner.

This topic describes the following use cases:

For more use cases of Virtual Planner, see Creating-virtualization-studies-to-assess-VM-capacity-requirements.

Predicting the behavior of your resources

Use Models to predict service performance and obtain forecasts of historical series of metrics, including deep details on the modeling techniques used in forecasts and how to interpret the results of model runs.

About models

A model is a simplified mathematical description of service components that evaluates historical data, predicts future behavior, and simulates what-if scenarios. Models are always built on existing data and analysis. After you create a model, define scenarios to perform multiple predictions under different conditions. With Models you can forecast and model changes in service demands.

How to determine when a resource (datastore) of a vSphere cluster completely saturates

How to determine the impact on the resource (datastore) of a vSphere cluster when new VMs are added to the cluster

For more information, see Modeling-capacity-usage.

Reserving capacity for future needs

Use Reservations to allocate and schedule the required IT infrastructure resources. You can reserve a vSphere VM on providers, such as Clusters, Hosts, and Datastores.

About Reservations

Reservations enable you to do reservation-aware capacity management that provides more detailed future saturation information with respect to capacity management based on time forecasting models and trends. 

How to reserve capacity on a vSphere cluster

For more information about other tasks that you can perform with Reservations, see Reserving-capacity-for-upcoming-services.

Assessing resource capacity against your requirements

Use Virtual Planner to easily simulate consolidation/virtualization scenarios to carefully design your virtual Infrastructure.

About virtual planner

With Virtual Planner, you can make the right plan to ensure your IT infrastructure is able to sustain new expected workloads and is optimized with respect to the placement of the current workloads. You can easily improve your virtual environment by planning the addition of new virtual resources and/or re-balancing current workloads across existing virtual hosts in order to support the forecasted service demands.

How to deploy new VMs in a vSphere cluster to use resources optimally

Further Reading

Best Practices for using the Capacity Pools view

 

Tip: For faster searching, add an asterisk to the end of your partial query. Example: cert*