Moviri Integrator for BMC Helix Capacity Optimization - Cloudera

“Moviri Integrator for BMC Helix Continuous Optimization – Cloudera” is an additional component of BMC Helix Continuous Optimization product. It allows extracting data from Cloudera Enterprise, which is Cloudera Hadoop distribution composed of CDH (Cloudera Data Hub) and Cloudera Manager. Relevant capacity metrics are loaded into BMC Helix Continuous Optimization, which provides advanced analytics over the extracted data in the form of an interactive dashboard, the Hadoop View.

The integration supports the extraction of both performance and configuration data across different component of CDH and can be configured via parameters that allow entity filtering and many other settings. Furthermore the connector is able to replicate relationships and logical dependencies among entities such as clusters, resource pools, services and nodes.

The documentation is targeted at BMC Helix Continuous Optimization administrators, in charge of configuring and monitoring the integration between BMC Helix Continuous Optimization and Cloudera.

Requirements
Datasource Check and Configuration
Supported entities
Hierarchy
Troubleshooting
Configuration and performance metrics mapping

Requirements

Supported versions of data source software

Supported Cloudera Data Hub and Cloudera Manager versions: 5.1 to 5.16.1
The integration supports both Cloudera Manager bundled in Cloudera Enterprise and Cloudera Express products

Supported configurations of data source software

Moviri – Cloudera Extractor requires Cloudera Manager is continuously and correctly monitoring the various entities supported by the integration, full list available below. Any lack in meeting this requirement will cause lack in data coverage.

Datasource Check and Configuration

Preparing to connect to the data source software

The connector included in "Moviri Integrator for BMC Helix Continuous Optimization – Cloudera" use the Cloudera Java API v6 to communicate with Cloudera Manager. This is always enabled and no additional configuration is required.
Please note that only SELECT statements are used by the connector, preventing any accidental change to the environments.
The connector requires a read-only user with permissions on all the clusters that should be accessed.

Connector configuration attributes

The following table shows specific properties of the connector, all the other generic properties are documented here.

The following image shows the list of options in the ETL configuration menu, with advanced properties.

cloudera UI.png

Supported entities

The following entities are supported:

Hadoop Cluster
Hadoop Resource Pool
Hadoop Node

In addition to standard system performance metrics, data related to the following Hadoop specific services is gathered:

HDFS
SPARK
YARN
HBASE
MAP REDUCE

Hierarchy

The connector is able to replicate relationships and logical dependencies among these entities. In particular all the available Clusters are attached to the root of the hierarchy and each Cluster contains its own Nodes and Pools.

Services' data is available among the above entities' metrics, according to the following table.

	HDFS	YARN	HBASE	MAP REDUCE	SPARK
Cluster	X	X	X	X	X
Pool		X
Node	X

Troubleshooting

For ETL troubleshooting, refer to official BMC documentation available here.