Moviri Integrator for BMC Helix Capacity Optimization - Cloudera
“Moviri Integrator for BMC Helix Continuous Optimization – Cloudera” is an additional component of BMC Helix Continuous Optimization product. It allows extracting data from Cloudera Enterprise, which is Cloudera Hadoop distribution composed of CDH (Cloudera Data Hub) and Cloudera Manager. Relevant capacity metrics are loaded into BMC Helix Continuous Optimization, which provides advanced analytics over the extracted data in the form of an interactive dashboard, the Hadoop View.
The integration supports the extraction of both performance and configuration data across different component of CDH and can be configured via parameters that allow entity filtering and many other settings. Furthermore the connector is able to replicate relationships and logical dependencies among entities such as clusters, resource pools, services and nodes.
The documentation is targeted at BMC Helix Continuous Optimization administrators, in charge of configuring and monitoring the integration between BMC Helix Continuous Optimization and Cloudera.
- Requirements
- Datasource Check and Configuration
- Supported entities
- Hierarchy
- Troubleshooting
- Configuration and performance metrics mapping
Requirements
Supported versions of data source software
- Supported Cloudera Data Hub and Cloudera Manager versions: 5.1 to 5.16.1
- The integration supports both Cloudera Manager bundled in Cloudera Enterprise and Cloudera Express products
Supported configurations of data source software
Moviri – Cloudera Extractor requires Cloudera Manager is continuously and correctly monitoring the various entities supported by the integration, full list available below. Any lack in meeting this requirement will cause lack in data coverage.
Datasource Check and Configuration
Preparing to connect to the data source software
The connector included in "Moviri Integrator for BMC Helix Continuous Optimization – Cloudera" use the Cloudera Java API v6 to communicate with Cloudera Manager. This is always enabled and no additional configuration is required.
Please note that only SELECT statements are used by the connector, preventing any accidental change to the environments.
The connector requires a read-only user with permissions on all the clusters that should be accessed.
Connector configuration attributes
The following table shows specific properties of the connector, all the other generic properties are documented here.
The following image shows the list of options in the ETL configuration menu, with advanced properties.
Supported entities
The following entities are supported:
- Hadoop Cluster
- Hadoop Resource Pool
- Hadoop Node
In addition to standard system performance metrics, data related to the following Hadoop specific services is gathered:
- HDFS
- SPARK
- YARN
- HBASE
- MAP REDUCE
Hierarchy
The connector is able to replicate relationships and logical dependencies among these entities. In particular all the available Clusters are attached to the root of the hierarchy and each Cluster contains its own Nodes and Pools.
Services' data is available among the above entities' metrics, according to the following table.
HDFS | YARN | HBASE | MAP REDUCE | SPARK | |
Cluster | X | X | X | X | X |
Pool |
| X |
|
|
|
Node | X |
|
|
|
|
Troubleshooting
For ETL troubleshooting, refer to official BMC documentation available here.
Configuration and performance metrics mapping