Control-M for Google Dataproc

Google Cloud Platform (GCP) Dataproc is a managed service that enables you to perform cloud-based big data processing and machine learning. 

Control-M for Google Dataproc enables you to do the following:

  • Connect to the Google Cloud Platform from a single computer with secure login, which eliminates the need to provide authentication.
  • Trigger jobs based on any workflow template created on Google Dataproc.
  • Integrate Dataproc jobs with other Control-M jobs into a single scheduling environment.
  • Monitor the Dataproc status and view the results in the Monitoring domain.
  • Attach an SLA job to your entire Google Dataproc service.
  • Introduce all Control-M capabilities to Google Dataproc, including advanced scheduling criteria, complex dependencies, quantitative and control resources, and variables.
  • Run 50 Google Dataproc jobs simultaneously per Control-M/Agent.

Setting up Control-M for Google Dataproc

This procedure describes how to install the Google Dataproc plug-in, create a connection profile, and define a Google Dataproc job in Helix Control-M and Automation API.

Before you Begin

  • Verify that Automation API is installed, as described in Setting up the API.
  • Verify that Agent version or later is installed.


  1. On the Agent host, set the Java environment variable by running one of the following commands through a command line:
    • Linux:
      • Bourne shell/bash: export BMC_INST_JAVA_HOME=<java_11_directory>
      • csh/tcsh: setenv BMC_INST_JAVA_HOME <java_11_directory>
    • Windows:  set BMC_INST_JAVA_HOME="<java_11_directory>"
  2. Run one of the following API commands:
    • For a fresh installation, use the provision image command:
      • Linux:  ctm provision image GDP_plugin.Linux
      • Windows: ctm provision image GDP_plugin.Windows
    • For an upgrade, use the following command:
      ctm provision agent::update
  3. Create a GCP Dataproc connection profile in Helix Control-M or Automation API, as follows:
  4. Define a GCP Dataproc job in Helix Control-M or Automation API, as follows:


To remove this plug-in from an Agent, follow the instructions in Removing a Plug-in. The plug-in ID is GDP042022.

Was this page helpful? Yes No Submitting... Thank you