Control-M for GCP Dataflow


The Google Cloud Platform (GCP) Dataflow job enables you to perform cloud-based data processing for batch and real-time data streaming applications.

Control-M for GCP Dataflow enables you to do the following:

  • Execute GCP Dataflow jobs based on a Classic or Flex template.
  • Manage GCP Dataflow credentials in a secure connection profile.
  • Connect to any GCP Dataflow endpoint.
  • Integrate GCP Dataflow jobs with other Control-M jobs into a single scheduling environment.
  • Monitor the status, results, and output of GCP Dataflow jobs in the Monitoring domain.
  • Attach an SLA job to your GCP Dataflow jobs.
  • Introduce all Control-M capabilities to Control-M for GCP Dataflow including advanced scheduling criteria, complex dependencies, Resource Pools, Lock Resources, and variables.
  • Run 50 GCP Dataflow jobs simultaneously per Agent.

Control-M for GCP Dataflow Compatibility

The following table lists the prerequisites that are required to use the GCP Dataflow plug-in, each with its minimum required version.

Component

Version

Control-M/EM

9.0.20.200

Control-M/Agent

9.0.20.201

Control-M Application Integrator

9.0.20.201

Control-M Automation API

9.0.20.240

Control-M for GCP Dataflow is supported on Control-M Web and Control-M Automation API, but not on Control-M client.

To download the required installation files for each prerequisite, see Obtaining-Control-M-Installation-Files-via-EPD.

Setting up Control-M for GCP Dataflow

This procedure describes how to deploy the GCP Dataflow plug-in, create a connection profile, and define a GCP Dataflow job in Control-M Web and Automation API.

Warning

Note

Integration plug-ins released by BMC require an Application Integrator installation. However, these plug-ins are not editable and you cannot import them into Application Integrator. To deploy these integrations to your Control-M environment, import them directly into Control-M using Control-M Automation API.

Before You Begin

Verify that Automation API is installed, as described in Automation API Installation.

Begin

  1. Create a temporary directory to save the downloaded files.
  2. Download the GCP Dataflow plug-in from the Control-M for GCP Dataflow download page in the EPD site.
  3. Install the GCP Dataflow plug-in via one of the following methods:
    • (9.0.21 or higher) Use the Automation API Provision service:
      1. Log in to the Control-M/EM Server machine as an Administrator and store the downloaded zip file in the one of the following locations (within several minutes, the job type appears in Control-M Web):
        • Linux: $HOME/ctm_em/AUTO_DEPLOY
        • Windows: <EM_HOME>\AUTO_DEPLOY
      2. Log in to the Control-M/Agent machine and run the provision image command, as follows:
        • Linux: ctm provision image GDF_plugin.Linux
        • Windows: ctm provision image GDF_plugin.Windows
    • (9.0.20.200 or lower) Use the Automation API Deploy service, as described in deploy jobtype.
  4. Create a GCP Dataflow connection profile in Control-M Web or Automation API, as follows:
  5. Define a GCP Dataflow job in Control-M Web or Automation API, as follows:
Warning

Note

To remove this plug-in from an Agent, see Removing a Plug-in. The plug-in ID is GDF032022.

 

Tip: For faster searching, add an asterisk to the end of your partial query. Example: cert*

Control-M