Control-M for Google Dataflow
Google Cloud Platform (GCP) Dataflow is a managed service that enables you to perform cloud-based data processing for batch and real-time data streaming applications.
Control-M for Google Dataflow enables you to do the following:
- Connect to the Google Cloud Platform from a single computer with secure login, which eliminates the need to provide authentication.
- Trigger jobs based on any template (Classic or Flex) created on Google Dataflow.
- Integrate Dataflow jobs with other Control-M jobs into a single scheduling environment.
- Monitor the Dataflow status and view the results in the Monitoring domain.
- Attach an SLA job to your entire Google Dataflow service.
- Introduce all Control-M capabilities to Google Dataflow, including advanced scheduling criteria, complex dependencies, quantitative and control resources, and variables.
- Run 50 Google Dataflow jobs simultaneously per Control-M/Agent.
Setting up Control-M for Google Dataflow
This procedure describes how to install the Google Dataflow plug-in, create a connection profile, and define a Google Dataflow job in Helix Control-M and Automation API.
Before you Begin
- Verify that Automation API is installed, as described in Setting up the API.
- Verify that Agent version 9.0.21.080 or later is installed.
Begin
- On the Agent host, set the Java environment variable by running one of the following commands through a command line:
- Linux:
- Bourne shell/bash: export BMC_INST_JAVA_HOME=<java_11_directory>
- csh/tcsh: setenv BMC_INST_JAVA_HOME <java_11_directory>
- Windows: set BMC_INST_JAVA_HOME="<java_11_directory>"
- Linux:
- Run one of the following API commands:
- For a fresh installation, use the provision image command:
- Linux: ctm provision image GDF_plugin.Linux
- Windows: ctm provision image GDF_plugin.Windows
- For an upgrade, use the following command:
ctm provision agent::update
- For a fresh installation, use the provision image command:
- Create a GCP Dataflow connection profile in Helix Control-M or Automation API, as follows:
- Helix Control-M: Creating a Centralized Connection Profile with GCP Dataflow Connection Profile Parameters
- Automation API: ConnectionProfile:GCP Dataflow
- Define a GCP Dataflow job in Helix Control-M or Automation API, as follows:
- Helix Control-M: Create a Job and then define specific GCP Dataflow parameters in GCP Dataflow Job parameters.
- Automation API: Job:GCP Dataflow
Was this page helpful? Yes No
Submitting...
Thank you
Comments
Log in or register to comment.