Control-M for Databricks
Databricks is a cloud-based data analytics platform that enables you to process large workloads of data.
Control-M for Databricks enables you to do the following:
- Connect to any Databricks workspace using a Databricks Personal Access Token (PAT) for authentication. You can generate the PAT in your Databricks workspace without dependence on the cloud vendor.
- Integrate Databricks jobs with other Control-M jobs into a single scheduling environment.
- Monitor the Databricks workspace status and view the results in the Monitoring domain.
- Attach an SLA job to your entire Databricks data service.
- Introduce all Control-M capabilities to Databricks, including advanced scheduling criteria, complex dependencies, quantitative and control resources, and variables.
- Run 50 Databricks jobs simultaneously per Control-M/Agent.
Control-M for Databricks Compatibility
The following table lists the prerequisites that are required to use the Databricks plug-in, each with its minimum required version.
Component | Version |
---|---|
Control-M/EM | 9.0.20.200 |
Control-M/Agent | 9.0.20.200 |
Control-M Application Integrator | 9.0.20.201 |
Control-M Web | 9.0.20.200 |
Control-M Automation API | 9.0.20.245 |
Control-M for Databricks is supported on Control-M Web and Control-M Automation API, but not on Control-M client.
To download the required installation files for each prerequisite, see Obtaining Control-M Installation Files.
Setting up Control-M for Databricks
This procedure describes how to deploy the Databricks plug-in, create a connection profile, and define a Databricks job in Control-M Web and Automation API.
NOTE: Integration plug-ins released by BMC require an Application Integrator installation at your site. However, these plug-ins are not editable and you cannot import them into Application Integrator. To deploy these integrations to your Control-M environment, you import them directly into Control-M using Control-M Automation API.
Before you Begin
Verify that Automation API is installed, as described in Automation API Installation.
Begin
Create a temporary directory to save the downloaded files.
Click http://www.bmc.com/available/epd and follow the instructions on the EPD site to download the Databricks plug-in, or go directly to the Control-M for Databricks download page.
- Deploy the Databricks job via Automation API, as described in deploy jobtype.
- Create a Databricks connection profile in Control-M Web or Automation API, as follows:
- Define a Databricks job in Control-M Web or Automation API, as follows:
- Web: Create a Job and then define specific Databricks parameters in Databricks Job parameters.
- Automation API: Job:Databricks
Comments
Log in or register to comment.