Generic - Database extractor (Java)
Step I. Complete the preconfiguration tasks
- Make sure that you can reach your database server from the ETL engine machine by running the ping <your_db_server_ip> command
- Make sure that you can remotely access your database server using telnet from the ETL engine machine, by running the telnet <your_db_server_ip> <your_db_server_port> command
- Connect to your database using a software client.
- If the username is part of a Windows domain, verify that the username is specified in the YOURDOMAIN\username format. Verify that your settings match to the settings that are used to connect to the same database with a software client.
Step II. Configure the ETL
You must configure the ETL to connect to the generic database for data collection. ETL configuration includes specifying the basic and optional advanced properties. While configuring the basic properties is sufficient, you can optionally configure the advanced properties for additional customization.
A. Configuring the basic properties
Some of the basic properties display default values. You can modify these values if required.
To configure the basic properties:
- In the Helix Capacity Optimization Console, navigate to Administration > ETL & System Tasks > and select ETL tasks.
- On the ETL tasks page, click Add > Add ETL under the Last run tab. The Add ETL page displays the configuration properties. You must configure properties in the following tabs: Run configuration, Entity catalog, SQL query, and Connection parameters.
- On the Run configuration tab, complete these steps:
- From the ETL module list, select Generic - Database Extractor. The name of the ETL is displayed in the ETL task name field. You can edit this field to customize the name.
Specify the datasets that you want to add to the ETL run configuration. The ETL collects data of metrics that are associated with these datasets.
- Click Edit.
- Select one (click) or more (shift+click) datasets from the Available datasets list and click >> to move them to the Selected datasets list.
- Click Apply.
Important: The ETL collects data on metrics associated with the datasets that are available in the Selected datasets list. Make sure that you select the CST dataset to import the buffer table data. For details, see Overview-of-datasets-in-an-ETL-task and Dataset-reference-for-ETL-tasks.
- On the Custom structure tables tab, complete the following steps. For details on custom structure tables, see Collecting-data-for-custom-structure-tables.
- In Table names, specify the buffer table identifier that you received while creating the buffer table by using the data mart API. The identifier starts with the BUF_ prefix. For details, see Datamart-API-endpoints.
Important: BUF_ prefix is automatically added to the name if the identifier is missing. - In Behavior, perform the following operations related to custom structure tables:
- Load data into a buffer table in append or update mode.
- Remove data from a buffer table in delete mode.
- Truncate (empty) a buffer table.
- In Table names, specify the buffer table identifier that you received while creating the buffer table by using the data mart API. The identifier starts with the BUF_ prefix. For details, see Datamart-API-endpoints.
- Click the Entity catalog tab and select one of the following options:
- Shared Entity Catalog: Select if the other ETLs access the same entities that are used by this ETL.
- From the Sharing with Entity Catalog list, select an entity catalog name that is shared between ETLs.
- Private Entity Catalog: Select if you want to use this ETL independently.
- Shared Entity Catalog: Select if the other ETLs access the same entities that are used by this ETL.
- On the SQL query tab, build the SQL extraction query.
- SQL query hints: Provides hints for including the mandatory and optional parameters in the SQL extraction query.
SQL query: Specify the SQL extraction query. You can select the required parameters and operators from the list and add them to the text area. This tab is only available when a custom database query has to be built in order to get the data from a generic source. The query must be formatted using the format supported by the database extractor. Do not add comments in the SQL query. Commenting is based on line formatting and is not maintained when the query is stored and run from the database.
- SQL parameters number: Select the number of custom parameters that you need to add to the query. You can add up to 10 custom parameters to the query.
Click the Connection parameters tab, and configure the following properties:
The [confluence_table-plus] macro is a standalone macro and it cannot be used inline. Click on this message for details.
The following image shows the basic properties configuration.(Optional) Override the default values of properties in the following tabs:
The [confluence_table-plus] macro is a standalone macro and it cannot be used inline. Click on this message for details.
- Click Save.
The ETL tasks page shows the details of the newly configured Generic - Database (Java) ETL.
(Optional) B. Configuring the advanced properties
You can configure the advanced properties to change the way the ETL works.
To configure the advanced properties:
- On the Add ETL page, click Advanced.
Configure the following properties:
- Click Save.
The ETL tasks page shows the details of the newly configured Generic - Database (Java) ETL.
Step III. Run the ETL
After you configure the ETL, you can run it to collect data. You can run the ETL in the following modes:
A. Simulation mode: Only validates connection to the data source, does not collect data. Use this mode when you want to run the ETL for the first time or after you make any changes to the ETL configuration.
B. Production mode: Collects data from the data source.
A. To run the ETL in the simulation mode
To run the ETL in the simulation mode:
- Navigate to Administration > ETL & System Tasks, and select ETL tasks.
- On the ETL tasks page, click the ETL. The ETL details are displayed.
- In the Run configurations table, click Edit
to modify the ETL configuration settings.
- On the Run configuration tab, ensure that the Execute in simulation mode option is set to Yes, and click Save.
- Click Run active configuration. A confirmation message about the ETL run job submission is displayed.
- On the ETL tasks page, check the ETL run status in the Last exit column.
OK Indicates that the ETL ran without any error. You are ready to run the ETL in the production mode. - If the ETL run status is Warning, Error, or Failed:
- On the ETL tasks page, click
in the last column of the ETL name row.
- Check the log and reconfigure the ETL if required.
- Run the ETL again.
- Repeat these steps until the ETL run status changes to OK.
- On the ETL tasks page, click
B. To run the ETL in the production mode
You can run the ETL manually when required or schedule it to run at a specified time.
To run the ETL manually
- On the ETL tasks page, click the ETL. The ETL details are displayed.
- In the Run configurations table, click Edit
to modify the ETL configuration settings. The Edit run configuration page is displayed.
- On the Run configuration tab, select No for the Execute in simulation mode option, and click Save.
- To run the ETL immediately, click Run active configuration. A confirmation message about the ETL run job submission is displayed.
When the ETL runs, it collects data from the source and transfers it to the BMC Helix Continuous Optimization database.
To schedule the ETL run in the production mode
By default, the ETL is scheduled to run daily. You can customize this schedule by changing the frequency and period of running the ETL.
To configure the ETL run schedule:
- On the ETL tasks page, click the ETL, and click Edit task. The ETL details are displayed.
- On the Edit task page, do the following, and click Save:
- Specify a unique name and description for the ETL task.
- In the Maximum execution time before warning field, specify the duration for which the ETL must run before generating warnings or alerts, if any.
- Select a predefined or custom frequency for starting the ETL run. The default selection is Predefined.
- Select the task group to which you want to assign the ETL task.
- Click Schedule. A message confirming the scheduling job submission is displayed.
When the ETL runs as scheduled, it collects data from the source and transfers it to the BMC Helix Continuous Optimization database.
Step IV. Verify data collection
Verify that the ETL ran successfully and check whether the Generic Database (Java) data is refreshed in the Workspace.
To verify whether the ETL ran successfully
- Click Administration > ETL and System Tasks > ETL tasks.
- In the Last exec time column corresponding to the ETL name, verify that the current date and time are displayed.
- In the Last exit column corresponding to the ETL name, verify that the status is OK.
In case of WARNING or ERROR, click in the last column of the ETL name row to review the log files.
Verify that the Generic Database data is refreshed:
- Click Workspace.
- Expand Generic - Database extractor (Java) > Systems.
- In the left pane, verify that the hierarchy displays the database instances.
- Click an instance, and click the Metrics tab in the right pane.
- Check if the Last Activity column in the Configuration metrics and Performance metrics tables displays the current date.
The following image shows sample metrics data.
(For Custom structure tables only) Verify that the data is imported or updated in this buffer table:
- Click Administration > Data marts.
The Data mart page lists the Out-of-the-box and custom data marts available in the system. The buffer tables are listed with the Type as Custom structure table.
- Click the name of the buffer table to view its contents.
For details, see Managing-data-marts.