Monitoring a reconciliation job
You can monitor a reconciliation job by using the fail-safe mechanism or from the Reconciliation component.
Overview of the fail-safe mechanism
The fail-safe mechanism is available only when you configure a value for the Job Idle Time (minutes) parameter. You can set this parameter and all other reconciliation-related fields in the new Dashboard, select Configurations > Core Configurations, and click the Reconciliation component in the left pane.
The fail-safe mechanism in the reconciliation engine monitors the progress of all the running jobs (scheduled, continuous, and non-continuous). When it encounters an unresponsive job, the mechanism automatically restarts the job. For noncontinuous jobs, the mechanism stops the current job and starts a new job. For continuous jobs, the mechanism stops the current run and lets the next run start after the continuous interval of that job has elapsed.
For example, assume that you have configured the job idle time for 500 minutes. The fail-safe mechanism monitors all the jobs after every 500 minutes. If the fail-safe mechanism detects a job that is not responding (not processing a CI) for more than 500 minutes, it restarts the unresponsive job.
To use the fail-safe mechanism to monitor reconciliation jobs
You can monitor a reconciliation job from the Reconciliation component and interpret the various values displayed in the Job Details area. This mechanism logs all the traces in the arrecond.log file. Locate the log file if you have configured it to reside in a particular directory. Usually, this log file resides in the installation directory; for example: \Program Files\BMC Software\AtriumCore\Logs.
When working with the fail-safe mechanism, consider the following information:
- The allowable time interval for which a job can remain idle is configurable.
- The configured time interval is in minutes.
- The default job idle time is 0 minutes.
- From the Data Flow section of the CMDB Dashboard, click Reconciliation. (Alternatively, on the CMDB Dashboard, select Jobs > Manage Reconciliation.)
The following page is displayed.
- From the Dataset list, select a data set name for which you want to view the jobs, or select All to view jobs for all the data sets in CMDB.
From the upper-right corner, select the time period for which you want to view the jobs.
For example, you can select Today, Last 7 days, and so on.
For the selected data set and time-period filters, the ribbon component displays the following fields:
Count of all jobs that have run at least once plus jobs that have not run even once.
|Executed||Count of jobs that have run at least once for the applied data set and time-period filters.|
Count of jobs that have not run even once for the applied data set and time-period filters.
|CIs||Total||Count of all CIs processed in all the executed jobs.|
|Good||Count of CIs that are successfully reconciled for the executed jobs.|
|Errors||Count of CIs that failed reconciliation for the executed jobs.|
|Relationships||Total||Count of all relationships processed in all the executed jobs.|
|Good||Count of relationships that are successfully reconciled for the executed jobs.|
|Errors||Count of relationships that failed reconciliation for the executed jobs.|
- The values displayed in the Jobs ribbon component are run-time values.
- The values in the CIs and Relationships ribbon components are generated by the Datasources Utility.
This utility runs at scheduled times, as set in the Configurations menu.
A message above the ribbon provides the date and time when the server last ran the utility.
- The time-period values are dependent on the the date of the dashboard utility run and not related to the actual calendar date.
- The information displayed for the Today option is the information populated in the last dashboard utility run. For example, you select Today in the time-period filter on 20 July 2017. If the dashboard utility was last run on 16 July 2017, the CI details that are displayed are for 16 July 2017. Additionally, if you select Last 7 days in the time-period filter, the CI details that are displayed are from 9 July 2017 to 15 July 2017.
Below the ribbon component, the All Job Run Activities table lists the details of each job in a row. Each row has the following fields:
Displays the job name that you provided when creating the job. Click a job name to view the job settings page. This page displays the following sections:
You can also edit, delete, and start the job from this page.
Displays one of the following job statuses:
Displays a count of CIs that succeeded and failed to reconcile.
Values in the green CIs column represent the number of CIs reconciled successfully, while values in the red CIs column represent the number of CIs that failed reconciliation. You can drill-down into the failed CIs by clicking the link for the number of failed CIs. This drill down page provides a Recommended Actions column that lists the solutions for resolving the failed CIs.
Displays a count of relationships that succeeded and failed to reconcile.
Values in the green Relationships column represent the number of relationships reconciled successfully, while values in the red Relationships column represent the number of relationships that failed reconciliation. You can drill down into the failed relationships by clicking the link for the number of failed relationships. This drill-down page provides a Recommended Actions column that lists the solutions for resolving the failed relationships.
Displays a link for the number of times the job has run so far. Click the link to view a page that displays the job history such as the job status, start time, and end time for each run of the job. Click the Status of a job to expand the job row and view the Activities and Events tabs.
Currently, a reconciliation job with multiple activities of the same type displays multiple job entries in the last-run job.
|Activities||Displays a link with the number of activities configured for the reconciliation job. Click the link to expand the job row and view the Activities and Events tabs. The Activities tab displays the details of the Identify, Merge, and Purge activities configured for the job, the current status of each activity, and so on. The Events tab provides description for each of the activities such as, the number of records found, the number of failures for identification, and so on.|
- If you stop a continuous job, it continues to process data for the the current run until it is complete. The job does not stop immediately.
For example, if you have set a continuous job to run every 1800 seconds and you click Stop Job at 1600 seconds after it started, the job status displays the In Progress status till the duration of 1800 seconds is complete. You can edit a job only after it stops.
- If you make changes to a continuous job, the changes will reflect only once you stop and restart the job manually.
For more information about creating and monitoring a reconciliation job, go to the Walkthroughs widget on the Dashboard, or go to the Reconciliation component, and from the walk-through options, click CMDB Reconciliation Video.