Creating training data sets for BMC Helix Business Workflows

To utilize BMC Helix Cognitive Automation capabilities to predict case templates and categories for cases, as an administrator, you train the cognitive service to understand your data and use it in the application.

Important

To leverage the cognitive capabilities, you must have a license to use BMC Helix Cognitive Automation.

As an administrator, you create a data set to train the cognitive service from sample data or from your application data. The following table describes the types of training data sets that you can create:

Training data set type	Data set contents
CSV Data Set	Uses data stored in a CSV file. Note: A CSV training data set file must have at least 5 rows, and it must not contain more than 25,000 intent examples associated with 2,000 intents.
BMC Helix Innovation Suite Data Set	Uses your application record definitions and record fields.

For applications that do not require continuous cognitive service training, train the cognitive service by using data from CSV files.

For applications that are used in a business environment that changes often, you need to continuously train the cognitive service. For such applications, you provide the cognitive service training data from your applications. This approach helps the cognitive service to get the updated data and provide suggestions according to the business changes.

To create a training data set

As an administrator, log in to BMC Helix Innovation Studio and navigate to the Administration tab.
Select Case Management > Cognitive Training Data Set.

On the Auto-classification Training and Evaluation tab, in the Data Sets section, click New, and perform one of the following steps:

To create a data set that uses a CSV file, select CSV Data Set, and in the New CSV Data Set pane, complete the fields as described in the following table:

Description of fields in the New CSV Data Set pane

Field	Description
Data Set Name	Enter a name for your training data set.
Description	Enter a description that provides details about your training data set.
TrainingType	The IBM Watson Conversation for natural language classification option is populated automatically and you cannot change this option.
CSV File	Click Choose File and upload your training data set.
Locale	Select the locale in which you want to display your training data set.
Training Data	Select the percentage of the CSV data that you want to use as training data.
Test Data	The percentage of CSV data that you want to use as test data is automatically calculated according to the Training Data percentage.

To create a data set that uses the application data, select Innovation Suite Data Set, and in the New Training Data Set pane, complete the fields as described in the following table:

Description of fields in the New Innovation Suite Data Set pane

Field	Description	Values to add for template prediction	Values to add for category prediction
Data Set Name	Enter a name for your training data set.	Not Applicable	Not Applicable
Description	Enter a description that provides details about your training data set.	Not Applicable	Not Applicable
TrainingType	The IBM Watson Conversation for natural language classification option is populated automatically. You cannot change this option.	Not Applicable	Not Applicable
Record Definition Name	Select a record to provide data to the cognitive service.	Case	Case
Filter	Select the Company ID for which you want to create the data sets. Enables the cognitive service to create data sets specific to a company. For example, Calbro Services.To map the correct trained data set with the correct company, you can use the following naming convention while creating your data set: Category datasetName for companyName For example, Category Data Set 1 for Calbro Services. Template datasetName for companyName For example, Template Data Set 1 for Calbro Services.	<Company ID>	<Company ID>
Text Fields	Click Add/Remove Text Fields and select the field from the list of available fields. The text fields enable the cognitive service to use the data from the selected field to predict the case templates.	Summary	Summary
Category Fields	Click Add/Remove Category Fields and select the field from the list of available fields. The category fields enable the cognitive service to use the data from the Summary field, and to return the Template IDs for the predicted case templates and Category IDs for predicted categories.Notes: For predicting templates, if you use a field other than Template Name in the Category Fields, the cognitive service fails to predict the case templates. For predicting categories, if you use a field other than Category Tier 1, Category Tier 2, and Category Tier 3 in the Category Fields, the cognitive service fails to predict the case templates. Ensure that you add the fields in the correct order, such as, Category Tier 1, Category Tier 2, and Category Tier 3.	Template Name	Category Tier 1 Category Tier 2 Category Tier 3
Training Data	Select the percentage of the CSV data that you want to use as training data.	Not Applicable	Not Applicable
Test Data	The percentage of CSV data that you want to use as test data is automatically calculated according to the Training Data percentage.	Not Applicable	Not Applicable

Click Save.
The new training data set is displayed in the Training Data Sets section. You can delete the training data set or create a copy of the existing training data set.

After you create the training data set, you can train and test the cognitive service. Learn more about it in Training and testing the cognitive service for a custom application.

Where to go from here

Enabling-template-and-category-prediction

Creating training data sets for BMC Helix Business Workflows

To create a training data set

Where to go from here

BMC Helix Business Workflows 21.3

On this page