Data analysis criteria
The Data Analysis function analyzes the data in a particular field by comparing the data to its record layout field definition and reports whether the data is valid or invalid. The Data Analysis report is divided into the following informational sections:
Statistics
Reports the number of unique values, number of valid values, and the range of values found in the selected field.
Occurrence
Reports each unique value and the number of times each value occurs within the selected field.
Invalid Data
Reports whether invalid data was found in the selected field. When invalid data is found, Data Solutions lists the number of invalid values and the highest, lowest, and most common invalid value found.
On the Data Analysis Specification screen, you can specify how the values are analyzed. You may choose to not produce the Statistics or Occurrence section of the report. You can set a maximum number of unique occurrences to report.
You can use a “filter dataset” to limit the values that Data Solutions lists in the occurrence report. This is useful when there are numerous records and a wide range of data values. Include the values that you want analysis information on in the filter dataset. Data Solutions limits the occurrence report to the “filtered” values found within the field.
The following figure allows you to enter criteria to analyze data fields. It is the same as the Data Analysis field exit provided by Data Solutions. See Field Exits (6) for more information.
The following figure is displayed when you enter:
- Option 3.3 on the Primary Option Menu
- Option 3.3 on the Criteria Menu
- Option 3 on the Data Analysis Tools Menu.
Data Analysis Criteria Screen
-------------------------------------------------------------------------------
File-AID/Data Solutions ------- Data Analysis Criteria ------------------------
COMMAND ===> SCROLL ===> PAGE
SET 1 OF 1 EMPLOYEE-MASTER-FILE GBL = N SEL = N
---- FIELD LEVEL/NAME ------- -FORMAT- RO --------- PARAMETERS -------------- +
******************************* TOP OF DATA ***********************************
5 EMP-NUMBER 5/AN
5 EMP-LAST-NAME 15/AN
5 EMP-FIRST-NAME 10/AN
5 EMP-MID-INIT 1/AN
5 FILLER 2/AN
5 EMP-TITLE 30/AN
5 EMP-PERSONAL-INFO SYNC 23/GRP
10 EMP-NATL-ID-NUMBER 9/NUM
10 FILLER 1/AN
10 EMP-DATE-OF-BIRTH 6/AN
10 EMP-DOB-REDEF REDEFINES EMP-DATE-OF-BIRTH
10 EMP-DOB-REDEF SYNC 6/GRP
15 EMP-DOB-MM 2/NUM
15 EMP-DOB-DD 2/NUM
Valid RO: EX, FM
Commands: SELect, View
Field descriptions of data analysis criteria
Set
Identifies the set of criteria displayed.
01 Level Name
Name of the record file structure from layout.
GBL
Specifies whether one or more global fields have been attached to this set of criteria (N or Y).
SEL
Specifies whether selection criteria has been attached to this set of criteria (N or Y).
FIELD LEVEL/NAME
Name of the field from layout.
FORMAT
Length and type of field.
RO
Specify a relational operator. Possible values are:
EX | Execute (default). Displays the Data Analysis Specification screen. See Data Analysis Specification. |
= | Replicates last entered criteria. |
FM | Field mask. For more information, see Field Mask. |
Parameters
Displays the parameters that were specified on the Data Analysis Specification screen.
This section provides information about the following topics: