Data analysis criteria


The Data Analysis function analyzes the data in a particular field by comparing the data to its record layout field definition and reports whether the data is valid or invalid. The Data Analysis report is divided into the following informational sections:

Statistics

Reports the number of unique values, number of valid values, and the range of values found in the selected field.

Occurrence

Reports each unique value and the number of times each value occurs within the selected field.

Invalid Data

Reports whether invalid data was found in the selected field. When invalid data is found, Data Solutions lists the number of invalid values and the highest, lowest, and most common invalid value found.

On the Data Analysis Specification screen, you can specify how the values are analyzed. You may choose to not produce the Statistics or Occurrence section of the report. You can set a maximum number of unique occurrences to report.

You can use a “filter dataset” to limit the values that Data Solutions lists in the occurrence report. This is useful when there are numerous records and a wide range of data values. Include the values that you want analysis information on in the filter dataset. Data Solutions limits the occurrence report to the “filtered” values found within the field.

The following figure allows you to enter criteria to analyze data fields. It is the same as the Data Analysis field exit provided by Data Solutions. See Field Exits (6) for more information.

The following figure is displayed when you enter:

  • Option 3.3 on the Primary Option Menu
  • Option 3.3 on the Criteria Menu
  • Option 3 on the Data Analysis Tools Menu.

Data Analysis Criteria Screen

  Action  Sets  Display  Show  Options  Menu  Help
 -------------------------------------------------------------------------------
 File-AID/Data Solutions ------- Data Analysis Criteria ------------------------
 COMMAND ===>                                                  SCROLL ===> PAGE

 SET  1 OF  1           EMPLOYEE-MASTER-FILE                     GBL = N SEL = N
 ---- FIELD LEVEL/NAME ------- -FORMAT- RO --------- PARAMETERS -------------- +
 ******************************* TOP OF DATA ***********************************
 5 EMP-NUMBER                    5/AN
 5 EMP-LAST-NAME                15/AN
 5 EMP-FIRST-NAME               10/AN
 5 EMP-MID-INIT                  1/AN
 5 FILLER                        2/AN
 5 EMP-TITLE                    30/AN
 5 EMP-PERSONAL-INFO SYNC       23/GRP
   10 EMP-NATL-ID-NUMBER         9/NUM
   10 FILLER                     1/AN
   10 EMP-DATE-OF-BIRTH          6/AN
   10 EMP-DOB-REDEF REDEFINES EMP-DATE-OF-BIRTH
   10 EMP-DOB-REDEF SYNC         6/GRP
     15 EMP-DOB-MM               2/NUM
     15 EMP-DOB-DD               2/NUM
 Valid RO: EX, FM
 Commands: SELect, View 

Field descriptions of data analysis criteria

Set

Identifies the set of criteria displayed.

01 Level Name

Name of the record file structure from layout.

GBL

Specifies whether one or more global fields have been attached to this set of criteria (N or Y).

SEL

Specifies whether selection criteria has been attached to this set of criteria (N or Y).

FIELD LEVEL/NAME

Name of the field from layout.

FORMAT

Length and type of field.

RO

Specify a relational operator. Possible values are:

EX

Execute (default). Displays the Data Analysis Specification screen. See Data Analysis Specification.

=

Replicates last entered criteria.

FM

Field mask. For more information, see Field Mask.

Parameters

Displays the parameters that were specified on the Data Analysis Specification screen.

This section provides information about the following topics:

 

Tip: For faster searching, add an asterisk to the end of your partial query. Example: cert*