Analyzing data
Data Solutions’s data analysis function enables you to analysis data in a particular field by comparing the data to its record layout field definition and produce a report detailing whether the data is valid or invalid.
This section gives you practice using Data Solutions to analyze the EMP-HOME-AREA-CODE field in the sample employee file, DASAMP.EMPLOYEE, using a filter to limit the analysis to the area codes specified in the Filter Dataset.
Accessing data analyzer menu (Option 3)
The Data Analyzer function is located on Data Solutions’ Primary Option Menu as function number 3, as shown in the following figure.
File-AID/Data Solutions Primary Option Menu
OPTION ===> 3
0 Parameters - Data Solutions Parameters USERID - USERID1
1 Date Aging - Date Aging Solutions DATE - 20/12/09
2 Euro Tools - Currency Conversions TIME - 08:39
3 Data Analyzer - Analysis Solutions TERMINAL - 3290A
4 Translator - Data Translator CCSID - 00037
5 Generation - Data Generation Solutions
6 Field Exits - Formatted Field Exits
7 Encryption - Data Encryption
8 Replace - Data Replace
D Data Privacy - Dynamic Privacy Rules
U Utilities - Print and File Override Utilities
S Criteria Sets - Create or modify Criteria sets
R XREF - Create or change record layout cross reference
T Tutorial - Display information about Data Solutions
X Exit - Terminate Data Solutions and return to ISPF
Product Update History: https://support.compuware.com/s/product-updates
Online Technical Support available at: https://support.compuware.com
(c) Copyright 1997-2021 BMC Software, Inc.
From the Data Solutions Primary Option Menu, select Data Solutions option 3 to access the Data Analysis Tools Menu (see the following figure).
Data Analysis Tools Menu
OPTION ===> 3
1 Date Pattern Analysis - Analyze Date Patterns
2 Validation - Validate Data Fields
3 Data Analysis - Analyze Data Fields
Use END to return to Primary Options Menu
From the Data Analysis Tools Menu, select option 3 to access the Input Dataset Specification screen (see the following figure).
Defining your request
The Input Dataset Specification screen, shown in the following figure, captures:
- Input dataset
- Record layout and XREF dataset information
- Change Criteria usage (and optionally, Change criteria dataset and member name).
Input Dataset Specification screen
COMMAND ===>
Specify Input Dataset Information:
Dataset name ===> DASAMP.EMPLOYEE
Member name ===> (Blank or pattern for member lis
Disposition ===> OLD (OLD or SHR)
I/O exit name ===>
Specify Record Layout Information:
Record layout usage ===> S (S = Single; X = XREF
Record layout dataset ===> DASAMP.LAYOUTS
Member name ===> EMPLOYEE (Blank or pattern for
XREF dataset name ===>
Member name ===> (Blank or pattern for member lis
Specify Criteria Information:
Use existing criteria ===> N (Y = Yes; N = No)
Criteria dataset name ===>
Member name ===> (Blank or pattern for member lis
Specify Business Rules or Field Exits Dataset Information:
Dataset name ===>
- Type DASAMP.EMPLOYEE in the Input Dataset name field.
- Specify S in the Record layout usage field.
- Type DASAMP.LAYOUTS in the Record layout dataset field.
- Type EMPLOYEE in the Member name field.
- Press <Enter>.
Specifying data analysis criteria
File-AID/Data Solutions displays the Data Analysis Criteria screen, as shown in the following figure.
Data Analysis Criteria Screen
-------------------------------------------------------------------------------
File-AID/Data Solutions ------- Data Analysis Criteria ------------------------
COMMAND ===> SCROLL ===> PAGE
SET 1 OF 1 EMPLOYEE-MASTER-FILE GBL = N SEL = N
---- FIELD LEVEL/NAME ------- -FORMAT- RO --------- PARAMETERS -------------- +
******************************* TOP OF DATA ***********************************
5 EMP-NUMBER 5/AN
5 EMP-LAST-NAME 15/AN
5 EMP-FIRST-NAME 10/AN
5 EMP-MID-INIT 1/AN
5 FILLER 2/AN
5 EMP-TITLE 30/AN
5 EMP-PERSONAL-INFO SYNC 23/GRP
10 EMP-NATL-ID-NUMBER 9/NUM
10 FILLER 1/AN
10 EMP-DATE-OF-BIRTH 6/AN
10 EMP-DOB-REDEF REDEFINES EMP-DATE-OF-BIRTH
10 EMP-DOB-REDEF SYNC 6/GRP
15 EMP-DOB-MM 2/NUM
15 EMP-DOB-DD 2/NUM
Valid RO: EX, =, FM
Commands: SELect, View
Press <PF8> to scroll forward to the 15 EMP-HOME-AREA-CODE field to select it for analysis.
Data Analysis Criteria Screen
-------------------------------------------------------------------------------
File-AID/Data Solutions ------- Data Analysis Criteria ------------------------
COMMAND ===> SCROLL ===> PAGE
SET 1 OF 1 EMPLOYEE-MASTER-FILE GBL = N SEL = N
---- FIELD LEVEL/NAME ------- -FORMAT- RO --------- PARAMETERS -------------- +
15 EMP-HOME-AREA-CODE 3/NUM EX
15 EMP-HOME-EXCHANGE 3/NUM
15 EMP-HOME-SUFFIX 4/NUM
***************************** BOTTOM OF DATA **********************************
Valid RO: EX, =, FM
Commands: SELect, View
- Type EX in the RO field to select the EMP-HOME-AREA-CODE field for data analysis.
- Press <Enter>.
Entering data analysis specifications
File-AID/Data Solutions displays the Data Analysis Specification screen, as shown in the following figure.
Data Analysis Specifications Screen
- Specify F in the Occurrences Report field.
- Type DASAMP.ANALYSIS.TABLE in the Dataset field (for Option F).
- Type 19 in the Location field.
- Type PS in the Data Type field.
- Type 00005 in the Length field.
- Press <PF3>.
File-AID/Data Solutions displays the Data Analysis Criteria screen, as shown in the following figure with the PARAMETERS field displaying the entered data analysis specifications.
Data Analysis Criteria Screen
-------------------------------------------------------------------------------
File-AID/Data Solutions ------- Data Analysis Criteria ------------------------
COMMAND ===> SCROLL ===> PAGE
SET 1 OF 1 EMPLOYEE-MASTER-FILE GBL = N SEL = N
---- FIELD LEVEL/NAME ------- -FORMAT- RO --------- PARAMETERS -------------- +
15 EMP-HOME-AREA-CODE 3/NUM EX FLEN(00003);STATS(Y);OCCUR(Y);MAXOC +
15 EMP-HOME-EXCHANGE 3/NUM
15 EMP-HOME-SUFFIX 4/NUM
**************************** BOTTOM OF DATA **********************************
Valid RO: EX, =, FM
Commands: SELect, View
Press <PF3>.
File-AID/Data Solutions displays the Criteria Menu, as shown in the following figure, allowing you to specify further change criteria. For this example, press <Enter> to proceed to the Output Specification screen.
Criteria Menu
OPTION ===>
- Status -
0 OPTIONS - Selection criteria options default
G GLOBAL - GLOBAL Fields 0 fields
1 DATE AGING - Aging criteria 0 sets
2.1 EURO CONVERSION - Euro Conversion criteria 0 sets
2.2 TRIANGULATION - Currency Triangulation 0 sets
3.1 DATE PATTERN ANALYSIS - Date Pattern criteria 0 sets
3.2 VALIDATION - Validate criteria 0 sets
3.3 DATA ANALYSIS - Data Analysis criteria 1 sets
4 TRANSLATOR - Data Translator criteria 0 sets
5.1 GENERATOR - Data Generator criteria 0 sets
5.2 PATTERN - Pattern Data Generation 0 sets
6 FIELD EXITS - Utilities criteria 0 sets
7 DATA ENCRYPTION - Data Encryption Criteria 0 sets
8 REPLACE - Data Replace Criteria 0 sets
Member list description ===> ______________________________
Long ===> ______________________________________________________________
Description ===> ______________________________________________________________
Commands: VIEW, SAVE, END, CANCEL, ENTER to continue
Press <Enter>.
Specifying output options
File-AID/Data Solutions displays the Output Specification screen, as shown in the following figure. Press <Enter> to proceed with the default Output Processing Options as shown.
Output Specification Screen
COMMAND ===>
Input dataset: DFHLLM0.DASAMP.EMPLOYEE
Specify Output Processing Option:
Option ===> A (A=ANALYZE; C=COPY; U=UPDATE/PREVIEW)
Create audit trail ===> N (Y=Chgd record only; N=No; A=All records in Batch)
Audit trail prefix ===>
Specify Output Dataset Information (COPY ONLY):
Dataset name ===>
Disposition ===> (OLD or NEW)
New VSAM Type ===> (K=KSDS,E=ESDS,R=RRDS,L=LDS,Z=ZFS)
(Blank = Input Dataset Type)
Specify PDS(E) COPY options:
Copy empty members from input ===> NO (Y = Yes; N = No)
Replace like-named members ===> YES (Y = Yes; N = No)
Rename copied members mask ===>
Use ENTER to continue or END to return to Criteria Menu
Use VIEW command to display criteria summary
Use SAVE command to write criteria request
Use CANCEL to restart at Input Specification Panel
Press <Enter>.
File-AID/Data Solutions displays the ANALYZE Execution Options screen, as shown in the following figure. Press <Enter> to proceed with the defaults.
ANALYZE Execution Options Screen
COMMAND ===>
Specify Execution Options:
Process online or batch ===> O (O = Online; B = Batch)
Maximum invalid fields ===> ALL (All or number of invalid fields)
Display invalid fields ===> Y (Y = Yes; N = No)
Maximum changes ===> ALL (All or number of changes)
Specify Change Criteria Information:
Save change criteria ===> N (Y = Yes; N = No)
Change criteria dataset ===> DASAMP.CHGCRIT
Member name ===> (Blank or pattern for member list)
Member description ===> ______________________________
Long ===> ______________________________________________________________
Description ===> ______________________________________________________________
Use ENTER to execute ANALYZE
Use VIEW command to display criteria summary
Use END to return to Output Specification Panel
Use CANCEL command to restart at Input Specification Panel
Press <Enter>.
File-AID/Data Solutions displays the Data Analysis Report, as shown in the following figure. You may scroll to display the entire report or you may elect to print the report.
Data Analysis Report Screen
-------------------------------------------------------------------------------
BROWSE DFHLLM0.DASOLUTN.WF.D00049.T164028 Line 00000000 Col 001 080
Command ===> Scroll ===> PAGE
********************************* Top of Data **********************************
SET 1 DATA-====================================================================
SET 1 DATA-DATA ANALYSIS FOR FIELD EMP-HOME-AREA-CODE RECORDS ANALY
SET 1 DATA-RECORDS ANALYZED __________50
SET 1 DATA-
SET 1 DATA-----STATISTICS REPORT-----------------------------------------------
SET 1 DATA-SIGN = N DATE PATTERN =
SET 1 DATA- ----COUNT--- ----DESCRIPTION---- ---------------F I E L D V A L U
SET 1 DATA- __________14 UNIQUE VALUES
SET 1 DATA- __________43 -VALID- VALUES
SET 1 DATA- ___________1 HIGHEST ______________________909
SET 1 DATA- ___________1 LOWEST ______________________111
SET 1 DATA- __________10 MOST COMMON ______________________408
SET 1 DATA- AVERAGE ______________________417
SET 1 DATA- __________10 MEDIAN ______________________408
SET 1 DATA-
SET 1 DATA-----OCCURRENCE REPORT-----------------------------------------------
SET 1 DATA-SIGN = N DATE PATTERN =
SET 1 DATA- ----COUNT--- ----DESCRIPTION---- ---------------F I E L D V A L U
SET 1 DATA- __________14 UNIQUE VALUES
Press <PF3>.
Printing the report
File-AID/Data Solutions displays the ANALYZE Print Report screen, as shown in the following figure. Press <Enter> to print the report.
ANALYZE Print Report Screen
COMMAND ===>
Print execution report ===> Y (Y = Yes; N = No)
Instructions:
Use ENTER to perform above action and return to the initial screen
Use END to exit without PRINT
Press <Enter>.
File-AID/Data Solutions displays the Print Parameters screen, as shown in the following figure. Press <Enter> to print the report.
Print Parameters Screen
COMMAND ===>
Number of lines/page ===> 55 (0 = Suppress page headings)
Sysout class ===> A
Number of copies ===> 1
Enter One of the Following Optional Destinations:
Destination printer ===> (Local or remote printer)
- - - OR - - -
External JES Node ID ===> (Predefined JES Node and symbolic ID
Target VM/TSO ident ===> of intended receiver of output)
- - - OR - - -
Sysout writer name ===> (Installation assigned output writer)
- - - OR - - -
(DSORG=PS; RECFM=V; LRECL=125)
Print dataset name ===>
Disposition ===> OLD (NEW; SHR; MOD; OLD)
Volume serial ===>
Use ENTER to continue, END to cancel
Press <Enter>.
File-AID/Data Solutions displays the Input dataset specification screen, as shown in the following figure. Your process is complete.
Input Dataset Specification screen
COMMAND ===>
Specify Input Dataset Information:
Dataset name ===> DASAMP.EMPLOYEE
Member name ===> (Blank or pattern for member list)
Disposition ===> OLD (OLD or SHR)
Specify Record Layout and XREF Information:
Record layout usage ===> S (S = Single; X = XREF)
Record layout dataset ===> DASAMP.LAYOUTS
Member name ===> EMPLOYEE (Blank or pattern for member list)
XREF dataset name ===>
Member name ===> (Blank or pattern for member list)
Specify Criteria Information:
Use existing criteria ===> N (Y = Yes; N = No)
Criteria dataset name ===> DASAMP.CHGCRIT
Member name ===> (Blank or pattern for member list)
Specify Business Rules or Field Exits Dataset Information:
Dataset name ===> DASAMP.EURORATE.FACTORS
Viewing the report
The Data Analysis report, as shown in the following figure, is divided into the following sections:
- Statistic report that lists the number of unique values, number of valid values, and the range of values found in the selected field
- Occurrence report that lists each unique value and the number of times each value occurs within the selected field.
- Invalid data report that reports whether invalid data was found in the selected field. When invalid data is found, Data Solutions lists the number of invalid values and the highest, lowest, and most common invalid value found.
Data Analysis Report
SET 1 DATA-DATA ANALYSIS FOR FIELD EMP-HOME-AREA-CODE RECORDS ANALYZED __________50
SET 1 DATA-
SET 1 DATA-----STATISTICS REPORT---------------------------------------------------------------
SET 1 DATA-SIGN = N DATE PATTERN =
SET 1 DATA- ----COUNT--- ----DESCRIPTION---- ---------------F I E L D V A L U E---------------
SET 1 DATA- __________14 UNIQUE VALUES
SET 1 DATA- __________43 -VALID- VALUES
SET 1 DATA- ___________1 HIGHEST ______________________909
SET 1 DATA- ___________1 LOWEST ______________________111
SET 1 DATA- __________10 MOST COMMON ______________________408
SET 1 DATA- AVERAGE ______________________417
SET 1 DATA- __________10 MEDIAN ______________________408
SET 1 DATA-
SET 1 DATA-----OCCURRENCE REPORT---------------------------------------------------------------
SET 1 DATA-SIGN = N DATE PATTERN =
SET 1 DATA- ----COUNT--- ----DESCRIPTION---- ---------------F I E L D V A L U E---------------
SET 1 DATA- __________14 UNIQUE VALUES
SET 1 DATA- __________43 -VALID- VALUES
SET 1 DATA- __________10 TIME(S) ______________________408
SET 1 DATA- ___________6 TIME(S) ______________________312
SET 1 DATA- ___________4 TIME(S) ______________________215
SET 1 DATA- ___________4 TIME(S) ______________________404
SET 1 DATA- ___________3 TIME(S) ______________________212
SET 1 DATA- ___________2 TIME(S) ______________________415
SET 1 DATA- ___________7 OCCURRENCES BYPASSED, NOT IN FILTER DATASET
SET 1 DATA-
SET 1 DATA-----INVALID DATA REPORT-------------------------------------------------------------
SET 1 DATA- ___________7 INVALID VALUES
SET 1 DATA- ___________7 HIGHEST
SET 1 DATA- POS. 1-003
SET 1 DATA- ZONE 444
SET 1 DATA- NUMR 000
SET 1 DATA- ___________7 LOWEST
SET 1 DATA- POS. 1-003
SET 1 DATA- ZONE 444
SET 1 DATA- NUMR 000
SET 1 DATA- ___________7 MOST COMMON
SET 1 DATA- POS. 1-003
SET 1 DATA- ZONE 444
SET 1 DATA- NUMR 000
SET 1 DATA-
SET 1 DATA-===================================================================================
SET 1 DATA-Records Read 50, Selected 50, Fields Processed 43, Invalid 7
Related topics