In this section: |
Data Profiling provides data characteristics for the columns in a synonym. You can display the characteristics for all the columns in a synonym or segment, or for an individual column.
For alphanumeric columns, Data Profiling provides the segment, format, count of distinct values, total count, patterns count, maximum, minimum, and average length, minimum and maximum values, and number of nulls. Patterns count shows the number of patterns found in each alphanumeric column.
For numeric columns, Data Profiling provides the segment, format, count of distinct values, total count, maximum, minimum, and average values, and number of nulls.
Data Profiling for an individual column provides access to Statistics, Patterns, Values, and Outliers reports.
How to: |
Data Profiling provides information on all the columns in a synonym or segment. You can also drill down to the Values or Patterns reports for an individual column from a synonym or segment Data Profiling report.
Note: Data Profiling is also available from the navigation pane by right-clicking a synonym, selecting Data Profiling, and then clicking Statistics.
To view the Data Profiling information for a synonym or segment:
The Synonym Editor opens to the Field View tab.
The Data Profiling information displays in the workspace. The last four columns are shown below the rest of the information for illustrative purposes only. The actual report runs across the workspace.
You may use the Data Profiling Results toolbar to view server messages, print the report, copy data as text, and export the report.
For pattern analysis, a 9 represents a digit, an A represents any uppercase letter, and an a represents any lowercase letter. All printable special characters are represented by themselves, and unprintable characters are represented by an X.
Note: Data Profiling is also available from the navigation pane by right-clicking a synonym.
Key Analysis provides a report that shows which columns in a data source can be used individually, or in combination, to uniquely identify a row. The columns identified in this report are candidates for key columns.
Note: Key Analysis is also available from the navigation pane by right-clicking a synonym, selecting Data Profiling, and then clicking Key Analysis.
To view key analysis for a synonym or segment:
If you selected a segment name, skip to step 4. All columns in the segment will be selected.
The selected segment.
Name of the segment.
The format of each column.
The number of elements (columns) shown.
The number of rows.
The number of distinct rows.
The percentage of rows that are distinct. This value must be 100% for a combination of columns to be used as key.
The number of duplicate values.
The percentage of duplicate values. This value must be 0% for a combination of columns to be used as key.
By default, the report is sorted by the number of elements so the first rows in the report show one element each. This enables you to determine if any single column could be used by itself as a key. The report then shows all combinations of two columns, three columns, and so on.
To see the values in the report, right-click on any row.
The duplicate rows option shows all duplicate values, which prevent the desired column combination from being used as a key.
How to: |
Data Profiling for an individual column provides access to four reports:
For alphanumeric columns, the Statistics report provides the segment, format, count of distinct values, total count, patterns count, maximum, minimum, and average length, minimum and maximum values, and number of nulls.
For numeric columns, the Statistics report provides the segment, format, count of distinct values, total count, maximum, minimum, and average values, and number of nulls.
These reports are available by right-clicking a column in the Synonym Editor and selecting Data Profiling.
To view the Statistical Data Profiling information for a single column:
The Synonym Editor opens to the Field View tab.
The Statistical Data Profiling information displays in the workspace.
Data Profile Patterns show patterns of letters, digits, and special characters, as well as counts. This is only available for alphanumeric columns.
To view the Patterns Data Profiling information for a single column:
The Synonym Editor opens to the Field View tab.
The Patterns Data Profiling information displays.
For pattern analysis, a 9 represents a digit, an A represents any uppercase letter, and an a represents any lowercase letter. All printable special characters are represented by themselves, and unprintable characters are represented by an X.
Data Profile Values show unique values.
To view the Values Data Profiling information for a single column:
The Synonym Editor opens to the Field View tab.
The Values Data Profiling information displays.
Data Profile Outliers show the 10 highest and lowest distinct values.
To view the Outliers Data Profiling information for a single column:
The Synonym Editor opens to the Field View tab.
The Outliers Data Profiling information displays.
Note: Outliers produce a maximum of 10 highest and lowest distinct values, if they exist.
iWay Software |