Data quality profiles include the following modules
to analyze data:
-
Summary. The Summary Report includes the following
tables to analyze data:
-
Data Statistics. The data
statistics table shows the statistical analysis and pattern information
about the data. Each column in the input data is listed as a row
in the table, which presents information such as data type, value counts,
minimum and maximum values, and shows a chart of duplicate and distinct data
as a percentage of the whole.
-
Data Overview. The data overview table shows the overview
of analysis and provides information about average, maximum, and
minimum lengths of data.
-
Data Extremes. The data extremes table shows simple extremes about
the first and last values of the data.
-
Frequency. The Frequency table shows the number of times
each value in the data occurs (both as an absolute count and as
a percentage of the whole).
-
Mask. The Mask tab shows the syntactic patterns of the
data (for example, the structure of the data rather than the content
of the data). Codes (masks) are used to describe these patterns.
-
Domain. The Domains table shows the domain analysis of
data. The available domains include Numeric, Datetime, Enum, Specval,
and Pattern.
-
Data Rules. The Data Rules report shows how much data
satisfies the data rules.
-
Primary Keys. The Primary Keys report analyzes the uniqueness
of designated keys.
-
Dependency. The Dependency report shows how much a particular
data is dependent on other data. The Determinants and Dependents
are specified by the user.
-
Foreign Key. The Foreign Keys report analyzes the column
or combination of columns that is used to establish and enforce
a link between the data in two tables of designated keys.
Note: All the demo files for data quality profiles, data
quality metrics, and data quality plans are available in the samples
folder inside the iDP application. The directory is found in the
following location: <idp_home>\idpweb\samples.