Loading Tree…

DQI-1004

Definition

The observed set of available data elements does not match the expected set.

Explanation

Prior to most data quality assessments the set of targeted data elements is clearly defined. This indicator targets discrepancies between the encountered and expected data elements. Discrepancies may indicate an erroneous data upload but also errors in the provided files or tables.

Example

A study data frame with 47 study variables is expected according to the provided metadata file. The variable names in the metadata file serve as the gold standard. Yet, the 47 variable names in the study data frame are not the same as the names in the metadata. This discrepancy is targeted by the indicator “Unexpected data elements” within the dimension “Structural data set error”.

Guidance

If required data elements are absent, data quality assessments will be incomplete with regards to these data elements but findings for other data elements commonly remain valid.

Any deficit encountered here should be remedied by updating the affected data sets to represent the correct selection of data elements. Afterwards the data quality reporting processes should be repeated.

Interpretation

The higher the number or percentage of occurrences the lower the data quality.

Implementations

Literature