A Data Quality Ontology for the Secondary Use of EHR Data.

AMIA Annu Symp Proc

University of Minnesota, Institute for Health Informatics; University of Minnesota, School of Nursing.

Published: February 2018

The secondary use of EHR data for research is expected to improve health outcomes for patients, but the benefits will only be realized if the data in the EHR is of sufficient quality to support these uses. A data quality (DQ) ontology was developed to rigorously define concepts and enable automated computation of data quality measures. The healthcare data quality literature was mined for the important terms used to describe data quality concepts and harmonized into an ontology. Four high-level data quality dimensions ("correctness", "consistency", "completeness" and "currency") categorize 19 lower level measures. The ontology serves as an unambiguous vocabulary, which defines concepts more precisely than natural language; it provides a mechanism to automatically compute data quality measures; and is reusable across domains and use cases. A detailed example is presented to demonstrate its utility. The DQ ontology can make data validation more common and reproducible.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4765682PMC

Publication Analysis

Top Keywords

data quality
28
data
11
quality ontology
8
secondary ehr
8
ehr data
8
quality measures
8
quality
7
ontology
5
ontology secondary
4
data secondary
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!