Introduction: Secondary use of electronic health record (EHR) data for research requires that the data are . Data quality (DQ) frameworks have traditionally focused on structural conformance and completeness of clinical data extracted from source systems. In this paper, we propose a framework for evaluating DQ that will allow researchers to evaluate fitness for use prior to analyses.

Methods: We reviewed current DQ literature, as well as experience from recent multisite network studies, and identified gaps in the literature and current practice. Derived principles were used to construct the conceptual framework with attention to both analytic fitness and informatics practice.

Results: We developed a systematic framework that guides researchers in assessing whether a data source is for their intended study or project. It combines tools for evaluating clinical context with DQ principles, as well as factoring in the characteristics of the data source, in order to develop DQ checks.

Conclusions: Our framework provides a systematic process for DQ development. Further work is needed to codify practices and metadata around both structural and semantic data quality.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8753309PMC
http://dx.doi.org/10.1002/lrh2.10264DOI Listing

Publication Analysis

Top Keywords

data quality
12
data
9
assessing data
8
clinical data
8
data source
8
developing systematic
4
systematic approach
4
approach assessing
4
quality secondary
4
secondary clinical
4

Similar Publications

Protocol for a feasibility study evaluating a supported self-management intervention for stroke survivors with aphasia (StarStep study).

Pilot Feasibility Stud

January 2025

Academic Unit for Ageing and Stroke Research, Leeds Institute of Health Sciences, University of Leeds, Leeds, LS2 9JT, UK.

Background: There is a growing evidence base to support the use of self-management interventions for improving quality of life after stroke. However, stroke survivors with aphasia have been underrepresented in research to date. It is therefore unclear if self-management is an appropriate or effective approach for this group.

View Article and Find Full Text PDF

An updated systematic review with meta-analysis and meta-regression of the factors associated with human visceral leishmaniasis in the Americas.

Infect Dis Poverty

January 2025

Universidade Federal de São João del Rei (UFSJ), Campus Centro-Oeste Dona Lindu, Avenida Sebastião Gonçalves Coelho 400, Chanadour, Divinópolis, MG, Brazil.

Background: Human visceral leishmaniasis (VL) is a systemic disease with high case-fatality rates and a widespread distribution. Continuous evaluation of the risk factors for VL is essential to ensure the effective implementation of prevention and control measures. The present study reviews the factors associated with VL in the Americas.

View Article and Find Full Text PDF

The global HIV epidemic remains a major public health challenge, with DTG playing a key role in ART regimens due to its efficacy and tolerability. This study evaluated virological outcomes and resistance mutations in patients on DTG in Mozambique through a retrospective cohort study in seven DREAM centers. Data from 29,601 patients (98.

View Article and Find Full Text PDF

A large set of antimalarial molecules (N ~ 15k) was employed from ChEMBL to build a robust random forest (RF) model for the prediction of antiplasmodial activity. Rather than depending on high throughput screening (HTS) data, molecules tested at multiple doses against blood stages of Plasmodium falciparum were used for model development. The open-access and code-free KNIME platform was used to develop a workflow to train the model on 80% of data (N ~ 12k).

View Article and Find Full Text PDF

Background: This study employs a LSTM-FC neural networks to address the critical public health issue of child undernutrition in Ethiopia. By employing this method, the study aims classify children's nutritional status and predict transitions between different undernutrition states over time. This analysis is based on longitudinal data extracted from the Young Lives cohort study, which tracked 1,997 Ethiopian children across five survey rounds conducted from 2002 to 2016.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!