Background:: Electronic medical records are increasingly used for research with limited external validation of their data.

Objective:: This study investigates the validity of electronic medical data (EMD) for estimating diabetes prevalence in general practitioner (GP) patients by comparing EMD with national Bettering the Evaluation and Care of Health (BEACH) data.

Method:: A "decision tree" was created using inclusion/exclusion of pre-agreed variables to determine the probability of diabetes in absence of diagnostic label, including diagnoses (coded/free-text diabetes, polycystic ovarian syndrome, impaired glucose tolerance, impaired fasting glucose), diabetic annual cycle of care (DACC), glycated haemoglobin (HbA1c) > 6.5%, and prescription (metformin, other diabetes medications). Via SQL query, cases were identified in EMD of five Illawarra and Southern Practice Network practices (30,007 active patients; from 2 years to January 2015). Patient-based Supplementary Analysis of Nominated Data (SAND) sub-studies from BEACH investigating diabetes prevalence (1172 GPs; 35,162 patients; November 2012 to February 2015) were comparison data. SAND results were adjusted for number of GP encounters per year, per patient, and then age-sex standardised to match age-sex distribution of EMD patients. Cluster-adjusted 95% confidence intervals (CIs) were calculated for both datasets.

Results:: EMD diabetes prevalence (T1 and/or T2) was 6.5% (95% CI: 4.1-8.9). Following age-sex standardisation, SAND prevalence, not significantly different, was 6.7% (95% CI: 6.3-7.1). Extracting only coded diagnosis missed 13.0% of probable cases, subsequently identified through the presence of metformin/other diabetes medications (*without other indicator variables) (6.1%), free-text diabetes label (3.8%), HbA1c result* (1.6%), DACC* (1.3%), and diabetes medications* (0.2%).

Discussion:: While complex, proxy variables can improve usefulness of EMD for research. Without their consideration, EMD results should be interpreted with caution.

Conclusion:: Enforceable, transparent data linkages in EMRs would resolve many problems with identification of diagnoses. Ongoing data quality improvement remains essential.

Download full-text PDF

Source
http://dx.doi.org/10.1177/1833358318798123DOI Listing

Publication Analysis

Top Keywords

diabetes prevalence
16
electronic medical
12
diabetes
10
medical data
8
prevalence general
8
diabetes medications
8
data sand
8
emd
7
data
6
prevalence
5

Similar Publications

Bias in machine learning applications to address non-communicable diseases at a population-level: a scoping review.

BMC Public Health

December 2024

Upstream Lab, MAP Centre for Urban Health Solutions, Li Ka Shing Knowledge Institute, Unity Health Toronto, 30 Bond Street, Toronto, ON, M5B 1W8, Canada.

Background: Machine learning (ML) is increasingly used in population and public health to support epidemiological studies, surveillance, and evaluation. Our objective was to conduct a scoping review to identify studies that use ML in population health, with a focus on its use in non-communicable diseases (NCDs). We also examine potential algorithmic biases in model design, training, and implementation, as well as efforts to mitigate these biases.

View Article and Find Full Text PDF

Background: Tuberculosis (TB) remains a significant global health issue. Drug-resistant TB and comorbidities exacerbate its burden, influencing treatment outcomes and healthcare utilization. Despite the growing prevalence of TB comorbidities, research often focuses on single comorbidities rather than comorbidity patterns.

View Article and Find Full Text PDF

Background And Aims: This study evaluated the predictive value of the APF risk score in East Asian patients undergoing open nephrectomy and its correlation with hypertension and NAFLD.

Methods And Results: A retrospective study used the clinical data of 82 patients who underwent ON between January 2010 and December 2022. Per their APF score, patients were categorized into groups A (0-2 points) and B (3-4 points).

View Article and Find Full Text PDF

Epigenetic inheritance of PCOS by developmental programming and germline transmission.

Trends Endocrinol Metab

December 2024

Department of Physiology and Pharmacology, Karolinska Institutet, 171 77 Stockholm, Sweden. Electronic address:

Polycystic ovary syndrome (PCOS) is a prevalent endocrine and metabolic disorder, affecting approximately 11-13% of women of reproductive age. Women with PCOS experience a higher prevalence of infertility, pregnancy complications, and cardiometabolic disorders such as obesity, insulin resistance, and type 2 diabetes mellitus. Furthermore, psychiatric comorbidities, including depression and anxiety, significantly impact the quality of life in this population.

View Article and Find Full Text PDF

Introduction: Type 2 diabetes increases the risk of Alzheimer's disease (AD) dementia. Insulin signaling dysfunction exacerbates tau protein phosphorylation, a hallmark of AD pathology. However, the comprehensive impact of diabetes on patterns of AD-related phosphoprotein in the human brain remains underexplored.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!