Objectives: Electronic health records (EHR) can allow for the generation of large cohorts of individuals with given diseases for clinical and genomic research. A rate-limiting step is the development of electronic phenotype selection algorithms to find such cohorts. This study evaluated the portability of a published phenotype algorithm to identify rheumatoid arthritis (RA) patients from EHR records at three institutions with different EHR systems.

Materials And Methods: Physicians reviewed charts from three institutions to identify patients with RA. Each institution compiled attributes from various sources in the EHR, including codified data and clinical narratives, which were searched using one of two natural language processing (NLP) systems. The performance of the published model was compared with locally retrained models.

Results: Applying the previously published model from Partners Healthcare to datasets from Northwestern and Vanderbilt Universities, the area under the receiver operating characteristic curve was found to be 92% for Northwestern and 95% for Vanderbilt, compared with 97% at Partners. Retraining the model improved the average sensitivity at a specificity of 97% to 72% from the original 65%. Both the original logistic regression models and locally retrained models were superior to simple billing code count thresholds.

Discussion: These results show that a previously published algorithm for RA is portable to two external hospitals using different EHR systems, different NLP systems, and different target NLP vocabularies. Retraining the algorithm primarily increased the sensitivity at each site.

Conclusion: Electronic phenotype algorithms allow rapid identification of case populations in multiple sites with little retraining.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3392871PMC
http://dx.doi.org/10.1136/amiajnl-2011-000583DOI Listing

Publication Analysis

Top Keywords

algorithm identify
8
identify rheumatoid
8
rheumatoid arthritis
8
electronic health
8
health records
8
electronic phenotype
8
three institutions
8
nlp systems
8
published model
8
locally retrained
8

Similar Publications

This study aimed to identify shared gene expression related to circadian rhythm disruption in polycystic ovary syndrome (PCOS) and non-alcoholic fatty liver disease (NAFLD) to discover common diagnostic biomarkers. Visceral fat RNA samples were collected from 12 PCOS and 14 non-PCOS patients, a sample size representing the clinical situation and sufficient to capture PCOS gene expression profiles. Along with liver transcriptome profiles from NAFLD patients, these data were analyzed to identify crosstalk circadian rhythm-related genes (CRRGs) between the diseases.

View Article and Find Full Text PDF

Role of immune cell homeostasis in research and treatment response in hepatocellular carcinoma.

Clin Exp Med

January 2025

Department of Thoracic Surgery, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200127, China.

Introduction Recently, immune cells within the tumor microenvironment (TME) have become crucial in regulating cancer progression and treatment responses. The dynamic interactions between tumors and immune cells are emerging as a promising strategy to activate the host's immune system against various cancers. The development and progression of hepatocellular carcinoma (HCC) involve complex biological processes, with the role of the TME and tumor phenotypes still not fully understood.

View Article and Find Full Text PDF

Background: Complex regional pain syndrome (CRPS) is a debilitating condition characterised by significant heterogeneity. Early diagnosis is critical, but limited data exists on the condition's early stages. This study aimed to characterise (very) early CRPS patients and explore potential subgroups to enhance understanding of its mechanisms.

View Article and Find Full Text PDF

Background: Most older patients with atrial fibrillation (AF) have comorbidities. However, it is unclear whether specific comorbidity patterns are associated with adverse outcomes. We identified comorbidity patterns and their association with mortality in multimorbid older AF patients with different multidimensional frailty.

View Article and Find Full Text PDF

Background: Atherosclerosis (AS) is increasingly recognized as a chronic inflammatory disease that significantly compromises vascular health and acts as a major contributor to cardiovascular diseases. Advancements in lipidomics and metabolomics have unveiled the complex role of fatty acid metabolism (FAM) in both healthy and pathological states. However, the specific roles of fatty acid metabolism-related genes (FAMGs) in shaping therapeutic approaches, especially in AS, remain largely unexplored and are a subject of ongoing research.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!