Purpose Data mining algorithms using electronic health records (EHRs) are useful in large-scale population-wide studies to classify etiology and comorbidities (Casey et al., 2016). Here, we apply this approach to developmental language disorder (DLD), a prevalent communication disorder whose risk factors and epidemiology remain largely undiscovered. Method We first created a reliable system for manually identifying DLD in EHRs based on speech-language pathologist (SLP) diagnostic expertise. We then developed and validated an automated algorithmic procedure, called, Automated Phenotyping Tool for identifying DLD cases in health systems data (APT-DLD), that classifies a DLD status for patients within EHRs on the basis of ICD (International Statistical Classification of Diseases and Related Health Problems) codes. APT-DLD was validated in a discovery sample ( = 973) using expert SLP manual phenotype coding as a gold-standard comparison and then applied and further validated in a replication sample of = 13,652 EHRs. Results In the discovery sample, the APT-DLD algorithm correctly classified 98% (concordance) of DLD cases in concordance with manually coded records in the training set, indicating that APT-DLD successfully mimics a comprehensive chart review. The output of APT-DLD was also validated in relation to independently conducted SLP clinician coding in a subset of records, with a positive predictive value of 95% of cases correctly classified as DLD. We also applied APT-DLD to the replication sample, where it achieved a positive predictive value of 90% in relation to SLP clinician classification of DLD. Conclusions APT-DLD is a reliable, valid, and scalable tool for identifying DLD cohorts in EHRs. This new method has promising public health implications for future large-scale epidemiological investigations of DLD and may inform EHR data mining algorithms for other communication disorders. Supplemental Material https://doi.org/10.23641/asha.12753578.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7890229 | PMC |
http://dx.doi.org/10.1044/2020_JSLHR-19-00397 | DOI Listing |
Sci Rep
December 2024
Department of Applied Mathematics, Faculty of Mathematical Science, Ferdowsi University of Mashhad, Mashhad, Iran.
This study presents a web application for predicting cardiovascular disease (CVD) and hypertension (HTN) among mine workers using machine learning (ML) techniques. The dataset, collected from 699 participants at the Gol-Gohar mine in Iran between 2016 and 2020, includes demographic, occupational, lifestyle, and medical information. After preprocessing and feature engineering, the Random Forest algorithm was identified as the best-performing model, achieving 99% accuracy for HTN prediction and 97% for CVD, outperforming other algorithms such as Logistic Regression and Support Vector Machines.
View Article and Find Full Text PDFNat Commun
December 2024
Department of Pharmacology, Jiangsu Provincial Key Laboratory of Critical Care Medicine, School of Medicine, Southeast University, Nanjing, China.
While circular RNAs (circRNAs) exhibit lower abundance compared to corresponding linear RNAs, they demonstrate potent biological functions. Nevertheless, challenges arise from the low concentration and distinctive structural features of circRNAs, rendering existing methods operationally intricate and less sensitive. Here, we engineer an intelligent tetrahedral DNA framework (TDF) possessing precise spatial pattern-recognition properties with exceptional sensing speed and sensitivity for circRNAs.
View Article and Find Full Text PDFSci Rep
December 2024
BAOBAB Unit, NeuroSpin center, CEA, Université Paris-Saclay, Gif-sur-Yvette, France.
Decoding states of consciousness from brain activity is a central challenge in neuroscience. Dynamic functional connectivity (dFC) allows the study of short-term temporal changes in functional connectivity (FC) between distributed brain areas. By clustering dFC matrices from resting-state fMRI, we previously described "brain patterns" that underlie different functional configurations of the brain at rest.
View Article and Find Full Text PDFNat Commun
December 2024
Institute of Pathology, Universitätsklinikum Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.
Pathogenic activating mutations in the fibroblast growth factor receptor 3 (FGFR3) drive disease maintenance and progression in urothelial cancer. 10-15% of muscle-invasive and metastatic urothelial cancer (MIBC/mUC) are FGFR3-mutant. Selective targeting of FGFR3 hotspot mutations with tyrosine kinase inhibitors (e.
View Article and Find Full Text PDFNat Commun
December 2024
Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, 430071, China.
Electron diffraction spectroscopy is a fundamental tool for investigating quasicrystal structures, which unveils the quasiperiodic long-range order. Nevertheless, it falls short in effectively distinguishing separate local isomorphism classes. This is a long outstanding problem.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!