Objectives: We propose and validate a domain knowledge-driven classification model for diagnosing post-acute sequelae of SARS-CoV-2 infection (PASC), also known as Long COVID, using Electronic Health Records (EHRs) data.
Materials And Methods: We developed a robust model that incorporates features strongly indicative of PASC or associated with the severity of COVID-19 symptoms as identified in our literature review. The XGBoost tree-based architecture was chosen for its ability to handle class-imbalanced data and its potential for high interpretability. Using the training data provided by the Long COVID Computation Challenge (L3C), which was a sample of the National COVID Cohort Collaborative (N3C), our models were fine-tuned and calibrated to optimize Area Under the Receiver Operating characteristic curve (AUROC) and the F1 score, following best practices for the class-imbalanced N3C data.
Results: Our age-stratified classification model demonstrated strong performance with an average 5-fold cross-validated AUROC of 0.844 and F1 score of 0.539 across the young adult, mid-aged, and older-aged populations in the training data. In an independent testing dataset, which was made available after the challenge was over, we achieved an overall AUROC score of 0.814 and F1 score of 0.545.
Discussion: The results demonstrated the utility of knowledge-driven feature engineering in a sparse EHR data and demographic stratification in model development to diagnose a complex and heterogeneously presenting condition like PASC. The model's architecture, mirroring natural clinician decision-making processes, contributed to its robustness and interpretability, which are crucial for clinical translatability. Further, the model's generalizability was evaluated over a new cross-sectional data as provided in the later stages of the L3C challenge.
Conclusion: The study proposed and validated the effectiveness of age-stratified, tree-based classification models to diagnose PASC. Our approach highlights the potential of machine learning in addressing the diagnostic challenges posed by the heterogeneity of Long-COVID symptoms.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11547948 | PMC |
http://dx.doi.org/10.1093/jamiaopen/ooae111 | DOI Listing |
J Med Internet Res
December 2024
School of Automation, Central South University, Changsha, China.
Background: Private-part skin diseases (PPSDs) can cause a patient's stigma, which may hinder the early diagnosis of these diseases. Artificial intelligence (AI) is an effective tool to improve the early diagnosis of PPSDs, especially in preventing the deterioration of skin tumors in private parts such as Paget disease. However, to our knowledge, there is currently no research on using AI to identify PPSDs due to the complex backgrounds of the lesion areas and the challenges in data collection.
View Article and Find Full Text PDFPlant Cell Rep
December 2024
Department of Biotechnology, School of Bioengineering, SRM Institute of Science and Technology, Kattankulathur, Tamil Nadu, 603 203, India.
CesA proteins response to arsenic stress in rice involves structural and regulatory mechanisms, highlighting the role of BES1/BZR1 transcript levels under arsenate exposure and significant downregulation of BZR1 protein expression. Plants interact with several hazardous metalloids during their life cycle through root and soil connection. One such metalloid, is arsenic and its perilous impact on rice cultivation is a well-known threat.
View Article and Find Full Text PDFEpilepsia
December 2024
VA Salt Lake City Health Care System, Informatics, Decision-Enhancement and Analytic Sciences Center, Salt Lake City, Utah, USA.
Objective: Traumatic brain injury (TBI) is a significant risk factor for epilepsy, but little work has explored whether risk of epilepsy after TBI may operate through intermediary mechanisms. The objective of this study was to statistically screen for potentially mediating effects among 64 comorbidities for epilepsy risk following TBI among Post-9/11 U.S.
View Article and Find Full Text PDFTomography
December 2024
Department of Medical Imaging and Radiological Science, I-Shou University, Kaohsiung City 824005, Taiwan.
Breast cancer is a leading cause of mortality among women in Taiwan and globally. Non-invasive imaging methods, such as mammography and ultrasound, are critical for early detection, yet standalone modalities have limitations in regard to their diagnostic accuracy. This study aims to enhance breast cancer detection through a cross-modality fusion approach combining mammography and ultrasound imaging, using advanced convolutional neural network (CNN) architectures.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!