Background: Generative AI, particularly large language models (LLMs), holds great potential for improving patient care and operational efficiency in healthcare. However, the use of LLMs is complicated by regulatory concerns around data security and patient privacy. This study aimed to develop and evaluate a secure infrastructure that allows researchers to safely leverage LLMs in healthcare while ensuring HIPAA compliance and promoting equitable AI.
View Article and Find Full Text PDFBackground: Pressure injuries (PIs) place a substantial burden on healthcare systems worldwide. Risk stratification of those who are at risk of developing PIs allows preventive interventions to be focused on patients who are at the highest risk. The considerable number of risk assessment scales and prediction models available underscores the need for a thorough evaluation of their development, validation, and clinical utility.
View Article and Find Full Text PDFThe genome is a sequence that encodes the DNA, RNA, and proteins that orchestrate an organism's function. We present Evo, a long-context genomic foundation model with a frontier architecture trained on millions of prokaryotic and phage genomes, and report scaling laws on DNA to complement observations in language and vision. Evo generalizes across DNA, RNA, and proteins, enabling zero-shot function prediction competitive with domain-specific language models and the generation of functional CRISPR-Cas and transposon systems, representing the first examples of protein-RNA and protein-DNA codesign with a language model.
View Article and Find Full Text PDFInformal caregivers of people with Alzheimer's disease and related dementias (ADRD) are at risk of poor mental health. This study aimed to investigate the feasibility and validity of studying caregivers' mental stressors using online caregiving forum data (March 2018-February 2022) and natural language processing and machine learning (NLP/ML). NLP/ML topic modeling generated eight prominent topics, which we compared with qualitatively defined themes and the existing caregiving framework to assess validity.
View Article and Find Full Text PDFThe increasing interest in leveraging generative AI models in healthcare necessitates secure infrastructure at academic medical centers. Without an all-encompassing secure system, researchers may create their own insecure microprocesses, risking the exposure of protected health information (PHI) to the public internet or its inadvertent incorporation into AI model training. To address these challenges, our institution implemented a secure pathway to the Azure OpenAI Service using our own private OpenAI instance which we fully control to facilitate high-throughput, secure LLM queries.
View Article and Find Full Text PDFBackground And Aims: Patient-reported outcomes (PROs) are vital in assessing disease activity and treatment outcomes in inflammatory bowel disease (IBD). However, manual extraction of these PROs from the free-text of clinical notes is burdensome. We aimed to improve data curation from free-text information in the electronic health record, making it more available for research and quality improvement.
View Article and Find Full Text PDFGovernments should evaluate advanced models and if needed impose safety measures.
View Article and Find Full Text PDFObjective: This study aims to explore and develop tools for early identification of depression concerns among cancer patients by leveraging the novel data source of messages sent through a secure patient portal.
Materials And Methods: We developed classifiers based on logistic regression (LR), support vector machines (SVMs), and 2 Bidirectional Encoder Representations from Transformers (BERT) models (original and Reddit-pretrained) on 6600 patient messages from a cancer center (2009-2022), annotated by a panel of healthcare professionals. Performance was compared using AUROC scores, and model fairness and explainability were examined.
Background And Aims: Opioid use disorder (OUD) and opioid dependence lead to significant morbidity and mortality, yet treatment retention, crucial for the effectiveness of medications like buprenorphine-naloxone, remains unpredictable. Our objective was to determine the predictability of 6-month retention in buprenorphine-naloxone treatment using electronic health record (EHR) data from diverse clinical settings and to identify key predictors.
Design: This retrospective observational study developed and validated machine learning-based clinical risk prediction models using EHR data.
Background: Timely heart failure (HF) diagnosis can lead to earlier intervention and reduced morbidity. Among historically marginalized patients, new-onset HF diagnosis is more likely to occur in acute care settings (emergency department or inpatient hospitalization) than outpatient settings. Whether inequity within outpatient clinician practices affects diagnosis settings is unknown.
View Article and Find Full Text PDFBackground And Objectives: Spinal CSF leaks lead to spontaneous intracranial hypotension (SIH). While International Classification of Headache Disorders, Third Edition (ICHD-3) criteria necessitate imaging confirmation or low opening pressure (OP) for SIH diagnosis, their sensitivity may be limited. We offered epidural blood patches (EBPs) to patients with symptoms suggestive of SIH, with and without a documented low OP or confirmed leak on imaging.
View Article and Find Full Text PDFBackground: Predictive models show promise in healthcare, but their successful deployment is challenging due to limited generalizability. Current external validation often focuses on model performance with restricted feature use from the original training data, lacking insights into their suitability at external sites. Our study introduces an innovative methodology for evaluating features during both the development phase and the validation, focusing on creating and validating predictive models for post-surgery patient outcomes with improved generalizability.
View Article and Find Full Text PDFAssessment in medical education has evolved through a sequence of eras each centering on distinct views and values. These eras include measurement (e.g.
View Article and Find Full Text PDFObjective: To measure pediatrician adherence to evidence-based guidelines in the treatment of young children with attention-deficit/hyperactivity disorder (ADHD) in a diverse healthcare system using natural language processing (NLP) techniques.
Materials And Methods: We extracted structured and free-text data from electronic health records (EHRs) of all office visits (2015-2019) of children aged 4-6 years in a community-based primary healthcare network in California, who had ≥1 visits with an ICD-10 diagnosis of ADHD. Two pediatricians annotated clinical notes of the first ADHD visit for 423 patients.
Background: Patients with cancer starting systemic treatment programs, such as chemotherapy, often develop depression. A prediction model may assist physicians and health care workers in the early identification of these vulnerable patients.
Objective: This study aimed to develop a prediction model for depression risk within the first month of cancer treatment.
AMIA Annu Symp Proc
January 2024
Ensemble learning is a powerful technique for improving the accuracy and reliability of prediction models, especially in scenarios where individual models may not perform well. However, combining models with varying accuracies may not always improve the final prediction results, as models with lower accuracies may obscure the results of models with higher accuracies. This paper addresses this issue and answers the question of when an ensemble approach outperforms individual models for prediction.
View Article and Find Full Text PDFImportance: Limited sharing of data sets that accurately represent disease and patient diversity limits the generalizability of artificial intelligence (AI) algorithms in health care.
Objective: To explore the factors associated with organizational motivation to share health data for AI development.
Design, Setting, And Participants: This qualitative study investigated organizational readiness for sharing health data across the academic, governmental, nonprofit, and private sectors.