Publications by authors named "Hernandez-Boussard T"

Background: Generative AI, particularly large language models (LLMs), holds great potential for improving patient care and operational efficiency in healthcare. However, the use of LLMs is complicated by regulatory concerns around data security and patient privacy. This study aimed to develop and evaluate a secure infrastructure that allows researchers to safely leverage LLMs in healthcare while ensuring HIPAA compliance and promoting equitable AI.

View Article and Find Full Text PDF

Background: Pressure injuries (PIs) place a substantial burden on healthcare systems worldwide. Risk stratification of those who are at risk of developing PIs allows preventive interventions to be focused on patients who are at the highest risk. The considerable number of risk assessment scales and prediction models available underscores the need for a thorough evaluation of their development, validation, and clinical utility.

View Article and Find Full Text PDF

The genome is a sequence that encodes the DNA, RNA, and proteins that orchestrate an organism's function. We present Evo, a long-context genomic foundation model with a frontier architecture trained on millions of prokaryotic and phage genomes, and report scaling laws on DNA to complement observations in language and vision. Evo generalizes across DNA, RNA, and proteins, enabling zero-shot function prediction competitive with domain-specific language models and the generation of functional CRISPR-Cas and transposon systems, representing the first examples of protein-RNA and protein-DNA codesign with a language model.

View Article and Find Full Text PDF

Informal caregivers of people with Alzheimer's disease and related dementias (ADRD) are at risk of poor mental health. This study aimed to investigate the feasibility and validity of studying caregivers' mental stressors using online caregiving forum data (March 2018-February 2022) and natural language processing and machine learning (NLP/ML). NLP/ML topic modeling generated eight prominent topics, which we compared with qualitatively defined themes and the existing caregiving framework to assess validity.

View Article and Find Full Text PDF

The increasing interest in leveraging generative AI models in healthcare necessitates secure infrastructure at academic medical centers. Without an all-encompassing secure system, researchers may create their own insecure microprocesses, risking the exposure of protected health information (PHI) to the public internet or its inadvertent incorporation into AI model training. To address these challenges, our institution implemented a secure pathway to the Azure OpenAI Service using our own private OpenAI instance which we fully control to facilitate high-throughput, secure LLM queries.

View Article and Find Full Text PDF
Article Synopsis
  • The study investigates how sex, race, and ethnicity impact the development of AI models for predicting glaucoma progression requiring surgery, emphasizing fairness and multicenter perspectives.
  • Researchers analyzed data from over 39,000 glaucoma patients across 7 academic eye centers, using different modeling approaches that either included or excluded sensitive demographic attributes.
  • Results showed that excluding sensitive attributes improved classification performance internally, but when assessed externally, including these attributes enhanced performance, highlighting the complexity between accuracy and fairness in AI predictions.
View Article and Find Full Text PDF

Background And Aims: Patient-reported outcomes (PROs) are vital in assessing disease activity and treatment outcomes in inflammatory bowel disease (IBD). However, manual extraction of these PROs from the free-text of clinical notes is burdensome. We aimed to improve data curation from free-text information in the electronic health record, making it more available for research and quality improvement.

View Article and Find Full Text PDF

Governments should evaluate advanced models and if needed impose safety measures.

View Article and Find Full Text PDF

Objective: This study aims to explore and develop tools for early identification of depression concerns among cancer patients by leveraging the novel data source of messages sent through a secure patient portal.

Materials And Methods: We developed classifiers based on logistic regression (LR), support vector machines (SVMs), and 2 Bidirectional Encoder Representations from Transformers (BERT) models (original and Reddit-pretrained) on 6600 patient messages from a cancer center (2009-2022), annotated by a panel of healthcare professionals. Performance was compared using AUROC scores, and model fairness and explainability were examined.

View Article and Find Full Text PDF
Article Synopsis
  • - Globally, obesity is on the rise, leading to serious health issues, including heart disease, and is a significant financial burden on healthcare systems, costing over $200 billion a year.
  • - This study utilized advanced AI to analyze over 390,000 Reddit discussions about GLP-1 receptor agonists (GLP-1 RAs), highlighting a wide interest in topics like weight loss results, side effects, accessibility, and psychological benefits.
  • - The analysis revealed that public sentiment around GLP-1 RAs is mostly neutral to positive, suggesting these findings could help in monitoring side effects not seen in trials and addressing drug shortages.
View Article and Find Full Text PDF
Article Synopsis
  • * Large language models (LLMs) show potential for automated fall detection by analyzing unstructured data from clinical notes, leading to promising results in two healthcare systems.
  • * The Mixtral-8×7B zero-shot model performed best, achieving high positive predictive value and recall in both Stanford Health Care and the Veterans Health Administration, paving the way for future LLM applications in fall prediction and prevention.
View Article and Find Full Text PDF

Background And Aims: Opioid use disorder (OUD) and opioid dependence lead to significant morbidity and mortality, yet treatment retention, crucial for the effectiveness of medications like buprenorphine-naloxone, remains unpredictable. Our objective was to determine the predictability of 6-month retention in buprenorphine-naloxone treatment using electronic health record (EHR) data from diverse clinical settings and to identify key predictors.

Design: This retrospective observational study developed and validated machine learning-based clinical risk prediction models using EHR data.

View Article and Find Full Text PDF

Background: Timely heart failure (HF) diagnosis can lead to earlier intervention and reduced morbidity. Among historically marginalized patients, new-onset HF diagnosis is more likely to occur in acute care settings (emergency department or inpatient hospitalization) than outpatient settings. Whether inequity within outpatient clinician practices affects diagnosis settings is unknown.

View Article and Find Full Text PDF

Background And Objectives: Spinal CSF leaks lead to spontaneous intracranial hypotension (SIH). While International Classification of Headache Disorders, Third Edition (ICHD-3) criteria necessitate imaging confirmation or low opening pressure (OP) for SIH diagnosis, their sensitivity may be limited. We offered epidural blood patches (EBPs) to patients with symptoms suggestive of SIH, with and without a documented low OP or confirmed leak on imaging.

View Article and Find Full Text PDF
Article Synopsis
  • - Coronary artery calcium (CAC) testing is important for assessing the risk of atherosclerotic cardiovascular disease (ASCVD), but public perception of CAC and its implications for heart health decision-making are not well understood.
  • - Researchers utilized an AI model to analyze 5,606 discussions on Reddit about CAC, identifying 91 topics categorized into 14 main themes, including the influence of CAC on treatment choices and concerns over testing risks.
  • - Sentiment analysis of these discussions showed that nearly half expressed neutral or negative feelings towards CAC testing, highlighting a need for better communication and education to improve public understanding and shared decision-making in cardiovascular health.
View Article and Find Full Text PDF

Background: Predictive models show promise in healthcare, but their successful deployment is challenging due to limited generalizability. Current external validation often focuses on model performance with restricted feature use from the original training data, lacking insights into their suitability at external sites. Our study introduces an innovative methodology for evaluating features during both the development phase and the validation, focusing on creating and validating predictive models for post-surgery patient outcomes with improved generalizability.

View Article and Find Full Text PDF

Assessment in medical education has evolved through a sequence of eras each centering on distinct views and values. These eras include measurement (e.g.

View Article and Find Full Text PDF

Objective: To measure pediatrician adherence to evidence-based guidelines in the treatment of young children with attention-deficit/hyperactivity disorder (ADHD) in a diverse healthcare system using natural language processing (NLP) techniques.

Materials And Methods: We extracted structured and free-text data from electronic health records (EHRs) of all office visits (2015-2019) of children aged 4-6 years in a community-based primary healthcare network in California, who had ≥1 visits with an ICD-10 diagnosis of ADHD. Two pediatricians annotated clinical notes of the first ADHD visit for 423 patients.

View Article and Find Full Text PDF

Background: Patients with cancer starting systemic treatment programs, such as chemotherapy, often develop depression. A prediction model may assist physicians and health care workers in the early identification of these vulnerable patients.

Objective: This study aimed to develop a prediction model for depression risk within the first month of cancer treatment.

View Article and Find Full Text PDF

Ensemble learning is a powerful technique for improving the accuracy and reliability of prediction models, especially in scenarios where individual models may not perform well. However, combining models with varying accuracies may not always improve the final prediction results, as models with lower accuracies may obscure the results of models with higher accuracies. This paper addresses this issue and answers the question of when an ensemble approach outperforms individual models for prediction.

View Article and Find Full Text PDF
Article Synopsis
  • Buprenorphine-naloxone is an effective treatment for opioid use disorder, but many patients don't stick with it long-term, leading to poor outcomes.* -
  • This study examined a machine learning model's ability to predict whether patients would stay in treatment (retention) or drop out (attrition) using electronic medical records and clinical notes.* -
  • The results showed the model could reasonably predict retention versus attrition, achieving an AUROC of 0.77 with combined data and 0.74 using only structured data from electronic records.*
View Article and Find Full Text PDF

Importance: Limited sharing of data sets that accurately represent disease and patient diversity limits the generalizability of artificial intelligence (AI) algorithms in health care.

Objective: To explore the factors associated with organizational motivation to share health data for AI development.

Design, Setting, And Participants: This qualitative study investigated organizational readiness for sharing health data across the academic, governmental, nonprofit, and private sectors.

View Article and Find Full Text PDF