Publications by Dev Dash

Publications by authors named "Dev Dash"

Page 1 of 1

Toward expert-level medical question answering with large language models.

Karan Singhal Tao Tu Juraj Gottweis Rory Sayres Ellery Wulczyn Dev Dash

Nat Med

January 2025

Large language models (LLMs) have shown promise in medical question answering, with Med-PaLM being the first to exceed a 'passing' score in United States Medical Licensing Examination style questions. However, challenges remain in long-form medical question answering and handling real-world workflows. Here, we present Med-PaLM 2, which bridges these gaps with a combination of base LLM improvements, medical domain fine-tuning and new strategies for improving reasoning and grounding through ensemble refinement and chain of retrieval.

View Article and Find Full Text PDF

Testing and Evaluation of Health Care Applications of Large Language Models: A Systematic Review.

Suhana Bedi Yutong Liu Lucy Orr-Ewing Dev Dash Sanmi Koyejo

JAMA

October 2024

Importance: Large language models (LLMs) can assist in various health care activities, but current evaluation approaches may not adequately identify the most useful application areas.

Objective: To summarize existing evaluations of LLMs in health care in terms of 5 components: (1) evaluation data type, (2) health care task, (3) natural language processing (NLP) and natural language understanding (NLU) tasks, (4) dimension of evaluation, and (5) medical specialty.

Data Sources: A systematic search of PubMed and Web of Science was performed for studies published between January 1, 2022, and February 19, 2024.

View Article and Find Full Text PDF

AI-ENABLED ASSESSMENT OF CARDIAC FUNCTION AND VIDEO QUALITY IN EMERGENCY DEPARTMENT POINT-OF-CARE ECHOCARDIOGRAMS.

Bryan He Dev Dash Youyou Duanmu Ting Xu Tan David Ouyang

J Emerg Med

February 2024

Article Synopsis

The study focuses on the use of point-of-care ultrasound (POCUS) to improve assessments of unstable patients in emergency departments, particularly through echocardiograms to evaluate heart function.
A new deep learning system named EchoNet-POCUS was developed to help emergency physicians interpret echocardiogram videos and minimize variability between operators.
Results show EchoNet-POCUS has high accuracy in predicting abnormal cardiac function (AUROC of 0.92) and decent accuracy in evaluating video quality (AUROC of 0.81), demonstrating its potential for real-time application in clinical settings.

View Article and Find Full Text PDF

Using an artificial intelligence software improves emergency medicine physician intracranial haemorrhage detection to radiologist levels.

Pranav Warman Anmol Warman Roshan Warman Andrew Degnan Johan Blickman Dev Dash

Emerg Med J

April 2024

Background: Tools to increase the turnaround speed and accuracy of imaging reports could positively influence ED logistics. The Caire ICH is an artificial intelligence (AI) software developed for ED physicians to recognise intracranial haemorrhages (ICHs) on non-contrast enhanced cranial CT scans to manage the clinical care of these patients in a timelier fashion.

Methods: A dataset of 532 non-contrast cranial CT scans was reviewed by five board-certified emergency physicians (EPs) with an average of 14.

View Article and Find Full Text PDF

Investigating real-world consequences of biases in commonly used clinical calculators.

Richard M Yoo Dev Dash Jonathan H Lu Julian Z Genkins Naveed Rabbani

Am J Manag Care

January 2023

Objectives: To evaluate whether one summary metric of calculator performance sufficiently conveys equity across different demographic subgroups, as well as to evaluate how calculator predictive performance affects downstream health outcomes.

Study Design: We evaluate 3 commonly used clinical calculators-Model for End-Stage Liver Disease (MELD), CHA2DS2-VASc, and simplified Pulmonary Embolism Severity Index (sPESI)-on the cohort extracted from the Stanford Medicine Research Data Repository, following the cohort selection process as described in respective calculator derivation papers.

Methods: We quantified the predictive performance of the 3 clinical calculators across sex and race.

View Article and Find Full Text PDF

Paging the Clinical Informatics Community: Respond STAT to Dobbs v. Jackson's Women's Health Organization.

Simone Arvisais-Anhalt Akshay Ravi Benjamin Weia Jos Aarts Hasan B Ahmad Dev Dash

Appl Clin Inform

January 2023

View Article and Find Full Text PDF

Deep Learning System Boosts Radiologist Detection of Intracranial Hemorrhage.

Roshan Warman Anmol Warman Pranav Warman Andrew Degnan Johan Blickman Dev Dash

Cureus

October 2022

Background: Intracranial hemorrhage (ICH) requires emergent medical treatment for positive outcomes. While previous artificial intelligence (AI) solutions achieved rapid diagnostics, none were shown to improve the performance of radiologists in detecting ICHs. Here, we show that the Caire ICH artificial intelligence system enhances a radiologist's ICH diagnosis performance.

View Article and Find Full Text PDF

Assessment of Adherence to Reporting Guidelines by Commonly Used Clinical Prediction Models From a Single Vendor: A Systematic Review.

Jonathan H Lu Alison Callahan Birju S Patel Keith E Morse Dev Dash

JAMA Netw Open

August 2022

Importance: Various model reporting guidelines have been proposed to ensure clinical prediction models are reliable and fair. However, no consensus exists about which model details are essential to report, and commonalities and differences among reporting guidelines have not been characterized. Furthermore, how well documentation of deployed models adheres to these guidelines has not been studied.

View Article and Find Full Text PDF

Building a Learning Health System: Creating an Analytical Workflow for Evidence Generation to Inform Institutional Clinical Care Guidelines.

Dev Dash Arjun Gokhale Birju S Patel Alison Callahan Jose Posada

Appl Clin Inform

January 2022

Background: One key aspect of a learning health system (LHS) is utilizing data generated during care delivery to inform clinical care. However, institutional guidelines that utilize observational data are rare and require months to create, making current processes impractical for more urgent scenarios such as those posed by the COVID-19 pandemic. There exists a need to rapidly analyze institutional data to drive guideline creation where evidence from randomized control trials are unavailable.

View Article and Find Full Text PDF