Publications by Dmitriy Dligach | LitMetric

Publications by authors named "Dmitriy Dligach"

Page 1 of 3

The TRIPOD-LLM reporting guideline for studies using large language models.

Jack Gallifant Majid Afshar Saleem Ameen Yindalon Aphinyanaphongs Shan Chen Dmitriy Dligach

Nat Med

January 2025

Large language models (LLMs) are rapidly being adopted in healthcare, necessitating standardized reporting guidelines. We present transparent reporting of a multivariable model for individual prognosis or diagnosis (TRIPOD)-LLM, an extension of the TRIPOD + artificial intelligence statement, addressing the unique challenges of LLMs in biomedical applications. TRIPOD-LLM provides a comprehensive checklist of 19 main items and 50 subitems, covering key aspects from title to discussion.

View Article and Find Full Text PDF

Lessons learned on information retrieval in electronic health records: a comparison of embedding models and pooling strategies.

Skatje Myers Timothy A Miller Yanjun Gao Matthew M Churpek Anoop Mayampurath Dmitriy Dligach

J Am Med Inform Assoc

December 2024

Objectives: Applying large language models (LLMs) to the clinical domain is challenging due to the context-heavy nature of processing medical records. Retrieval-augmented generation (RAG) offers a solution by facilitating reasoning over large text sources. However, there are many parameters to optimize in just the retrieval system alone.

View Article and Find Full Text PDF

LCD benchmark: long clinical document benchmark on mortality prediction for language models.

WonJin Yoon Shan Chen Yanjun Gao Zhanzhan Zhao Dmitriy Dligach

J Am Med Inform Assoc

November 2024

Objectives: The application of natural language processing (NLP) in the clinical domain is important due to the rich unstructured information in clinical documents, which often remains inaccessible in structured data. When applying NLP methods to a certain domain, the role of benchmark datasets is crucial as benchmark datasets not only guide the selection of best-performing models but also enable the assessment of the reliability of the generated outputs. Despite the recent availability of language models capable of longer context, benchmark datasets targeting long clinical document classification tasks are absent.

View Article and Find Full Text PDF

Outcomes and Cost-Effectiveness of an EHR-Embedded AI Screener for Identifying Hospitalized Adults at Risk for Opioid Use Disorder.

Majid Afshar Felice Resnik Cara Joyce Madeline Oguss Dmitriy Dligach

Res Sq

October 2024

Unlabelled: Hospitalized adults with opioid use disorder (OUD) are at high risk for adverse events and rehospitalizations. This pre-post quasi-experimental study evaluated whether an AI-driven OUD screener embedded in the electronic health record (EHR) was non-inferior to usual care in identifying patients for Addiction Medicine consults, aiming to provide a similarly effective but more scalable alternative to human-led ad hoc consultations. The AI screener analyzed EHR notes in real-time with a convolutional neural network to identify patients at risk and recommend consultation.

View Article and Find Full Text PDF

The TRIPOD-LLM Statement: A Targeted Guideline For Reporting Large Language Models Use.

Jack Gallifant Majid Afshar Saleem Ameen Yindalon Aphinyanaphongs Shan Chen Dmitriy Dligach

medRxiv

July 2024

Article Synopsis

TRIPOD-LLM is a new set of reporting guidelines specifically designed for the use of Large Language Models (LLMs) in biomedical research, aiming to standardize transparency and quality in healthcare applications.
The guidelines include a checklist with 19 main items and 50 subitems, adaptable to various research designs, emphasizing the importance of human oversight and task-specific performance.
An interactive website is provided to help researchers easily complete the guidelines and generate submissions, with the intention of continually updating the document as the field evolves.

View Article and Find Full Text PDF

Family history as the strongest predictor of aortic and peripheral aneurysms in patients with intracranial aneurysms.

Pui Man Rosalind Lai Elliot Akama-Garren Anil Can Selena-Rae Tirado Victor M Castro Dmitriy Dligach

J Clin Neurosci

August 2024

Objective: Intracranial aneurysms (IA) and aortic aneurysms (AA) are both abnormal dilations of arteries with familial predisposition and have been proposed to share co-prevalence and pathophysiology. Associations of IA and non-aortic peripheral aneurysms are less well-studied. The goal of the study was to understand the patterns of aortic and peripheral (extracranial) aneurysms in patients with IA, and risk factors associated with the development of these aneurysms.

View Article and Find Full Text PDF

Automated stratification of trauma injury severity across multiple body regions using multi-modal, multi-class machine learning models.

Jifan Gao Guanhua Chen Ann P O'Rourke John Caskey Kyle A Carey Dmitriy Dligach

J Am Med Inform Assoc

May 2024

Objective: The timely stratification of trauma injury severity can enhance the quality of trauma care but it requires intense manual annotation from certified trauma coders. The objective of this study is to develop machine learning models for the stratification of trauma injury severity across various body regions using clinical text and structured electronic health records (EHRs) data.

Materials And Methods: Our study utilized clinical documents and structured EHR variables linked with the trauma registry data to create 2 machine learning models with different approaches to representing text.

View Article and Find Full Text PDF

LCD Benchmark: Long Clinical Document Benchmark on Mortality Prediction for Language Models.

WonJin Yoon Shan Chen Yanjun Gao Zhanzhan Zhao Dmitriy Dligach

medRxiv

July 2024

Objective: The application of Natural Language Processing (NLP) in the clinical domain is important due to the rich unstructured information in clinical documents, which often remains inaccessible in structured data. When applying NLP methods to a certain domain, the role of benchmark datasets is crucial as benchmark datasets not only guide the selection of best-performing models but also enable the assessment of the reliability of the generated outputs. Despite the recent availability of language models (LMs) capable of longer context, benchmark datasets targeting long clinical document classification tasks are absent.

View Article and Find Full Text PDF

Development of a Human Evaluation Framework and Correlation with Automated Metrics for Natural Language Generation of Medical Diagnoses.

Emma Croxford Yanjun Gao Brian Patterson Daniel To Samuel Tesch Dmitriy Dligach

medRxiv

April 2024

In the evolving landscape of clinical Natural Language Generation (NLG), assessing abstractive text quality remains challenging, as existing methods often overlook generative task complexities. This work aimed to examine the current state of automated evaluation metrics in NLG in healthcare. To have a robust and well-validated baseline with which to examine the alignment of these metrics, we created a comprehensive human evaluation framework.

View Article and Find Full Text PDF

Development and external validation of multimodal postoperative acute kidney injury risk machine learning models.

George K Karway Jay L Koyner John Caskey Alexandra B Spicer Kyle A Carey Dmitriy Dligach

JAMIA Open

December 2023

Objectives: To develop and externally validate machine learning models using structured and unstructured electronic health record data to predict postoperative acute kidney injury (AKI) across inpatient settings.

Materials And Methods: Data for adult postoperative admissions to the Loyola University Medical Center (2009-2017) were used for model development and admissions to the University of Wisconsin-Madison (2009-2020) were used for validation. Structured features included demographics, vital signs, laboratory results, and nurse-documented scores.

View Article and Find Full Text PDF

Improving the Transferability of Clinical Note Section Classification Models with BERT and Large Language Model Ensembles.

Weipeng Zhou Dmitriy Dligach Majid Afshar Yanjun Gao Timothy A Miller

Proc Conf Assoc Comput Linguist Meet

July 2023

Text in electronic health records is organized into sections, and classifying those sections into section categories is useful for downstream tasks. In this work, we attempt to improve the transferability of section classification models by combining the dataset-specific knowledge in supervised learning models with the world knowledge inside large language models (LLMs). Surprisingly, we find that zero-shot LLMs out-perform supervised BERT-based models applied to out-of-domain data.

View Article and Find Full Text PDF

End-to-end clinical temporal information extraction with multi-head attention.

Timothy Miller Steven Bethard Dmitriy Dligach Guergana Savova

Proc Conf Assoc Comput Linguist Meet

July 2023

Understanding temporal relationships in text from electronic health records can be valuable for many important downstream clinical applications. Since Clinical TempEval 2017, there has been little work on end-to-end systems for temporal relation extraction, with most work focused on the setting where gold standard events and time expressions are given. In this work, we make use of a novel multi-headed attention mechanism on top of a pre-trained transformer encoder to allow the learning process to attend to multiple aspects of the contextualized embeddings.

View Article and Find Full Text PDF

Overview of the Problem List Summarization (ProbSum) 2023 Shared Task on Summarizing Patients' Active Diagnoses and Problems from Electronic Health Record Progress Notes.

Yanjun Gao Dmitriy Dligach Timothy Miller Matthew M Churpek Majid Afshar

Proc Conf Assoc Comput Linguist Meet

July 2023

The BioNLP Workshop 2023 initiated the launch of a shared task on Problem List Summarization (ProbSum) in January 2023. The aim of this shared task is to attract future research efforts in building NLP models for real-world diagnostic decision support applications, where a system generating relevant and accurate diagnoses will augment the healthcare providers' decision-making process and improve the quality of care for patients. The goal for participants is to develop models that generated a list of diagnoses and problems using input from the daily care notes collected from the hospitalization of critically ill patients.

View Article and Find Full Text PDF

Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning.

Brihat Sharma Yanjun Gao Timothy Miller Matthew M Churpek Majid Afshar Dmitriy Dligach

Proc Conf Assoc Comput Linguist Meet

July 2023

Generative artificial intelligence (AI) is a promising direction for augmenting clinical diagnostic decision support and reducing diagnostic errors, a leading contributor to medical errors. To further the development of clinical AI systems, the Diagnostic Reasoning Benchmark (DR.BENCH) was introduced as a comprehensive generative AI framework, comprised of six tasks representing key components in clinical reasoning.

View Article and Find Full Text PDF

Deployment of Real-time Natural Language Processing and Deep Learning Clinical Decision Support in the Electronic Health Record: Pipeline Implementation for an Opioid Misuse Screener in Hospitalized Adults.

Majid Afshar Sabrina Adelaine Felice Resnik Marlon P Mundt John Long Dmitriy Dligach

JMIR Med Inform

April 2023

Background: The clinical narrative in electronic health records (EHRs) carries valuable information for predictive analytics; however, its free-text form is difficult to mine and analyze for clinical decision support (CDS). Large-scale clinical natural language processing (NLP) pipelines have focused on data warehouse applications for retrospective research efforts. There remains a paucity of evidence for implementing NLP pipelines at the bedside for health care delivery.

View Article and Find Full Text PDF

Progress Note Understanding - Assessment and Plan Reasoning: Overview of the 2022 N2C2 Track 3 shared task.

Yanjun Gao Dmitriy Dligach Timothy Miller Matthew M Churpek Ozlem Uzuner

J Biomed Inform

June 2023

Daily progress notes are a common note type in the electronic health record (EHR) where healthcare providers document the patient's daily progress and treatment plans. The EHR is designed to document all the care provided to patients, but it also enables note bloat with extraneous information that distracts from the diagnoses and treatment plans. Applications of natural language processing (NLP) in the EHR is a growing field with the majority of methods in information extraction.

View Article and Find Full Text PDF

DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing.

Yanjun Gao Dmitriy Dligach Timothy Miller John Caskey Brihat Sharma

J Biomed Inform

February 2023

The meaningful use of electronic health records (EHR) continues to progress in the digital era with clinical decision support systems augmented by artificial intelligence. A priority in improving provider experience is to overcome information overload and reduce the cognitive burden so fewer medical errors and cognitive biases are introduced during patient care. One major type of medical error is diagnostic error due to systematic or predictable errors in judgement that rely on heuristics.

View Article and Find Full Text PDF

The Evaluation of a Clinical Decision Support Tool Using Natural Language Processing to Screen Hospitalized Adults for Unhealthy Substance Use: Protocol for a Quasi-Experimental Design.

Cara Joyce Talar W Markossian Jenna Nikolaides Elisabeth Ramsey Hale M Thompson Dmitriy Dligach

JMIR Res Protoc

December 2022

Background: Automated and data-driven methods for screening using natural language processing (NLP) and machine learning may replace resource-intensive manual approaches in the usual care of patients hospitalized with conditions related to unhealthy substance use. The rigorous evaluation of tools that use artificial intelligence (AI) is necessary to demonstrate effectiveness before system-wide implementation. An NLP tool to use routinely collected data in the electronic health record was previously validated for diagnostic accuracy in a retrospective study for screening unhealthy substance use.

View Article and Find Full Text PDF

Summarizing Patients' Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models.

Yanjun Gao Timothy Miller Dongfang Xu Dmitriy Dligach Matthew M Churpek

Proc Int Conf Comput Ling

October 2022

Automatically summarizing patients' main problems from daily progress notes using natural language processing methods helps to battle against information and cognitive overload in hospital settings and potentially assists providers with computerized diagnostic decision support. Problem list summarization requires a model to understand, abstract, and generate clinical documentation. In this work, we propose a new NLP task that aims to generate a list of problems in a patient's daily care plan using input from the provider's progress notes during hospitalization.

View Article and Find Full Text PDF

Hierarchical Annotation for Building A Suite of Clinical Natural Language Processing Tasks: Progress Note Understanding.

Yanjun Gao Dmitriy Dligach Timothy Miller Samuel Tesch Ryan Laffin

LREC Int Conf Lang Resour Eval

June 2022

Applying methods in natural language processing on electronic health records (EHR) data is a growing field. Existing corpus and annotation focus on modeling textual features and relation prediction. However, there is a paucity of annotated corpus built to model clinical diagnostic thinking, a process involving text understanding, domain knowledge abstraction and reasoning.

View Article and Find Full Text PDF

A scoping review of publicly available language tasks in clinical natural language processing.

Yanjun Gao Dmitriy Dligach Leslie Christensen Samuel Tesch Ryan Laffin

J Am Med Inform Assoc

September 2022

Objective: To provide a scoping review of papers on clinical natural language processing (NLP) shared tasks that use publicly available electronic health record data from a cohort of patients.

Materials And Methods: We searched 6 databases, including biomedical research and computer science literature databases. A round of title/abstract screening and full-text screening were conducted by 2 reviewers.

View Article and Find Full Text PDF

Development and multimodal validation of a substance misuse algorithm for referral to treatment using artificial intelligence (SMART-AI): a retrospective deep learning study.

Majid Afshar Brihat Sharma Dmitriy Dligach Madeline Oguss Randall Brown

Lancet Digit Health

June 2022

Background: Substance misuse is a heterogeneous and complex set of behavioural conditions that are highly prevalent in hospital settings and frequently co-occur. Few hospital-wide solutions exist to comprehensively and reliably identify these conditions to prioritise care and guide treatment. The aim of this study was to apply natural language processing (NLP) to clinical notes collected in the electronic health record (EHR) to accurately screen for substance misuse.

View Article and Find Full Text PDF

Correction: Identifying COVID-19 Outbreaks From Contact-Tracing Interview Forms for Public Health Departments: Development of a Natural Language Processing Pipeline.

John Caskey Iain L McConnell Madeline Oguss Dmitriy Dligach Rachel Kulikoff

JMIR Public Health Surveill

March 2022

[This corrects the article DOI: 10.2196/36119.].

View Article and Find Full Text PDF

Identifying COVID-19 Outbreaks From Contact-Tracing Interview Forms for Public Health Departments: Development of a Natural Language Processing Pipeline.

John Caskey Iain L McConnell Madeline Oguss Dmitriy Dligach Rachel Kulikoff

JMIR Public Health Surveill

March 2022

Background: In Wisconsin, COVID-19 case interview forms contain free-text fields that need to be mined to identify potential outbreaks for targeted policy making. We developed an automated pipeline to ingest the free text into a pretrained neural language model to identify businesses and facilities as outbreaks.

Objective: We aimed to examine the precision and recall of our natural language processing pipeline against existing outbreaks and potentially new clusters.

View Article and Find Full Text PDF

Geometric Features Associated with Middle Cerebral Artery Bifurcation Aneurysm Formation: A Matched Case-Control Study.

Jian Zhang Anil Can Pui Man Rosalind Lai Srinivasan Mukundan Victor M Castro Dmitriy Dligach

J Stroke Cerebrovasc Dis

March 2022

Objectives: The pathogenesis of intracranial aneurysms is multifactorial and includes genetic, environmental, and anatomic influences. We aimed to identify image-based morphological parameters that were associated with middle cerebral artery (MCA) bifurcation aneurysms.

Materials And Methods: We evaluated three-dimensional morphological parameters obtained from CT angiography (CTA) or digital subtraction angiography (DSA) from 317 patients with unilateral MCA bifurcation aneurysms diagnosed at the Brigham and Women's Hospital and Massachusetts General Hospital between 1990 and 2016.

View Article and Find Full Text PDF