Publications by Uzuner O

Publications by authors named "Uzuner O"

Page 1 of 4

CACER: Clinical concept Annotations for Cancer Events and Relations.

Yujuan Velvin Fu Giridhar Kaushik Ramachandran Ahmad Halwani Bridget T McInnes Fei Xia

J Am Med Inform Assoc

November 2024

Article Synopsis

The study aims to analyze the relationship between cancer drugs and their associated symptoms by extracting structured information from oncology clinical notes using a new corpus, CACER, which includes detailed annotations of over 48,000 medical problems and drug events.
Transformer-based models such as BERT, Llama3, Flan-T5, and GPT-4 were evaluated for their ability to extract events and relationships from clinical narratives, with BERT and Llama3 performing the best overall.
The research concludes that while large language models like GPT-4 are capable, they did not outperform smaller models like BERT, emphasizing the effectiveness of well-annotated training data.

View Article and Find Full Text PDF

Large language models for biomedicine: foundations, opportunities, challenges, and best practices.

Satya S Sahoo Joseph M Plasek Hua Xu Özlem Uzuner Trevor Cohen

J Am Med Inform Assoc

September 2024

Objectives: Generative large language models (LLMs) are a subset of transformers-based neural network architecture models. LLMs have successfully leveraged a combination of an increased number of parameters, improvements in computational efficiency, and large pre-training datasets to perform a wide spectrum of natural language processing (NLP) tasks. Using a few examples (few-shot) or no examples (zero-shot) for prompt-tuning has enabled LLMs to achieve state-of-the-art performance in a broad range of NLP applications.

View Article and Find Full Text PDF

Clinical natural language processing for secondary uses.

Yanjun Gao Diwakar Mahajan Özlem Uzuner Meliha Yetisgen

J Biomed Inform

February 2024

View Article and Find Full Text PDF

LeafAI: query generator for clinical cohort discovery rivaling a human programmer.

Nicholas J Dobbins Bin Han Weipeng Zhou Kristine F Lan H Nina Kim

J Am Med Inform Assoc

November 2023

Objective: Identifying study-eligible patients within clinical databases is a critical step in clinical research. However, accurate query design typically requires extensive technical and biomedical expertise. We sought to create a system capable of generating data model-agnostic queries while also providing novel logical reasoning capabilities for complex clinical trial eligibility criteria.

View Article and Find Full Text PDF

Advancements in extracting social determinants of health information from narrative text.

Kevin Lybarger Oliver J Bear Don't Walk Meliha Yetisgen Özlem Uzuner

J Am Med Inform Assoc

July 2023

View Article and Find Full Text PDF

Overview of the 2022 n2c2 shared task on contextualized medication event extraction in clinical notes.

Diwakar Mahajan Jennifer J Liang Ching-Huei Tsou Özlem Uzuner

J Biomed Inform

August 2023

Background: An accurate medication history, foundational for providing quality medical care, requires understanding of medication change events documented in clinical notes. However, extracting medication changes without the necessary clinical context is insufficient for real-world applications.

Methods: To address this need, Track 1 of the 2022 National NLP Clinical Challenges focused on extracting the context for medication changes documented in clinical notes using the Contextualized Medication Event Dataset.

View Article and Find Full Text PDF

Leveraging natural language processing to augment structured social determinants of health data in the electronic health record.

Kevin Lybarger Nicholas J Dobbins Ritche Long Angad Singh Patrick Wedgeworth

J Am Med Inform Assoc

July 2023

Objective: Social determinants of health (SDOH) impact health outcomes and are documented in the electronic health record (EHR) through structured data and unstructured clinical notes. However, clinical notes often contain more comprehensive SDOH information, detailing aspects such as status, severity, and temporality. This work has two primary objectives: (1) develop a natural language processing information extraction model to capture detailed SDOH information and (2) evaluate the information gain achieved by applying the SDOH extractor to clinical narratives and combining the extracted representations with existing structured data.

View Article and Find Full Text PDF

Progress Note Understanding - Assessment and Plan Reasoning: Overview of the 2022 N2C2 Track 3 shared task.

Yanjun Gao Dmitriy Dligach Timothy Miller Matthew M Churpek Ozlem Uzuner

J Biomed Inform

June 2023

Daily progress notes are a common note type in the electronic health record (EHR) where healthcare providers document the patient's daily progress and treatment plans. The EHR is designed to document all the care provided to patients, but it also enables note bloat with extraneous information that distracts from the diagnoses and treatment plans. Applications of natural language processing (NLP) in the EHR is a growing field with the majority of methods in information extraction.

View Article and Find Full Text PDF

The 2022 n2c2/UW shared task on extracting social determinants of health.

Kevin Lybarger Meliha Yetisgen Özlem Uzuner

J Am Med Inform Assoc

July 2023

Objective: The n2c2/UW SDOH Challenge explores the extraction of social determinant of health (SDOH) information from clinical notes. The objectives include the advancement of natural language processing (NLP) information extraction techniques for SDOH and clinical information more broadly. This article presents the shared task, data, participating teams, performance results, and considerations for future work.

View Article and Find Full Text PDF

Extracting medication changes in clinical narratives using pre-trained language models.

Giridhar Kaushik Ramachandran Kevin Lybarger Yaya Liu Diwakar Mahajan Jennifer J Liang

J Biomed Inform

March 2023

An accurate and detailed account of patient medications, including medication changes within the patient timeline, is essential for healthcare providers to provide appropriate patient care. Healthcare providers or the patients themselves may initiate changes to patient medication. Medication changes take many forms, including prescribed medication and associated dosage modification.

View Article and Find Full Text PDF

Challenges and opportunities for mining adverse drug reactions: perspectives from pharma, regulatory agencies, healthcare providers and consumers.

Graciela Gonzalez-Hernandez Martin Krallinger Monica Muñoz Raul Rodriguez-Esteban Özlem Uzuner

Database (Oxford)

September 2022

Monitoring drug safety is a central concern throughout the drug life cycle. Information about toxicity and adverse events is generated at every stage of this life cycle, and stakeholders have a strong interest in applying text mining and artificial intelligence (AI) methods to manage the ever-increasing volume of this information. Recognizing the importance of these applications and the role of challenge evaluations to drive progress in text mining, the organizers of BioCreative VII (Critical Assessment of Information Extraction in Biology) convened a panel of experts to explore 'Challenges in Mining Drug Adverse Reactions'.

View Article and Find Full Text PDF

Call for papers: Special issue on clinical natural language processing for secondary use applications.

Meliha Yetisgen Ozlem Uzuner Yanjun Gao Diwakar Mahajan

J Biomed Inform

September 2022

View Article and Find Full Text PDF

The Leaf Clinical Trials Corpus: a new resource for query generation from clinical trial eligibility criteria.

Nicholas J Dobbins Tony Mullen Özlem Uzuner Meliha Yetisgen

Sci Data

August 2022

Identifying cohorts of patients based on eligibility criteria such as medical conditions, procedures, and medication use is critical to recruitment for clinical trials. Such criteria are often most naturally described in free-text, using language familiar to clinicians and researchers. In order to identify potential participants at scale, these criteria must first be translated into queries on clinical databases, which can be labor-intensive and error-prone.

View Article and Find Full Text PDF

A scoping review of publicly available language tasks in clinical natural language processing.

Yanjun Gao Dmitriy Dligach Leslie Christensen Samuel Tesch Ryan Laffin

J Am Med Inform Assoc

September 2022

Objective: To provide a scoping review of papers on clinical natural language processing (NLP) shared tasks that use publicly available electronic health record data from a cohort of patients.

Materials And Methods: We searched 6 databases, including biomedical research and computer science literature databases. A round of title/abstract screening and full-text screening were conducted by 2 reviewers.

View Article and Find Full Text PDF

Extracting Radiological Findings With Normalized Anatomical Information Using a Span-Based BERT Relation Extraction Model.

Kevin Lybarger Aashka Damani Martin Gunn O Zlem Uzuner Meliha Yetisgen

AMIA Jt Summits Transl Sci Proc

June 2023

Medical imaging is critical to the diagnosis and treatment of numerous medical problems, including many forms of cancer. Medical imaging reports distill the findings and observations of radiologists, creating an unstructured textual representation of unstructured medical images. Large-scale use of this text-encoded information requires converting the unstructured text to a structured, semantic representation.

View Article and Find Full Text PDF

A Context-Enhanced De-identification System.

Kahyun Lee Mehmet Kayaalp Sam Henry Özlem Uzuner

ACM Trans Comput Healthc

January 2022

Many modern entity recognition systems, including the current state-of-the-art de-identification systems, are based on bidirectional long short-term memory (biLSTM) units augmented by a conditional random field (CRF) sequence optimizer. These systems process the input sentence by sentence. This approach prevents the systems from capturing dependencies over sentence boundaries and makes accurate sentence boundary detection a prerequisite.

View Article and Find Full Text PDF

Transferability of neural network clinical deidentification systems.

Kahyun Lee Nicholas J Dobbins Bridget McInnes Meliha Yetisgen Özlem Uzuner

J Am Med Inform Assoc

November 2021

Objective: Neural network deidentification studies have focused on individual datasets. These studies assume the availability of a sufficient amount of human-annotated data to train models that can generalize to corresponding test data. In real-world situations, however, researchers often have limited or no in-house training data.

View Article and Find Full Text PDF

Corrigendum to: The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.

Sam Henry Yanshan Wang Feichen Shen Ozlem Uzuner

J Am Med Inform Assoc

October 2021

View Article and Find Full Text PDF

MT-clinical BERT: scaling clinical information extraction with multitask learning.

Andriy Mulyar Ozlem Uzuner Bridget McInnes

J Am Med Inform Assoc

September 2021

Objective: Clinical notes contain an abundance of important, but not-readily accessible, information about patients. Systems that automatically extract this information rely on large amounts of training data of which there exists limited resources to create. Furthermore, they are developed disjointly, meaning that no information can be shared among task-specific systems.

View Article and Find Full Text PDF

Family History Extraction From Synthetic Clinical Narratives Using Natural Language Processing: Overview and Evaluation of a Challenge Data Set and Solutions for the 2019 National NLP Clinical Challenges (n2c2)/Open Health Natural Language Processing (OHNLP) Competition.

Feichen Shen Sijia Liu Sunyang Fu Yanshan Wang Sam Henry

JMIR Med Inform

January 2021

Background: As a risk factor for many diseases, family history (FH) captures both shared genetic variations and living environments among family members. Though there are several systems focusing on FH extraction using natural language processing (NLP) techniques, the evaluation protocol of such systems has not been standardized.

Objective: The n2c2/OHNLP (National NLP Clinical Challenges/Open Health Natural Language Processing) 2019 FH extraction task aims to encourage the community efforts on a standard evaluation and system development on FH extraction from synthetic clinical narratives.

View Article and Find Full Text PDF

The 2019 n2c2/OHNLP Track on Clinical Semantic Textual Similarity: Overview.

Yanshan Wang Sunyang Fu Feichen Shen Sam Henry Ozlem Uzuner

JMIR Med Inform

November 2020

Background: Semantic textual similarity is a common task in the general English domain to assess the degree to which the underlying semantics of 2 text segments are equivalent to each other. Clinical Semantic Textual Similarity (ClinicalSTS) is the semantic textual similarity task in the clinical domain that attempts to measure the degree of semantic equivalence between 2 snippets of clinical text. Due to the frequent use of templates in the Electronic Health Record system, a large amount of redundant text exists in clinical notes, making ClinicalSTS crucial for the secondary use of clinical text in downstream clinical natural language processing applications, such as clinical text summarization, clinical semantics extraction, and clinical information retrieval.

View Article and Find Full Text PDF

The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.

Sam Henry Yanshan Wang Feichen Shen Ozlem Uzuner

J Am Med Inform Assoc

October 2020

Objective: The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task track 3, focused on medical concept normalization (MCN) in clinical records. This track aimed to assess the state of the art in identifying and matching salient medical concepts to a controlled vocabulary. In this paper, we describe the task, describe the data set used, compare the participating systems, present results, identify the strengths and limitations of the current state of the art, and identify directions for future research.

View Article and Find Full Text PDF

Adverse drug event detection using reason assignments in FDA drug labels.

Corey Sutphin Kahyun Lee Antonio Jimeno Yepes Özlem Uzuner Bridget T McInnes

J Biomed Inform

October 2020

Adverse drug events (ADEs) are unintended incidents that involve the taking of a medication. ADEs pose significant health and financial problems worldwide. Information about ADEs can inform health care and improve patient safety.

View Article and Find Full Text PDF

Normalizing Adverse Events using Recurrent Neural Networks with Attention.

Kahyun Lee Özlem Uzuner

AMIA Jt Summits Transl Sci Proc

May 2020

Adverse events (AEs) are undesirable outcomes of medication administration and cause many hospitalizations as well as even deaths per year. Information about AEs can enable their prevention. Natural language processing (NLP) techniques can identify AEs from narratives and match them to a structured terminology.

View Article and Find Full Text PDF

Extraction and Analysis of Clinically Important Follow-up Recommendations in a Large Radiology Dataset.

Wilson Lau Thomas H Payne Ozlem Uzuner Meliha Yetisgen

AMIA Jt Summits Transl Sci Proc

May 2020

Communication of follow-up recommendations when abnormalities are identified on imaging studies is prone to error. In this paper, we present a natural language processing approach based on deep learning to automatically identify clinically important recommendations in radiology reports. Our approach first identifies the recommendation sentences and then extracts reason, test, and time frame of the identified recommendations.

View Article and Find Full Text PDF