Defining Patient-Oriented Natural Language Processing: A New Paradigm for Research and Development to Facilitate Adoption and Use by Medical Experts.

Abeed Sarker Mohammed Ali Al-Garadi Yuan-Chi Yang Jinho Choi Arshed A Quyyumi Greg S Martin

JMIR Med Inform

Predictive Health Institute and Center for Health Discovery and Well Being, Department of Medicine, School of Medicine, Emory University, Atlanta, GA, United States.

Published: September 2021

The capabilities of natural language processing (NLP) methods have expanded significantly in recent years, and progress has been particularly driven by advances in data science and machine learning. However, NLP is still largely underused in patient-oriented clinical research and care (POCRC). A key reason behind this is that clinical NLP methods are typically developed, optimized, and evaluated with narrowly focused data sets and tasks (eg, those for the detection of specific symptoms in free texts). Such research and development (R&D) approaches may be described as problem oriented, and the developed systems perform specialized tasks well. As standalone systems, however, they generally do not comprehensively meet the needs of POCRC. Thus, there is often a gap between the capabilities of clinical NLP methods and the needs of patient-facing medical experts. We believe that to increase the practical use of biomedical NLP, future R&D efforts need to be broadened to a new research paradigm-one that explicitly incorporates characteristics that are crucial for POCRC. We present our viewpoint about 4 such interrelated characteristics that can increase NLP systems' suitability for POCRC (3 that represent NLP system properties and 1 associated with the R&D process)-(1) interpretability (the ability to explain system decisions), (2) patient centeredness (the capability to characterize diverse patients), (3) customizability (the flexibility for adapting to distinct settings, problems, and cohorts), and (4) multitask evaluation (the validation of system performance based on multiple tasks involving heterogeneous data sets). By using the NLP task of clinical concept detection as an example, we detail these characteristics and discuss how they may result in the increased uptake of NLP systems for POCRC.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512184	PMC
http://dx.doi.org/10.2196/18471	DOI Listing

Publication Analysis

Top Keywords

nlp methods

nlp

natural language

language processing

medical experts

clinical nlp

data sets

pocrc

defining patient-oriented

patient-oriented natural

Similar Publications

High-dimensional multiple imputation (HDMI) for partially observed confounders including natural language processing-derived auxiliary covariates.

Am J Epidemiol

January 2025

Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.

Janick Weberpals Pamela A Shaw Kueiyu Joshua Lin Richard Wyss Joseph M Plasek

Multiple imputation (MI) models can be improved with auxiliary covariates (AC), but their performance in high-dimensional data remains unclear. We aimed to develop and compare high-dimensional MI (HDMI) methods using structured and natural language processing (NLP)-derived AC in studies with partially observed confounders. We conducted a plasmode simulation with acute kidney injury as outcome and simulated 100 cohorts with a null treatment effect, incorporating creatinine labs, atrial fibrillation (AFib), and other investigator-derived confounders in the outcome generation.

View Article and Find Full Text PDF

Similar Publications

Artificial Intelligence (AI) and Liquid Biopsy Transforming Early Detection of Liver Metastases in Gastrointestinal Cancers.

Curr Cancer Drug Targets

January 2025

Department of Pharmacology, Sri Shanmugha College of Pharmacy, Sankari, Salem, 637304, Tamil Nadu, India.

Thilagesh P Anand Kumar S Aiswarya Nair U Rabiniraj S Shobana P

Liver metastases from Gastrointestinal (GI) cancers present significant challenges in oncology, often signaling poor prognosis. Traditional detection methods like imaging and tissue biopsies have limitations in sensitivity, specificity, and tumor heterogeneity represen-tation. The advent of artificial intelligence (AI) in healthcare, driven by advancements in ma-chine learning, algorithms, and data science, offers a promising frontier for early detection and management of liver metastases.

View Article and Find Full Text PDF

Similar Publications

Automated Detection of Cancer-Suspicious Findings in Japanese Radiology Reports with Natural Language Processing: A Multicenter Study.

J Imaging Inform Med

January 2025

Department of Medical Informatics, Osaka University Graduate School of Medicine, 2-2 Yamadaoka, Suita, 565-0871, Osaka, Japan.

Kento Sugimoto Shoya Wada Shozo Konishi Junya Sato Katsuki Okada

Missed critical imaging findings, particularly those indicating cancer, are a common issue that can result in delays in patient follow-up and treatment. To address this, we developed a rule-based natural language processing (NLP) algorithm to detect cancer-suspicious findings from Japanese radiology reports. The dataset used consisted of chest and abdomen CT reports from six institutions.

View Article and Find Full Text PDF

Similar Publications

Speak and you shall predict: evidence that speech at initial cocaine abstinence is a biomarker of long-term drug use behavior.

Biol Psychiatry

January 2025

Psychiatry and Neuroscience Departments, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Place, New York City, NY, 10029; Psychiatry and Neuroscience Departments, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Place, New York City, NY, 10029. Electronic address:

Carla Agurto Guillermo Cecchi Sarah King Elif K Eyigoz Muhammad A Parvaz

Background: Valid scalable biomarkers for predicting longitudinal clinical outcomes in psychiatric research are crucial for optimizing intervention and prevention efforts. Here we recorded spontaneous speech from initially abstinent individuals with cocaine use disorder (iCUD) for use in predicting drug use outcomes.

Methods: At baseline, 88 iCUD provided 5-minute speech samples describing the positive consequences of quitting drug use and negative consequences of using drugs.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!