Objective: Natural language processing (NLP) tasks are commonly decomposed into subtasks, chained together to form processing pipelines. The residual error produced in these subtasks propagates, adversely affecting the end objectives. Limited availability of annotated clinical data remains a barrier to reaching state-of-the-art operating characteristics using statistically based NLP tools in the clinical domain. Here we explore the unique linguistic constructions of clinical texts and demonstrate the loss in operating characteristics when out-of-the-box part-of-speech (POS) tagging tools are applied to the clinical domain. We test a domain adaptation approach integrating a novel lexical-generation probability rule used in a transformation-based learner to boost POS performance on clinical narratives.

Methods: Two target corpora from independent healthcare institutions were constructed from high frequency clinical narratives. Four leading POS taggers with their out-of-the-box models trained from general English and biomedical abstracts were evaluated against these clinical corpora. A high performing domain adaptation method, Easy Adapt, was compared to our newly proposed method ClinAdapt.

Results: The evaluated POS taggers drop in accuracy by 8.5-15% when tested on clinical narratives. The highest performing tagger reports an accuracy of 88.6%. Domain adaptation with Easy Adapt reports accuracies of 88.3-91.0% on clinical texts. ClinAdapt reports 93.2-93.9%.

Conclusions: ClinAdapt successfully boosts POS tagging performance through domain adaptation requiring a modest amount of annotated clinical data. Improving the performance of critical NLP subtasks is expected to reduce pipeline error propagation leading to better overall results on complex processing tasks.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3756264PMC
http://dx.doi.org/10.1136/amiajnl-2012-001453DOI Listing

Publication Analysis

Top Keywords

domain adaptation
20
clinical narratives
12
clinical
11
improving performance
8
natural language
8
language processing
8
annotated clinical
8
clinical data
8
operating characteristics
8
clinical domain
8

Similar Publications

Bruton's tyrosine kinase (BTK) is a major drug target in immune cells. The membrane-binding pleckstrin homology and tec homology (PH-TH) domains of BTK are required for signaling. Dimerization of the PH-TH module strongly stimulates the kinase activity of BTK in vitro.

View Article and Find Full Text PDF

Tumor development often requires cellular adaptation to a unique, high metabolic state; however, the molecular mechanisms that drive such metabolic changes in TFE3-rearranged renal cell carcinoma (TFE3-RCC) remain poorly understood. TFE3-RCC, a rare subtype of RCC, is defined by the formation of chimeric proteins involving the transcription factor TFE3. In this study, we analyzed cell lines and genetically engineered mice, demonstrating that the expression of the chimeric protein PRCC-TFE3 induced a hypoxia-related signature by transcriptionally upregulating HIF1α and HIF2α.

View Article and Find Full Text PDF

Transapical beating heart septal myectomy learning curve and training of future surgeons: an observational study.

Int J Surg

December 2024

Division of Cardiovascular Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, People's Republic of China.

Background: Description of the learning curve for transapical beating heart septal myectomy (TA-BSM) helps to understand the potential for wider adaptability. The authors elaborate and examine a competency-based training assessment for TA-BSM that could serve to disseminate septal myectomy expertise.

Materials And Methods: Data on 177 consecutive patients who underwent the TA-BSM for hypertrophic obstructive cardiomyopathy (HOCM) between April 2022 and June 2023 was collected prospectively, which was registered on ClinicalTrials.

View Article and Find Full Text PDF

Objectives: Efficient performance evaluation is essential for driving improvement, ensuring accountability and optimisation of outcomes in healthcare delivery. However, its complexity often leads to ineffective implementation. This article aims to advance the field of performance measurement within alternative healthcare delivery models of care through the development and validation of a comprehensive evaluation framework.

View Article and Find Full Text PDF

To ensure that an eHealth technology fits with its intended users, other stakeholders, and the context within which it will be used, thorough development, implementation, and evaluation processes are necessary. The CeHRes (Centre for eHealth and Wellbeing Research) Roadmap is a framework that can help shape these processes. While it has been successfully used in research and practice, new developments and insights have arisen since the Roadmap's first publication in 2011, not only within the domain of eHealth but also within the different disciplines in which the Roadmap is grounded.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!