Objective: Information extraction and classification of clinical data are current challenges in natural language processing. This paper presents a cascaded method to deal with three different extractions and classifications in clinical data: concept annotation, assertion classification and relation classification.
Materials And Methods: A pipeline system was developed for clinical natural language processing that includes a proofreading process, with gold-standard reflexive validation and correction. The information extraction system is a combination of a machine learning approach and a rule-based approach. The outputs of this system are used for evaluation in all three tiers of the fourth i2b2/VA shared-task and workshop challenge.
Results: Overall concept classification attained an F-score of 83.3% against a baseline of 77.0%, the optimal F-score for assertions about the concepts was 92.4% and relation classifier attained 72.6% for relationships between clinical concepts against a baseline of 71.0%. Micro-average results for the challenge test set were 81.79%, 91.90% and 70.18%, respectively.
Discussion: The challenge in the multi-task test requires a distribution of time and work load for each individual task so that the overall performance evaluation on all three tasks would be more informative rather than treating each task assessment as independent. The simplicity of the model developed in this work should be contrasted with the very large feature space of other participants in the challenge who only achieved slightly better performance. There is a need to charge a penalty against the complexity of a model as defined in message minimalisation theory when comparing results.
Conclusion: A complete pipeline system for constructing language processing models that can be used to process multiple practical detection tasks of language structures of clinical records is presented.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3168325 | PMC |
http://dx.doi.org/10.1136/amiajnl-2011-000302 | DOI Listing |
Pilot Feasibility Stud
January 2025
Academic Unit for Ageing and Stroke Research, Leeds Institute of Health Sciences, University of Leeds, Leeds, LS2 9JT, UK.
Background: There is a growing evidence base to support the use of self-management interventions for improving quality of life after stroke. However, stroke survivors with aphasia have been underrepresented in research to date. It is therefore unclear if self-management is an appropriate or effective approach for this group.
View Article and Find Full Text PDFThe Mendelian Phenotype Search Engine (MPSE), a clinical decision support tool using Natural Language Processing and Machine Learning, helped neonatologists expedite decisions to whole genome sequencing (WGS) to diagnose patients in the neonatal intensive care unit. After the MPSE was introduced, utilization of WGS increased, time to ordering WGS decreased, and WGS diagnostic yield increased.
View Article and Find Full Text PDFSci Rep
January 2025
Department of Computer Science and Engineering, University Institute of Technology, Rajiv Gandhi Proudyogiki Vishwavidyalaya (State Technological University of Madhya Pradesh), Madhya Pradesh, Bhopal, 462033, India.
The most common carcinoma-related cause of death among women is breast cancer. Early detection is crucial, and the manual screening method may lead to a delayed diagnosis, which would delay treatment and put lives at risk. Mammography imaging is advised for routine screening to diagnose breast cancer at an early stage.
View Article and Find Full Text PDFHead Neck Pathol
January 2025
Oral Diagnosis Department, Semiology and Oral Pathology Areas, Piracicaba Dental School, University of Campinas (UNICAMP), Av. Limeira, 901, Bairro Areão, Piracicaba, São Paulo, Brazil.
Introduction: Solitary fibrous tumor (SFT) is a fibroblastic neoplasm of uncertain biological origin that is rare in the tongue.
Case Report: A 42-year-old woman presented with a painless, submucosal nodule in the tongue. Based on the clinical hypothesis of benign mesenchymal neoplasia, the lesion was excised, and the specimen was submitted for histopathological analysis.
BMC Bioinformatics
January 2025
Department of Information Technology, Vardhaman College of Engineering, Shamshabad, Hyderabad, India.
Background: Biomedical text mining is a technique that extracts essential information from scientific articles using named entity recognition (NER). Traditional NER methods rely on dictionaries, rules, or curated corpora, which may not always be accessible. To overcome these challenges, deep learning (DL) methods have emerged.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!