A neural network multi-task learning approach to biomedical named entity recognition.

BMC Bioinformatics

Language Technology Laboratory, DTAL, University of Cambridge, 9 West Road, Cambridge, CB39DB, UK.

Published: August 2017

Background: Named Entity Recognition (NER) is a key task in biomedical text mining. Accurate NER systems require task-specific, manually-annotated datasets, which are expensive to develop and thus limited in size. Since such datasets contain related but different information, an interesting question is whether it might be possible to use them together to improve NER performance. To investigate this, we develop supervised, multi-task, convolutional neural network models and apply them to a large number of varied existing biomedical named entity datasets. Additionally, we investigated the effect of dataset size on performance in both single- and multi-task settings.

Results: We present a single-task model for NER, a Multi-output multi-task model and a Dependent multi-task model. We apply the three models to 15 biomedical datasets containing multiple named entities including Anatomy, Chemical, Disease, Gene/Protein and Species. Each dataset represent a task. The results from the single-task model and the multi-task models are then compared for evidence of benefits from Multi-task Learning. With the Multi-output multi-task model we observed an average F-score improvement of 0.8% when compared to the single-task model from an average baseline of 78.4%. Although there was a significant drop in performance on one dataset, performance improves significantly for five datasets by up to 6.3%. For the Dependent multi-task model we observed an average improvement of 0.4% when compared to the single-task model. There were no significant drops in performance on any dataset, and performance improves significantly for six datasets by up to 1.1%. The dataset size experiments found that as dataset size decreased, the multi-output model's performance increased compared to the single-task model's. Using 50, 25 and 10% of the training data resulted in an average drop of approximately 3.4, 8 and 16.7% respectively for the single-task model but approximately 0.2, 3.0 and 9.8% for the multi-task model.

Conclusions: Our results show that, on average, the multi-task models produced better NER results than the single-task models trained on a single NER dataset. We also found that Multi-task Learning is beneficial for small datasets. Across the various settings the improvements are significant, demonstrating the benefit of Multi-task Learning for this task.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5558737PMC
http://dx.doi.org/10.1186/s12859-017-1776-8DOI Listing

Publication Analysis

Top Keywords

single-task model
20
multi-task learning
16
multi-task model
16
multi-task
13
named entity
12
dataset size
12
compared single-task
12
model
9
neural network
8
biomedical named
8

Similar Publications

Biomarkers.

Alzheimers Dement

December 2024

Janssen Research & Development, A Division of Janssen Pharmaceutica, Neuroscience Therapeutic Area, Beerse, Belgium.

Background: Neurodegenerative diseases are a heterogeneous group of illnesses. Differences across patients exist in the underlying biological drivers of disease. Furthermore, cross-diagnostic disease mechanisms exist, and different pathologies often co-occur in the brain.

View Article and Find Full Text PDF

Background: Dementia has a worldwide prevalence of 55 million people, with 60 to 70% of cases attributed to Alzheimer's Disease (AD). In Antioquia, Colombia, exists a group of families with early-onset AD associated to PSEN1-E280A, a genetic variant with an autosomal dominant inheritance pattern and a penetrance over 99%, which enables the study of individuals across different disease stages. Electroencephalography (EEG) is a non-invasive, portable, and low-cost technique that allows the study of electrophysiological changes associated with neurodegeneration.

View Article and Find Full Text PDF

Background: Dementia compromises physical function, posing risks for falls. People living with dementia (PWD) have been historically excluded from intervention trials due to researchers' eligibility criteria. Exercise shows potential in enhancing physical function, but more evidence is needed.

View Article and Find Full Text PDF

Novel approach for quality control testing of medical displays using deep learning technology.

Biomed Phys Eng Express

January 2025

Gunma Prefectural College of Health Sciences, 323-1, Kamioki-machi, Maebashi, Gunma, Japan, Maebashi, Gunma, 371-0052, JAPAN.

In digital image diagnosis using medical displays, it is crucial to rigorously manage display devices to ensure appropriate image quality and diagnostic safety. The aim of this study was to develop a model for the efficient quality control (QC) of medical displays, specifically addressing the measurement items of contrast response and maximum luminance as part of constancy testing, and to evaluate its performance. In addition, the study focused on whether these tasks could be addressed using a multitasking strategy.

View Article and Find Full Text PDF

Eye diseases such as age-related macular degeneration (AMD) are major causes of irreversible vision loss. Early and accurate detection of these diseases is essential for effective management. Optical coherence tomography (OCT) imaging provides clinicians with in vivo, cross-sectional views of the retina, enabling the identification of key pathological features.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!