Background: Annotating scientific literature with ontology concepts is a critical task in biology and several other domains for knowledge discovery. Ontology based annotations can power large-scale comparative analyses in a wide range of applications ranging from evolutionary phenotypes to rare human diseases to the study of protein functions. Computational methods that can tag scientific text with ontology terms have included lexical/syntactic methods, traditional machine learning, and most recently, deep learning.

Results: Here, we present state of the art deep learning architectures based on Gated Recurrent Units for annotating text with ontology concepts. We use the Colorado Richly Annotated Full Text Corpus (CRAFT) as a gold standard for training and testing. We explore a number of additional information sources including NCBI's BioThesauraus and Unified Medical Language System (UMLS) to augment information from CRAFT for increasing prediction accuracy. Our best model results in a 0.84 F1 and semantic similarity.

Conclusion: The results shown here underscore the impact for using deep learning architectures for automatically recognizing ontology concepts from literature. The augmentation of the models with biological information beyond that present in the gold standard corpus shows a distinct improvement in prediction accuracy.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9516808PMC
http://dx.doi.org/10.1186/s13040-022-00310-0DOI Listing

Publication Analysis

Top Keywords

ontology concepts
16
gated recurrent
8
recognizing ontology
8
text ontology
8
deep learning
8
learning architectures
8
gold standard
8
prediction accuracy
8
ontology
6
recurrent unit
4

Similar Publications

Background: Research about anxiety, depression and psychosis and their treatments is often reported using inconsistent language, and different aspects of the overall research may be conducted in separate silos. This leads to challenges in evidence synthesis and slows down the development of more effective interventions to prevent and treat these conditions. To address these challenges, the Global Alliance for Living Evidence on aNxiety, depressiOn and pSychosis (GALENOS) Project is conducting a series of living systematic reviews about anxiety, depression and psychosis.

View Article and Find Full Text PDF

Apples and oranges: Conceptual review as task analysis method.

Eur J Neurosci

January 2025

Department of Philosophy, Radboud University, Nijmegen, The Netherlands.

Conceptual review is a method to address issues of task comparability and task validity in cognitive neuroscience. Meta-analyses within cognitive neuroscience (CNS) as well as integration of neuroscientific findings with findings from adjacent disciplines both involve gathering studies that have purportedly investigated the same mental concept. After all, it is no use comparing apples and oranges.

View Article and Find Full Text PDF

Background And Objective: Despite significant investments in the normalization and the standardization of Electronic Health Records (EHRs), free text is still the rule rather than the exception in clinical notes. The use of free text has implications in data reuse methods used for supporting clinical research since the query mechanisms used in cohort definition and patient matching are mainly based on structured data and clinical terminologies. This study aims to develop a method for the secondary use of clinical text by: (a) using Natural Language Processing (NLP) for tagging clinical notes with biomedical terminology; and (b) designing an ontology that maps and classifies all the identified tags to various terminologies and allows for running phenotyping queries.

View Article and Find Full Text PDF

Expanding the concept of ID conversion in TogoID by introducing multi-semantic and label features.

J Biomed Semantics

January 2025

Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Kashiwa, Chiba, Japan.

Background: TogoID ( https://togoid.dbcls.jp/ ) is an identifier (ID) conversion service designed to link IDs across diverse categories of life science databases.

View Article and Find Full Text PDF

Within the reductionist framework, researchers in the special sciences formulate key terms and concepts and try to explain them with lower-level science terms and concepts. For example, behavioural vision scientists describe contrast perception with a psychometric function, in which the perceived brightness increases logarithmically with the physical contrast of a light patch (the Weber-Fechner law). Visual neuroscientists describe the output of neural circuits with neurometric functions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!