AI Article Synopsis

  • The rapid growth of biomedical literature requires automated methods to understand relationships between concepts, leading to the LitCoin NLP challenge aimed at developing and benchmarking these techniques.
  • The study employed ensemble learning with specialized models like BioBERT and PubMedBERT for named entity recognition (NER), while also finetuning a large model for improved relation extraction tasks.
  • The developed NLP system achieved first place in NER and second in relation extraction, demonstrating that specialized models significantly outperform general models like ChatGPT in biomedical contexts, supporting future research initiatives.

Article Abstract

Objectives: The rapid expansion of biomedical literature necessitates automated techniques to discern relationships between biomedical concepts from extensive free text. Such techniques facilitate the development of detailed knowledge bases and highlight research deficiencies. The LitCoin Natural Language Processing (NLP) challenge, organized by the National Center for Advancing Translational Science, aims to evaluate such potential and provides a manually annotated corpus for methodology development and benchmarking.

Materials And Methods: For the named entity recognition (NER) task, we utilized ensemble learning to merge predictions from three domain-specific models, namely BioBERT, PubMedBERT, and BioM-ELECTRA, devised a rule-driven detection method for cell line and taxonomy names and annotated 70 more abstracts as additional corpus. We further finetuned the T0pp model, with 11 billion parameters, to boost the performance on relation extraction and leveraged entites' location information (eg, title, background) to enhance novelty prediction performance in relation extraction (RE).

Results: Our pioneering NLP system designed for this challenge secured first place in Phase I-NER and second place in Phase II-relation extraction and novelty prediction, outpacing over 200 teams. We tested OpenAI ChatGPT 3.5 and ChatGPT 4 in a Zero-Shot setting using the same test set, revealing that our finetuned model considerably surpasses these broad-spectrum large language models.

Discussion And Conclusion: Our outcomes depict a robust NLP system excelling in NER and RE across various biomedical entities, emphasizing that task-specific models remain superior to generic large ones. Such insights are valuable for endeavors like knowledge graph development and hypothesis formulation in biomedical research.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11339500PMC
http://dx.doi.org/10.1093/jamia/ocae061DOI Listing

Publication Analysis

Top Keywords

performance relation
8
relation extraction
8
novelty prediction
8
nlp system
8
place phase
8
biomedical
5
ensemble pretrained
4
pretrained language
4
language models
4
models extract
4

Similar Publications

TRPV4 as a Novel Regulator of Ferroptosis in Colon Adenocarcinoma: Implications for Prognosis and Therapeutic Targeting.

Dig Dis Sci

January 2025

Ningxia Medical University, Xing Qing Block, Shengli Street No.1160, Yin Chuan City, 750004, Ningxia Province, People's Republic of China.

Background: Colon adenocarcinoma (COAD) is a leading cause of cancer-related mortality worldwide. Transient receptor potential vanilloid 4 (TRPV4), a calcium-permeable non-selective cation channel, has been implicated in various cancers, including COAD. This study investigates the role of TRPV4 in colon adenocarcinoma and elucidates its potential mechanism via the ferroptosis pathway.

View Article and Find Full Text PDF

A prediction model for electrical strength of gaseous medium based on molecular reactivity descriptors and machine learning method.

J Mol Model

January 2025

Hubei Key Laboratory·for High-Efficiency-Utilization of Solar Energy and Operation, Control of Energy-Storage System, Hubei-University of Technology, Wuhan, 430068, China.

Context: Ionization and adsorption in gas discharge are similar to electrophilic and nucleophilic reactions. The molecular descriptors characterizing reactions such as electrostatic potential descriptors are useful in predicting the electrical strength of environmentally friendly gases. In this study, descriptors of 73 molecules are employed for correlation analysis with electrical strength.

View Article and Find Full Text PDF

Objective: Rheumatoid arthritis (RA) is an autoimmune condition that causes severe joint deformities and impaired functionality, affecting the well-being and daily life of individuals. Consequently, there is a pressing demand for identifying viable therapeutic targets for treating RA. This study aimed to explore the molecular mechanisms of osteoclast differentiation in PBMC from patients with RA through transcriptome sequencing and bioinformatics analysis.

View Article and Find Full Text PDF

Purpose Of Review: Male breast cancer (MBC) is a rare entity which often arises in elderly people. Aim of this review is to evaluate the principal issues related to MBC in elderly, because the therapeutic management of disease is not only related to the biological behavior of the tumor, but also to the comorbidities and frailty of older population. A scoping literature review was performed on Pubmed and Cochrane Database using the following keywords: therapeutic management/ male/ breast cancer/ elderly patients.

View Article and Find Full Text PDF

Role of immune cell homeostasis in research and treatment response in hepatocellular carcinoma.

Clin Exp Med

January 2025

Department of Thoracic Surgery, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200127, China.

Introduction Recently, immune cells within the tumor microenvironment (TME) have become crucial in regulating cancer progression and treatment responses. The dynamic interactions between tumors and immune cells are emerging as a promising strategy to activate the host's immune system against various cancers. The development and progression of hepatocellular carcinoma (HCC) involve complex biological processes, with the role of the TME and tumor phenotypes still not fully understood.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!