From zero to hero: Harnessing transformers for biomedical named entity recognition in zero- and few-shot contexts.

Miloš Košprdić Nikola Prodanović Adela Ljajić Bojana Bašaragin Nikola Milošević

Artif Intell Med

Institute for Artificial Intelligence Research and Development of Serbia, Fruškogorska 1, Novi Sad, 21000, Serbia; Bayer A.G., Research and Development, Mullerstrasse 173, Berlin, 13342, Germany. Electronic address:

Published: October 2024

Supervised named entity recognition (NER) in the biomedical domain depends on large sets of annotated texts with the given named entities. The creation of such datasets can be time-consuming and expensive, while extraction of new entities requires additional annotation tasks and retraining the model. This paper proposes a method for zero- and few-shot NER in the biomedical domain to address these challenges. The method is based on transforming the task of multi-class token classification into binary token classification and pre-training on a large number of datasets and biomedical entities, which allows the model to learn semantic relations between the given and potentially novel named entity labels. We have achieved average F1 scores of 35.44% for zero-shot NER, 50.10% for one-shot NER, 69.94% for 10-shot NER, and 79.51% for 100-shot NER on 9 diverse evaluated biomedical entities with fine-tuned PubMedBERT-based model. The results demonstrate the effectiveness of the proposed method for recognizing new biomedical entities with no or limited number of examples, outperforming previous transformer-based methods, and being comparable to GPT3-based models using models with over 1000 times fewer parameters. We make models and developed code publicly available.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.artmed.2024.102970	DOI Listing

Publication Analysis

Top Keywords

named entity

biomedical entities

entity recognition

zero- few-shot

ner biomedical

biomedical domain

token classification

biomedical

ner

entities

Similar Publications

Heterogeneous entity representation for medicinal synergy prediction.

Bioinformatics

January 2025

School of Data Science and Society, University of North Carolina at Chapel Hill, NC 27599, United States.

Jiawei Wu Jun Wen Mingyuan Yan Anqi Dong Shuai Gao

Motivation: Forecasting the synergistic effects of drug combinations facilitates drug discovery and development, especially regarding cancer therapeutics. While numerous computational methods have emerged, most of them fall short in fully modeling the relationships among clinical entities including drugs, cell lines, and diseases, which hampers their ability to generalize to drug combinations involving unseen drugs. These relationships are complex and multidimensional, requiring sophisticated modeling to capture nuanced interplay that can significantly influence therapeutic efficacy.

View Article and Find Full Text PDF

Similar Publications

Light-driven in-situ synthesis of nano-sulfur and graphene oxide composites for efficient removal of heavy metal ions.

J Hazard Mater

January 2025

State Key Lab of Geohazard prevention & Geoenvironment protection, College of Materials and Chemistry & Chemical Engineering, Chengdu University of Technology, Chengdu 610059, China. Electronic address:

Wentong Fan Sheng Li Qiaomei Yuan Peng Wu Xinfeng Zhang

Sulfur nanoparticles (SNPs) and their composites are promising for heavy metal adsorption, yet current SNPs often lack surface S, leading to low affinity toward heavy metal and ease of aggregation. Here, we report a simple light-driven method for facile prepare SNPs with surfaces enriched with S and in-situ load them onto graphene oxide (GO) to fabricate GO-S composites. Under illumination, the O generated by photosensitizer phloxine B was able to oxidize S into elemental SNPs.

View Article and Find Full Text PDF

Similar Publications

A comprehensive dataset and neural network approach for named entity recognition in the Uzbek language.

Data Brief

February 2025

Tashkent institute of textile and light industry, 5, Shoxdjaxon str., Tashkent city 100100, Uzbekistan.

Davlatyor Mengliev Vladimir Barakhnin Mukhriddin Eshkulov Bahodir Ibragimov Shohrux Madirimov

In this study, the authors presented a dataset for named entity recognition in the Uzbek language. The dataset consists of 2000 sentences and 25,865 words, and the sources were legal documents and hand-crafted sentences annotated using the BIOES scheme. The study is complemented by the fact that the authors demonstrated the applications of the created dataset by training a language model using the CNN + LSTM architecture, which achieves high accuracy in NER tasks, with an F1 score of 90.

View Article and Find Full Text PDF

Similar Publications

Complexed hyaluronic acid-based nanoparticles in cancer therapy and diagnosis: Research trends by natural language processing.

Heliyon

January 2025

Pharmaceutical Sciences and Technology Program, Faculty of Pharmaceutical Sciences, Chulalongkorn University, Bangkok, 10330, Thailand.

Abd Kakhar Umar Patanachai K Limpikirati Bachtiar Rivai Ilham Ardiansah Sriwidodo Sriwidodo

Hyaluronic acid (HA) is a popular surface modifier in targeted cancer delivery due to its receptor-binding abilities. However, HA alone faces limitations in lipid solubility, biocompatibility, and cell internalization, making it less effective as a standalone delivery system. This comprehensive study aimed to explore a dynamic landscape of complexation in HA-based nanoparticles in cancer therapy, examining diverse aspects from influential modifiers to emerging trends in cancer diagnostics.

View Article and Find Full Text PDF

Similar Publications

The Impact of Temperature on Extracting Information From Clinical Trial Publications Using Large Language Models.

Cureus

December 2024

Department of Radiation Oncology, Cantonal Hospital Winterthur, Winterthur, CHE.

Paul Windisch Fabio Dennstädt Carole Koechli Christina Schröder Daniel M Aebersold

Introduction The application of natural language processing (NLP) for extracting data from biomedical research has gained momentum with the advent of large language models (LLMs). However, the effect of different LLM parameters, such as temperature settings, on biomedical text mining remains underexplored and a consensus on what settings can be considered "safe" is missing. This study evaluates the impact of temperature settings on LLM performance for a named entity recognition and a classification task in clinical trial publications.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!