A multimodal approach for few-shot biomedical named entity recognition in low-resource languages.

J Biomed Inform

Department of Population Health Sciences, Weill Cornell Medicine, New York 10022, USA. Electronic address:

Published: January 2025

In this study, we revisit named entity recognition (NER) in the biomedical domain from a multimodal perspective, with a particular focus on applications in low-resource languages. Existing research primarily relies on unimodal methods for NER, which limits the potential for capturing diverse information. To address this limitation, we propose a novel method that integrates a cross-modal generation module to transform unimodal data into multimodal data, thereby enabling the use of enriched multimodal information for NER. Additionally, we design a cross-modal filtering module to mitigate the adverse effects of text-image mismatches in multimodal NER. We validate our proposed method on two biomedical datasets specifically curated for low-resource languages. Experimental results demonstrate that our method significantly enhances the performance of NER, highlighting its effectiveness and potential for broader applications in biomedical research and low-resource language contexts.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jbi.2024.104754DOI Listing

Publication Analysis

Top Keywords

low-resource languages
12
named entity
8
entity recognition
8
multimodal ner
8
multimodal
5
ner
5
multimodal approach
4
approach few-shot
4
biomedical
4
few-shot biomedical
4

Similar Publications

In task-shared, mental health, and psychosocial support interventions, monitoring the quality of delivery (fidelity and competence) of nonspecialist providers is critical. Quality of delivery is frequently reported in brief, summary statistics, and while both fidelity and competence scores tend to be high, rarely have factors associated with quality of delivery in low-resource, mental health, and psychosocial support interventions been examined using inferential statistics. Understanding both modifiable and non-modifiable predictors of quality of delivery is important for adapting training and supervision approaches throughout intervention delivery.

View Article and Find Full Text PDF

In this study, the authors presented a dataset for named entity recognition in the Uzbek language. The dataset consists of 2000 sentences and 25,865 words, and the sources were legal documents and hand-crafted sentences annotated using the BIOES scheme. The study is complemented by the fact that the authors demonstrated the applications of the created dataset by training a language model using the CNN + LSTM architecture, which achieves high accuracy in NER tasks, with an F1 score of 90.

View Article and Find Full Text PDF
Article Synopsis
  • Red cell distribution width (RDW) is a measure of variability in red blood cell sizes and may indicate cardiovascular disease (CVD) risk, especially in low-resource areas, according to a study in Ghana and Nigeria.
  • The study involved 319 adults with hypertension who underwent assessments like blood pressure and RDW measurement, revealing an average RDW of 13.96% and a median CVD risk score of 8.11%.
  • Findings showed that RDW is positively correlated with age, systolic and diastolic blood pressure, and WHO CVD risk scores, suggesting RDW could be a useful predictor of CVD risk in this population, particularly with an RDW cutoff of >14
View Article and Find Full Text PDF

Background: The increasing use of social media to share lived and living experiences of substance use presents a unique opportunity to obtain information on side effects, use patterns, and opinions on novel psychoactive substances. However, due to the large volume of data, obtaining useful insights through natural language processing technologies such as large language models is challenging.

Objective: This paper aims to develop a retrieval-augmented generation (RAG) architecture for medical question answering pertaining to clinicians' queries on emerging issues associated with health-related topics, using user-generated medical information on social media.

View Article and Find Full Text PDF

Background: Engaging fathers(to-be) can improve maternal, newborn, and child health outcomes. However, father-focused interventions in low-resource settings are under-researched. As part of an integrated early childhood development pilot cluster randomised trial in Nairobi's informal settlements, this study aimed to test the feasibility of a text-only intervention for fathers (SMS4baba) adapted from one developed in Australia (SMS4dads).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!