Mapping vaccine names in clinical trials to vaccine ontology using cascaded fine-tuned domain-specific language models.

J Biomed Semantics

Department of Artificial Intelligence and Informatics, Mayo Clinic, Jacksonville, FL, 32224, USA.

Published: August 2024

Background: Vaccines have revolutionized public health by providing protection against infectious diseases. They stimulate the immune system and generate memory cells to defend against targeted diseases. Clinical trials evaluate vaccine performance, including dosage, administration routes, and potential side effects.

Clinicaltrials: gov is a valuable repository of clinical trial information, but the vaccine data in them lacks standardization, leading to challenges in automatic concept mapping, vaccine-related knowledge development, evidence-based decision-making, and vaccine surveillance.

Results: In this study, we developed a cascaded framework that capitalized on multiple domain knowledge sources, including clinical trials, the Unified Medical Language System (UMLS), and the Vaccine Ontology (VO), to enhance the performance of domain-specific language models for automated mapping of VO from clinical trials. The Vaccine Ontology (VO) is a community-based ontology that was developed to promote vaccine data standardization, integration, and computer-assisted reasoning. Our methodology involved extracting and annotating data from various sources. We then performed pre-training on the PubMedBERT model, leading to the development of CTPubMedBERT. Subsequently, we enhanced CTPubMedBERT by incorporating SAPBERT, which was pretrained using the UMLS, resulting in CTPubMedBERT + SAPBERT. Further refinement was accomplished through fine-tuning using the Vaccine Ontology corpus and vaccine data from clinical trials, yielding the CTPubMedBERT + SAPBERT + VO model. Finally, we utilized a collection of pre-trained models, along with the weighted rule-based ensemble approach, to normalize the vaccine corpus and improve the accuracy of the process. The ranking process in concept normalization involves prioritizing and ordering potential concepts to identify the most suitable match for a given context. We conducted a ranking of the Top 10 concepts, and our experimental results demonstrate that our proposed cascaded framework consistently outperformed existing effective baselines on vaccine mapping, achieving 71.8% on top 1 candidate's accuracy and 90.0% on top 10 candidate's accuracy.

Conclusion: This study provides a detailed insight into a cascaded framework of fine-tuned domain-specific language models improving mapping of VO from clinical trials. By effectively leveraging domain-specific information and applying weighted rule-based ensembles of different pre-trained BERT models, our framework can significantly enhance the mapping of VO from clinical trials.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11316402PMC
http://dx.doi.org/10.1186/s13326-024-00318-xDOI Listing

Publication Analysis

Top Keywords

clinical trials
28
vaccine ontology
16
domain-specific language
12
language models
12
vaccine data
12
cascaded framework
12
mapping clinical
12
vaccine
11
clinical
8
trials vaccine
8

Similar Publications

Insights into NEK2 inhibitors as antitumor agents: From mechanisms to potential therapeutics.

Eur J Med Chem

January 2025

Department of Respiratory and Critical Care Medicine, Targeted Tracer Research and Development Laboratory, Institute of Respiratory Healthand, Department of Frontiers Science Center for Disease-related Molecular Network, Core Facilities, West China Hospital, Sichuan University, Chengdu, 610041, Sichuan, China. Electronic address:

NEK2, a serine/threonine protein kinase, is integral to mitotic events such as centrosome duplication and separation, microtubule stabilization, spindle assembly checkpoint, and kinetochore attachment. However, NEK2 overexpression leads to centrosome amplification and chromosomal instability, which are significantly associated with various malignancies, including liver, breast, and non-small cell lung cancer. This overexpression could facilitate tumor development and confer resistance to therapy by promoting aberrant cell division and centrosome amplification.

View Article and Find Full Text PDF

Background: Telehealth interventions can effectively support caregivers of people with dementia by providing care and improving their health outcomes. However, to successfully translate research into clinical practice, the content and details of the interventions must be sufficiently reported in published papers.

Objective: This study aims to evaluate the completeness of a telehealth intervention reporting in randomized controlled trials (RCTs) conducted for caregivers of people with dementia.

View Article and Find Full Text PDF

Efficacy and Safety of Sulforaphane Added to Antipsychotics for the Treatment of Negative Symptoms of Schizophrenia: A Randomized Controlled Trial.

J Clin Psychiatry

January 2025

Nathan S. Kline Institute for Psychiatric Research, Orangeburg, New York, and Department of Psychiatry, New York University School of Medicine, New York, New York.

There are few established treatments for negative symptoms in schizophrenia, which persist in many patients after positive symptoms are reduced. Oxidative stress, inflammation, and epigenetic modifications involving histone deacetylase (HDAC) have been implicated in the pathophysiology of schizophrenia. Sulforaphane has antioxidant properties and is an HDAC inhibitor.

View Article and Find Full Text PDF

To provide proof-of-concept (PoC), dose-range finding, and safety data for BI 1358894, a TRPC4/5 ion channel inhibitor, in patients with borderline personality disorder (BPD). This was a phase 2, multinational, randomized, double-blind, placebo controlled trial. Patients were randomized to oral placebo or BI 1358894 (5 mg, 25 mg, 75 mg, or 125 mg) once daily in a 2.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!