Processing unstructured clinical texts is often necessary to support certain tasks in biomedicine, such as matching patients to clinical trials. Among other methods, domain-specific language models have been built to utilize free-text information. This study evaluated the performance of Bidirectional Encoder Representations from Transformers (BERT) models in assessing the similarity between clinical trial texts. We compared an unstructured aggregated summary of clinical trials reviewed at the Johns Hopkins Molecular Tumor Board with the ClinicalTrials.gov records, focusing on the titles and eligibility criteria. Seven pretrained BERT-Based models were used in our analysis. Of the six biomedical-domain-specific models, only SciBERT outperformed the original BERT model by accurately assigning higher similarity scores to matched than mismatched trials. This finding is promising and shows that BERT and, likely, other language models may support patient-trial matching.

Download full-text PDF

Source
http://dx.doi.org/10.3233/SHTI210848DOI Listing

Publication Analysis

Top Keywords

bert models
8
unstructured clinical
8
clinical trial
8
trial texts
8
clinical trials
8
language models
8
models
6
clinical
5
evaluation pretrained
4
bert
4

Similar Publications

Background: Self-narratives about traumatic experiences and symptoms are informative for early identification of potential patients; however, their use in clinical screening is limited. This study aimed to develop an automated screening method that analyzes self-narratives of early adolescent earthquake survivors to screen for PTSD in a timely and effective manner.

Methods: An inquiry-based questionnaire consisting of a series of open-ended questions about trauma history and psychological symptoms, was designed to simulate the clinical structured interviews based on the DSM-5 diagnostic criteria, and was used to collect self-narratives from 430 survivors who experienced the Ya'an earthquake in Sichuan Province, China.

View Article and Find Full Text PDF

This study delves into the transformative potential of Machine Learning (ML) and Natural Language Processing (NLP) within the pharmaceutical industry, spotlighting their significant impact on enhancing medical research methodologies and optimizing healthcare service delivery. Utilizing a vast dataset sourced from a well-established online pharmacy, this research employs sophisticated ML algorithms and cutting-edge NLP techniques to critically analyze medical descriptions and optimize recommendation systems for drug prescriptions and patient care management. Key technological integrations include BERT embeddings, which provide nuanced contextual understanding of complex medical texts, and cosine similarity measures coupled with TF-IDF vectorization to significantly enhance the precision and reliability of text-based medical recommendations.

View Article and Find Full Text PDF

MCBERT: A Multi-Modal Framework for the Diagnosis of Autism Spectrum Disorder.

Biol Psychol

December 2024

Big Data Analytics and Web Intelligence Laboratory, Department of Computer Science & Engineering, Delhi Technological University, New Delhi, India. Electronic address:

Within the domain of neurodevelopmental disorders, autism spectrum disorder (ASD) emerges as a distinctive neurological condition characterized by multifaceted challenges. The delayed identification of ASD poses a considerable hurdle in effectively managing its impact and mitigating its severity. Addressing these complexities requires a nuanced understanding of data modalities and the underlying patterns.

View Article and Find Full Text PDF

Motivation: Joint extraction of entity and relation is an important research direction in Information Extraction. The number of scientific and technological biomedical literature is rapidly increasing, so automatically extracting entities and their relations from these literatures are key tasks to promote the progress of biomedical research.

Results: The joint extraction of entity and relation model achieves both intra-sentence extraction and cross-sentence extraction, alleviating the problem of long-distance information dependence in long literature.

View Article and Find Full Text PDF

Background: Automated recognition and redaction of personal identifiers in free text can enable organisations to share data while protecting privacy. This is important in the context of pharmacovigilance since relevant detailed information on the clinical course of events, differential diagnosis, and patient-reported reflections may often only be conveyed in narrative form. The aim of this study is to develop and evaluate a method for automated redaction of person names in English narrative text on adverse event reports.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!