Background: Eligibility criteria are the main strategy for screening appropriate participants for clinical trials. Automatic analysis of clinical trial eligibility criteria by digital screening, leveraging natural language processing techniques, can improve recruitment efficiency and reduce the costs involved in promoting clinical research.

Objective: We aimed to create a natural language processing model to automatically classify clinical trial eligibility criteria.

Methods: We proposed a classifier for short text eligibility criteria based on ensemble learning, where a set of pretrained models was integrated. The pretrained models included state-of-the-art deep learning methods for training and classification, including Bidirectional Encoder Representations from Transformers (BERT), XLNet, and A Robustly Optimized BERT Pretraining Approach (RoBERTa). The classification results by the integrated models were combined as new features for training a Light Gradient Boosting Machine (LightGBM) model for eligibility criteria classification.

Results: Our proposed method obtained an accuracy of 0.846, a precision of 0.803, and a recall of 0.817 on a standard data set from a shared task of an international conference. The macro F1 value was 0.807, outperforming the state-of-the-art baseline methods on the shared task.

Conclusions: We designed a model for screening short text classification criteria for clinical trials based on multimodel ensemble learning. Through experiments, we concluded that performance was improved significantly with a model ensemble compared to a single model. The introduction of focal loss could reduce the impact of class imbalance to achieve better performance.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7367522PMC
http://dx.doi.org/10.2196/17832DOI Listing

Publication Analysis

Top Keywords

eligibility criteria
20
ensemble learning
12
clinical trial
12
text classification
8
clinical trials
8
trial eligibility
8
natural language
8
language processing
8
short text
8
pretrained models
8

Similar Publications

Validating the Accuracy of Parkinson's Disease Clinical Diagnosis: A UK Brain Bank Case-Control Study.

Ann Neurol

January 2025

Research Unit of Neurology, Neurophysiology and Neurobiology, Department of Medicine and Surgery, Università Campus Bio-Medico di Roma, Rome, Italy.

Objective: Despite diagnostic criteria refinements, Parkinson's disease (PD) clinical diagnosis still suffers from a not satisfying accuracy, with the post-mortem examination as the gold standard for diagnosis. Seminal clinicopathological series highlighted that a relevant number of patients alive-diagnosed with idiopathic PD have an alternative post-mortem diagnosis. We evaluated the diagnostic accuracy of PD comparing the in-vivo clinical diagnosis with the post-mortem diagnosis performed through the pathological examination in 2 groups.

View Article and Find Full Text PDF

Objectives: To assess the therapeutic effects and safety of Tongxie Yaofang (TXYF) granules vs placebo as an alternative treatment for diarrhoea-predominant irritable bowel syndrome (IBS-D). We hypothesised that TXYF would improve clinical responses among patients with IBS-D.

Design: A randomised, double-blind, placebo-controlled, phase II, superiority trial.

View Article and Find Full Text PDF

Despite progress in healthcare services for individuals living with sickle cell disease (SCD) in Africa, substantial gaps remain in advanced treatments for SCD. To help address this burden, Tanzania has established one of the largest single-centre SCD programmes in the world and developed an advanced therapy programme for SCD focused on patient engagement and advocacy, clinical activities involving exchange blood transfusion (ExBT) and haematopoietic stem cell transplant (HSCT), gene therapy (GT) preparedness, and enabling partnerships. This report describes the programme's genesis, structure and progress achieved.

View Article and Find Full Text PDF

Background:  Digital health interventions targeting behavior change are promising in adults and adolescents; however, less attention has been given to younger children. The proliferation of wearables, such as smartwatches and activity trackers, that support the collection of and reflection on personal health data highlights an opportunity to consider novel approaches to supporting health in young children (aged 5-11 y).

Objective:  This review aims to investigate how smartwatches and activity trackers have been used across child health interventions (for children aged 5-11 y) for different health areas, specifically to identify the population characteristics of those being targeted, describe the characteristics of the devices being used, and report the feasibility and acceptability of these devices for health-related applications with children.

View Article and Find Full Text PDF

Background And Objectives: Identifying genetic causes of dementia in patients visiting memory clinics is important for patient care and family planning. Traditional clinical selection criteria for genetic testing may miss carriers of pathogenic variants in dementia-related genes. This study aimed identify how many carriers we are missing and to optimize criteria for selecting patients for genetic counseling in memory clinics.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!