Introduction: The incidence of oral cavity squamous cell carcinoma (OSCC) continues to rise. OSCC is associated with a low average survival rate, and most patients have a poor disease prognosis because of delayed diagnosis. We used machine learning techniques to predict high-risk cases of OSCC by using salivary autoantibody levels and demographic and behavioral data.

Methods: We collected the salivary samples of patients recruited from a teaching hospital between September 2008 and December 2012. Ten salivary autoantibodies, sex, age, smoking, alcohol consumption, and betel nut chewing were used to build prediction models for identifying patients with a high risk of OSCC. The machine learning algorithms applied in the study were logistic regression, random forest, support vector machine with the radial basis function kernel, eXtreme Gradient Boosting (XGBoost), and a stacking model. We evaluated the performance of the models by using the area under the receiver operating characteristic curve (AUC), with simulations conducted 100 times.

Results: A total of 337 participants were enrolled in this study. The best predictive model was constructed using a stacking algorithm with original forms of age and logarithmic levels of autoantibodies (AUC = 0.795 ± 0.055). Adding autoantibody levels as a data source significantly improved the prediction capability (from 0.698 ± 0.06 to 0.795 ± 0.055, p < 0.001).

Conclusions: We successfully established a prediction model for high-risk cases of OSCC. This model can be applied clinically through an online calculator to provide additional personalized information for OSCC diagnosis, thereby reducing the disease morbidity and mortality rates.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9685866PMC
http://dx.doi.org/10.1186/s12903-022-02607-2DOI Listing

Publication Analysis

Top Keywords

prediction models
8
squamous cell
8
cell carcinoma
8
salivary autoantibody
8
machine learning
8
autoantibody levels
8
development validation
4
machine
4
validation machine
4
machine learning-based
4

Similar Publications

Background: An association exists between obesity and reduced testosterone levels in males. The propose of this research is to reveal the correlation between 15 indices linked to obesity and lipid levels with the concentration of serum testosterone, and incidence of testosterone deficiency (TD) among adult American men.

Methods: The study utilized information gathered from the National Health and Nutrition Examination Survey (NHANES) carried out from 2011 to 2016.

View Article and Find Full Text PDF

A Nomogram utilizing ECG P-wave parameters to predict recurrence risk following catheter ablation in paroxysmal atrial fibrillation.

J Cardiothorac Surg

January 2025

Department of Cardiology, Fujian Medical University Union Hospital, Fujian Heart Medical Center, Fujian Institute of Coronary Heart Disease, Fujian Clinical Medical Research Center for Heart and Macrovascular Disease, Fuzhou, 350001, China.

Objective: The objective of this study is to assess the predictive utility of perioperative P-wave parameters in patients with paroxysmal atrial fibrillation (PAF) undergoing catheter ablation, and to develop a predictive model using these parameters.

Methods: A total of 213 patients with PAF undergoing catheter ablation were retrospectively analyzed. P-wave parameters were measured within 3 days preoperatively and on the day postoperatively to determine their predictive significance for postoperative PAF recurrence.

View Article and Find Full Text PDF

Background: Stroke has emerged as an escalating public health challenge among middle-aged and older individuals in China, closely linked to glycolipid metabolic abnormalities. The Hemoglobin A1c/High-Density Lipoprotein Cholesterol (HbA1c/HDL-C) ratio, an integrated marker of glycolipid homeostasis, may serve as a novel predictor of stroke risk.

Methods: Our investigation utilized data from the China Health and Retirement Longitudinal Study cohort (2011-2018).

View Article and Find Full Text PDF

Background: The clinical manifestations and course of rheumatoid arthritis-associated interstitial lung disease (RA-ILD) exhibits considerable heterogeneity. In this study, we aimed to explore radiographic progression over a defined period, employing the Warrick score as a semi-quantitative measure in early RA-ILD, and to assess the associated risk factors for progression.

Methods: RA-ILD patients underwent consecutive Warrick scoring based on initial high-resolution computed tomography (HRCT) at diagnosis and the first follow-up.

View Article and Find Full Text PDF

Background: Drug-drug interactions (DDIs) especially antagonistic ones present significant risks to patient safety, underscoring the urgent need for reliable prediction methods. Recently, substructure-based DDI prediction has garnered much attention due to the dominant influence of functional groups and substructures on drug properties. However, existing approaches face challenges regarding the insufficient interpretability of identified substructures and the isolation of chemical substructures.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!