A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm.

Biomed Res Int

Department of Thoracic Surgery, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, 3 East Qing Chun Road, 310000 Hangzhou, Zhejiang, China.

Published: April 2021

Background: After curative surgical resection, about 30-75% lung adenocarcinoma (LUAD) patients suffer from recurrence with dismal survival outcomes. Identification of patients with high risk of recurrence to impose intense therapy is urgently needed.

Materials And Methods: Gene expression data of LUAD were obtained from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Differentially expressed genes (DEGs) were calculated by comparing the recurrent and primary tissues. Prognostic genes associated with the recurrence-free survival (RFS) of LUAD patients were identified using univariate analysis. LASSO Cox regression and multivariate Cox analysis were applied to extract key genes and establish the prediction model.

Results: We detected 37 DEGs between primary and recurrent LUAD tumors. Using univariate analysis, 31 DEGs were found to be significantly associated with RFS. We established the RFS prediction model including thirteen genes using the LASSO Cox regression. In the training cohort, we classified patients into high- and low-risk groups and found that patients in the high-risk group suffered from worse RFS compared to those in the low-risk group ( < 0.01). Concordant results were confirmed in the internal and external validation cohort. The efficiency of the prediction model was also confirmed under different clinical subgroups. The high-risk group was significantly identified as the risk factor of recurrence in LUAD by the multivariate Cox analysis (HR = 13.37, = 0.01). Compared to clinicopathological features, our prediction model possessed higher accuracy to identify patients with high risk of recurrence (AUC = 96.3%). Finally, we found that the G2M checkpoint pathway was enriched both in recurrent tumors and primary tumors of high-risk patients.

Conclusions: Our recurrence-specific gene-based prognostic prediction model provides extra information about the risk of recurrence in LUAD, which is conducive for clinicians to conduct individualized therapy in clinic.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7669350PMC
http://dx.doi.org/10.1155/2020/9124792DOI Listing

Publication Analysis

Top Keywords

prediction model
20
risk recurrence
12
recurrence-specific gene-based
8
lung adenocarcinoma
8
luad patients
8
patients high
8
high risk
8
gene expression
8
univariate analysis
8
lasso cox
8

Similar Publications

Comprehensive Analysis Reveals the Potential Diagnostic Value of Biomarkers Associated With Aging and Circadian Rhythm in Knee Osteoarthritis.

Orthop Surg

January 2025

Department of Orthopedics, Tianjin Medical University General Hospital, International Science and Technology Cooperation Base of Spinal Cord Injury, Tianjin Key Laboratory of Spine and Spinal Cord, Tianjin, China.

Objective: Knee osteoarthritis (KOA) is characterized by structural changes. Aging is a major risk factor for KOA. Therefore, the objective of this study was to examine the role of genes related to aging and circadian rhythms in KOA.

View Article and Find Full Text PDF

Impeding linear calibration models from accurately predicting target sample analyte amounts are the target sample-wise deviations in measurement profiles (e.g., spectra) relative to calibration samples.

View Article and Find Full Text PDF

Zero echo time (zero-TE) pulse sequences provide a quiet and artifact-free alternative to conventional functional magnetic resonance imaging (fMRI) pulse sequences. The fast readouts (<1 ms) utilized in zero-TE fMRI produce an image contrast with negligible contributions from blood oxygenation level-dependent (BOLD) mechanisms, yet the zero-TE contrast is highly sensitive to brain function. However, the precise relationship between the zero-TE contrast and neuronal activity has not been determined.

View Article and Find Full Text PDF

Background: Established risk models may not be applicable to patients at higher cardiovascular risk with a measured Lp(a) (lipoprotein[a]) level, a causal risk factor for atherosclerotic cardiovascular disease.

Methods: This was a model development study. The data source was the Nashville Biosciences Lp(a) data set, which includes clinical data from the Vanderbilt University Health System.

View Article and Find Full Text PDF

Semiparametric estimator for the covariate-specific receiver operating characteristic curve.

Stat Methods Med Res

January 2025

CITMAga and Department of Statistics and Operations Research, Universidade de Vigo, Vigo, Galicia, Spain.

The study of the predictive ability of a marker is mainly based on the accuracy measures provided by the so-called confusion matrix. Besides, the area under the receiver operating characteristic curve has become a popular index for summarizing the overall accuracy of a marker. However, the nature of the relationship between the marker and the outcome, and the role that potential confounders play in this relationship could be fundamental in order to extrapolate the observed results.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!