Bayesian variable selection for parametric survival model with applications to cancer omics data.

Hum Genomics

China International Cooperation Center for Environment and Human Health, Nanjing Medical University, 101 Longmian Avenue, Nanjing, 211166, Jiangsu, China.

Published: November 2018

Background: Modeling thousands of markers simultaneously has been of great interest in testing association between genetic biomarkers and disease or disease-related quantitative traits. Recently, an expectation-maximization (EM) approach to Bayesian variable selection (EMVS) facilitating the Bayesian computation was developed for continuous or binary outcome using a fast EM algorithm. However, it is not suitable to the analyses of time-to-event outcome in many public databases such as The Cancer Genome Atlas (TCGA).

Results: We extended the EMVS to high-dimensional parametric survival regression framework (SurvEMVS). A variant of cyclic coordinate descent (CCD) algorithm was used for efficient iteration in M-step, and the extended Bayesian information criteria (EBIC) was employed to make choice on hyperparameter tuning. We evaluated the performance of SurvEMVS using numeric simulations and illustrated the effectiveness on two real datasets. The results of numerical simulations and two real data analyses show the well performance of SurvEMVS in aspects of accuracy and computation. Some potential markers associated with survival of lung or stomach cancer were identified.

Conclusions: These results suggest that our model is effective and can cope with high-dimensional omics data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6218990PMC
http://dx.doi.org/10.1186/s40246-018-0179-xDOI Listing

Publication Analysis

Top Keywords

bayesian variable
8
variable selection
8
parametric survival
8
omics data
8
performance survemvs
8
bayesian
4
selection parametric
4
survival model
4
model applications
4
applications cancer
4

Similar Publications

Purpose: The purpose of this study was to determine if the Vocabulary Acquisition and Usage for Late Talkers (VAULT) intervention could be efficaciously applied to a new treatment target: words a child neither understood nor said. We also assessed whether the type of context variability used to encourage semantic learning (i.e.

View Article and Find Full Text PDF

With the increasing maturity of genetic profiling, an essential and routine task in cancer research is to model disease outcomes/phenotypes using genetic variables. Many methods have been successfully developed. However, oftentimes, empirical performance is unsatisfactory because of a "lack of information.

View Article and Find Full Text PDF

Intraoperative assessment of labral quality determines arthroscopic repair versus reconstruction for hip labral tear treatment. T2 mapping technology discriminates between healthy and damaged cartilage. This study investigated if T2 mapping magnetic resonance imaging (MRI) can preoperatively predict labral repair versus reconstruction.

View Article and Find Full Text PDF

Associations of urinary caffeine metabolites with sex hormones: comparison of three statistical models.

Front Nutr

January 2025

Department of Epidemiology and Health Statistics, School of Public Health, Guilin Medical University, Guilin, China.

Aims: The association between urinary caffeine and caffeine metabolites with sex hormones remains unclear. This study used three statistical models to explore the associations between urinary caffeine and its metabolites and sex hormones among adults.

Methods: We selected the participants aged ≥18 years in the National Health and Nutrition Examination Survey (NHANES) data 2013-2014 as our study subjects.

View Article and Find Full Text PDF

Recent technological advancements have enabled the collection of intensive longitudinal data (ILD), consisting of repeated measurements from the same individual. The threshold autoregressive (TAR) model is often used to capture the dynamic outcome process in ILD, with autoregressive parameters varying based on outcome variable levels. For ILD from multiple individuals, multilevel TAR (ML-TAR) models have been proposed, with Bayesian approaches typically used for parameter estimation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!