We propose a multivariate sparse group lasso variable selection and estimation method for data with high-dimensional predictors as well as high-dimensional response variables. The method is carried out through a penalized multivariate multiple linear regression model with an arbitrary group structure for the regression coefficient matrix. It suits many biology studies well in detecting associations between multiple traits and multiple predictors, with each trait and each predictor embedded in some biological functional groups such as genes, pathways or brain regions. The method is able to effectively remove unimportant groups as well as unimportant individual coefficients within important groups, particularly for large p small n problems, and is flexible in handling various complex group structures such as overlapping or nested or multilevel hierarchical structures. The method is evaluated through extensive simulations with comparisons to the conventional lasso and group lasso methods, and is applied to an eQTL association study.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4479976PMC
http://dx.doi.org/10.1111/biom.12292DOI Listing

Publication Analysis

Top Keywords

group lasso
12
multivariate sparse
8
sparse group
8
multivariate multiple
8
multiple linear
8
linear regression
8
arbitrary group
8
group structure
8
group
6
multivariate
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!