Regression-Based Network Estimation for High-Dimensional Genetic Data.

J Comput Biol

1 School of Industrial Management Engineering, Korea University, Seoul, South Korea.

Published: April 2019

Given the continuous advancement in genome sequencing technology, large volumes of gene expression data can be easily obtained. However, the corresponding increase in genetic information necessitates adoption of a new approach for network estimation. Data dimensions increase with the progress in genome sequencing technology, thereby making it difficult to estimate gene networks by causing multicollinearity. Furthermore, such a problem also occurs when hub nodes exist, where gene networks are known to have regulator genes that can be interpreted as hub nodes. This study aims at developing methods that demonstrate good performance when handling high-dimensional data with hub nodes. We propose regression-based approaches as feasible solutions in this article. Elastic-net and adaptive elastic-net penalty regressions were applied to compensate for the disadvantages of existing regression-based approaches employing LASSO or adaptive LASSO. Experiments were performed to compare the proposed regression-based approaches with other conventional methods. We confirmed the superior performance of the regression-based approaches and applied it to actual genetic data to verify the suitability to estimate gene networks. As results, robustness of the proposed methods was demonstrated with respect to high-dimensional gene expression data.

Download full-text PDF

Source
http://dx.doi.org/10.1089/cmb.2018.0225DOI Listing

Publication Analysis

Top Keywords

regression-based approaches
16
gene networks
12
hub nodes
12
network estimation
8
genetic data
8
genome sequencing
8
sequencing technology
8
gene expression
8
expression data
8
estimate gene
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!