AI Article Synopsis

  • - The research addresses the challenge of analyzing omics data, which is complex due to its high dimensions and correlated features, by enhancing the ISIS algorithm with different elastic-net variants for better feature selection in omics studies.
  • - Using DNA methylation data from a large study on American Indians, the authors found that ISIS-AEnet provided superior predictive accuracy and identified significant epigenomic markers related to body mass index (BMI) compared to traditional methods.
  • - The extended ISIS R package also includes options for logistic and Cox regression, enabling it to handle various research scenarios, while maintaining efficient feature selection and supporting biological discoveries.

Article Abstract

The statistical analysis of omics data poses a great computational challenge given their ultra-high-dimensional nature and frequent between-features correlation. In this work, we extended the iterative sure independence screening (ISIS) algorithm by pairing ISIS with elastic-net (Enet) and 2 versions of adaptive elastic-net (adaptive elastic-net (AEnet) and multistep adaptive elastic-net (MSAEnet)) to efficiently improve feature selection and effect estimation in omics research. We subsequently used genome-wide human blood DNA methylation data from American Indian participants in the Strong Heart Study (n = 2235 participants; measured in 1989-1991) to compare the performance (predictive accuracy, coefficient estimation, and computational efficiency) of ISIS-paired regularization methods with that of a bayesian shrinkage and traditional linear regression to identify an epigenomic multimarker of body mass index (BMI). ISIS-AEnet outperformed the other methods in prediction. In biological pathway enrichment analysis of genes annotated to BMI-related differentially methylated positions, ISIS-AEnet captured most of the enriched pathways in common for at least 2 of all the evaluated methods. ISIS-AEnet can favor biological discovery because it identifies the most robust biological pathways while achieving an optimal balance between bias and efficient feature selection. In the extended SIS R package, we also implemented ISIS paired with Cox and logistic regression for time-to-event and binary endpoints, respectively, and a bootstrap approach for the estimation of regression coefficients.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11228868PMC
http://dx.doi.org/10.1093/aje/kwae006DOI Listing

Publication Analysis

Top Keywords

feature selection
12
adaptive elastic-net
12
selection extended
8
extended sis
8
sis package
8
body mass
8
strong heart
8
heart study
8
omics feature
4
package identification
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!