Motivation: Recent technological developments have enabled the possibility of genetic and genomic integrated data analysis approaches, where multiple omics datasets from various biological levels are combined and used to describe (disease) phenotypic variations. The main goal is to explain and ultimately predict phenotypic variations by understanding their genetic basis and the interaction of the associated genetic factors. Therefore, understanding the underlying genetic mechanisms of phenotypic variations is an ever increasing research interest in biomedical sciences. In many situations, we have a set of variables that can be considered to be the outcome variables and a set that can be considered to be explanatory variables. Redundancy analysis (RDA) is an analytic method to deal with this type of directionality. Unfortunately, current implementations of RDA cannot deal optimally with the high dimensionality of omics data (p≫n). The existing theoretical framework, based on Ridge penalization, is suboptimal, since it includes all variables in the analysis. As a solution, we propose to use Elastic Net penalization in an iterative RDA framework to obtain a sparse solution.

Results: We proposed sparse redundancy analysis (sRDA) for high dimensional omics data analysis. We conducted simulation studies with our software implementation of sRDA to assess the reliability of sRDA. Both the analysis of simulated data, and the analysis of 485 512 methylation markers and 18,424 gene-expression values measured in a set of 55 patients with Marfan syndrome show that sRDA is able to deal with the usual high dimensionality of omics data.

Availability And Implementation: http://uva.csala.me/rda.

Contact: a.csala@amc.uva.nl.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btx374DOI Listing

Publication Analysis

Top Keywords

redundancy analysis
12
data analysis
12
phenotypic variations
12
sparse redundancy
8
analysis
8
genetic genomic
8
high dimensionality
8
dimensionality omics
8
omics data
8
data
6

Similar Publications

Structure of Plant Populations in Constructed Wetlands and Their Ability for Water Purification.

Plants (Basel)

January 2025

Core Botanical Gardens/Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China.

In constructed wetlands (CWs) with multiple plant communities, population structure may change over time and these variations may ultimately influence water quality. However, in CWs with multiple plant communities, it is still unclear how population structure may change over time and how these variations ultimately influence water quality. Here, we established a CW featuring multiple plant species within a polder to investigate the variation in plant population structure and wastewater treatment effect for drainage water over the course of one year.

View Article and Find Full Text PDF

The characteristics of heartwood and sapwood not only reflect tree growth and site quality but also provide insights into habitat changes. This study examines the natural Oliv. forest in the Arghan section of the lower Tarim River, comparing the heartwood and sapwood characteristics of at different distances from the river, as well as at varying trunk heights and diameters at breast height (DBH).

View Article and Find Full Text PDF

The basic leucine zipper (bZIP) transcription factors play essential roles in multiple stress responses and have been identified and functionally characterized in many plant species. However, the bZIP family members in blueberry are unclear. In this study, we identified 102 genes in .

View Article and Find Full Text PDF

The significant microbiota variability represents a key feature that makes the full comprehension of the functional interaction between microbiota and the host an ongoing challenge. To overcome this limitation, in this study, fish intestinal microbiota was analyzed through a meta-analysis, identifying the core microbiota and constructing stochastic Bayesian network (BN) models with SAMBA. We combined three experiments performed with gilthead sea bream juveniles of the same hatchery batch, reared at the same season/location, and fed with diets enriched on processed animal proteins (PAP) and other alternative ingredients (NOPAP-PP, NOPAP-SCP).

View Article and Find Full Text PDF

Fertilization Induced Soil Microbial Shifts Show Minor Effects on Yield.

Microorganisms

January 2025

Key Laboratory of Silviculture and Conservation of the Ministry of Education, College of Forestry, Beijing Forestry University, Beijing 100083, China.

Fertilization can improve soil nutrition and increase the yield of , but the response of soil microbial communities to fertilization treatments and their correlation with soil nutrition and yield are unclear. In order to investigate the characteristics of soil physicochemical qualities and the bacterial community, we carried out a field experiment comparing various quantities of nitrogen (N), phosphorus (P), and potassium (K) fertilizers to the unfertilized control treatments and the yield of in raw material forests in response to different applications of fertilizers and to try to clarify the interrelation among the three. Results showed that (1) there are significant differences in the effects of different fertilization treatments on the soil properties of raw material forests.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!