Background: Independent Component Analysis (ICA) is a method that models gene expression data as an action of a set of statistically independent hidden factors. The output of ICA depends on a fundamental parameter: the number of components (factors) to compute. The optimal choice of this parameter, related to determining the effective data dimension, remains an open question in the application of blind source separation techniques to transcriptomic data.

Results: Here we address the question of optimizing the number of statistically independent components in the analysis of transcriptomic data for reproducibility of the components in multiple runs of ICA (within the same or within varying effective dimensions) and in multiple independent datasets. To this end, we introduce ranking of independent components based on their stability in multiple ICA computation runs and define a distinguished number of components (Most Stable Transcriptome Dimension, MSTD) corresponding to the point of the qualitative change of the stability profile. Based on a large body of data, we demonstrate that a sufficient number of dimensions is required for biological interpretability of the ICA decomposition and that the most stable components with ranks below MSTD have more chances to be reproduced in independent studies compared to the less stable ones. At the same time, we show that a transcriptomics dataset can be reduced to a relatively high number of dimensions without losing the interpretability of ICA, even though higher dimensions give rise to components driven by small gene sets.

Conclusions: We suggest a protocol of ICA application to transcriptomics data with a possibility of prioritizing components with respect to their reproducibility that strengthens the biological interpretation. Computing too few components (much less than MSTD) is not optimal for interpretability of the results. The components ranked within MSTD range have more chances to be reproduced in independent studies.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5594474PMC
http://dx.doi.org/10.1186/s12864-017-4112-9DOI Listing

Publication Analysis

Top Keywords

independent components
12
components
11
independent
8
transcriptomic data
8
statistically independent
8
number components
8
number dimensions
8
interpretability ica
8
chances reproduced
8
reproduced independent
8

Similar Publications

Screening and identification of evaluation indicators of low phosphorus tolerant germplasm in Gleditsia sinensis Lam.

Sci Rep

December 2024

Institute for Forest Resources and Environment of Guizhou, College of Forestry, Guizhou University, Guiyang, 550025, Guizhou, China.

This study aims to explore the low phosphorus (P) tolerance of saplings from different Gleditsia sinensis Lam. families. It also seeks to screen for Gleditsia sinensis families with strong low P tolerance and identify key indicators for evaluating their tolerance.

View Article and Find Full Text PDF

Using Fourier Transform Infrared spectroscopy (FTIR), it is possible to show chemical composition of materials and / or profile chemical changes occurring in tissues, cells, and body fluids during onset and progression of diseases. For diagnostic application, the use of blood would be the most appropriate in biospectroscopy studies since, (i) it is easily accessible and, (ii) enables frequent analyses of biochemical changes occurring in pathological states. At present, different studies have investigated potential of serum, plasma and sputum being alternative biofluids for lung cancer detection using FTIR.

View Article and Find Full Text PDF

NS1 binding protein regulates stress granule dynamics and clearance by inhibiting p62 ubiquitination.

Nat Commun

December 2024

Department of Biological Sciences and Biotechnology, College of Life Sciences and Nanotechnology, Hannam University, Daejeon, Korea.

The NS1 binding protein, known for interacting with the influenza A virus protein, is involved in RNA processing, cancer, and nerve cell growth regulation. However, its role in stress response independent of viral infections remains unclear. This study investigates NS1 binding protein's function in regulating stress granules during oxidative stress through interactions with GABARAP subfamily proteins.

View Article and Find Full Text PDF

Concurrent optoacoustic tomography and magnetic resonance imaging of resting-state functional connectivity in the mouse brain.

Nat Commun

December 2024

Institute for Biomedical Engineering and Institute of Pharmacology and Toxicology, Faculty of Medicine, University of Zurich, Zurich, Switzerland.

Resting-state functional connectivity (rsFC) has been essential to elucidate the intricacy of brain organization, further revealing clinical biomarkers of neurological disorders. Although functional magnetic resonance imaging (fMRI) remains a cornerstone in the field of rsFC recordings, its interpretation is often hindered by the convoluted physiological origin of the blood-oxygen-level-dependent (BOLD) contrast affected by multiple factors. Here, we capitalize on the unique concurrent multiparametric hemodynamic recordings of a hybrid magnetic resonance optoacoustic tomography platform to comprehensively characterize rsFC in female mice.

View Article and Find Full Text PDF

Purpose: The relationship between sagittal lumbopelvic alignment and the bony pathomorphology of hip dysplasia is currently at the forefront of clinical and scientific interest. The aim of this study was to determine whether there is a compensatory lumbopelvic aspect associated with the two major acetabular phenotypes in dysplastic hips.

Methods: From September 2022 to March 2024, a total of 145 patients with symptomatic bilateral hip dysplasia were included in the study.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!