Motivation: Statistical analyses of high-throughput sequencing data have re-shaped the biological sciences. In spite of myriad advances, recovering interpretable biological signal from data corrupted by technical noise remains a prevalent open problem. Several classes of procedures, among them classical dimensionality reduction techniques and others incorporating subject-matter knowledge, have provided effective advances. However, no procedure currently satisfies the dual objectives of recovering stable and relevant features simultaneously.

Results: Inspired by recent proposals for making use of control data in the removal of unwanted variation, we propose a variant of principal component analysis (PCA), sparse contrastive PCA that extracts sparse, stable, interpretable and relevant biological signal. The new methodology is compared to competing dimensionality reduction approaches through a simulation study and via analyses of several publicly available protein expression, microarray gene expression and single-cell transcriptome sequencing datasets.

Availability And Implementation: A free and open-source software implementation of the methodology, the scPCA R package, is made available via the Bioconductor Project. Code for all analyses presented in this article is also available via GitHub.

Contact: philippe_boileau@berkeley.edu.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btaa176DOI Listing

Publication Analysis

Top Keywords

sparse contrastive
8
principal component
8
component analysis
8
biological signal
8
dimensionality reduction
8
data
5
exploring high-dimensional
4
biological
4
high-dimensional biological
4
biological data
4

Similar Publications

Baseline and outcome stereoacuity of children with anisometropic amblyopia undergoing dichoptic amblyopia treatment.

J AAPOS

January 2025

Pediatric Vision Laboratory, Retina Foundation of the Southwest, Dallas, Texas; Optometry & Vision Science, University of Waterloo, Waterloo, Ontario, Canada.

Background: One rationale for dichoptic amblyopia therapy is that it may promote recovery of binocular function. Yet data on binocular outcomes in anisometropic amblyopia following dichoptic therapy are sparse. We report factors associated with pre- and post-treatment binocular function in anisometropic amblyopia, and examine binocular function in children who recover normal visual acuity compared to those with residual amblyopia.

View Article and Find Full Text PDF

The antibiotic metronidazole (MNZ) has gained interest as a potential MRI contrast agent for imaging hypoxia. N-labeled MNZ can be efficiently hyperpolarized via SABRE-SHEATH (Signal Amplification By Reversible Exchange in SHield Enables Alignment Transfer to Heteronuclei), but the envisioned MRI approach requires that MNZ rapidly undergoes structural changes in hypoxic environments with significant N frequency differences manifested in its downstream metabolic products. We have performed NMR studies of the anticipated metabolic product amino-MNZ (despite anticipated stability concerns) accompanied by computational density functional theory (DFT) studies to predict the N chemical shifts of different relevant species.

View Article and Find Full Text PDF

Contrastive independent subspace analysis network for multi-view spatial information extraction.

Neural Netw

January 2025

College of Mechatronics and Control Engineering, Shenzhen University, Shenzhen, Guangdong, China.

Multi-view classification integrates features from different views to optimize classification performance. Most of the existing works typically utilize semantic information to achieve view fusion but neglect the spatial information of data itself, which accommodates data representation with correlation information and is proven to be an essential aspect. Thus robust independent subspace analysis network, optimized by sparse and soft orthogonal optimization, is first proposed to extract the latent spatial information of multi-view data with subspace bases.

View Article and Find Full Text PDF

Discovering non-associated pressure-sensitive plasticity models with EUCLID.

Adv Model Simul Eng Sci

January 2025

Department of Mechanical and Process Engineering, Institute for Mechanical Systems, ETH Zürich, Zürich, 8092 Switzerland.

We extend (EUCLID Efficient Unsupervised Constitutive Law Identification and Discovery)-a data-driven framework for automated material model discovery-to pressure-sensitive plasticity models, encompassing arbitrarily shaped yield surfaces with convexity constraints and non-associated flow rules. The method only requires full-field displacement and boundary force data from one single experiment and delivers constitutive laws as interpretable mathematical expressions. We construct a material model library for pressure-sensitive plasticity models with non-associated flow rules in four steps: (1) a Fourier series describes an arbitrary yield surface shape in the deviatoric stress plane; (2) a pressure-sensitive term in the yield function defines the shape of the shear failure surface and determines plastic deformation under tension; (3) a compression cap term determines plastic deformation under compression; (4) a non-associated flow rule may be adopted to avoid the excessive dilatancy induced by plastic deformations.

View Article and Find Full Text PDF

Background: Disturbances in calcium and phosphorus homeostasis resulting from chronic kidney disease (CKD) may lead to atherosclerotic changes in blood vessels, potentially altering bone marrow perfusion. Our study aimed to investigate vertebral bone marrow perfusion using dynamic contrast-enhanced (DCE) MRI with a pharmacokinetic model. We also measured possible changes in water and fat content and bony trabeculae using T2* quantification, MR spectroscopy (MRS), and microcomputed tomography (μCT).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!