AI Article Synopsis

  • Clustering analysis of cancer genomics has challenges due to limits in traditional methods, which struggle to capture complex data structures and handle noise/outliers.
  • * A new approach, called hypergraph regularized tensor robust principal component analysis (HTRPCA), addresses these issues by using a tensor representation and hypergraph regularization to preserve geometric information in cancer omics data.
  • * HTRPCA effectively decomposes data into low-rank and sparse components, allowing for improved clustering of cancer samples, and has shown better performance than existing methods on TCGA datasets.

Article Abstract

In recent years, clustering analysis of cancer genomics data has gained widespread attention. However, limited by the dimensions of the matrix, the traditional methods cannot fully mine the underlying geometric structure information in the data. Besides, noise and outliers inevitably exist in the data. To solve the above two problems, we come up with a new method which uses tensor to represent cancer omics data and applies hypergraph to save the geometric structure information in original data. This model is called hypergraph regularized tensor robust principal component analysis (HTRPCA). The data processed by HTRPCA becomes two parts, one of which is a low-rank component that contains pure underlying structure information between samples, and the other is some sparse interference points. So we can use the low-rank component for clustering. This model can retain complex geometric information between more sample points due to the addition of the hypergraph regularization. Through clustering, we can demonstrate the effectiveness of HTRPCA, and the experimental results on TCGA datasets demonstrate that HTRPCA precedes other advanced methods. This paper proposes a new method of using tensors to represent cancer omics data and introduces hypergraph items to save the geometric structure information of the original data. At the same time, the model decomposes the original tensor into low-order tensors and sparse tensors. The low-rank tensor was used to cluster cancer samples to verify the effectiveness of the method.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s12539-021-00441-8DOI Listing

Publication Analysis

Top Keywords

omics data
12
geometric structure
12
data
9
hypergraph regularized
8
regularized tensor
8
tensor robust
8
robust principal
8
principal component
8
component analysis
8
represent cancer
8

Similar Publications

Unveiling the role of PANoptosis-related genes in breast cancer: an integrated study by multi-omics analysis and machine learning algorithms.

Breast Cancer Res Treat

January 2025

Department of Breast Surgery, Thyroid Surgery, Huangshi Central Hospital, Affiliated Hospital of Hubei Polytechnic University, No.141, Tianjin Road, Huangshi, 435000, Hubei, China.

Background: The heterogeneity of breast cancer (BC) necessitates the identification of novel subtypes and prognostic models to enhance patient stratification and treatment strategies. This study aims to identify novel BC subtypes based on PANoptosis-related genes (PRGs) and construct a robust prognostic model to guide individualized treatment strategies.

Methods: The transcriptome data along with clinical data of BC patients were sourced from the TCGA and GEO databases.

View Article and Find Full Text PDF

Multiplexed spatial mapping of chromatin features, transcriptome and proteins in tissues.

Nat Methods

January 2025

Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

The phenotypic and functional states of cells are modulated by a complex interactive molecular hierarchy of multiple omics layers, involving the genome, epigenome, transcriptome, proteome and metabolome. Spatial omics approaches have enabled the study of these layers in tissue context but are often limited to one or two modalities, offering an incomplete view of cellular identity. Here we present spatial-Mux-seq, a multimodal spatial technology that allows simultaneous profiling of five different modalities: two histone modifications, chromatin accessibility, whole transcriptome and a panel of proteins at tissue scale and cellular level in a spatially resolved manner.

View Article and Find Full Text PDF

Omics data provide a plethora of quantifiable information that can potentially be used to identify biomarkers targeting the physiological processes and ecological phenomena of organisms. However, omics data have not been fully utilized because current prediction methods in biomarker construction are susceptible to data multidimensionality and noise. We developed OmicSense, a quantitative prediction method that uses a mixture of Gaussian distributions as the probability distribution, yielding the most likely objective variable predicted for each biomarker.

View Article and Find Full Text PDF

Integrative multi-omics analysis of autism spectrum disorder reveals unique microbial macromolecules interactions.

J Adv Res

January 2025

Proteomics and Metabolomics Unit, Basic Research Department, Children's Cancer Hospital, 57357 Cairo, (CCHE-57357), Egypt; Department of Physiology, Faculty of Veterinary Medicine, Suez Canal University, 41522 Ismailia, Egypt. Electronic address:

Introduction: Gut microbiota alterations have been implicated in Autism Spectrum Disorder (ASD), yet the mechanisms linking these changes to ASD pathophysiology remain unclear.

Objectives: This study utilized a multi-omics approach to uncover mechanisms linking gut microbiota to ASD by examining microbial diversity, bacterial metaproteins, associated metabolic pathways and host proteome.

Methods: The gut microbiota of 30 children with severe ASD and 30 healthy controls was analyzed.

View Article and Find Full Text PDF

[Gene coexpression networks: concepts and applications].

Biol Aujourdhui

January 2025

Sorbonne Université, CNRS, Inserm U1156, Institut de Biologie Paris Seine, Laboratoire de Biologie du Développement/UMR7622, 9 Quai St-Bernard, 75005 Paris, France.

The advent of high-throughput omics data and the generation of new algorithms provide the biologists with the opportunity to explore living processes in the context of systems biology aiming at revealing the gene interactions, the networks underlying complex cellular functions. In this article, we discuss two methods for gene network reconstruction, WGCNA (Weighted Gene Correlation Network Analysis) developed by Steve Horvath and collaborators in 2008, and MIIC (Multivariate Information-based Inductive Causation) developed by Hervé Isambert and his team in 2017 and 2024. These two methods are complementary, WGCNA generating undirected networks in which most gene-to-gene interactions are indirect, while MIIC reveals direct interactions and some causal links.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!