AI Article Synopsis

Article Abstract

With the recent advancement of technologies, it is progressively easier to collect diverse types of genome-wide data. It is commonly expected that by analyzing these data in an integrated way, one can improve the understanding of a complex biological system. Current methods, however, are prone to overfitting heavy noise such that their applications are limited. High noise is one of the major challenges for multiomics data integration. This may be the main cause of overfitting and poor performance in generalization. A sample reweighting strategy is typically used to cope with this problem. In this article, we propose a robust multimodal data integration method, called SMSPL, which can simultaneously predict subtypes of cancers and identify potentially significant multiomics signatures. Especially, the proposed method leverages the linkages between different types of data to interactively recommend high-confidence samples, adopts a new soft weighting scheme to assign weights to the training samples of each type, and then iterates between weights recalculating and classifiers updating. Simulation and five real experiments substantiate the capability of the proposed method for classification and identification of significant multiomics signatures with heavy noise. We expect SMSPL to take a small step in the multiomics data integration and help researchers comprehensively understand the biological process.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCYB.2020.3006240DOI Listing

Publication Analysis

Top Keywords

multiomics data
12
data integration
12
robust multimodal
8
heavy noise
8
multiomics signatures
8
proposed method
8
data
7
multiomics
5
smspl robust
4
multimodal approach
4

Similar Publications

Background: The metabolism of stearoyl-GPE plays a key role in the liver metastasis of gastric cancer. This investigation delves into the mechanisms underlying the intricate tumor microenvironment (TME) heterogeneity triggered by stearoyl metabolism in gastric cancer with liver metastasis (LMGC), offering novel perspectives for LMGC.

Objective: Utilizing Mendelian randomization, we determined that stearoyl metabolism significantly contributes to the progression of gastric cancer (GC).

View Article and Find Full Text PDF

Background: Cancer immunotherapy has transformed metastatic cancer treatment, yet challenges persist regarding therapeutic efficacy. RECQL4, a RecQ-like helicase, plays a central role in DNA replication and repair as part of the DNA damage response, a pathway implicated in enhancing efficacy of immune checkpoint inhibitor (ICI) therapies. However, its role in patient response to ICI remains unclear.

View Article and Find Full Text PDF

Identifying Disease Associated Multi-Omics Network With Mixed Graphical Models Based on Markov Random Field Model.

Genet Epidemiol

January 2025

Interdisciplinary Program of Bioinformatics, College of Natural Science, Seoul National University, Seoul, South Korea.

In this article, we proposed a new method named fused mixed graphical model (FMGM), which can infer network structures associated with dichotomous phenotypes. FMGM is based on a pairwise Markov random field model, and statistical analyses including the proposed method were conducted to find biological markers and underlying network structures of the atopic dermatitis (AD) from multiomics data of 6-month-old infants. The performance of FMGM was evaluated with simulations by using synthetic datasets of power-law networks, showing that FMGM had superior performance for identifying the differences of the networks compared to the separate inference with the previous method, causalMGM (F1-scores 0.

View Article and Find Full Text PDF

Integrating multi-omics data may help researchers understand the genetic underpinnings of complex traits and diseases. However, the best ways to integrate multi-omics data and use them to address pressing scientific questions remain a challenge. One important and topical problem is how to assess the aggregate effect of multiple genomic data types (e.

View Article and Find Full Text PDF

Crohn's disease (CD) and ulcerative colitis (UC) are chronic relapsing inflammatory bowel disorders (IBD), the pathogenesis of which is uncertain but includes genetic susceptibility factors, immune-mediated tissue injury and environmental influences, most of which appear to act via the gut microbiome. We hypothesized that host-microbe alterations could be used to prognostically stratify patients experiencing relapses up to four years after endoscopy. We therefore examined multiple omics data, including published and new datasets, generated from paired inflamed and non-inflamed mucosal biopsies from 142 patients with IBD (54 CD; 88 UC) and from 34 control (non-diseased) biopsies.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!