Identification of biomarkers by integrating multiple omics together is important because complex diseases occur due to an intricate interplay of various genetic materials. Traditional single-omics association tests neither explore this crucial interomics dependence nor identify moderately weak signals due to the multiple-testing burden. Conversely, multiomics data integration imparts complementary information but suffers from an increased multiple-testing burden, data diversity inherent with different omics features, high-dimensionality, and so forth. Most of the available methods address subtype classification using dimension-reduction techniques to circumvent the sample size issue but interacting multiomics biomarker identification methods are unavailable. We propose a two-step model that first investigates phenotype-omics association using logistic regression. Then, selects disease-associated omics using sparse principal components which explores the interrelationship of multiple variables from two omics in a multivariate multiple regression framework. On the basis of this model, we developed a multiomics biomarker identification algorithm, interacting omics search (ioSearch), that jointly tests the effect of multiple omics with disease and between-omics associations by using pathway information that subsequently reduces the multiple-testing burden. Further, inference in terms of p values potentially makes it an easily interpretable biomarker identification tool. Extensive simulation demonstrates ioSearch as statistically powerful with a controlled Type-I error rate. Its application to publicly available breast cancer data sets identified relevant omics features in important pathways.

Download full-text PDF

Source
http://dx.doi.org/10.1002/gepi.22536DOI Listing

Publication Analysis

Top Keywords

multiple-testing burden
12
biomarker identification
12
interacting multiomics
8
breast cancer
8
cancer data
8
data sets
8
multiple omics
8
omics features
8
multiomics biomarker
8
omics
7

Similar Publications

ADELLE: A global testing method for trans-eQTL mapping.

PLoS Genet

January 2025

Department of Human Genetics, The University of Chicago, Chicago, Illinois, United States of America.

Understanding the genetic regulatory mechanisms of gene expression is an ongoing challenge. Genetic variants that are associated with expression levels are readily identified when they are proximal to the gene (i.e.

View Article and Find Full Text PDF

Objective: Fenfluramine (FFA), stiripentol (STP), and cannabidiol (CBD) are approved add-on therapies for seizures in Dravet syndrome (DS). We report on the long-term safety and health care resource utilization (HCRU) of patients with DS treated with FFA under an expanded access program (EAP).

Methods: A cohort of 124 patients received FFA for a median of 2.

View Article and Find Full Text PDF

PsyRiskMR: a comprehensive resource for identifying psychiatric disorders risk factors through Mendelian randomization.

Biol Psychiatry

December 2024

Information Materials and Intelligent Sensing Laboratory of Anhui Province, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China. Electronic address:

Background: Psychiatric disorders pose an enormous economic and emotional burden on individuals, their families and society. Given that the current analysis of the pathogenesis of psychiatric disorders remains challenging and time-consuming, elucidating the modifiable risk factors becomes crucial for the diagnosis and management of psychiatric disorders. However, inferring the causal risk factors in these disorders from disparate data sources is challenging due to constraints in data collection and analytical capabilities.

View Article and Find Full Text PDF

Background: The permutation test has been widely used to provide the p-values of statistical tests when the standard test statistics do not follow parametric null distributions. However, the permutation test may require huge numbers of iterations, especially when the detection of very small p-values is required for multiple testing adjustments in the analysis of datasets with a large number of features.

Objective: To overcome this computational burden, we suggest a novel enhanced adaptive permutation test that estimates p-values using the negative binomial (NB) distribution.

View Article and Find Full Text PDF

Classical gene-by-environment interaction (GxE) analysis can be used to characterize genetic effect heterogeneity but has a high multiple testing burden in the context of genome-wide association studies (GWAS). We adapt a colocalization method, SharePro, to account for effect heterogeneity in fine-mapping and identify candidates for GxE analysis with reduced multiple testing burden. SharePro demonstrates improved power for both fine-mapping and GxE analysis compared to existing methods as well as well-controlled false type I error in simulations.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!