AI Article Synopsis

  • - The text discusses the challenges of detecting complex genetic interactions (epistasis) that influence human traits, pointing out that traditional regression methods struggle with high-order interactions in large genomic datasets due to computational limitations and inadequacies in modeling biological interactions properly.
  • - It introduces the epiTree pipeline, built on a framework called Predictability, Computability, Stability (PCS), which utilizes tree-based models to identify higher-order interactions in genomic data by selecting relevant variants based on tissue-specific gene expression and employing iterative random forests.
  • - The efficacy of the epiTree pipeline is validated through two case studies from the UK Biobank, demonstrating its ability to reveal both known and novel genetic interactions in predicting traits like red hair and multiple sclerosis, thus potentially

Article Abstract

Detecting epistatic drivers of human phenotypes is a considerable challenge. Traditional approaches use regression to sequentially test multiplicative interaction terms involving pairs of genetic variants. For higher-order interactions and genome-wide large-scale data, this strategy is computationally intractable. Moreover, multiplicative terms used in regression modeling may not capture the form of biological interactions. Building on the Predictability, Computability, Stability (PCS) framework, we introduce the epiTree pipeline to extract higher-order interactions from genomic data using tree-based models. The epiTree pipeline first selects a set of variants derived from tissue-specific estimates of gene expression. Next, it uses iterative random forests (iRF) to search training data for candidate Boolean interactions (pairwise and higher-order). We derive significance tests for interactions, based on a stabilized likelihood ratio test, by simulating Boolean tree-structured null (no epistasis) and alternative (epistasis) distributions on hold-out test data. Finally, our pipeline computes PCS epistasis p-values that probabilisticly quantify improvement in prediction accuracy via bootstrap sampling on the test set. We validate the epiTree pipeline in two case studies using data from the UK Biobank: predicting red hair and multiple sclerosis (MS). In the case of predicting red hair, epiTree recovers known epistatic interactions surrounding MC1R and novel interactions, representing non-linearities not captured by logistic regression models. In the case of predicting MS, a more complex phenotype than red hair, epiTree rankings prioritize novel interactions surrounding HLA-DRB1, a variant previously associated with MS in several populations. Taken together, these results highlight the potential for epiTree rankings to help reduce the design space for follow up experiments.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11020961PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0298906PLOS

Publication Analysis

Top Keywords

epitree pipeline
12
red hair
12
interactions
9
boolean interactions
8
higher-order interactions
8
predicting red
8
case predicting
8
hair epitree
8
interactions surrounding
8
novel interactions
8

Similar Publications

Article Synopsis
  • - The text discusses the challenges of detecting complex genetic interactions (epistasis) that influence human traits, pointing out that traditional regression methods struggle with high-order interactions in large genomic datasets due to computational limitations and inadequacies in modeling biological interactions properly.
  • - It introduces the epiTree pipeline, built on a framework called Predictability, Computability, Stability (PCS), which utilizes tree-based models to identify higher-order interactions in genomic data by selecting relevant variants based on tissue-specific gene expression and employing iterative random forests.
  • - The efficacy of the epiTree pipeline is validated through two case studies from the UK Biobank, demonstrating its ability to reveal both known and novel genetic interactions in predicting traits like red hair and multiple sclerosis, thus potentially
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!