BowSaw: Inferring Higher-Order Trait Interactions Associated With Complex Biological Phenotypes.

Front Mol Biosci

Bioinformatics Graduate Program, Boston University, Boston, MA, United States.

Published: June 2021

AI Article Synopsis

Article Abstract

Machine learning is helping the interpretation of biological complexity by enabling the inference and classification of cellular, organismal and ecological phenotypes based on large datasets, e.g., from genomic, transcriptomic and metagenomic analyses. A number of available algorithms can help search these datasets to uncover patterns associated with specific traits, including disease-related attributes. While, in many instances, treating an algorithm as a black box is sufficient, it is interesting to pursue an enhanced understanding of how system variables end up contributing to a specific output, as an avenue toward new mechanistic insight. Here we address this challenge through a suite of algorithms, named BowSaw, which takes advantage of the structure of a trained random forest algorithm to identify combinations of variables ("rules") frequently used for classification. We first apply BowSaw to a simulated dataset and show that the algorithm can accurately recover the sets of variables used to generate the phenotypes through complex Boolean rules, even under challenging noise levels. We next apply our method to data from the integrative Human Microbiome Project and find previously unreported high-order combinations of microbial taxa putatively associated with Crohn's disease. By leveraging the structure of trees within a random forest, BowSaw provides a new way of using decision trees to generate testable biological hypotheses.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8245782PMC
http://dx.doi.org/10.3389/fmolb.2021.663532DOI Listing

Publication Analysis

Top Keywords

random forest
8
bowsaw
4
bowsaw inferring
4
inferring higher-order
4
higher-order trait
4
trait interactions
4
interactions associated
4
associated complex
4
complex biological
4
biological phenotypes
4

Similar Publications

Ecosystem functioning and management are primarily concerned with addressing climate change and biodiversity loss, which are closely linked to carbon stock and species diversity. This research aimed to quantify forest understory (shrub and herb) diversity, tree biomass and carbon sequestration in the Binsar Wildlife Sanctuary. Using random sampling methods, data were gathered from six distinct forest communities.

View Article and Find Full Text PDF

The aim of this study was to examine the adherence, changes in weight, and, waist circumference associated with the daily consumption of a culturally preferred food, namely an avocado, among Hispanic/Latina females in the Habitual Diet and Avocado Trial (HAT). HAT was a multisite, randomized controlled trial conducted between 2018 and 2020. Participants in the Avocado-Supplemented Diet Group were provided with and instructed to consume one avocado/day (~2.

View Article and Find Full Text PDF

Background/objectives: With the improvement of living standards, alcoholic liver disease caused by long-term drinking has been a common multiple disease. Probiotic interventions may help mitigate liver damage caused by alcohol intake, but the mechanisms need more investigation.

Methods: This study involved 70 long-term alcohol drinkers (18-65 years old, alcohol consumption ≥20 g/day, lasting for more than one year) who were randomly assigned to either the BC99 group or the placebo group.

View Article and Find Full Text PDF

A Simple Machine Learning-Based Quantitative Structure-Activity Relationship Model for Predicting pIC Inhibition Values of FLT3 Tyrosine Kinase.

Pharmaceuticals (Basel)

January 2025

Centro de Química Médica, Facultad de Medicina Clínica Alemana, Universidad del Desarrollo, Santiago 7780272, Chile.

Acute myeloid leukemia (AML) presents significant therapeutic challenges, particularly in cases driven by mutations in the FLT3 tyrosine kinase. This study aimed to develop a robust and user-friendly machine learning-based quantitative structure-activity relationship (QSAR) model to predict the inhibitory potency (pIC values) of FLT3 inhibitors, addressing the limitations of previous models in dataset size, diversity, and predictive accuracy. Using a dataset which was 14 times larger than those employed in prior studies (1350 compounds with 1269 molecular descriptors), we trained a random forest regressor, chosen due to its superior predictive performance and resistance to overfitting.

View Article and Find Full Text PDF

Depression Recognition Using Daily Wearable-Derived Physiological Data.

Sensors (Basel)

January 2025

Department of Psychological and Cognitive Sciences, Tsinghua University, Beijing 100084, China.

The objective identification of depression using physiological data has emerged as a significant research focus within the field of psychiatry. The advancement of wearable physiological measurement devices has opened new avenues for the identification of individuals with depression in everyday-life contexts. Compared to other objective measurement methods, wearables offer the potential for continuous, unobtrusive monitoring, which can capture subtle physiological changes indicative of depressive states.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!