Publications by authors named "Bonneau R"

is the causative agent of the venereal disease trichomoniasis which infects men and women globally and is associated with serious outcomes during pregnancy and cancers of the human reproductive tract. Trichomonads parasitize a range of hosts in addition to humans including birds, livestock, and domesticated animals. Recent genetic analysis of trichomonads recovered from columbid birds has provided evidence that these parasite species undergo frequent host-switching, and that a current epoch spillover event from columbids likely gave rise to in humans.

View Article and Find Full Text PDF
Article Synopsis
  • Measuring the impact of online misinformation is hard because not everyone who sees false information believes it!
  • Researchers created a new method to figure out how many people not only see misinformation but also believe it, using surveys and Twitter data!
  • Their findings show that people who are more likely to believe false news often have extreme opinions and are exposed to wrong information faster, which makes it tricky to fix the problem on social media!
View Article and Find Full Text PDF

Phosphotriesterases (PTEs) represent a class of enzymes capable of efficient neutralization of organophosphates (OPs), a dangerous class of neurotoxic chemicals. PTEs suffer from low catalytic activity, particularly at higher temperatures, due to low thermostability and low solubility. Supercharging, a protein engineering approach via selective mutation of surface residues to charged residues, has been successfully employed to generate proteins with increased solubility and thermostability by promoting charge-charge repulsion between proteins.

View Article and Find Full Text PDF

Unlabelled: Microbiome studies have revealed gut microbiota's potential impact on complex diseases. However, many studies often focus on one disease per cohort. We developed a meta-analysis workflow for gut microbiome profiles and analyzed shotgun metagenomic data covering 11 diseases.

View Article and Find Full Text PDF

AlphaFold2 revolutionized structural biology with the ability to predict protein structures with exceptionally high accuracy. Its implementation, however, lacks the code and data required to train new models. These are necessary to (1) tackle new tasks, like protein-ligand complex structure prediction, (2) investigate the process by which the model learns and (3) assess the model's capacity to generalize to unseen regions of fold space.

View Article and Find Full Text PDF

Tissue structure and molecular circuitry in the colon can be profoundly impacted by systemic age-related effects, but many of the underlying molecular cues remain unclear. Here, we built a cellular and spatial atlas of the colon across three anatomical regions and 11 age groups, encompassing ~1,500 mouse gut tissues profiled by spatial transcriptomics and ~400,000 single nucleus RNA-seq profiles. We developed a new computational framework, cSplotch, which learns a hierarchical Bayesian model of spatially resolved cellular expression associated with age, tissue region, and sex, by leveraging histological features to share information across tissue samples and data modalities.

View Article and Find Full Text PDF

Inferring gene regulatory networks (GRNs) from single-cell data is challenging due to heuristic limitations. Existing methods also lack estimates of uncertainty. Here we present Probabilistic Matrix Factorization for Gene Regulatory Network Inference (PMF-GRN).

View Article and Find Full Text PDF

Microbiome studies have revealed gut microbiota's potential impact on complex diseases. However, many studies often focus on one disease per cohort. We developed a meta-analysis workflow for gut microbiome profiles and analyzed shotgun metagenomic data covering 11 diseases.

View Article and Find Full Text PDF

Organophosphates (OPs) are a class of neurotoxic acetylcholinesterase inhibitors including widely used pesticides as well as nerve agents such as VX and VR. Current treatment of these toxins relies on reactivating acetylcholinesterase, which remains ineffective. Enzymatic scavengers are of interest for their ability to degrade OPs systemically before they reach their target.

View Article and Find Full Text PDF
Article Synopsis
  • - The NIAID organized a workshop focusing on the use of various omics approaches (like genomics, transcriptomics, and microbiomics) to study asthma and allergic diseases, bringing together experts from different fields.
  • - Participants discussed the current trends, challenges, and emerging strategies in asthma and allergy research, emphasizing the need for integrated and rigorous analytic frameworks.
  • - The workshop highlighted the importance of cross-disciplinary collaboration to enhance understanding and improve care for asthma and allergic conditions.
View Article and Find Full Text PDF

Background: Modeling of gene regulatory networks (GRNs) is limited due to a lack of direct measurements of genome-wide transcription factor activity (TFA) making it difficult to separate covariance and regulatory interactions. Inference of regulatory interactions and TFA requires aggregation of complementary evidence. Estimating TFA explicitly is problematic as it disconnects GRN inference and TFA estimation and is unable to account for, for example, contextual transcription factor-transcription factor interactions, and other higher order features.

View Article and Find Full Text PDF

Protein hydrogels represent an important and growing biomaterial for a multitude of applications, including diagnostics and drug delivery. We have previously explored the ability to engineer the thermoresponsive supramolecular assembly of coiled-coil proteins into hydrogels with varying gelation properties, where we have defined important parameters in the coiled-coil hydrogel design. Using Rosetta energy scores and Poisson-Boltzmann electrostatic energies, we iterate a computational design strategy to predict the gelation of coiled-coil proteins while simultaneously exploring five new coiled-coil protein hydrogel sequences.

View Article and Find Full Text PDF

The sequence-structure-function relationships that ultimately generate the diversity of extant observed proteins is complex, as proteins bridge the gap between multiple informational and physical scales involved in nearly all cellular processes. One limitation of existing protein annotation databases such as UniProt is that less than 1% of proteins have experimentally verified functions, and computational methods are needed to fill in the missing information. Here, we demonstrate that a multi-aspect framework based on protein language models can learn sequence-structure-function representations of amino acid sequences, and can provide the foundation for sensitive sequence-structure-function aware protein sequence search and annotation.

View Article and Find Full Text PDF

Theranostic materials research is experiencing rapid growth driven by the interest in integrating both therapeutic and diagnostic modalities. These materials offer the unique capability to not only provide treatment but also track the progression of a disease. However, to create an ideal theranostic biomaterial without compromising drug encapsulation, diagnostic imaging must be optimized for improved sensitivity and spatial localization.

View Article and Find Full Text PDF

Cells respond to environmental and developmental stimuli by remodeling their transcriptomes through regulation of both mRNA transcription and mRNA decay. A central goal of biology is identifying the global set of regulatory relationships between factors that control mRNA production and degradation and their target transcripts and construct a predictive model of gene expression. Regulatory relationships are typically identified using transcriptome measurements and causal inference algorithms.

View Article and Find Full Text PDF

Exploiting sequence-structure-function relationships in biotechnology requires improved methods for aligning proteins that have low sequence similarity to previously annotated proteins. We develop two deep learning methods to address this gap, TM-Vec and DeepBLAST. TM-Vec allows searching for structure-structure similarities in large sequence databases.

View Article and Find Full Text PDF

Multiple sequence alignments (MSAs) of proteins encode rich biological information and have been workhorses in bioinformatic methods for tasks like protein design and protein structure prediction for decades. Recent breakthroughs like AlphaFold2 that use transformers to attend directly over large quantities of raw MSAs have reaffirmed their importance. Generation of MSAs is highly computationally intensive, however, and no datasets comparable to those used to train AlphaFold2 have been made available to the research community, hindering progress in machine learning for proteins.

View Article and Find Full Text PDF

Autism spectrum disorder (ASD) is a neurodevelopmental disorder characterized by heterogeneous cognitive, behavioral and communication impairments. Disruption of the gut-brain axis (GBA) has been implicated in ASD although with limited reproducibility across studies. In this study, we developed a Bayesian differential ranking algorithm to identify ASD-associated molecular and taxa profiles across 10 cross-sectional microbiome datasets and 15 other datasets, including dietary patterns, metabolomics, cytokine profiles and human brain gene expression profiles.

View Article and Find Full Text PDF

Background: Disruption of the microbial community in the respiratory tract due to infections, like influenza, could impact transmission of bacterial pathogens. Using samples from a household study, we determined whether metagenomic-type analyses of the microbiome provide the resolution necessary to track transmission of airway bacteria. Microbiome studies have shown that the microbial community across various body sites tends to be more similar between individuals who cohabit in the same household than between individuals from different households.

View Article and Find Full Text PDF

Labeled protein-based biomaterials have become a popular for various biomedical applications such as tissue-engineered, therapeutic, or diagnostic scaffolds. Labeling of protein biomaterials, including with ultrasmall super-paramagnetic iron oxide (USPIO) nanoparticles, has enabled a wide variety of imaging techniques. These USPIO-based biomaterials are widely studied in magnetic resonance imaging (MRI), thermotherapy, and magnetically-driven drug delivery which provide a method for direct and non-invasive monitoring of implants or drug delivery agents.

View Article and Find Full Text PDF

For the past half-century, structural biologists relied on the notion that similar protein sequences give rise to similar structures and functions. While this assumption has driven research to explore certain parts of the protein universe, it disregards spaces that don't rely on this assumption. Here we explore areas of the protein universe where similar protein functions can be achieved by different sequences and different structures.

View Article and Find Full Text PDF

Comprehensive protein function annotation is essential for understanding microbiome-related disease mechanisms in the host organisms. However, a large portion of human gut microbial proteins lack functional annotation. Here, we have developed a new metagenome analysis workflow integrating genome reconstruction, taxonomic profiling, and deep learning-based functional annotations from DeepFRI.

View Article and Find Full Text PDF

Structures of membrane proteins are challenging to determine experimentally and currently represent only about 2% of the structures in the Protein Data Bank. Because of this disparity, methods for modeling membrane proteins are fewer and of lower quality than those for modeling soluble proteins. However, better expression, crystallization, and cryo-EM techniques have prompted a recent increase in experimental structures of membrane proteins, which can act as templates to predict the structure of closely related proteins through homology modeling.

View Article and Find Full Text PDF

The modeling of gene regulatory networks (GRNs) is limited due to a lack of direct measurements of regulatory features in genome-wide screens. Most GRN inference methods are therefore forced to model relationships between regulatory genes and their targets with expression as a proxy for the upstream independent features, complicating validation and predictions produced by modeling frameworks. Separating covariance and regulatory influence requires aggregation of independent and complementary sets of evidence, such as transcription factor (TF) binding and target gene expression.

View Article and Find Full Text PDF