Machine-learning models are key to modern biology, yet models trained on one dataset are often not generalizable to other datasets from different cohorts or laboratories due to both technical and biological differences. Domain adaptation, a type of transfer learning, alleviates this problem by aligning different datasets so that models can be applied across them. However, most state-of-the-art domain adaptation methods were designed for large-scale data such as images, whereas biological datasets are smaller and have more features, and these are also complex and heterogeneous.
View Article and Find Full Text PDFPreeclampsia is a severe obstetrical syndrome which contributes to 10-15% of all maternal deaths. Although the mechanisms underlying systemic damage in preeclampsia-such as impaired placentation, endothelial dysfunction, and immune dysregulation-are well studied, the initial triggers of the condition remain largely unknown. Furthermore, although the pathogenesis of preeclampsia begins early in pregnancy, there are no early diagnostics for this life-threatening syndrome, which is typically diagnosed much later, after systemic damage has already manifested.
View Article and Find Full Text PDFGenomic diversity within species can be driven by both point mutations and larger structural variations, but so far, strain-tracking approaches have focused mostly on the former. In a recent issue of Nature Biotechnology, Ley and colleagues introduce SynTracker, a tool designed for scalable strain tracking with microsynteny in low-coverage metagenomic settings.
View Article and Find Full Text PDFAs investigations of low-biomass microbial communities have become more common, so too has the recognition of major challenges affecting these analyses. These challenges have been shown to compromise biological conclusions and have contributed to several controversies. Here, we review some of the most common and influential challenges in low-biomass microbiome research.
View Article and Find Full Text PDFCross-validation is a common method for estimating the predictive performance of machine learning models. In a data-scarce regime, where one typically wishes to maximize the number of instances used for training the model, an approach called 'leave-one-out cross-validation' is often used. In this design, a separate model is built for predicting each data instance after training on all other instances.
View Article and Find Full Text PDFEvery step in common microbiome profiling protocols has variable efficiency for each microbe. For example, different DNA extraction kits may have different efficiency for Gram-positive and -negative bacteria. These variable efficiencies, combined with technical variation, create strong processing biases, which impede the identification of signals that are reproducible across studies and the development of generalizable and biologically interpretable prediction models.
View Article and Find Full Text PDFBackground: Esophageal adenocarcinoma (EAC) is rising in incidence, and established risk factors do not explain this trend. Esophageal microbiome alterations have been associated with Barrett's esophagus (BE) and dysplasia and EAC. The oral microbiome is tightly linked to the esophageal microbiome; this study aimed to identify salivary microbiome-related factors associated with BE, dysplasia, and EAC.
View Article and Find Full Text PDFPreterm birth (PTB) is the leading cause of neonatal morbidity and mortality. The vaginal microbiome has been associated with PTB, yet the mechanisms underlying this association are not fully understood. Understanding microbial genetic adaptations to selective pressures, especially those related to the host, may yield insights into these associations.
View Article and Find Full Text PDFEsophageal adenocarcinoma (EAC) is rising in incidence and associated with poor survival, and established risk factors do not explain this trend. Microbiome alterations have been associated with progression from the precursor Barrett's esophagus (BE) to EAC, yet the oral microbiome, tightly linked to the esophageal microbiome and easier to sample, has not been extensively studied in this context. We aimed to assess the relationship between the salivary microbiome and neoplastic progression in BE to identify microbiome-related factors that may drive EAC development.
View Article and Find Full Text PDFSequencing-based approaches for the analysis of microbial communities are susceptible to contamination, which could mask biological signals or generate artifactual ones. Methods for in silico decontamination using controls are routinely used, but do not make optimal use of information shared across samples and cannot handle taxa that only partially originate in contamination or leakage of biological material into controls. Here we present Source tracking for Contamination Removal in microBiomes (SCRuB), a probabilistic in silico decontamination method that incorporates shared information across multiple samples and controls to precisely identify and remove contamination.
View Article and Find Full Text PDFPreterm birth (PTB) is the leading cause of neonatal morbidity and mortality. The vaginal microbiome has been associated with PTB, yet the mechanisms underlying this association are not fully understood. Understanding microbial genetic adaptations to selective pressures, especially those related to the host, may yield new insights into these associations.
View Article and Find Full Text PDFSpontaneous preterm birth (sPTB) is a leading cause of maternal and neonatal morbidity and mortality, yet its prevention and early risk stratification are limited. Previous investigations have suggested that vaginal microbes and metabolites may be implicated in sPTB. Here we performed untargeted metabolomics on 232 second-trimester vaginal samples, 80 from pregnancies ending preterm.
View Article and Find Full Text PDFObjective: Previous studies have demonstrated an association between gut microbiota composition and type 1 diabetes (T1D) pathogenesis. However, little is known about the composition and function of the gut microbiome in adults with longstanding T1D or its association with host glycemic control.
Research Design And Methods: We performed a metagenomic analysis of the gut microbiome obtained from fecal samples of 74 adults with T1D, 14.
Patterns of sequencing coverage along a bacterial genome-summarized by a peak-to-trough ratio (PTR)-have been shown to accurately reflect microbial growth rates, revealing a new facet of microbial dynamics and host-microbe interactions. Here, we introduce Compute PTR (CoPTR): a tool for computing PTRs from complete reference genomes and assemblies. Using simulations and data from growth experiments in simple and complex communities, we show that CoPTR is more accurate than the current state of the art while also providing more PTR estimates overall.
View Article and Find Full Text PDFBiomechanical and molecular processes of premature cervical remodeling preceding spontaneous preterm birth (sPTB) likely result from interactions between the cervicovaginal microbiota and host immune responses. A non-optimal cervicovaginal microbiota confers increased risk of sPTB. The cervicovaginal space is metabolically active in pregancy; microbiota can produce, modify, and degrade metabolites within this ecosystem.
View Article and Find Full Text PDFObjective: Despite technological advances, results from various clinical trials have repeatedly shown that many individuals with type 1 diabetes (T1D) do not achieve their glycemic goals. One of the major challenges in disease management is the administration of an accurate amount of insulin for each meal that will match the expected postprandial glycemic response (PPGR). The objective of this study was to develop a prediction model for PPGR in individuals with T1D.
View Article and Find Full Text PDFBackground: Increasing evidence points to the esophageal microbiome as an important co-factor in esophageal neoplasia. Esophageal microbiome composition is strongly influenced by the oral microbiome. Salivary microbiome assessment has emerged as a potential non-invasive tool to identify patients at risk for esophageal cancer, but key host and environmental factors that may affect the salivary microbiome have not been well-defined.
View Article and Find Full Text PDFA central paradigm in microbiome data analysis, which we term the genome-centric paradigm, is that a linear (non-branching) DNA sequence is the ideal representation of a microbial genome. This representation is natural, as microbes indeed have non-branching genomes. Tremendous discoveries in microbiology were made under this paradigm, but is it always optimal for microbiome research? In this Commentary, we claim that the realization of this paradigm in metagenomic assembly, a fundamental step in the "metagenomics analysis pipeline," suboptimally models the extensive genomic variability present in the microbiome.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
February 2021
The COVID-19 pandemic has the potential to affect the human microbiome in infected and uninfected individuals, having a substantial impact on human health over the long term. This pandemic intersects with a decades-long decline in microbial diversity and ancestral microbes due to hygiene, antibiotics, and urban living (the hygiene hypothesis). High-risk groups succumbing to COVID-19 include those with preexisting conditions, such as diabetes and obesity, which are also associated with microbiome abnormalities.
View Article and Find Full Text PDF