Unlabelled: The composition of bacterial transcriptomes is determined by the transcriptional regulatory network (TRN). The TRN regulates the transition from one physiological state to another. Here, we use independent component analysis to monitor the composition of the transcriptome during the transition from the exponential growth phase to the stationary phase.
View Article and Find Full Text PDFThe exponential growth of microbial genome data presents unprecedented opportunities for unlocking the potential of microorganisms. The burgeoning field of pangenomics offers a framework for extracting insights from this big biological data. Recent advances in microbial pangenomic research have generated substantial data and literature, yielding valuable knowledge across diverse microbial species.
View Article and Find Full Text PDFThe transcriptional regulatory network (TRN) in bacteria is thought to rapidly evolve in response to selection pressures, modulating transcription factor (TF) activities and interactions. In order to probe the limits and mechanisms surrounding the short-term adaptability of the TRN, we generated, evolved, and characterized knockout (KO) strains in Escherichia coli for 11 regulators selected based on measured growth impact on glucose minimal media. All but one knockout strain (Δlrp) were able to recover growth and did so requiring few convergent mutations.
View Article and Find Full Text PDFPublic gene expression databases are a rapidly expanding resource of organism responses to diverse perturbations, presenting both an opportunity and a challenge for bioinformatics workflows to extract actionable knowledge of transcription regulatory network function. Here, we introduce a five-step computational pipeline, called iModulonMiner, to compile, process, curate, analyze, and characterize the totality of RNA-seq data for a given organism or cell type. This workflow is centered around the data-driven computation of co-regulated gene sets using Independent Component Analysis, called iModulons, which have been shown to have broad applications.
View Article and Find Full Text PDFMetabologenomics integrates metabolomics with other omics data types to comprehensively study the genetic and environmental factors that influence metabolism. These multi-omics data can be incorporated into genome-scale metabolic models (GEMs), which are highly curated knowledge bases that explicitly account for genes, transcripts, proteins and metabolites. By including all known biochemical reactions catalysed by enzymes and transporters encoded in the human genome, GEMs analyse and predict the behaviour of complex metabolic networks.
View Article and Find Full Text PDFHydrogen sulfide (HS), mainly produced from L-cysteine (Cys), renders bacteria highly resistant to oxidative stress and potentially increases antimicrobial resistance (AMR). CyuR is a Cys-dependent transcription regulator, responsible for the activation of the cyuPA operon and generation of HS. Despite its potential importance, its regulatory network remains poorly understood.
View Article and Find Full Text PDFis an important cyanobacterium that serves as a versatile and robust model for studying circadian biology and photosynthetic metabolism. Its transcriptional regulatory network (TRN) is of fundamental interest, as it orchestrates the cell's adaptation to the environment, including its response to sunlight. Despite the previous characterization of constituent parts of the TRN, a comprehensive layout of its topology remains to be established.
View Article and Find Full Text PDFStreptomyces produces diverse secondary metabolites of biopharmaceutical importance, yet the rate of biosynthesis of these metabolites is often hampered by complex transcriptional regulation. Therefore, a fundamental understanding of transcriptional regulation in Streptomyces is key to fully harness its genetic potential. Here, independent component analysis (ICA) of 454 high-quality gene expression profiles of the model species Streptomyces coelicolor is performed, of which 249 profiles are newly generated for S.
View Article and Find Full Text PDFvalorization is a promising strategy for climate adaptation and transitioning towards a circular carbon economy. Here, we present a multi-scale, integrated systems approach for designing biomanufacturing systems that can utilize as a feedstock, focusing on the Wood-Ljungdahl and reductive glycine pathways. This approach relies on first principles, coupling the optimization of pathway and process variables.
View Article and Find Full Text PDFUnlabelled: Microorganisms with simplified genomes represent interesting cell chassis for systems and synthetic biology. However, genome reduction can lead to undesired traits, such as decreased growth rate and metabolic imbalances. To investigate the impact of genome reduction on strain DGF-298, a strain in which ~ 36% of the genome has been removed, we reconstructed a strain-specific metabolic model (AC1061), investigated the regulation of gene expression using iModulon-based transcriptome analysis, and performed adaptive laboratory evolution to let the strain correct potential imbalances that arose during its simplification.
View Article and Find Full Text PDFSynthetic biology enables the reprogramming of cellular functions for various applications. However, challenges in scalability and predictability persist due to context-dependent performance and complex circuit-host interactions. This study introduces an iModulon-based engineering approach, utilizing machine learning-defined co-regulated gene groups (iModulons) as design parts containing essential genes for specific functions.
View Article and Find Full Text PDFis responsible for a range of diseases in humans contributing significantly to morbidity and mortality. Among more than 200 serotypes of , serotype M1 strains hold the greatest clinical relevance due to their high prevalence in severe human infections. To enhance our understanding of pathogenesis and discovery of potential therapeutic approaches, we have developed the first genome-scale metabolic model (GEM) for a serotype M1 strain, which we name iYH543.
View Article and Find Full Text PDFMature red blood cells (RBCs) lack mitochondria and thus exclusively rely on glycolysis to generate adenosine triphosphate (ATP) during aging in vivo or storage in blood banks. Here, we leveraged 13,029 volunteers from the Recipient Epidemiology and Donor Evaluation Study to identify associations between end-of-storage levels of glycolytic metabolites and donor age, sex, and ancestry-specific genetic polymorphisms in regions encoding phosphofructokinase 1, platelet (detected in mature RBCs); hexokinase 1 (HK1); and ADP-ribosyl cyclase 1 and 2 (CD38/BST1). Gene-metabolite associations were validated in fresh and stored RBCs from 525 Diversity Outbred mice and via multi-omics characterization of 1,929 samples from 643 human RBC units during storage.
View Article and Find Full Text PDFFilamentous Actinobacteria, recently renamed Actinomycetia, are the most prolific source of microbial bioactive natural products. Studies on biosynthetic gene clusters benefit from or require chromosome-level assemblies. Here, we provide DNA sequences from >1000 isolates: 881 complete genomes and 153 near-complete genomes, representing 28 genera and 389 species, including 244 likely novel species.
View Article and Find Full Text PDFIt has proved challenging to quantitatively relate the proteome to the transcriptome on a per-gene basis. Recent advances in data analytics have enabled a biologically meaningful modularization of the bacterial transcriptome. We thus investigate whether matched datasets of transcriptomes and proteomes from bacteria under diverse conditions can be modularized in the same way to reveal novel relationships between their compositions.
View Article and Find Full Text PDFFast growth phenotypes are achieved through optimal transcriptomic allocation, in which cells must balance tradeoffs in resource allocation between diverse functions. One such balance between stress readiness and unbridled growth in has been termed the fear versus greed (f/g) tradeoff. Two specific RNA polymerase (RNAP) mutations observed in adaptation to fast growth have been previously shown to affect the f/g tradeoff, suggesting that genetic adaptations may be primed to control f/g resource allocation.
View Article and Find Full Text PDFUnderstanding diverse bacterial nutritional requirements and responses is foundational in microbial research and biotechnology. In this study, we employed knowledge-enriched transcriptomic analytics to decipher complex stress responses of Vibrio natriegens to supplied nutrients, aiming to enhance microbial engineering efforts. We computed 64 independently modulated gene sets that comprise a quantitative basis for transcriptome dynamics across a comprehensive transcriptomics dataset containing a broad array of nutrient conditions.
View Article and Find Full Text PDFMicrobial genome sequences are rapidly accumulating, enabling large-scale studies of sequence variation. Existing studies primarily focus on coding regions to study amino acid substitution patterns in proteins. However, non-coding regulatory regions also play a distinct role in determining physiologic responses.
View Article and Find Full Text PDFUnlabelled: A critical body of knowledge has developed through advances in protein microscopy, protein-fold modeling, structural biology software, availability of sequenced bacterial genomes, large-scale mutation databases, and genome-scale models. Based on these recent advances, we develop a computational framework that; i) identifies the oligomeric structural proteome encoded by an organism's genome from available structural resources; ii) maps multi-strain alleleomic variation, resulting in the structural proteome for a species; and iii) calculates the 3D orientation of proteins across subcellular compartments with residue-level precision. Using the platform, we; iv) compute the quaternary K-12 MG1655 structural proteome; v) use a dataset of 12,000 mutations to build Random Forest classifiers that can predict the severity of mutations; and, in combination with a genome-scale model that computes proteome allocation, vi) obtain the spatial allocation of the proteome.
View Article and Find Full Text PDFKinetic models of metabolism are promising platforms for studying complex metabolic systems and designing production strains. Given the availability of enzyme kinetic data from historical experiments and machine learning estimation tools, a straightforward modeling approach is to assemble kinetic data enzyme by enzyme until a desired scale is reached. However, this type of 'bottom up' parameterization of kinetic models has been difficult due to a number of issues including gaps in kinetic parameters, the complexity of enzyme mechanisms, inconsistencies between parameters obtained from different sources, and differences.
View Article and Find Full Text PDFGenome mining is revolutionizing natural products discovery efforts. The rapid increase in available genomes demands comprehensive computational platforms to effectively extract biosynthetic knowledge encoded across bacterial pangenomes. Here, we present BGCFlow, a novel systematic workflow integrating analytics for large-scale genome mining of bacterial pangenomes.
View Article and Find Full Text PDFMicrobes have inherent capacities for utilizing various carbon sources, however they often exhibit sub-par fitness due to low metabolic efficiency. To test whether a bacterial strain can optimally utilize multiple carbon sources, Escherichia coli was serially evolved in L-lactate and glycerol. This yielded two end-point strains that evolved first in L-lactate then in glycerol, and vice versa.
View Article and Find Full Text PDFMachine learning applied to large compendia of transcriptomic data has enabled the decomposition of bacterial transcriptomes to identify independently modulated sets of genes, such iModulons represent specific cellular functions. The identification of iModulons enables accurate identification of genes necessary and sufficient for cross-species transfer of cellular functions. We demonstrate cross-species transfer of: 1) the biotransformation of vanillate to protocatechuate, 2) a malonate catabolic pathway, 3) a catabolic pathway for 2,3-butanediol, and 4) an antimicrobial resistance to ampicillin found in multiple Pseudomonas species to Escherichia coli.
View Article and Find Full Text PDFJ Ind Microbiol Biotechnol
January 2024
Unlabelled: The demand for discovering novel microbial secondary metabolites is growing to address the limitations in bioactivities such as antibacterial, antifungal, anticancer, anthelmintic, and immunosuppressive functions. Among microbes, the genus Streptomyces holds particular significance for secondary metabolite discovery. Each Streptomyces species typically encodes approximately 30 secondary metabolite biosynthetic gene clusters (smBGCs) within its genome, which are mostly uncharacterized in terms of their products and bioactivities.
View Article and Find Full Text PDF