A necessary transformation for a sustainable economy is the transition from fossil-derived plastics to polymers derived from biomass and waste resources. While renewable feedstocks can enhance material performance through unique chemical moieties, probing the vast material design space by experiment alone is not practically feasible. Here, we develop a machine-learning-based tool, PolyID, to reduce the design space of renewable feedstocks to enable efficient discovery of performance-advantaged, biobased polymers.
View Article and Find Full Text PDFThe discovery of new materials in unexplored chemical spaces necessitates quick and accurate prediction of thermodynamic stability, often assessed using density functional theory (DFT), and efficient search strategies. Here, we develop a new approach to finding stable inorganic functional materials. We start by defining an upper bound to the fully relaxed energy obtained via DFT as the energy resulting from a constrained optimization over only cell volume.
View Article and Find Full Text PDFMuconic acid is a bioprivileged molecule that can be converted into direct replacement chemicals for incumbent petrochemicals and performance-advantaged bioproducts. In this study, Pseudomonas putida KT2440 is engineered to convert glucose and xylose, the primary carbohydrates in lignocellulosic hydrolysates, to muconic acid using a model-guided strategy to maximize the theoretical yield. Using adaptive laboratory evolution (ALE) and metabolic engineering in a strain engineered to express the D-xylose isomerase pathway, we demonstrate that mutations in the heterologous D-xylose:H symporter (XylE), increased expression of a major facilitator superfamily transporter (PP_2569), and overexpression of aroB encoding the native 3-dehydroquinate synthase, enable efficient muconic acid production from glucose and xylose simultaneously.
View Article and Find Full Text PDFLong-lived organic radicals are promising candidates for the development of high-performance energy solutions such as organic redox batteries, transistors, and light-emitting diodes. However, "stable" organic radicals that remain unreactive for an extended time and that can be stored and handled under ambient conditions are rare. A necessary but not sufficient condition for organic radical stability is the presence of thermodynamic stabilization, such as conjugation with an adjacent π-bond or lone-pair, or hyperconjugation with a σ-bond.
View Article and Find Full Text PDFNuclear magnetic resonance (NMR) is one of the primary techniques used to elucidate the chemical structure, bonding, stereochemistry, and conformation of organic compounds. The distinct chemical shifts in an NMR spectrum depend upon each atom's local chemical environment and are influenced by both through-bond and through-space interactions with other atoms and functional groups. The prediction of NMR chemical shifts using quantum mechanical (QM) calculations is now commonplace in aiding organic structural assignment since spectra can be computed for several candidate structures and then compared with experimental values to find the best possible match.
View Article and Find Full Text PDFOptimizing the metabolism of microbial cell factories for yields and titers is a critical step for economically viable production of bioproducts and biofuels. In this process, tuning the expression of individual enzymes to obtain the desired pathway flux is a challenging step, in which data from separate multiomics techniques must be integrated with existing biological knowledge to determine where changes should be made. Following a design-build-test-learn strategy, building on recent advances in Bayesian metabolic control analysis, we identify key enzymes in the oleaginous yeast that correlate with the production of itaconate by integrating a metabolic model with multiomics measurements.
View Article and Find Full Text PDFPrior engineering of the ethanologen has enabled it to metabolize xylose and to produce 2,3-butanediol (2,3-BDO) as a dominant fermentation product. When co-fermenting with xylose, glucose is preferentially utilized, even though xylose metabolism generates ATP more efficiently during 2,3-BDO production on a BDO-mol basis. To gain a deeper understanding of metabolism, we first estimated the kinetic parameters of the glucose facilitator protein of by fitting a kinetic uptake model, which shows that the maximum transport capacity of glucose is seven times higher than that of xylose, and glucose is six times more affinitive to the transporter than xylose.
View Article and Find Full Text PDFMachine-readable chemical structure representations are foundational in all attempts to harness machine learning for the prediction of reactivities, selectivities, and chemical properties directly from molecular structure. The featurization of discrete chemical structures into a continuous vector space is a critical phase undertaken before model selection, and the development of new ways to quantitatively encode molecules is an active area of research. In this Account, we highlight the application and suitability of different representations, from expert-guided "engineered" descriptors to automatically "learned" features, in different prediction tasks relevant to organic and organometallic chemistry, where differing amounts of training data are available.
View Article and Find Full Text PDFComput Struct Biotechnol J
December 2020
Microorganisms rely on protein interactions to transmit signals, react to stimuli, and grow. One of the best ways to understand these protein interactions is through structural characterization. However, in the past, structural knowledge was limited to stable, high-affinity complexes that could be crystallized.
View Article and Find Full Text PDFAs genetic engineering of organisms has grown easier and more precise, computational modeling of metabolic systems has played an increasingly important role in both guiding experimental interventions and in understanding the results of metabolic perturbations.
View Article and Find Full Text PDFThe stabilities of radicals play a central role in determining the thermodynamics and kinetics of many reactions in organic chemistry. In this data descriptor, we provide consistent and validated quantum chemical calculations for over 200,000 organic radical species and 40,000 associated closed-shell molecules containing C, H, N and O atoms. These data consist of optimized 3D geometries, enthalpies, Gibbs free energy, vibrational frequencies, Mulliken charges and spin densities calculated at the M06-2X/def2-TZVP level of theory, which was previously found to have a favorable trade-off between experimental accuracy and computational efficiency.
View Article and Find Full Text PDFThe hazards to health and the environment associated with the transportation sector include smog, particulate matter, and greenhouse gas emissions. Conversion of lignocellulosic biomass into biofuels has the potential to provide significant amounts of infrastructure-compatible liquid transportation fuels that reduce those hazardous materials. However, the development of these technologies is inefficient, due to: (i) the lack of a priori fuel property consideration, (ii) poor shared vocabulary between process chemists and fuel engineers, and (iii) modern and future engines operating outside the range of traditional autoignition metrics such as octane or cetane numbers.
View Article and Find Full Text PDFAn amendment to this paper has been published and can be accessed via a link at the top of the paper.
View Article and Find Full Text PDFBond dissociation enthalpies (BDEs) of organic molecules play a fundamental role in determining chemical reactivity and selectivity. However, BDE computations at sufficiently high levels of quantum mechanical theory require substantial computing resources. In this paper, we develop a machine learning model capable of accurately predicting BDEs for organic molecules in a fraction of a second.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
December 2019
Lignocellulosic biomass offers a renewable carbon source which can be anaerobically digested to produce short-chain carboxylic acids. Here, we assess fuel properties of oxygenates accessible from catalytic upgrading of these acids a priori for their potential to serve as diesel bioblendstocks. Ethers derived from C and C carboxylic acids are identified as advantaged fuel candidates with significantly improved ignition quality (>56% cetane number increase) and reduced sooting (>86% yield sooting index reduction) when compared to commercial petrodiesel.
View Article and Find Full Text PDFModern biological tools generate a wealth of data on metabolite and protein concentrations that can be used to help inform new strain designs. However, learning from these data to predict how a cell will respond to genetic changes, a key need for engineering, remains challenging. A promising technique for leveraging omics measurements in metabolic modeling involves the construction of kinetic descriptions of the enzymatic reactions that occur within a cell.
View Article and Find Full Text PDFMachine learning methods have shown promise in predicting molecular properties, and given sufficient training data, machine learning approaches can enable rapid high-throughput virtual screening of large libraries of compounds. Graph-based neural network architectures have emerged in recent years as the most successful approach for predictions based on molecular structure and have consistently achieved the best performance on benchmark quantum chemical datasets. However, these models have typically required optimized 3D structural information for the molecule to achieve the highest accuracy.
View Article and Find Full Text PDFModern omics analyses are able to effectively characterize the genetic, regulatory, and metabolic phenotypes of engineered microbes, yet designing genetic interventions to achieve a desired phenotype remains challenging. With recent developments in genetic engineering techniques, timelines associated with building and testing strain designs have been greatly reduced, allowing for the first time an efficient closed loop iteration between experiment and analysis. However, the scale and complexity associated with multi-omics datasets complicates manual biological reasoning about the mechanisms driving phenotypic changes.
View Article and Find Full Text PDFSuccinate is a precursor of multiple commodity chemicals and bio-based succinate production is an active area of industrial bioengineering research. One of the most important microbial strains for bio-based production of succinate is the capnophilic gram-negative bacterium Actinobacillus succinogenes, which naturally produces succinate by a mixed-acid fermentative pathway. To engineer A.
View Article and Find Full Text PDF, a Gram-negative facultative anaerobe, exhibits the native capacity to convert pentose and hexose sugars to succinic acid (SA) with high yield as a tricarboxylic acid (TCA) cycle intermediate. In addition, is capnophilic, incorporating CO into SA, making this organism an ideal candidate host for conversion of lignocellulosic sugars and CO to an emerging commodity bioproduct sourced from renewable feedstocks. In this work, we report the development of facile metabolic engineering capabilities in , enabling examination of SA flux determinants via knockout of the primary competing pathways-namely, acetate and formate production-and overexpression of the key enzymes in the reductive branch of the TCA cycle leading to SA.
View Article and Find Full Text PDFBiotechnol Biofuels
January 2017
Background: Production of chemicals from engineered organisms in a batch culture involves an inherent trade-off between productivity, yield, and titer. Existing strategies for strain design typically focus on designing mutations that achieve the highest yield possible while maintaining growth viability. While these methods are computationally tractable, an optimum productivity could be achieved by a dynamic strategy in which the intracellular division of resources is permitted to change with time.
View Article and Find Full Text PDFThe production of chemicals alongside fuels will be essential to enhance the feasibility of lignocellulosic biorefineries. Succinic acid (SA), a naturally occurring C4-diacid, is a primary intermediate of the tricarboxylic acid cycle and a promising building block chemical that has received significant industrial attention. Basfia succiniciproducens is a relatively unexplored SA-producing bacterium with advantageous features such as broad substrate utilization, genetic tractability, and facultative anaerobic metabolism.
View Article and Find Full Text PDFIn the mammalian suprachiasmatic nucleus (SCN), noisy cellular oscillators communicate within a neuronal network to generate precise system-wide circadian rhythms. Although the intracellular genetic oscillator and intercellular biochemical coupling mechanisms have been examined previously, the network topology driving synchronization of the SCN has not been elucidated. This network has been particularly challenging to probe, due to its oscillatory components and slow coupling timescale.
View Article and Find Full Text PDFStochastic noise at the cellular level has been shown to play a fundamental role in circadian oscillations, influencing how groups of cells entrain to external cues and likely serving as the mechanism by which cell-autonomous rhythms are generated. Despite this importance, few studies have investigated how clock perturbations affect stochastic noise-even as increasing numbers of high-throughput screens categorize how gene knockdowns or small molecules can change clock period and amplitude. This absence is likely due to the difficulty associated with measuring cell-autonomous stochastic noise directly, which currently requires the careful collection and processing of single-cell data.
View Article and Find Full Text PDFBioluminescence rhythms from cellular reporters have become the most common method used to quantify oscillations in circadian gene expression. These experimental systems can reveal phase and amplitude change resulting from circadian disturbances, and can be used in conjunction with mathematical models to lend further insight into the mechanistic basis of clock amplitude regulation. However, bioluminescence experiments track the mean output from thousands of noisy, uncoupled oscillators, obscuring the direct effect of a given stimulus on the genetic regulatory network.
View Article and Find Full Text PDF