Mechanistic models explicitly represent hypothesized biological knowledge. As such, they offer more generalizability than data-driven models. However, identifying model curation efforts that improve performance for mechanistic models is nontrivial. Here, we develop a solution to this problem for genome-scale metabolic models. We generate an ensemble of models, each equally consistent with experimental data, then perform simulations with them. We apply machine learning to the simulation output to identify model structure variation that maximally influences simulations. These variants are high-priority candidates for curation through removal, addition, or reannotation in the model. We apply this approach, automated metabolic model ensemble-driven elimination of uncertainty with statistical learning (AMMEDEUS), to 29 bacterial species to improve gene essentiality predictions. We explore targets for individual species and compile pan-species targets to improve the database used during model construction. AMMEDEUS is an automated and performance-driven recommendation system that complements intuition during curation of biochemical knowledgebases.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6975163 | PMC |
http://dx.doi.org/10.1016/j.cels.2019.11.006 | DOI Listing |
J Inherit Metab Dis
January 2025
Department of Pediatrics, Emma Children's Hospital, Amsterdam Gastroenterology Endocrinology Metabolism, Amsterdam UMC Location University of Amsterdam, Amsterdam, The Netherlands.
Inborn errors of metabolism (IEMs) are rare genetic conditions with significant morbidity and mortality. Technological advances have increased therapeutic options, making it challenging to remain up to date. A centralized therapy knowledgebase is needed for early diagnosis and targeted treatment.
View Article and Find Full Text PDFThe profound stability of bacterial spores makes them a promising platform for biotechnological applications like biocatalysis, bioremediation, drug delivery, etc. However, though the spore is composed of >40 proteins, only ∼12 have been explored as fusion carriers for protein display. Here, we assessed the suitability of 33 spore proteins (SPs) as enzyme display carriers by direct allele tagging at native genomic loci.
View Article and Find Full Text PDFNucleic Acids Res
January 2025
Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA.
BindingDB (bindingdb.org) is a public, web-accessible database of experimentally measured binding affinities between small molecules and proteins, which supports diverse applications including medicinal chemistry, biochemical pathway annotation, training of artificial intelligence models and computational chemistry methods development. This update reports significant growth and enhancements since our last review in 2016.
View Article and Find Full Text PDFGenetics
November 2024
Department of Genetics, Stanford University, 3165 Porter Dr, Palo Alto, CA 94304, USA.
Budding yeast (Saccharomyces cerevisiae) is the most extensively characterized eukaryotic model organism and has long been used to gain insight into the fundamentals of genetics, cellular biology, and the functions of specific genes and proteins. The Saccharomyces Genome Database (SGD) is a scientific resource that provides information about the genome and biology of S. cerevisiae.
View Article and Find Full Text PDFNat Commun
October 2024
Laboratory of Experimental Cancer Research, Department of Human Structure and Repair, Ghent University, Ghent, Belgium.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!