Engineering microorganisms into biological factories that convert renewable feedstocks into valuable materials is a major goal of synthetic biology; however, for many nonmodel organisms, we do not yet have the genetic tools, such as suites of strong promoters, necessary to effectively engineer them. In this work, we developed a computational framework that can leverage standard RNA-seq data sets to identify sets of constitutive, strongly expressed genes and predict strong promoter signals within their upstream regions. The framework was applied to a diverse collection of RNA-seq data measured for the methanotroph 5GB1 and identified 25 genes that were constitutively, strongly expressed across 12 experimental conditions. For each gene, the framework predicted short (27-30 nucleotide) sequences as candidate promoters and derived -35 and -10 consensus promoter motifs (TTGACA and TATAAT, respectively) for strong expression in . This consensus closely matches the canonical sigma-70 motif and was found to be enriched in promoter regions of the genome. A subset of promoter predictions was experimentally validated in a XylE reporter assay, including the consensus promoter, which showed high expression. The , , and promoter predictions were additionally screened in an experiment that scrambled the -35 and -10 signal sequences, confirming that transcription initiation was disrupted when these specific regions of the predicted sequence were altered. These results indicate that the computational framework can make biologically meaningful promoter predictions and identify key pieces of regulatory systems that can serve as foundational tools for engineering diverse microorganisms for biomolecule production.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acssynbio.1c00017DOI Listing

Publication Analysis

Top Keywords

computational framework
12
rna-seq data
12
promoter predictions
12
promoter
8
nonmodel organisms
8
data sets
8
-35 -10
8
consensus promoter
8
framework identifying
4
identifying promoter
4

Similar Publications

Background: X-ray grating-based dark-field imaging can sense the small angle scattering caused by object's micro-structures. This technique is sensitive to the porous microstructure of lung alveoli and has the potential to detect lung diseases at an early stage. Up to now, a human-scale dark-field CT (DF-CT) prototype has been built for lung imaging.

View Article and Find Full Text PDF

We introduce EmoAtlas, a computational library/framework extracting emotions and syntactic/semantic word associations from texts. EmoAtlas combines interpretable artificial intelligence (AI) for syntactic parsing in 18 languages and psychologically validated lexicons for detecting the eight emotions in Plutchik's theory. We show that EmoAtlas can match or surpass transformer-based natural language processing techniques, BERT or large language models like ChatGPT 3.

View Article and Find Full Text PDF

The construction industry is generally characterized by high emissions, making its transition to low-carbon practices essential for achieving a low-carbon economy. However, due to information asymmetry, there remains a gap in research regarding the strategic interactions and reward/punishment mechanisms between governments and firms throughout this transition. This paper addresses this gap by investigating probabilistic and static reward and punishment evolutionary games.

View Article and Find Full Text PDF

Service transformation plays a pivotal role in achieving the sustainable development of the sports industry. This study originates from the interactive relationships among sports enterprises, consumers, and regulatory authorities, proposing a logical framework for the service transformation of the sports industry. Furthermore, a three-party evolutionary game model is constructed to explore the strategic evolution and stability conditions under both single-agent and multi-agent scenarios.

View Article and Find Full Text PDF

Reservoir computing is a machine learning method that is well-suited for complex time series prediction tasks. Both delay embedding and the projection of input data into a higher-dimensional space play important roles in enabling accurate predictions. We establish simple post-processing methods that train on past node states at uniformly or randomly-delayed timeshifts.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!