Modeling coding sequence design for virus-based expression in tobacco.

Synth Syst Biotechnol

Department of Biomedical Engineering, The Iby and Aladar Fleischman Faculty of Engineering, Tel Aviv, Israel.

Published: June 2025

Transient expression in Tobacco is a popular way to produce recombinant proteins in plants. The design of various expression vectors, delivered into the plant by , has enabled high production levels of some proteins. To further enhance expression, researchers often adapt the coding sequence of heterologous genes to the host, but this strategy has produced mixed results in Tobacco. To study the effects of different sequence features on protein yield, we compile a dataset of the yields and coding sequences of previously published expression studies of more than 200 coding sequences. We evaluate various established gene expression models on a subset of the expression studies. We find that use of tobacco codons is only moderately predictive of protein yield as informative sequence features likely extend over multiple codons. Additionally, we show that codon usage of organisms that use tobacco as a host for expression of their proteins in a similar way as the synthetic system, like viruses and agrobacteria, can be used to predict heterologous expression. Other predictive features are related to tRNA supply and demand, the inclusion of a translational ramp of codons with lower adaptation to the tRNA pool at the beginning of the coding region, and the amino acid composition of the recombinant protein. A model based on all the features achieved a correlation of 0.57 with protein yield. We believe that our study provides a practical guideline for coding sequence design for efficient expression in tobacco.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11718241PMC
http://dx.doi.org/10.1016/j.synbio.2024.12.002DOI Listing

Publication Analysis

Top Keywords

coding sequence
12
expression tobacco
12
protein yield
12
expression
10
sequence design
8
sequence features
8
coding sequences
8
expression studies
8
tobacco
6
sequence
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!