A well-known mechanism through which new protein-coding genes originate is by modification of pre-existing genes, e.g. by duplication or horizontal transfer. In contrast, many viruses generate protein-coding genes de novo, via the overprinting of a new reading frame onto an existing ("ancestral") frame. This mechanism is thought to play an important role in viral pathogenicity, but has been poorly explored, perhaps because identifying the de novo frames is very challenging. Therefore, a new approach to detect them was needed. We assembled a reference set of overlapping genes for which we could reliably determine the ancestral frames, and found that their codon usage was significantly closer to that of the rest of the viral genome than the codon usage of de novo frames. Based on this observation, we designed a method that allowed the identification of de novo frames based on their codon usage with a very good specificity, but intermediate sensitivity. Using our method, we predicted that the Rex gene of deltaretroviruses has originated de novo by overprinting the Tax gene. Intriguingly, several genes in the same genomic region have also originated de novo and encode proteins that regulate the functions of Tax. Such "gene nurseries" may be common in viral genomes. Finally, our results confirm that the genomic GC content is not the only determinant of codon usage in viruses and suggest that a constraint linked to translation must influence codon usage.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3744397PMC
http://dx.doi.org/10.1371/journal.pcbi.1003162DOI Listing

Publication Analysis

Top Keywords

codon usage
24
originated novo
12
novo overprinting
12
novo frames
12
protein-coding genes
8
frames based
8
novo
7
codon
6
usage
6
genes
5

Similar Publications

Synechococcus is a significant primary producer in the oceans, coexisting with cyanophages, which are important agents of mortality. Bacterial resistance against phage infection is a topic of significant interest, yet little is known for ecologically relevant systems. Here we use exogenous gene expression and gene disruption to investigate mechanisms underlying intracellular resistance of marine Synechococcus WH5701 to the Syn9 cyanophage.

View Article and Find Full Text PDF

Predicting gene sequences with AI to study codon usage patterns.

Proc Natl Acad Sci U S A

January 2025

Department of Computer Science, University of Haifa, Haifa 3303221, Israel.

Selective pressure acts on the codon use, optimizing multiple, overlapping signals that are only partially understood. We trained AI models to predict codons given their amino acid sequence in the eukaryotes and and the bacteria and to study the extent to which we can learn patterns in naturally occurring codons to improve predictions. We trained our models on a subset of the proteins and evaluated their predictions on large, separate sets of proteins of varying lengths and expression levels.

View Article and Find Full Text PDF

Human Riboviruses: A Comprehensive Study.

J Mol Evol

December 2024

Department of Zoology, Hansraj College, University of Delhi, Mahatma Hansraj Marg, Malkaganj, Delhi, 110007, India.

The urgency to understand the complex interactions between viruses, their animal reservoirs, and human populations has been necessitated by the continuous spread of zoonotic viral diseases as evidenced in epidemics and pandemics throughout human history. Riboviruses are involved in some of the most prevalent human diseases, responsible for causing epidemics and pandemics. These viruses have an animal origin and have been known to cross the inter-species barrier time and time again, eventually infecting human beings.

View Article and Find Full Text PDF
Article Synopsis
  • Fleas significantly affect human and animal health worldwide, prompting a study on the complete mitochondrial genomes of two species: Paradoxopsyllus custodis and Stenischia montanis yunlongensis.
  • The genomes measured 15,375 bp and 15,651 bp, containing 37 genes, with an observable preference for AT nucleotide combinations and unique coding features, including incomplete stop codons.
  • Phylogenetic analysis indicated a paraphyletic relationship within the Leptopsyllidae family, providing insights into the mitochondrial genome and beneficial genetic markers for identifying and classifying fleas in the Siphonaptera order.
View Article and Find Full Text PDF

Natural selection shapes codon usage and host adaptation of NS1 in mosquito-borne pathogenic flaviviruses.

Int J Biol Macromol

December 2024

National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, National Institute for Viral Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing 102206, China. Electronic address:

The NS1 protein of nine mosquito-borne flaviviruses, including Dengue virus 1-4, Japanese encephalitis virus, West Nile virus, Yellow fever virus, Tembusu virus, and Zika virus, shows distinct codon usage and evolutionary traits. Codon usage analysis shows notable base composition bias and non-conservatism in NS1, with distinct evolutionary traits from its ORF. Analysis of relative synonymous codon usage (RSCU) indicates that the NS1 genes exhibit non-conservative RSCU patterns within different mosquito-borne pathogenic flaviviruses.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!