MgCod: Gene Prediction in Phage Genomes with Multiple Genetic Codes.

J Mol Biol

Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA; School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA. Electronic address:

Published: July 2023

Massive sequencing of microbiomes has led to the discovery of a large number of phage genomes with intermittent stop codon recoding. We have developed a computational tool, MgCod, that identifies genomic regions (blocks) with distinct stop codon recoding simultaneously with the prediction of protein-coding regions. When MgCod was used to scan a large volume of human metagenomic contigs hundreds of viral contigs with intermittent stop codon recoding were revealed. Many of these contigs originated from genomes of known crAssphages. Further analyses had shown that intermittent recoding was associated with subtle patterns in the organization of protein-coding genes, such as 'single-coding' and 'dual-coding'. The dual-coding genes, clustered into blocks, could be translated by two alternative codes producing nearly identical proteins. It was observed that the dual-coded blocks were enriched with the early-stage phage genes, while the late-stage genes were residing in the single-coded blocks. MgCod can identify types of stop codon recoding in novel genomic sequences in parallel with gene prediction. It is available for download from https://github.com/gatech-genemark/MgCod.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jmb.2023.168159DOI Listing

Publication Analysis

Top Keywords

codon recoding
16
gene prediction
8
phage genomes
8
intermittent codon
8
recoding
5
mgcod
4
mgcod gene
4
prediction phage
4
genomes multiple
4
multiple genetic
4

Similar Publications

Defining serine tRNA knockout as a strategy for effective repression of gene expression in organisms with a recoded genome.

Nucleic Acids Res

January 2025

Division of Pharmacoengineering and Molecular Pharmaceutics, The University of North Carolina at Chapel Hill, 125 Mason Farm Rd. Chapel Hill, NC 27599, USA.

Whole genome codon compression-the reassignment of all instances of a specific codon to synonymous codons-can generate organisms capable of tolerating knockout of otherwise essential transfer RNAs (tRNAs). As a result, such knockout strains enable numerous unique applications, such as high-efficiency production of DNA encoding extremely toxic genes or non-canonical proteins. However, achieving stringent control over protein expression in these organisms remains challenging, particularly with proteins where incomplete repression results in deleterious phenotypes.

View Article and Find Full Text PDF

RNA viruses possess small genomes encoding a limited repertoire of essential and often multifunctional proteins. Although genetically tagging viral proteins provides a powerful tool for dissecting mechanisms of viral replication and infection, it remains a challenge. Here, we leverage genetic code expansion to develop a recoded strain of respiratory syncytial virus (RSV) in which the multifunctional nucleoprotein is site-specifically modified with a noncanonical amino acid.

View Article and Find Full Text PDF

Engineered tRNAs efficiently suppress CDKL5 premature termination codons.

Sci Rep

December 2024

Department of Medical Biotechnology and Translational Medicine, University of Milan, Segrate (Milan), 20054, Italy.

Article Synopsis
  • The CDKL5 deficiency disorder (CDD) is a severe neurodevelopmental condition with symptoms including early epilepsy, intellectual disabilities, and motor/visual dysfunction, caused by mutations in the CDKL5 gene.
  • Currently, there is no cure for CDD; treatments focus on managing seizures, though some genetic therapies show promise in addressing the disorder's root causes.
  • Recent studies using Anticodon-edited tRNAs (ACE-tRNAs) have shown potential in restoring full-length CDKL5 protein synthesis, opening up possibilities for effective new treatments for patients with nonsense mutations.
View Article and Find Full Text PDF

Engineering of the genetic code.

Curr Opin Biotechnol

December 2024

Department of Life Sciences, Ilse Katz Institute for Nanoscale Science and Technology, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel. Electronic address:

The genetic code is a universally conserved mechanism that translates genetic information into proteins, consisting of 64 codons formed by four nucleotide bases. With a few exceptions, the genetic code universally encodes 20 canonical amino acids (AAs) and three stop signals, with many AAs represented by multiple codons. Genetic engineering has expanded this system through approaches like codon reassignment and synthetic base pair introduction, allowing for the incorporation of noncanonical AAs (ncAAs) into proteins, known as genetic code expansion (GCE).

View Article and Find Full Text PDF

The genetic code is not universal.

Biosystems

January 2025

The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy. Electronic address:

Recently, a new genetic code with 62 sense codons, coding for 21 amino acids, and only 2 termination codons has been identified in archaea. The authors argue that the appearance of this variant of the genetic code is due to the relatively recent and complete recoding of all UAG stop codons to codons encoding for pyrrolysine. I re-evaluate this discovery by presenting arguments that favour the early, i.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!