The majority of bacteriophage diversity remains uncharacterised, and new intriguing mechanisms of their biology are being continually described. Members of some phage lineages, such as the , repurpose stop codons to encode an amino acid by using alternate genetic codes. Here, we investigated the prevalence of stop codon reassignment in phage genomes and subsequent impacts on functional annotation. We predicted 76 genomes within INPHARED and 712 vOTUs from the Unified Human Gut Virome catalogue (UHGV) that repurpose a stop codon to encode an amino acid. We re-annotated these sequences with modified versions of Pharokka and Prokka, called Pharokka-gv and Prokka-gv, to automatically predict stop codon reassignment prior to annotation. Both tools significantly improved the quality of annotations, with Pharokka-gv performing best. For sequences predicted to repurpose TAG to glutamine (translation table 15), Pharokka-gv increased the median gene length (median of per genome medians) from 287 to 481 bp for UHGV sequences (67.8% increase) and from 318 to 550 bp for INPHARED sequences (72.9% increase). The re-annotation increased mean coding density from 66.8% to 90.0%, and from 69.0% to 89.8% for UHGV and INPHARED sequences. Furthermore, the proportion of genes that could be assigned functional annotation increased, including an increase in the number of major capsid proteins that could be identified. We propose that automatic prediction of stop codon reassignment before annotation is beneficial to downstream viral genomic and metagenomic analyses.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10769273 | PMC |
http://dx.doi.org/10.1101/2023.12.19.572299 | DOI Listing |
Nat Struct Mol Biol
January 2025
Laboratory of Regulation of Gene Expression, Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic.
Transfer RNAs (tRNAs) serve as a dictionary for the ribosome translating the genetic message from mRNA into a polypeptide chain. In addition to this canonical role, tRNAs are involved in other processes such as programmed stop codon readthrough (SC-RT). There, tRNAs with near-cognate anticodons to stop codons must outcompete release factors and incorporate into the ribosomal decoding center to prevent termination and allow translation to continue.
View Article and Find Full Text PDFNucleic Acids Res
January 2025
Division of Pharmacoengineering and Molecular Pharmaceutics, The University of North Carolina at Chapel Hill, 125 Mason Farm Rd. Chapel Hill, NC 27599, USA.
Whole genome codon compression-the reassignment of all instances of a specific codon to synonymous codons-can generate organisms capable of tolerating knockout of otherwise essential transfer RNAs (tRNAs). As a result, such knockout strains enable numerous unique applications, such as high-efficiency production of DNA encoding extremely toxic genes or non-canonical proteins. However, achieving stringent control over protein expression in these organisms remains challenging, particularly with proteins where incomplete repression results in deleterious phenotypes.
View Article and Find Full Text PDFImmunity
January 2025
Division of Oncogenomics, Oncode institute, the Netherlands Cancer Institute, Amsterdam, the Netherlands; Erasmus MC, Department of Genetics, Rotterdam University, Rotterdam, the Netherlands. Electronic address:
Prolonged exposure to interferon-gamma (IFNγ) and the associated increased expression of the enzyme indoleamine 2,3-dioxygenase 1 (IDO1) create an intracellular shortage of tryptophan in the cancer cells, which stimulates ribosomal frameshifting and tryptophan to phenylalanine (W>F) codon reassignments during protein synthesis. Here, we investigated whether such neoepitopes can be useful targets of adoptive T cell therapy. Immunopeptidomic analyses uncovered hundreds of W>F neoepitopes mainly presented by the HLA-A24:02 allele.
View Article and Find Full Text PDFCurr Opin Biotechnol
December 2024
Department of Life Sciences, Ilse Katz Institute for Nanoscale Science and Technology, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel. Electronic address:
The genetic code is a universally conserved mechanism that translates genetic information into proteins, consisting of 64 codons formed by four nucleotide bases. With a few exceptions, the genetic code universally encodes 20 canonical amino acids (AAs) and three stop signals, with many AAs represented by multiple codons. Genetic engineering has expanded this system through approaches like codon reassignment and synthetic base pair introduction, allowing for the incorporation of noncanonical AAs (ncAAs) into proteins, known as genetic code expansion (GCE).
View Article and Find Full Text PDFPLoS Genet
December 2024
Earlham Institute, Norwich Research Park, Norwich, United Kingdom.
The translation of nucleotide sequences into amino acid sequences, governed by the genetic code, is one of the most conserved features of molecular biology. The standard genetic code, which uses 61 sense codons to encode one of the 20 standard amino acids and 3 stop codons (UAA, UAG, and UGA) to terminate translation, is used by most extant organisms. The protistan phylum Ciliophora (the 'ciliates') are the most prominent exception to this norm, exhibiting the grfeatest diversity of nuclear genetic code variants and evidence of repeated changes in the code.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!