CoreTracker: accurate codon reassignment prediction, applied to mitochondrial genomes.

Bioinformatics

Département d'Informatique et de Recherche Opérationnelle (DIRO), Université de Montréal, Montréal, QC CP 6128, Canada.

Published: November 2017

Motivation: Codon reassignments have been reported across all domains of life. With the increasing number of sequenced genomes, the development of systematic approaches for genetic code detection is essential for accurate downstream analyses. Three automated prediction tools exist so far: FACIL, GenDecoder and Bagheera; the last two respectively restricted to metazoan mitochondrial genomes and CUG reassignments in yeast nuclear genomes. These tools can only analyze a single genome at a time and are often not followed by a validation procedure, resulting in a high rate of false positives.

Results: We present CoreTracker, a new algorithm for the inference of sense-to-sense codon reassignments. CoreTracker identifies potential codon reassignments in a set of related genomes, then uses statistical evaluations and a random forest classifier to predict those that are the most likely to be correct. Predicted reassignments are then validated through a phylogeny-aware step that evaluates the impact of the new genetic code on the protein alignment. Handling simultaneously a set of genomes in a phylogenetic framework, allows tracing back the evolution of each reassignment, which provides information on its underlying mechanism. Applied to metazoan and yeast genomes, CoreTracker significantly outperforms existing methods on both precision and sensitivity.

Availability And Implementation: CoreTracker is written in Python and available at https://github.com/UdeM-LBIT/CoreTracker.

Contact: mabrouk@iro.umontreal.ca.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btx421DOI Listing

Publication Analysis

Top Keywords

codon reassignments
12
mitochondrial genomes
8
genetic code
8
set genomes
8
genomes
7
coretracker
5
reassignments
5
coretracker accurate
4
codon
4
accurate codon
4

Similar Publications

Transfer RNAs (tRNAs) serve as a dictionary for the ribosome translating the genetic message from mRNA into a polypeptide chain. In addition to this canonical role, tRNAs are involved in other processes such as programmed stop codon readthrough (SC-RT). There, tRNAs with near-cognate anticodons to stop codons must outcompete release factors and incorporate into the ribosomal decoding center to prevent termination and allow translation to continue.

View Article and Find Full Text PDF

Defining serine tRNA knockout as a strategy for effective repression of gene expression in organisms with a recoded genome.

Nucleic Acids Res

January 2025

Division of Pharmacoengineering and Molecular Pharmaceutics, The University of North Carolina at Chapel Hill, 125 Mason Farm Rd. Chapel Hill, NC 27599, USA.

Whole genome codon compression-the reassignment of all instances of a specific codon to synonymous codons-can generate organisms capable of tolerating knockout of otherwise essential transfer RNAs (tRNAs). As a result, such knockout strains enable numerous unique applications, such as high-efficiency production of DNA encoding extremely toxic genes or non-canonical proteins. However, achieving stringent control over protein expression in these organisms remains challenging, particularly with proteins where incomplete repression results in deleterious phenotypes.

View Article and Find Full Text PDF

Adoptive T cell therapy targeting an inducible and broadly shared product of aberrant mRNA translation.

Immunity

January 2025

Division of Oncogenomics, Oncode institute, the Netherlands Cancer Institute, Amsterdam, the Netherlands; Erasmus MC, Department of Genetics, Rotterdam University, Rotterdam, the Netherlands. Electronic address:

Prolonged exposure to interferon-gamma (IFNγ) and the associated increased expression of the enzyme indoleamine 2,3-dioxygenase 1 (IDO1) create an intracellular shortage of tryptophan in the cancer cells, which stimulates ribosomal frameshifting and tryptophan to phenylalanine (W>F) codon reassignments during protein synthesis. Here, we investigated whether such neoepitopes can be useful targets of adoptive T cell therapy. Immunopeptidomic analyses uncovered hundreds of W>F neoepitopes mainly presented by the HLA-A24:02 allele.

View Article and Find Full Text PDF

Engineering of the genetic code.

Curr Opin Biotechnol

December 2024

Department of Life Sciences, Ilse Katz Institute for Nanoscale Science and Technology, Ben-Gurion University of the Negev, Beer-Sheva 8410501, Israel. Electronic address:

The genetic code is a universally conserved mechanism that translates genetic information into proteins, consisting of 64 codons formed by four nucleotide bases. With a few exceptions, the genetic code universally encodes 20 canonical amino acids (AAs) and three stop signals, with many AAs represented by multiple codons. Genetic engineering has expanded this system through approaches like codon reassignment and synthetic base pair introduction, allowing for the incorporation of noncanonical AAs (ncAAs) into proteins, known as genetic code expansion (GCE).

View Article and Find Full Text PDF

The translation of nucleotide sequences into amino acid sequences, governed by the genetic code, is one of the most conserved features of molecular biology. The standard genetic code, which uses 61 sense codons to encode one of the 20 standard amino acids and 3 stop codons (UAA, UAG, and UGA) to terminate translation, is used by most extant organisms. The protistan phylum Ciliophora (the 'ciliates') are the most prominent exception to this norm, exhibiting the grfeatest diversity of nuclear genetic code variants and evidence of repeated changes in the code.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!