Fake IDs? Widespread misannotation of DNA transposons as a general transcription factor.

Genome Biol

School of Biological Sciences, University of Adelaide, North Terrace, Adelaide, South Australia, 5005, Australia.

Published: November 2023

Accurate annotation of genes and transposable elements (TEs) is vital for understanding genomes, but current annotation pipelines often misannotate TEs as genes. This study reveals how the general transcription factor II-I repeat domain-containing protein 2 (GTF2IRD2) erroneously annotated DNA transposons in non-mammalian species, as it contains a 3' fused hAT transposase domain. We also demonstrate the generality of this problem by identifying misannotated TEs as genes in other vertebrate genomes. Such misannotations can lead to errors in phylogenetic analyses and wasted time for investigators. The study proposes adding a final TE-check to gene annotation pipelines to mitigate this problem.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10641963PMC
http://dx.doi.org/10.1186/s13059-023-03102-9DOI Listing

Publication Analysis

Top Keywords

dna transposons
8
general transcription
8
transcription factor
8
annotation pipelines
8
tes genes
8
fake ids?
4
ids? widespread
4
widespread misannotation
4
misannotation dna
4
transposons general
4

Similar Publications

Cytidine analogs in plant epigenetic research and beyond.

J Exp Bot

December 2024

Centre of Plant Structural and Functional Genomics, Institute of Experimental Botany, Czech Acad Sci, Šlechtitelů 31, Olomouc 77900, Czech Republic.

Cytosine (DNA) methylation plays important roles in silencing transposable elements, plant development, genomic imprinting, stress responses, and maintenance of genome stability. To better understand the functions of this epigenetic modification, several tools have been developed to manipulate DNA methylation levels. These include mutants of DNA methylation writers and readers, targeted manipulation of locus-specific methylation, and the use of chemical inhibitors.

View Article and Find Full Text PDF

The beta-rhizobial strain Paraburkholderia phymatum STM815 is noteworthy for its wide host range in nodulating legumes, primarily mimosoids (over 50 different species) but also some papilionoids. It cannot, however, nodulate soybean (Glycine max [L.] Merr.

View Article and Find Full Text PDF

The genome of the solitary bee Tetrapedia diversipes (Hymenoptera, Apidae).

G3 (Bethesda)

December 2024

Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, Rua do Matão, 277, CEP 05508-090, São Paulo, SP, Brazil.

Tetrapedia diversipes is a Neotropical solitary bee commonly found in trap-nests, known for its morphological adaptations for floral oil collection and prepupal diapause during the cold and dry season. Here, we present the genome assembly of T. diversipes (332 Mbp), comprising 2,575 scaffolds, with 15,028 predicted protein-coding genes.

View Article and Find Full Text PDF

Nanopore sequencing enables detection of DNA methylation at the same time as identification of canonical sequence. A recent study validated low-pass nanopore sequencing to accurately estimate global methylation levels in vertebrates with sequencing coverage as low as 0.01x.

View Article and Find Full Text PDF

Unlabelled: strain E264 ( E264) and close relatives stochastically duplicate a 208.6 kb region of chromosome I via RecA-dependent recombination between two nearly identical insertion sequence elements. Because homologous recombination occurs at a constant, low level, populations of E264 are always heterogeneous, but cells containing two or more copies of the region (Dup+) have an advantage, and hence predominate, during biofilm growth, while those with a single copy (Dup-) are favored during planktonic growth.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!