Many of the sequenced bacterial and archaeal genomes encode regions of viral provenance. Yet, not all of these regions encode bona fide viruses. Gene transfer agents (GTAs) are thought to be former viruses that are now maintained in genomes of some bacteria and archaea and are hypothesized to enable exchange of DNA within bacterial populations. In Alphaproteobacteria, genes homologous to the "head-tail" gene cluster that encodes structural components of the Rhodobacter capsulatus GTA (RcGTA) are found in many taxa, even if they are only distantly related to Rhodobacter capsulatus. Yet, in most genomes available in GenBank RcGTA-like genes have annotations of typical viral proteins, and therefore are not easily distinguished from their viral homologs without additional analyses. Here, we report a "support vector machine" classifier that quickly and accurately distinguishes RcGTA-like genes from their viral homologs by capturing the differences in the amino acid composition of the encoded proteins. Our open-source classifier is implemented in Python and can be used to scan homologs of the RcGTA genes in newly sequenced genomes. The classifier can also be trained to identify other types of GTAs, or even to detect other elements of viral ancestry. Using the classifier trained on a manually curated set of homologous viruses and GTAs, we detected RcGTA-like "head-tail" gene clusters in 57.5% of the 1,423 examined alphaproteobacterial genomes. We also demonstrated that more than half of the in silico prophage predictions are instead likely to be GTAs, suggesting that in many alphaproteobacterial genomes the RcGTA-like elements remain unrecognized.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6821227PMC
http://dx.doi.org/10.1093/gbe/evz206DOI Listing

Publication Analysis

Top Keywords

gene transfer
8
transfer agents
8
"head-tail" gene
8
rhodobacter capsulatus
8
rcgta-like genes
8
viral homologs
8
classifier trained
8
alphaproteobacterial genomes
8
genomes
6
viral
5

Similar Publications

Metabolic enhancement contributed by horizontal gene transfer is essential for dietary specialization in leaf beetles.

Proc Natl Acad Sci U S A

January 2025

State Key Laboratory of Biocatalysis and Enzyme Engineering, School of Life Sciences, Hubei University, Wuhan 430062, China.

Horizontal gene transfer (HGT) from bacteria to insects is widely reported and often associated with the adaptation and diversification of insects. However, compelling evidence demonstrating how HGT-conferred metabolic adjustments enable species to adapt to surrounding environment remains scarce. Dietary specialization is an important ecological strategy adopted by animals to reduce inter- and intraspecific competition for limited resources.

View Article and Find Full Text PDF

Transfer learning aims to integrate useful information from multi-source datasets to improve the learning performance of target data. This can be effectively applied in genomics when we learn the gene associations in a target tissue, and data from other tissues can be integrated. However, heavy-tail distribution and outliers are common in genomics data, which poses challenges to the effectiveness of current transfer learning approaches.

View Article and Find Full Text PDF

The surveillance of mobile genetic elements facilitating the spread of antimicrobial resistance genes has been challenging. Here, we tracked both clonal and plasmid transmission in colistin- and carbapenem-resistant using short- and long-read sequencing technologies. We observed three clonal transmissions, all containing Incompatibility group (Inc) L plasmids and New Delhi metallo-beta-lactamase , although not co-located on the same plasmid.

View Article and Find Full Text PDF

Unlabelled: Archaeal molecular biology has been a topic of intense research in recent decades as their role in global ecosystems, nutrient cycles, and eukaryotic evolution comes to light. The hypersaline-adapted archaeal species and serve as important model organisms for understanding archaeal genomics, genetics, and biochemistry, in part because efficient tools enable genetic manipulation. As a result, the number of strains in circulation among the haloarchaeal research community has increased in recent decades.

View Article and Find Full Text PDF

We present the complete mitochondrial genome of from China. The mitogenome of is circular, AT-rich (75.3%), and 15,898 bp in length.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!