The initial objective of this study was to shed light on the evolution of small DNA tumor viruses by analyzing assemblies of publicly available deep sequencing datasets. The survey generated a searchable database of contig snapshots representing more than 100,000 Sequence Read Archive records. Using modern structure-aware search tools, we iteratively broadened the search to include an increasingly wide range of other virus families. The analysis revealed a surprisingly diverse range of chimeras involving different virus groups. In some instances, genes resembling known DNA-replication modules or known virion protein operons were paired with unrecognizable sequences that structural predictions suggest may represent previously unknown replicases and novel virion architectures. Discrete clades of an emerging group called adintoviruses were discovered in datasets representing humans and other primates. As a proof of concept, we show that the contig database is also useful for discovering RNA viruses and candidate archaeal phages. The ancillary searches revealed additional examples of chimerization between different virus groups. The observations support a gene-centric taxonomic framework that should be useful for future virus-hunting efforts.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11071296 | PMC |
http://dx.doi.org/10.1101/2024.03.25.586562 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!