VISTA: A Tool for Fast Taxonomic Assignment of Viral Genome Sequences.

Genomics Proteomics Bioinformatics

National Genomics Data Center, China National Center for Bioinformation, Beijing 100101, China.

Published: November 2024

The rapid expansion of the number of viral genome sequences in public databases necessitates a scalable, universal, and automated preliminary taxonomic framework for comprehensive virus studies. Here, we introduce VISTA (Virus Sequence-based Taxonomy Assignment), a computational tool that employs a novel pairwise sequence comparison system and an automatic demarcation threshold identification framework for virus taxonomy. Leveraging physio-chemical property sequences, k-mer profiles, and machine learning techniques, VISTA constructs a robust distance-based framework for taxonomic assignment. Functionally similar to PASC (Pairwise Sequence Comparison), a widely used virus assignment tool based on pairwise sequence comparison, VISTA demonstrates superior performance by providing significantly improved separation for taxonomic groups, more objective taxonomic demarcation thresholds, greatly enhanced speed, and a wider application scope. We successfully applied VISTA to 38 virus families, as well as to the class Caudoviricetes. This demonstrates VISTA's scalability, robustness, and ability to automatically and accurately assign taxonomy to both prokaryotic and eukaryotic viruses. Furthermore, the application of VISTA to 679 unclassified prokaryotic virus genomes recovered from metagenomic data identified 46 novel virus families. VISTA is available as both a command line tool and a user-friendly web portal at https://ngdc.cncb.ac.cn/vista.

Download full-text PDF

Source
http://dx.doi.org/10.1093/gpbjnl/qzae082DOI Listing

Publication Analysis

Top Keywords

pairwise sequence
12
sequence comparison
12
taxonomic assignment
8
viral genome
8
genome sequences
8
vista virus
8
virus families
8
vista
7
virus
7
taxonomic
5

Similar Publications

Genetic Mechanism Analysis Related to Cold Tolerance of Red Swamp Crayfish, Procambarus clarkii.

Mar Biotechnol (NY)

January 2025

Key Laboratory of Efficient Utilization of Non-grain Feed Resources (Co-construction by Ministry and Province) of Ministry of Agriculture and Rural Affairs, Shandong Agricultural University, Taian, Shandong, China.

In China, the red swamp crayfish (Procambarus clarkii), a notorious invasive species, has become an important economic freshwater species. In order to compare the genetic diversity and population structure of crayfish from northern and southern China, we collected 60 crayfish individuals from 4 crayfish populations in northern China and 2 populations in southern China for sequencing using the 2b-RAD technique. Additionally, the whole genome sequence information obtained by 2b-RAD of 90 individuals from 2 populations in northern China and 7 populations in southern China were downloaded from NCBI.

View Article and Find Full Text PDF

Genotypic and phenotypic diversity of Mycobacterium tuberculosis strains from eastern India.

Infect Genet Evol

January 2025

Immunogenomics & Systems Biology group, Institute of Life Sciences (ILS), Bhubaneswar, Odisha, India; School of Biotechnology, Kalinga Institute of Industrial Technology (KIIT), Bhubaneswar, Odisha, India. Electronic address:

Whole genome sequencing has been used to investigate the genomic diversity of M. tuberculosis in the northern and southern states of India, but information about the eastern part of the country is still limited. Through a sequencing-based strategy, this study seeks to comprehend the diversity and drug resistance pattern in the eastern region.

View Article and Find Full Text PDF

is a complex species incorporating a great variety of vegetable types, including cabbage, cauliflower, broccoli, kale, and others. Southern Italy, and especially the Puglia region, is rich in landraces. In this study, genotyping-by-sequencing (GBS) was applied to a germplasm panel of 82 samples, mostly landraces and some commercial varieties, belonging to various morphotypes of .

View Article and Find Full Text PDF

With their diverse species, mosquitoes are known to transmit the causal agents of diseases such as malaria, dengue, and yellow fever. Their high adaptability, attraction to humans, and variable adult behaviors make them a significant health concern. The focus on Aedes aegypti is significant for reducing vector-human contacts, monitoring insecticide resistance, and developing innovative vector management strategies.

View Article and Find Full Text PDF

Purpose: Henneguya sp. is a crucial myxosporean parasite known to cause milky flesh or tapioca disease in the freshwater fish population, leading to heavy mortality. Studies to investigate its host range and to monitor their prevalence in wild and aquacultured fish are necessary.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!