Nucleotide and dinucleotide preference of segmented viruses are shaped more by segment: In case study of tomato spotted wilt virus.

Infect Genet Evol

College of Plant Protection, Yangzhou University, Yangzhou 225009, China; Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Yangzhou 225009, China. Electronic address:

Published: August 2024

AI Article Synopsis

  • Several studies suggest that the nucleotide composition of viruses may be linked to their host species or protein coding regions, but the role of viral segments remains unclear.
  • In the analysis of the tomato spotted wilt virus (TSWV), researchers found a consistent over-representation of adenine (A) in the first two codon positions across all viral segments, while the third codon position varied.
  • Dinucleotide preferences were identified, with specific dinucleotides being overrepresented or underrepresented across the virus’s genomic sequences and segments; this indicates that for TSWV, nucleotide composition is more influenced by viral segments and protein coding regions rather than the host species.

Article Abstract

Several studies have showed that the nucleotide and dinucleotide composition of viruses possibly follows their host species or protein coding region. Nevertheless, the influence of viral segment on viral nucleotide and dinucleotide composition is still unknown. Here, we explored through tomato spotted wilt virus (TSWV), a segmented virus that seriously threatens the production of tomatoes all over the world. Through nucleotide composition analysis, we found the same over-representation of A across all viral segments at the first and second codon position, but it exhibited distinct in segments at the third codon position. Interestingly, the protein coding regions which encoded by the same or different segments exhibit obvious distinct nucleotide preference. Then, we found that the dinucleotides UpG and CpU were overrepresented and the dinucleotides UpA, CpG and GpU were underrepresented, not only in the complete genomic sequences, but also in different segments, protein coding regions and host species. Notably, 100% of the data investigated here were predicted to the correct viral segment and protein coding region, despite the fact that only 67% of the data analyzed here were predicted to the correct viral host species. In conclusion, in case study of TSWV, nucleotide composition and dinucleotide preference of segment viruses are more strongly dependent on segment and protein coding region than on host species. This research provides a novel perspective on the molecular evolutionary mechanisms of TSWV and provides reference for future research on genetic diversity of segmented viruses.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.meegid.2024.105608DOI Listing

Publication Analysis

Top Keywords

protein coding
20
host species
16
nucleotide dinucleotide
12
coding region
12
dinucleotide preference
8
segmented viruses
8
case study
8
tomato spotted
8
spotted wilt
8
wilt virus
8

Similar Publications

Thunb. (1784) is primarily distributed in eastern Asia,  has a total length of 152,778 bp and consists of a large single copy (LSC) region of 84,517 bp, a small single copy (SSC) region of 18,277 bp, and two inverted repeat (IRs) regions of 24,992 bp . The GC content is 37.

View Article and Find Full Text PDF

The complete mitochondrial genome of the was sequenced by Sanger platform. The circular mitogenome of (16,512 bp) encoded the typical 37 genes, and one non-coding regions. All of the protein-encoding genes were located on the H chain except ND6.

View Article and Find Full Text PDF

(Cucurbitaceae) is an endemic species native to the Shennongjia forestry district of China, whose plastid genome was reported in this study. The whole genome exhibits the typical quadripartite structure with 156,906 bp in size. A total of 130 genes were identified, containing 85 protein-coding genes (CDS), 37 tRNA, and 8 rRNA genes.

View Article and Find Full Text PDF

(Compositae) is a perennial herbaceous plant owning high economic, feeding and medicinal values. It is widely distributed in desertification and saline alkali areas. The complete chloroplast genome was firstly reported in this study.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!