Alignment-free sequence comparison for virus genomes based on location correlation coefficient.

Infect Genet Evol

School of Life Sciences, Tsinghua University, Beijing 100084, PR China. Electronic address:

Published: December 2021

Coronaviruses (especially SARS-CoV-2) are characterized by rapid mutation and wide spread. As these characteristics easily lead to global pandemics, studying the evolutionary relationship between viruses is essential for clinical diagnosis. DNA sequencing has played an important role in evolutionary analysis. Recent alignment-free methods can overcome the problems of traditional alignment-based methods, which consume both time and space. This paper proposes a novel alignment-free method called the correlation coefficient feature vector (CCFV), which defines a correlation measure of the L-step delay of a nucleotide location from its location in the original DNA sequence. The numerical feature is a 16×L-dimensional numerical vector describing the distribution characteristics of the nucleotide positions in a DNA sequence. The proposed L-step delay correlation measure is interestingly related to some types of L+1 spaced mers. Unlike traditional gene comparison, our method avoids the computational complexity of multiple sequence alignment, and hence improves the speed of sequence comparison. Our method is applied to evolutionary analysis of the common human viruses including SARS-CoV-2, Dengue virus, Hepatitis B virus, and human rhinovirus and achieves the same or even better results than alignment-based methods. Especially for SARS-CoV-2, our method also confirms that bats are potential intermediate hosts of SARS-CoV-2.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8493760PMC
http://dx.doi.org/10.1016/j.meegid.2021.105106DOI Listing

Publication Analysis

Top Keywords

sequence comparison
8
correlation coefficient
8
evolutionary analysis
8
alignment-based methods
8
correlation measure
8
l-step delay
8
dna sequence
8
comparison method
8
alignment-free sequence
4
comparison virus
4

Similar Publications

This research evaluated the effectiveness of an online simulation-based serious game as a learning tool in diagnosis and treatment planning for oral lesions (SimOL) in comparison to a pre-recorded lecture-based approach and to determine its appropriate integration into the undergraduate dental curriculum. A crossover randomized control trial was conducted with a cohort of 77 dental undergraduates. They were randomly assigned into two groups.

View Article and Find Full Text PDF

Social media generates vast amounts of spatio-temporal sequential data. However, current methods often ignore the complex spatio-temporal correlations within these data. This oversight makes it difficult to fully capture the dynamic features of the data.

View Article and Find Full Text PDF

The WRINKLED1 (WRI1) transcription factor controls carbon flow in plants through regulating the expression of glycolysis and fatty acid biosynthesis genes. The role of Gossypium hirsutum WRINKLED1 (GhWRI1) in seed-oil accumulation still needs to be explored. Multiple sequence alignment of WRI1 proteins confirmed the presence of two conserved AP2 domains.

View Article and Find Full Text PDF

Customer churn prediction model based on hybrid neural networks.

Sci Rep

December 2024

College of Computer Science and Engineering, Guangxi Normal University, Guilin, 541000, China.

In today's competitive market environment, accurately identifying potential churn customers and taking effective retention measures are crucial for improving customer retention and ensuring the sustainable development of an organization. However, traditional machine learning algorithms and single deep learning models have limitations in extracting complex nonlinear and time-series features, resulting in unsatisfactory prediction results. To address this problem, this study proposes a hybrid neural network-based customer churn prediction model, CCP-Net.

View Article and Find Full Text PDF

Purpose: To quantitatively and qualitatively compare the magnitude of metal total hip arthroplasty-induced imaging artifacts in vivo between 1.5T and 0.55T MRI.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!