Single-nucleotide conservation state annotation of the SARS-CoV-2 genome.

Commun Biol

Bioinformatics Interdepartmental Program, University of California, Los Angeles, CA, USA.

Published: June 2021

Given the global impact and severity of COVID-19, there is a pressing need for a better understanding of the SARS-CoV-2 genome and mutations. Multi-strain sequence alignments of coronaviruses (CoV) provide important information for interpreting the genome and its variation. We apply a comparative genomics method, ConsHMM, to the multi-strain alignments of CoV to annotate every base of the SARS-CoV-2 genome with conservation states based on sequence alignment patterns among CoV. The learned conservation states show distinct enrichment patterns for genes, protein domains, and other regions of interest. Certain states are strongly enriched or depleted of SARS-CoV-2 mutations, which can be used to predict potentially consequential mutations. We expect the conservation states to be a resource for interpreting the SARS-CoV-2 genome and mutations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8175581PMC
http://dx.doi.org/10.1038/s42003-021-02231-wDOI Listing

Publication Analysis

Top Keywords

sars-cov-2 genome
16
conservation states
12
genome mutations
8
sars-cov-2
5
genome
5
single-nucleotide conservation
4
conservation state
4
state annotation
4
annotation sars-cov-2
4
genome global
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!