On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs.

Biosemiotics

Department for Theoretical Physics, Ivan Franko National University of Lviv, 12 Drahomanov St, UA-79005 Lviv, Ukraine.

Published: February 2021

The aim of the study is to analyze viruses using parameters obtained from distributions of nucleotide sequences in the viral RNA. Seeking for the input data homogeneity, we analyze single-stranded RNA viruses only. Two approaches are used to obtain the nucleotide sequences; In the first one, chunks of equal length (four nucleotides) are considered. In the second approach, the whole RNA genome is divided into parts by adenine or the most frequent nucleotide as a "space". Rank-frequency distributions are studied in both cases. The defined nucleotide sequences are signs comparable to a certain extent to syllables or words as seen from the nature of their rank-frequency distributions. Within the first approach, the Pólya and the negative hypergeometric distribution yield the best fit. For the distributions obtained within the second approach, we have calculated a set of parameters, including entropy, mean sequence length, and its dispersion. The calculated parameters became the basis for the classification of viruses. We observed that proximity of viruses on planes spanned on various pairs of parameters corresponds to related species. In certain cases, such a proximity is observed for unrelated species as well calling thus for the expansion of the set of parameters used in the classification. We also observed that the fifth most frequent nucleotide sequences obtained within the second approach are of different nature in case of human coronaviruses (different nucleotides for MERS, SARS-CoV, and SARS-CoV-2 versus identical nucleotides for four other coronaviruses). We expect that our findings will be useful as a supplementary tool in the classification of diseases caused by RNA viruses with respect to severity and contagiousness.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7887720PMC
http://dx.doi.org/10.1007/s12304-021-09403-5DOI Listing

Publication Analysis

Top Keywords

nucleotide sequences
20
second approach
12
sequences viral
8
rna viruses
8
frequent nucleotide
8
rank-frequency distributions
8
set parameters
8
nucleotide
6
sequences
5
viruses
5

Similar Publications

Purpose: This work described a new species of Ceratomyxa, based on morphological and phylogenetic analyzes of myxospores collected from the gallbladder of the fish Astyanax mexicanus.

Methods: Sixty-two specimens were captured, between December 2022 and February 2024, in the Flexal River, in the community of Tessalônica, state of Amapá. The specimens were transported alive to the Laboratory of Morphophysiology and Animal Health, at the State University of Amapá, where the studies were carried out.

View Article and Find Full Text PDF

Crohn's disease (CD) is a chronic inflammatory bowel disease with an unknown etiology. Ubiquitination plays a significant role in the pathogenesis of CD. This study aimed to explore the functional roles of ubiquitination-related genes in CD.

View Article and Find Full Text PDF

Alu-Sc-mediated exonization generated a mitochondrial LKB1 gene variant found only in higher order primates.

Sci Rep

January 2025

Singapore Immunology Network (SIgN), Agency for Science, Technology and Research (A*STAR), 8A Biomedical Grove, #04-06 Immunos, Singapore, 138648, Singapore.

The tumor suppressor LKB1/STK11 plays important roles in regulating cellular metabolism and stress responses and its mutations are associated with various cancers. We recently identified a novel exon 1b within intron 1 of human LKB1/STK11, which generates an alternatively spliced, mitochondria-targeting LKB1 isoform important for regulating mitochondrial oxidative stress. Here we examined the formation of this novel exon 1b and uncovered its relatively late emergence during evolution.

View Article and Find Full Text PDF

Omics data provide a plethora of quantifiable information that can potentially be used to identify biomarkers targeting the physiological processes and ecological phenomena of organisms. However, omics data have not been fully utilized because current prediction methods in biomarker construction are susceptible to data multidimensionality and noise. We developed OmicSense, a quantitative prediction method that uses a mixture of Gaussian distributions as the probability distribution, yielding the most likely objective variable predicted for each biomarker.

View Article and Find Full Text PDF

Gene enhancers often form long-range contacts with promoters, but it remains unclear if the activity of enhancers and their chromosomal contacts are mediated by the same DNA sequences and recruited factors. Here, we study the effects of expression quantitative trait loci (eQTLs) on enhancer activity and promoter contacts in primary monocytes isolated from 34 male individuals. Using eQTL-Capture Hi-C and a Bayesian approach considering both intra- and inter-individual variation, we initially detect 19 eQTLs associated with enhancer-eGene promoter contacts, most of which also associate with enhancer accessibility and activity.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!