Mining and analysis of microsatellites in human coronavirus genomes using the in-house built Java pipeline.

Genomics Inform

Department of Computer Science & IT, MJP Rohilkhand University, Bareilly 243006, Uttar Pradesh, India.

Published: September 2022

Microsatellites or simple sequence repeats are motifs of 1 to 6 nucleotides in length present in both coding and non-coding regions of DNA. These are found widely distributed in the whole genome of prokaryotes, eukaryotes, bacteria, and viruses and are used as molecular markers in studying DNA variations, gene regulation, genetic diversity and evolutionary studies, etc. However, in vitro microsatellite identification proves to be time-consuming and expensive. Therefore, the present research has been focused on using an in-house built java pipeline to identify, analyse, design primers and find related statistics of perfect and compound microsatellites in the seven complete genome sequences of coronavirus, including the genome of coronavirus disease 2019, where the host is Homo sapiens. Based on search criteria among seven genomic sequences, it was revealed that the total number of perfect simple sequence repeats (SSRs) found to be in the range of 76 to 118 and compound SSRs from 01 to10, thus reflecting the low conversion of perfect simple sequence to compound repeats. Furthermore, the incidence of SSRs was insignificant but positively correlated with genome size (R2 = 0.45, p > 0.05), with simple sequence repeats relative abundance (R2 = 0.18, p > 0.05) and relative density (R2 = 0.23, p > 0.05). Dinucleotide repeats were the most abundant in the coding region of the genome, followed by tri, mono, and tetra. This comparative study would help us understand the evolutionary relationship, genetic diversity, and hypervariability in minimal time and cost.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9576472PMC
http://dx.doi.org/10.5808/gi.20033DOI Listing

Publication Analysis

Top Keywords

simple sequence
16
sequence repeats
12
in-house built
8
built java
8
java pipeline
8
genetic diversity
8
perfect simple
8
repeats
5
genome
5
mining analysis
4

Similar Publications

Background: Neonatal mice are frequently used to model diseases that affect human infants. Microbial community composition has been shown to impact disease progression in these models. Despite this, the maturation of the early-life murine microbiome has not been well-characterized.

View Article and Find Full Text PDF

The competition for resources is a defining feature of microbial communities. In many contexts, from soils to host-associated communities, highly diverse microbes are organized into metabolic groups or guilds with similar resource preferences. The resource preferences of individual taxa that give rise to these guilds are critical for understanding fluxes of resources through the community and the structure of diversity in the system.

View Article and Find Full Text PDF

Accurate drug-target binding affinity (DTA) prediction is crucial in drug discovery. Recently, deep learning methods for DTA prediction have made significant progress. However, there are still two challenges: (1) recent models always ignore the correlations in drug and target data in the drug/target representation process and (2) the interaction learning of drug-target pairs always is by simple concatenation, which is insufficient to explore their fusion.

View Article and Find Full Text PDF

SSR marker-based genetic diversity and structure analyses of var. from different populations.

PeerJ

January 2025

Guangxi Key Laboratory of Efficacy Study on Chinese Materia Medica, Nanning, Guangxi, China.

Background: var. is a variety in the section of the genus of the family Theaceae which is native to Fangchenggang, Guangxi, China. To date, the genetic diversity and structure of this variety remains to be understood.

View Article and Find Full Text PDF

Mitochondrial genome of : features, RNA editing, and insights into male sterility.

Front Plant Sci

January 2025

Bio-resource Research and Utilization Joint Key Laboratory of Sichuan and Chongqing, Chongqing Institute of Medicinal Plant Cultivation, Nanchuan, Chongqing, China.

Introduction: Mitochondria are essential organelles that provide energy for plants. They are semi-autonomous, maternally inherited, and closely linked to cytoplasmic male sterility (CMS) in plants. , a widely used medicinal plant from the Caprifoliaceae family, is rich in chlorogenic acid (CGA) and its analogues, which are known for their antiviral and anticancer properties.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!