The SARS-CoV-2 virus has made the largest pandemic of the 21st century, with hundreds of millions of cases and tens of millions of fatalities. Scientists all around the world are racing to develop vaccines and new pharmaceuticals to overcome the pandemic and offer effective treatments for COVID-19 disease. Consequently, there is an essential need to better understand how the pathogenesis of SARS-CoV-2 is affected by viral mutations and to determine the conserved segments in the viral genome that can serve as stable targets for novel therapeutics. Here, we introduce a text-mining method to estimate the mutability of genomic segments directly from a reference (ancestral) whole genome sequence. The method relies on calculating the importance of genomic segments based on their spatial distribution and frequency over the whole genome. To validate our approach, we perform a large-scale analysis of the viral mutations in nearly 80,000 publicly available SARS-CoV-2 predecessor whole genome sequences and show that these results are highly correlated with the segments predicted by the statistical method used for keyword detection. Importantly, these correlations are found to hold at the codon and gene levels, as well as for gene coding regions. Using the text-mining method, we further identify codon sequences that are potential candidates for siRNA-based antiviral drugs. Significantly, one of the candidates identified in this work corresponds to the first seven codons of an epitope of the spike glycoprotein, which is the only SARS-CoV-2 immunogenic peptide without a match to a human protein.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8795775 | PMC |
http://dx.doi.org/10.1017/qrd.2021.13 | DOI Listing |
Elucidating ancestry-specific structures in admixed populations is crucial for comprehending population history and mitigating confounding effects in genome-wide association studies. Existing methods for elucidating the ancestry-specific structures generally rely on frequency-based estimates of genetic relationship matrix (GRM) among admixed individuals after masking segments from ancestry components not being targeted for investigation. However, these approaches disregard linkage information between markers, potentially limiting their resolution in revealing structure within an ancestry component.
View Article and Find Full Text PDFThe are a family of non-segmented positive-sense enveloped RNA viruses containing significant pathogens including hepatitis C virus and yellow fever virus. Recent large-scale metagenomic surveys have identified many diverse RNA viruses related to classical orthoflaviviruses and pestiviruses but quite different genome lengths and configurations, and with a hugely expanded host range that spans multiple animal phyla, including molluscs, cnidarians and stramenopiles,, and plants. Grouping of RNA-directed RNA polymerase (RdRP) hallmark gene sequences of flavivirus and 'flavi-like' viruses into four divergent clades and multiple lineages within them was congruent with helicase gene phylogeny, PPHMM profile comparisons, and comparison of RdRP protein structure predicted by AlphFold2.
View Article and Find Full Text PDFEstablishing the anterior-posterior body axis is a fundamental process during embryogenesis, and the fruit fly, , provides one of the best-known case studies of this process. In Drosophila, localized mRNA of serves as anterior determinant (AD). Bicoid engages in a concentration-dependent competition with nucleosomes and initiates symmetry-breaking along the AP axis by promoting chromatin accessibility at the loci of transcription factor (TF) genes that are expressed in the anterior of the embryo.
View Article and Find Full Text PDFTandem repeats are a highly polymorphic class of genomic variation that play causal roles in rare diseases but are notoriously difficult to sequence using short-read techniques . Most previous studies profiling tandem repeats genome-wide have reduced the description of each locus to the singular value of the length of the entire repetitive locus . Here we introduce a comprehensive database of 3.
View Article and Find Full Text PDFHeliyon
January 2025
Tehran Azad University of Medical Sciences, Faculty of Pharmaceutical Sciences, Iran.
The significance of vaccine development has gained heightened importance in light of the pandemic. In such critical circumstances, global citizens anticipate researchers in this field to swiftly identify a vaccine candidate to combat the pandemic's root cause. It is widely recognized that the vaccine design process is traditionally both time-consuming and costly.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!