A pervasive question in biological research studying gene regulation, chromatin structure, or genomics is where, and to what extent, does a signal of interest arise genome-wide? This question is addressed using a variety of methods relying on high-throughput sequencing data as their final output, including ChIP-seq for protein-DNA interactions, GapR-seq for measuring supercoiling, and HBD-seq or DRIP-seq for R-loop positioning. Current computational methods to calculate genome-wide enrichment of the signal of interest usually do not properly handle the count-based nature of sequencing data, they often do not make use of the local correlation structure of sequencing data, and they do not apply any regularization of enrichment estimates. This can result in unrealistic estimates of the true underlying biological enrichment of interest, unrealistically low estimates of confidence in point estimates of enrichment (or no estimates of confidence at all), unrealistic gyrations in enrichment estimates at very close (<10 bp) genomic loci due to noise inherent in sequencing data, and in a multiple-hypothesis testing problem during interpretation of genome-wide enrichment estimates. We developed a tool called Enricherator to infer genome-wide enrichments from sequencing count data. Enricherator uses the variational Bayes algorithm to fit a generalized linear model to sequencing count data and to sample from the approximate posterior distribution of enrichment estimates (https://github.com/jwschroeder3/enricherator). Enrichments inferred by Enricherator more precisely identify known binding sites in cases where low coverage between binding sites leads to false-positive peak calls in these noisy regions of the genome; these benefits extend to published datasets.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377166 | PMC |
http://dx.doi.org/10.1016/j.jmb.2024.168567 | DOI Listing |
Viruses
December 2024
Foundation Plant Services, University of California-Davis, Davis, CA 95616, USA.
Among the cultivated crop species, the economically and culturally important grapevine plays host to the greatest number of distinctly characterized viruses. A critical component of the management and containment of these viral diseases in grapevine is both the identification of infected vines and the characterization of new pathogens. Next-generation high-throughput sequencing technologies, i.
View Article and Find Full Text PDFViruses
December 2024
Global Health and Tropical Medicine, GHTM, Associate Laboratory in Translation and Innovation Towards Global Health, LA-REAL, Instituto de Higiene e Medicina Tropical, IHMT, Universidade NOVA de Lisboa, Rua da Junqueira 100, 1349-008 Lisboa, Portugal.
The high genetic variability of HIV-1 and the emergence of transmitted drug resistance (TDR) can impact treatment efficacy. In this study, we investigated the prevalent HIV-1 genotypes and drug-resistance-associated mutations in drug-naïve HIV-1 individuals in Cabo Verde. The study, conducted between 2018 and 2019, included drug-naïve HIV-1 individuals from the São Vicente, Boa Vista, Fogo, and Santiago islands.
View Article and Find Full Text PDFViruses
December 2024
Wadsworth Center, David Axelrod Institute, New York State Department of Health, Albany, NY 12208, USA.
A historical perspective of more than one hundred years of influenza surveillance in New York State demonstrates the progression from anecdotes and case counts to next-generation sequencing and electronic database management, greatly improving pandemic preparedness and response. Here, we determined if influenza virologic surveillance at the New York State public health laboratory (NYS PHL) tests sufficient specimen numbers within preferred confidence limits to assess situational awareness and detect novel viruses that pose a pandemic risk. To this end, we analyzed retrospective electronic data on laboratory test results for the influenza seasons 1997-1998 to 2021-2022 according to sample sizes recommended in the Influenza Virologic Surveillance Right Size Roadmap issued by the Association of Public Health Laboratories and Centers for Disease Control and Prevention.
View Article and Find Full Text PDFViruses
December 2024
Federal Centre for Animal Health, 600901 Vladimir, Russia.
The lack of data on the whole-genome analysis of genotype II African swine fever virus (ASFV) isolates significantly hinders our understanding of its molecular evolution, and as a result, the range of single nucleotide polymorphisms (SNPs) necessary to describe a more accurate and complete scheme of its circulation. In this regard, this study aimed to identify unique SNPs, conduct phylogenetic analysis, and determine the level of homology of isolates obtained in the period from 2019 to 2022 in the central and eastern regions of Russia. Twenty-one whole-genome sequences of genotype II ASFV isolates were assembled, analyzed, and submitted to GenBank.
View Article and Find Full Text PDFViruses
November 2024
Chantal BIYA International Reference Centre for Research on HIV/AIDS Prevention and Management, Yaoundé P.O. Box 3077, Cameroon.
Dual therapies (DT) combining integrase strand transfer inhibitors (INSTIs) with second-generation non-nucleoside reverse transcriptase inhibitors (2nd-Gen-NNRTIs) offer new possibilities for HIV treatment to improve adherence. However, drug resistance associated mutations (RAMs) to prior antiretrovirals may jeopardize the efficacy of DT. We herein describe the predicted efficacy of DT combining INSTIs + 2nd-Gen-NNRTI following treatment failure among Cameroonian patients.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!