An annotation is a set of genomic intervals sharing a particular function or property. Examples include genes or their exons, evolutionarily conserved elements, and regions with a particular epigenetic state. A common task is to compare two annotations to determine if one is enriched or depleted in the regions covered by the other. We study the problem of assigning statistical significance to such a comparison based on a null model representing two random unrelated annotations. To incorporate more background information into such analyses,we propose a new null model based on a Markov chain which differentiates among several genomic contexts. These contexts can capture various confounding factors, such as GC content or assembly gaps. We then develop a new algorithm for estimating p-values by computing the exact expectation and variance of the test statistics and then estimating the p-value using a normal approximation. Compared to the previous algorithm by Gafurov et al., the new algorithm provides three advances: (1) the running time is improved from quadratic to linear or quasi-linear, (2) the algorithm can handle two different test statistics, and (3) the algorithm can handle both simple and context-dependent Markov chain null models. We demonstrate the efficiency and accuracy of our algorithm on synthetic and real data sets, including the recent human telomere-to-telomere assembly. In particular, our algorithm computed p-values for 450 pairs of human genome annotations using 24 threads in under three hours. Moreover, the use of genomic contexts to correct for GC bias resulted in the reversal of some previously published findings.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10690252 | PMC |
http://dx.doi.org/10.1101/2023.11.22.568259 | DOI Listing |
Nat Commun
January 2025
Gene Regulation Laboratory, MRC Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, OX3 9DS, Oxford, UK.
Individual enhancers are defined as short genomic regulatory elements, bound by transcription factors, and able to activate cell-specific gene expression at a distance, in an orientation-independent manner. Within mammalian genomes, enhancer-like elements may be found individually or within clusters referred to as locus control regions or super-enhancers (SEs). While these behave similarly to individual enhancers with respect to cell specificity, distribution and distance, their orientation-dependence has not been formally tested.
View Article and Find Full Text PDFViruses
December 2024
INSERM U1052, CNRS UMR5286, Université Claude Bernard Lyon 1, Hospices Civils de Lyon, Lyon Hepatology Institute (IHU Everest), 69003 Lyon, France.
Cyclophilin (Cyp) inhibitors are of clinical interest in respect to their antiviral activities in the context of many viral infections including chronic hepatitis B and C. Cyps are a group of enzymes with peptidyl-prolyl isomerase activity (PPIase), known to be required for replication of diverse viruses including hepatitis B and C viruses (HBV and HCV). Amongst the Cyp family, the molecular mechanisms underlying the antiviral effects of CypA have been investigated in detail, but potential roles of other Cyps are less well studied in the context of viral hepatitis.
View Article and Find Full Text PDFInt J Mol Sci
January 2025
Department of Neurology, Virginia Commonwealth University, Richmond, VA 23298, USA.
Acute ischemic stroke with large vessel occlusion (LVO) continues to present a considerable challenge to global health, marked by substantial morbidity and mortality rates. Although definitive diagnostic markers exist in the form of neuroimaging, their expense, limited availability, and potential for diagnostic delay can often result in missed opportunities for life-saving interventions. Despite several past attempts, research efforts to date have been fraught with challenges likely due to multiple factors, such as the inclusion of diverse stroke types, variable onset intervals, differing pathobiologies, and a range of infarct sizes, all contributing to inconsistent circulating biomarker levels.
View Article and Find Full Text PDFInt J Mol Sci
January 2025
St. Catherine Specialty Hospital, 10000 Zagreb, Croatia.
Pharmacogenetics is a branch of genomic medicine aiming to personalize drug prescription guidelines based on individual genetic information. This concept might lead to a reduction in adverse drug reactions, which place a heavy burden on individual patients' health and the economy of the healthcare system. The aim of this study was to present insights gained from the pharmacogenetics-based clustering of over 500 patients from the Croatian population.
View Article and Find Full Text PDFInt J Mol Sci
January 2025
Department of Medical Oncology, CRO di Aviano, National Cancer Institute, IRCCS, 33081 Aviano, Italy.
Non-small cell lung cancer (NSCLC) remains a leading cause of cancer-related mortality worldwide. The discovery of specific driver mutations has revolutionized the treatment landscape of oncogene-addicted NSCLC through targeted therapies, significantly improving patient outcomes. However, immune checkpoint inhibitors (ICIs) have demonstrated limited effectiveness in this context.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!