AI Article Synopsis

  • DNA methylation is a key epigenetic modification used in various fields, enabling insights into transcription regulation, aging, and disease prediction through techniques like bisulfite and enzymatic methyl sequencing.
  • Computational methods have been created to analyze methylation heterogeneity, revealing important cellular patterns; however, limitations exist due to incomplete data from shotgun sequencing that restrict comprehensive analysis.
  • The developed probabilistic imputation method enhances the recovery of methylation patterns using neighboring information, increasing the regions evaluated and providing a user-friendly pipeline for genome-wide screening of methylation levels and patterns, which has significant implications for research and early disease detection.

Article Abstract

DNA methylation is one of the most studied epigenetic modifications that has applications ranging from transcriptional regulation to aging, and can be assessed by bisulfite sequencing (BS-seq) or enzymatic methyl sequencing (EM-seq) at single base-pair resolution. The permutations of methylation statuses given by aligned reads reflect the methylation patterns of individual cells. These patterns at specific genomic locations are sought to be indicative of cellular heterogeneity within a cellular population, which are predictive of developments and diseases; therefore, methylation heterogeneity has potentials in early detection of these changes. Computational methods have been developed to assess methylation heterogeneity using methylation patterns formed by four consecutive CpGs, but the nature of shotgun sequencing often give partially observed patterns, which makes very limited data available for downstream analysis. While many programs are developed to impute genome-wide methylation levels, currently there is only one method developed for recovering partially observed methylation patterns; however, the program needs lots of data to train and cannot be used directly; therefore, we developed a probabilistic-based imputation method that uses information from neighbouring sites to recover partially observed methylation patterns speedily. It is demonstrated to allow for the evaluation of methylation heterogeneity at 15% more regions genome-wide with high accuracy for data with moderate depth. To make it more user-friendly we also provide a computational pipeline for genome-screening, which can be used in both evaluating methylation levels and profiling methylation patterns genomewide for all cytosine contexts, which is the first of its kind. Our method allows for accurate estimation of methylation levels and makes evaluating methylation heterogeneity available for much more data with reasonable coverage, which has important implications in using methylation heterogeneity for monitoring changes within the cellular populations that were impossible to detect for the assessment of development and diseases.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580945PMC
http://dx.doi.org/10.3389/fbinf.2022.815289DOI Listing

Publication Analysis

Top Keywords

methylation patterns
24
methylation heterogeneity
24
methylation
17
partially observed
16
observed methylation
12
evaluating methylation
12
methylation levels
12
patterns
8
heterogeneity
7
bsimp imputing
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!