On the prediction of non-CG DNA methylation using machine learning.

NAR Genom Bioinform

Department of Computer Science and Engineering, University of California, Riverside, CA 92521, USA.

Published: June 2023

DNA methylation can be detected and measured using sequencing instruments after sodium bisulfite conversion, but experiments can be expensive for large eukaryotic genomes. Sequencing nonuniformity and mapping biases can leave parts of the genome with low or no coverage, thus hampering the ability of obtaining DNA methylation levels for all cytosines. To address these limitations, several computational methods have been proposed that can predict DNA methylation from the DNA sequence around the cytosine or from the methylation level of nearby cytosines. However, most of these methods are entirely focused on CG methylation in humans and other mammals. In this work, we study, for the first time, the problem of predicting cytosine methylation for CG, CHG and CHH contexts on six plant species, either from the DNA primary sequence around the cytosine or from the methylation levels of neighboring cytosines. In this framework, we also study the cross-species prediction problem and the cross-context prediction problem (within the same species). Finally, we show that providing gene and repeat annotations allows existing classifiers to significantly improve their prediction accuracy. We introduce a new classifier called AMPS (annotation-based methylation prediction from sequence) that takes advantage of genomic annotations to achieve higher accuracy.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10189801PMC
http://dx.doi.org/10.1093/nargab/lqad045DOI Listing

Publication Analysis

Top Keywords

dna methylation
16
cytosine methylation
12
methylation
9
methylation levels
8
sequence cytosine
8
prediction problem
8
dna
6
prediction
5
prediction non-cg
4
non-cg dna
4

Similar Publications

The ribophorin family, including RPN1, has been associated with tumor progression, but its specific role in pan-cancer dynamics remains unclear. Using data from TCGA, GTEx, and Ualcan databases, we investigated the relationship of RPN1 with prognosis, genomic alterations, and epigenetic modifications across various cancers. Differential analysis revealed elevated RPN1 expression in multiple cancer types, indicating a potential prognostic value.

View Article and Find Full Text PDF

Background: Parkinson's disease (PD) is the second most common neurodegenerative disorder, after Alzheimer's disease. Many individuals with PD experience cognitive impairment, significantly threatening both their physical and mental well-being. Research has shown that abnormal DNA methylation is closely linked to neurodegenerative conditions such as Alzheimer's and Parkinson's disease.

View Article and Find Full Text PDF

This review addresses the significant challenges and technological developments in cancer screening and early diagnosis in the context of India's diverse and resource-constrained healthcare landscape. Selected cancers like breast, cervical, oral, lung, and colorectal cancers are focused on, and established screening methods such as clinical breast examination (CBE), mammography, visual inspection with acetic acid (VIA), HPV DNA testing, and oral visual inspection (OVI) are reviewed. These are cost-effective strategies that are proven to reduce mortality.

View Article and Find Full Text PDF

Prognostic Role of Global DNA Methylation in Renal Cancer Reveals Decitabine Treatment Benefit.

Curr Med Chem

March 2025

School of Biological Science & Medical Engineering, Southeast University, Nanjing, Jiangsu, 210000, China.

Background: Renal cancer presents a significant global health challenge due to its rising incidence and mortality rates. Often undetected in early stages, it complicates diagnosis and treatment. Current therapies face resistance and limited effectiveness, especially in advanced stages.

View Article and Find Full Text PDF

Background/objectives: Tartrazine (TRZ), a synthetic red azo dye derived from coal tar, is widely used as a food colorant in various food products, pharmaceuticals, and cosmetics. This study aims to investigate the impact of TRZ on the expression levels of DNA methyltransferases (, , and ) and histone deacetylases ( and ). Additionally, we evaluate genomic DNA stability using the alkaline comet assay in three human cell lines: immortalized human keratinocyte (HaCaT), human hepatocellular carcinoma (HepG2), and human lung adenocarcinoma (A549).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!