Background: The combination of chromatin immunoprecipitation with two-channel microarray technology enables genome-wide mapping of binding sites of DNA-interacting proteins (ChIP-on-chip) or sites with methylated CpG di-nucleotides (DNA methylation microarray). These powerful tools are the gateway to understanding gene transcription regulation. Since the goals of such studies, the sample preparation procedures, the microarray content and study design are all different from transcriptomics microarrays, the data pre-processing strategies traditionally applied to transcriptomics microarrays may not be appropriate. Particularly, the main challenge of the normalization of "regulation microarrays" is (i) to make the data of individual microarrays quantitatively comparable and (ii) to keep the signals of the enriched probes, representing DNA sequences from the precipitate, as distinguishable as possible from the signals of the un-enriched probes, representing DNA sequences largely absent from the precipitate.

Results: We compare several widely used normalization approaches (VSN, LOWESS, quantile, T-quantile, Tukey's biweight scaling, Peng's method) applied to a selection of regulation microarray datasets, ranging from DNA methylation to transcription factor binding and histone modification studies. Through comparison of the data distributions of control probes and gene promoter probes before and after normalization, and assessment of the power to identify known enriched genomic regions after normalization, we demonstrate that there are clear differences in performance between normalization procedures.

Conclusion: T-quantile normalization applied separately on the channels and Tukey's biweight scaling outperform other methods in terms of the conservation of enriched and un-enriched signal separation, as well as in identification of genomic regions known to be enriched. T-quantile normalization is preferable as it additionally improves comparability between microarrays. In contrast, popular normalization approaches like quantile, LOWESS, Peng's method and VSN normalization alter the data distributions of regulation microarrays to such an extent that using these approaches will impact the reliability of the downstream analysis substantially.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3293711PMC
http://dx.doi.org/10.1186/1471-2164-13-42DOI Listing

Publication Analysis

Top Keywords

dna methylation
12
normalization
10
methylation microarray
8
transcriptomics microarrays
8
probes representing
8
representing dna
8
dna sequences
8
normalization approaches
8
tukey's biweight
8
biweight scaling
8

Similar Publications

The Impact of Modifiable Risk Factors on the Endothelial Cell Methylome and Cardiovascular Disease Development.

Front Biosci (Landmark Ed)

January 2025

School of Cardiovascular and Metabolic Medicine & Sciences, British Heart Foundation Centre of Research Excellence, King's College London, SE5 9NU London, UK.

Cardiovascular disease (CVD) is the most prevalent cause of mortality and morbidity in the Western world. A common underlying hallmark of CVD is the plaque-associated arterial thickening, termed atherosclerosis. Although the molecular mechanisms underlying the aetiology of atherosclerosis remain unknown, it is clear that both its development and progression are associated with significant changes in the pattern of DNA methylation within the vascular cell wall.

View Article and Find Full Text PDF

Background/objectives: The DNA methylation of neonatal cord blood can be used to accurately estimate gestational age. This is known as epigenetic gestational age. The greater the difference between epigenetic and chronological gestational age, the greater the association with an inappropriate perinatal fetal environment and development.

View Article and Find Full Text PDF

A Guinea Pig Model of Pediatric Metabolic Dysfunction-Associated Steatohepatitis: Poor Vitamin C Status May Advance Disease.

Nutrients

January 2025

Section of Preclinical Disease Biology, Department of Veterinary and Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, 1870 Frederiksberg, Denmark.

Children and teenagers display a distinct metabolic dysfunction-associated steatohepatitis (MASH) phenotype, yet studies of childhood MASH are scarce and validated animal models lacking, limiting the development of treatments. Poor vitamin C (VitC) status may affect MASH progression and often co-occurs with high-fat diets and related metabolic imbalances. As a regulator of DNA methylation, poor VitC status may further contribute to MASH by regulating gene expression This study investigated guinea pigs-a species that, like humans, depends on vitC in the diet-as a model of pediatric MASH, examining the effects of poor VitC status on MASH hallmarks and global DNA methylation levels.

View Article and Find Full Text PDF

DNA methylation has been widely studied with the goal of correlating the genome profiles of various diseases with epigenetic mechanisms. Multiple approaches have been developed that employ extensive steps, such as bisulfite treatments, polymerase chain reactions (PCR), restriction digestion, sequencing, mass analysis, etc., to identify DNA methylation.

View Article and Find Full Text PDF

Serum cystatin C is a well-established marker of renal function and a valuable predictor of health risks and mortality. DNA methylation-predicted cystatin C (DNAmCystatinC), an advanced epigenetic biomarker, serves as a proxy for serum cystatin C levels. However, the relationships between serum cystatin C, DNAmCystatinC, renal function, and mortality outcomes have not been previously examined.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!