Technical and biological sources of unreliability of Infinium probes on Illumina methylation microarrays.

Tatiana Nazarenko Charlotte Dafni Vavourakis Allison Jones Iona Evans Lena Schreiberhuber Christine Kastner Isma Ishaq-Parveen Elisa Redl Anthony W Watson Kirsten Brandt Clive Carter Alexey Zaikin Chiara Maria Stella Herzog Martin Widschwendter

Clin Epigenetics

Research Institute for Biomedical Aging Research, Universität Innsbruck, 6020, Innsbruck, Austria.

Published: September 2024

The study examines the reliability of the Illumina Methylation array platform in measuring DNA methylation by evaluating the consistency of repeated measures using both type I and type II Infinium probes.
A new method is proposed to identify unreliable probes by using dynamic thresholds for mean intensity (MI) and unreliability scores based on simulations that factor in technical noise.
Validation across multiple datasets indicates that probes with low MI tend to show higher variability in β values, and an R package is introduced to help researchers calculate MI and unreliability scores for better data analysis.

The Illumina Methylation array platform has facilitated countless epigenetic studies on DNA methylation (DNAme) in health and disease, yet relatively few studies have so studied its reliability, i.e., the consistency of repeated measures. Here we investigate the reliability of both type I and type II Infinium probes. We propose a method for excluding unreliable probes based on dynamic thresholds for mean intensity (MI) and 'unreliability', estimated by probe-level simulation of the influence of technical noise on methylation β values using the background intensities of negative control probes. We validate our method in several datasets, including newly generated Illumina MethylationEPIC BeadChip v1.0 data from paired whole blood samples taken six weeks apart and technical replicates spanning multiple sample types. Our analysis revealed that specifically probes with low MI exhibit higher β value variability between repeated samples. MI was associated with the number of C-bases in the respective probe sequence and correlated negatively with unreliability scores. The unreliability scores were substantiated through validation in a new EPIC v1.0 (blood and cervix) and a publicly available 450k (blood) dataset, as they effectively captured the variability observed in β values between technical replicates. Finally, despite promising higher robustness, the newer version v2.0 of the MethylationEPIC BeadChip retained a substantial number of probes with poor unreliability scores. To enhance current pre-processing pipelines, we developed an R package to calculate MI and unreliability scores and provide guidance on establishing optimal dynamic score thresholds for a given dataset.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11409515	PMC
http://dx.doi.org/10.1186/s13148-024-01739-2	DOI Listing

Publication Analysis

Top Keywords

unreliability scores

infinium probes

illumina methylation

methylationepic beadchip

technical replicates

probes

unreliability

technical

technical biological

biological sources

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!