Evaluation of marker selection methods and statistical models for chronological age prediction based on DNA methylation.

Leg Med (Tokyo)

Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam Road, Hong Kong, China. Electronic address:

Published: November 2020

In forensic investigation, retrieving biological information from DNA evidence is a promising field of interest. One of the applications is on the estimation of the age of the donor based on DNA methylation. A large number of studies focused on age prediction using the 450 K Human Methylation Beadchip. Various marker selection methods and prediction models have been considered. However, there is a lack of research evaluating different high-dimensional variable selection methods of CpG sites with various models for age prediction. The aim of this study is to evaluate four variable selection methods (forward selection, LASSO, elastic net and SCAD) combined with a classical statistical model and sophisticated machine learning models based on the mean absolute deviation (MAD) and the root-mean-square error (RMSE). We used publicly available 450 K data set containing 991 whole blood samples (age 19-101 years). We found that the multiple linear regression model with 16 markers selected from the forward selection method performed very well in age prediction (MAD = 3.76 years and RMSE = 5.01 years). On the other hand, the highly advanced ultrahigh dimensional variable selection methods and sophisticated machine learning algorithms appeared unnecessary for age prediction based on DNA methylation.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.legalmed.2020.101744DOI Listing

Publication Analysis

Top Keywords

selection methods
20
age prediction
20
based dna
12
dna methylation
12
variable selection
12
marker selection
8
prediction based
8
forward selection
8
sophisticated machine
8
machine learning
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!