Identifying blood-specific age-related DNA methylation markers on the Illumina MethylationEPIC® BeadChip

Alsaleh, Hussain and Haddrill, Penelope R. (2019) Identifying blood-specific age-related DNA methylation markers on the Illumina MethylationEPIC® BeadChip. Forensic Science International, 303. 109944. ISSN 0379-0738 (https://doi.org/10.1016/j.forsciint.2019.109944)

[thumbnail of Alsaleh-Haddrill-FSI-2019-Identifying-blood-specific-age-related-DNA-methylation-markers]
Preview
Text. Filename: Alsaleh_Haddrill_FSI_2019_Identifying_blood_specific_age_related_DNA_methylation_markers.pdf
Accepted Author Manuscript
License: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 logo

Download (1MB)| Preview

Abstract

The past decade has seen rapid development in DNA methylation (DNAm) microarrays, including the Illumina HumanMethylation27 and HumanMethylation450 (450 K) chips, which have played an essential role in identifying and evaluating age-related (AR) DNAm markers in different tissues. Recently, a new array, the Illumina MethylationEPIC (EPIC) was introduced, with nearly double the number of probes as the 450 K (∼850,000 probes). In this study, we test these newly added probes for age association using a large cohort of 754 DNAm profiles from blood samples assayed on the EPIC BeadChip, for individuals aged 0–88 years old. 52 AR CpG sites (Spearman’s abs(rho) >0.6 and P-value <10−83) were identified, 21 of which were novel sites and mapped to 18 genes, nine of which (LHFPL4, SLC12A8, EGFEM1P, GPR158, TAL1, KIAA1755, LOC730668, DUSP16, and FAM65C) have never previously been reported to be associated with age. The data were subsequently split into a 527-sample training set and a 227-sample testing set to build and validate two age prediction models using elastic net regression and multivariate regression. Elastic net regression selected 425 CpG markers with a mean absolute deviation (MAD) of 2.6 years based on the testing set. To build a multivariate linear regression model, AR CpG sites with R2 > 0.5 at FDR < 0.05 were input into stepwise regression to select the best subset for age prediction. The resulting six CpG markers were linearly modelled with age and explained 81% of age-correlated variation in DNAm levels. Age estimation accuracy using bootstrap analysis was 4.5 years, with 95% confidence intervals of 4.56 to 4.57 years based on the testing set. These results suggest that EPIC BeadChip probes for age estimation fall within the range of probes found on the previous Illumina HumanMethylation platforms in terms of their age-prediction ability.