Cluster mean-field theory accurately predicts statistical properties of large-scale DNA methylation patterns

Kerr, Lyndsay and Sproul, Duncan and Grima, Ramon (2022) Cluster mean-field theory accurately predicts statistical properties of large-scale DNA methylation patterns. Journal of the Royal Society Interface, 19 (186). 20210707. ISSN 1742-5689 (https://doi.org/10.1098/rsif.2021.0707)

[thumbnail of Kerr-etal-JRSI-2022-Cluster-mean-field-theory-accurately-predicts-statistical-properties-of-large-scale-DNA-methylation-patterns]
Preview
Text. Filename: Kerr_etal_JRSI_2022_Cluster_mean_field_theory_accurately_predicts_statistical_properties_of_large_scale_DNA_methylation_patterns.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (1MB)| Preview

Abstract

The accurate establishment and maintenance of DNA methylation patterns is vital for mammalian development and disruption to these processes causes human disease. Our understanding of DNA methylation mechanisms has been facilitated by mathematical modelling, particularly stochastic simulations. Megabase-scale variation in DNA methylation patterns is observed in development, cancer and ageing and the mechanisms generating these patterns are little understood. However, the computational cost of stochastic simulations prevents them from modelling such large genomic regions. Here, we test the utility of three different mean-field models to predict summary statistics associated with large-scale DNA methylation patterns. By comparison to stochastic simulations, we show that a cluster mean-field model accurately predicts the statistical properties of steady-state DNA methylation patterns, including the mean and variance of methylation levels calculated across a system of CpG sites, as well as the covariance and correlation of methylation levels between neighbouring sites. We also demonstrate that a cluster mean-field model can be used within an approximate Bayesian computation framework to accurately infer model parameters from data. As mean-field models can be solved numerically in a few seconds, our work demonstrates their utility for understanding the processes underpinning large-scale DNA methylation patterns.