bioRxiv | 2021

A computational solution for bolstering reliability of epigenetic clocks: Implications for clinical trials and longitudinal tracking

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Abstract


Epigenetic clocks are widely used aging biomarkers calculated from DNA methylation data. Unfortunately, measurements for individual CpGs can be surprisingly unreliable due to technical noise, and this may limit the utility of epigenetic clocks. We report that noise produces deviations up to 3 to 9 years between technical replicates for six major epigenetic clocks. The elimination of low-reliability CpGs does not ameliorate this issue. Here, we present a novel computational multi-step solution to address this noise, involving performing principal component analysis on the CpG-level data followed by biological age prediction using principal components as input. This method extracts shared systematic variation in DNAm while minimizing random noise from individual CpGs. Our novel principal-component versions of six clocks show agreement between most technical replicates within 0 to 1.5 years, equivalent or improved prediction of outcomes, and more stable trajectories in longitudinal studies and cell culture. This method entails only one additional step compared to traditional clocks, does not require prior knowledge of CpG reliabilities, and can improve the reliability of any existing or future epigenetic biomarker. The high reliability of principal component-based epigenetic clocks will make them particularly useful for applications in personalized medicine and clinical trials evaluating novel aging interventions.

Volume None
Pages None
DOI 10.1101/2021.04.16.440205
Language English
Journal bioRxiv

Full Text