Introduction

Degenerative biological changes and functional decline occur with advancing age, increasing the incidence of disorders, diseases, and mortality. Thus, biological aging proceeds in tandem with chronological age. However, the pace and extent of age changes vary among individuals at any given chronological age (Mitnitski et al. 2002a; Karasik et al. 2005). Thus, the term biological aging has been developed to conceptualize the fact that individuals differ in their manifestation of age changes as they age chronologically. The heterogeneity in biological aging among chronological age peers necessitates a reliable measure of biological or functional age separate from chronological age.

A good measure of biological age should reflect age-related changes occurring at various biological levels. One of the best characterized measures of biological age is the frailty (or deficit) index (Rockwood et al. 1994, 1999; Fried et al. 2001; Mitnitski et al. 2001; Kulminski et al. 2007a, b). This composite index is expressed as the proportion of health deficits accumulated by individuals among a set of health items surveyed throughout the body, and it can be calculated at any given chronological age (Mitnitski et al. 2001). The health items (variables) include various signs, symptoms, laboratory measurements, disabilities, and diseases. A frailty index calculated from about 20 to 100 health variables gives reliable and comparable results (Mitnitski et al. 2006; Rockwood et al. 2007; Rockwood and Mitnitski 2007; Searle et al. 2008). Frailty indexes have been extensively examined and used in various studies (Mitnitski et al. 2001, 2002a, b; Kulminski et al. 2007a, b; Matteini et al. 2010).

We developed a frailty index called FI34 (Kim et al. 2013; Kim and Jazwinski 2015). Composed of 34 common health and function variables, FI34 increases exponentially with age, indicating declining health and function ability. The rate of increase accelerates approximately 2~3 % annually, and the rate of increase and the pattern of aging displayed by the hierarchical clustering of the component variables differ among offspring of long-lived versus short-lived parents (Kim et al. 2013), which indicates a genetic basis of the frailty index. Indeed, an estimate of the heritability (narrow sense) of FI34 is relatively high (0.39). A survival analysis indicates that FI34 predicts mortality better than does chronological age (Kim et al. 2013). This is a critical determination. Frailty and deficit indices, such as FI34, increase with chronological age. Thus, they could simply be a surrogate for the passage of calendar time. The fact that FI34 predicts mortality/survival better than does the simple passage of time indicates that it is more than a naive chronometer. Rather, it is a metric of biological age, a complex and intrinsic feature of an organism.

Physiologic factors associated with FI34 have been identified. Elevated levels of resting metabolic rate are linked to higher FI34 scores in nonagenarians, indicating an increased energy demand for basic body functioning with declining health (Kim et al. 2014). This association of energy metabolism with healthy aging of the oldest old has various underlying factors that operate in a gender-specific manner (Kim et al. 2016a, b). In female nonagenarians, fat mass and fat-free mass are important contributors to healthy aging; in male nonagenarians, however, tissue quality rather than body composition is important. In addition, genetic factors that are associated with FI34 have been identified, which include UCP2 and UCP3, in females, and XRCC6, and LASS1, in males. Also, non-coding genomic regions at 12q13-14 that appear to have regulatory function are related to healthy aging (Kim et al. 2015). Thus, FI34 has been useful in identifying both genetic and phenotypic factors related to healthy aging.

A different type of age measure was proposed based on leukocyte DNA methylation. DNA methylation levels at many CpG sites in the genome are correlated with chronological age (Fraga et al. 2005; Rakyan et al. 2010; Bocklandt et al. 2011; Koch and Wagner 2011; Lin et al. 2016), and subsets of such CpG sites have been used in epigenetic models of aging. Hannum et al. obtained a “predicted age” from DNA methylation levels at 71 CpG sites and calculated an “apparent methylomic aging rate (AMAR)” for each individual by dividing the predicted age by the chronological age (Hannum et al. 2013). AMAR greater than 1 was interpreted to mean “fast aging,” whereas AMAR less than 1 to mean “slow aging.” Similarly, Horvath selected 353 CpG sites in which DNA methylation levels are highly correlated with chronological age (Horvath 2013). He transformed subjects’ chronological ages (1 and 2 below), used a linear regression to describe the relationship between the transformed age and the DNA methylation levels of the 353 CpG sites that are highly correlated with chronological age “weighted” by their regression coefficients to maximize the overall relationship (3), and then took the “inverse” of this linear regression to calculate the DNAmAge of each subject (4):

$$ F\left(\mathrm{age}\right)= \log \left(\mathrm{age}+1\right)- \log \left(\mathrm{adult}\kern0.30em \mathrm{age}+1\right),\kern0.30em \mathrm{if}\kern0.30em \mathrm{age}\le \mathrm{adult}\kern0.30em \mathrm{age}\kern0.30em \left(=20\right) $$
(1)
$$ F\left(\mathrm{age}\right)=\left(\mathrm{age}-\mathrm{adult}\kern0.30em \mathrm{ag}\mathrm{e}\right)/\left(\mathrm{adult}\kern0.30em \mathrm{ag}\mathrm{e}+1\right),\kern0.30em \mathrm{i}\mathrm{f}\kern0.30em \mathrm{a}\mathrm{g}\mathrm{e}>\mathrm{adult}\kern0.30em \mathrm{ag}\mathrm{e}\kern0.30em \left(=20\right) $$
(2)
$$ F\left(\mathrm{age}\right)={b}_0+{b}_1{CpG}_1+{b}_2{CpG}_2\dots {b}_{353}{CpG}_{353}+\mathrm{error} $$
(3)
$$ \mathrm{DNAmAge}=\mathrm{the}\kern0.30em ``\mathrm{inverse}"\kern0.30em \mathrm{of}\kern0.30em \mathrm{the}\kern0.30em \mathrm{function}\kern0.30em \mathrm{F}\kern0.30em \mathrm{in}\kern0.30em (3) $$
(4)

Not surprisingly based on its dependence on chronological age, the DNAmAge is an excellent correlate of chronological age. Indeed, it is almost as accurate as the age on drivers’ licenses. Subsequently, the potential of DNAmAge-derived measures as indicators of biological age have been explored (Horvath et al. 2015; Marioni et al. 2015; Breitling et al. 2016; Chen et al. 2016; Christiansen et al. 2016; Perna et al. 2016). One of them, AgeAccelerationDiff (AgeDiff hereafter), is the difference between DNAmAge and chronological age. AgeDiff was associated with mortality (Marioni et al. 2015; Christiansen et al. 2016; Perna et al. 2016). It was also associated with a frailty index similar to our FI34 but having a much greater emphasis on activities of daily living and a substantially lesser focus on cognitive function (Breitling et al. 2016). Likewise, the residual of the linear regression of DNAmAge on chronological age, AgeAccelerationResidual (AgeResid hereafter), was associated with longevity and mortality (Horvath et al. 2015; Chen et al. 2016). These findings indicate that these DNAmAge-derived measures may represent biological age.

In this article, we compared DNAmAge measures with FI34 side by side and assessed the effectiveness of each of these measures in predicting mortality. Our results indicate that FI34 uniformly outperforms the DNAmAge measures.

Materials and methods

Subjects and health data

Subjects in this study are 262 unrelated individuals randomly selected from the Louisiana Healthy Aging Study cohort (Table 1). Age was based on documentary evidence supported by demographic questionnaires. Only Caucasians, inferred genetically, were included in data analyses to avoid population confounding (Jazwinski et al. 2010). Details of data collection and calculation and characterization of FI34 were described elsewhere (Kim et al. 2013). All participants provided informed consent according to the protocol approved by the respective Institutional Review Boards.

Table 1 Summary statistics of the study sample (N = 262; 206 deceased)

DNA methylation data and analysis

Genomic DNA was isolated from blood specimens, and 500 ng of each genomic DNA sample was treated with bisulfite using the EZ-96 DNA methylation Kit (Zymo Research). DNA methylation data were obtained using the Infinium HumanMethylation450K BeadChip Kit (Illumina) at the University of Utah Genomics Core Facility. Data preprocessing and quality control were done using the R package RnBeads (Assenov et al. 2014). DNA methylation probes with the detection P value greater 0.05 were excluded from beta value calculation. DNA methylation age (“DNAmAge”) of each subject was obtained using the Online Age Calculator (https://dnamage.genetics.ucla.edu/) (Horvath 2013), with the recommended default setting. The DNAmAge was calculated directly without retraining the model on the DNA methylation data from our study population, as the online calculator was developed using a large collection of genome-wide methylation datasets. The output from the online calculator also contains cell count estimates of various leukocyte types, which were used as additional covariates in Cox proportional hazard regression analysis. All the statistical analyses were performed using R (R_Core_Team 2016).

Results

Characteristics of the study cohort

Our study sample consists of 262 Caucasians whose ages range from 60 to 103 (Table 1). Of these, 206 (79 %) were deceased at the time of follow-up just prior to this analysis, and the average elapsed time-to-death from entry into the study was 4.4 years. Females and males did not differ in mean chronological ages and mean DNAmAges, but the two gender groups significantly differed in FI34, AgeDiff, and AgeResid. FI34 was higher in females (P = 5.2e-5), as described previously (Kim and Jazwinski 2015). On the other hand, AgeDiff and AgeResid were higher in males (P = 0.0025 and 0.0086, respectively), which concurs with the higher AMAR in males (Hannum et al. 2013).

DNAmAge and FI34 are correlated with age

DNAmAge is derived from transformed chronological age in a statistical model in which the chronological age is regressed on the methylation status of 353 CpG sites throughout the genome (Horvath 2013). Thus, DNAmAge is expected to be highly correlated with chronological age. For example, the average correlation coefficients were 0.97 and 0.96 for all training and test data sets, respectively (Horvath 2013). Likewise, in our sample, DNAmAge was significantly correlated with chronological age (r = 0.63, P < 2.0e-16; Fig. 1a). FI34 was also correlated linearly with chronological age as health and function deficits tend to increase with age (r = 0.31, P = 4.4e-7; Fig. 1b), although an exponential model describes the relationship better (Kim et al. 2013; Kim and Jazwinski 2015). AgeDiff was also correlated with chronological age (r = −0.20, P = 9.3e-4; Fig. 1c), while AgeResid was not (Fig. 1d).

Fig. 1
figure 1

Scatter plots of chronological age (Age) by DNAmAge (a), FI34 (b), AgeDiff (c), and AgeResid (d). Each regression line is from the corresponding standardized simple linear regression, whose β value equals the correlation coefficient r

As both FI34 and DNAmAge were correlated with chronological age, FI34 and DNAmAge were correlated with each other (r = 0.20, P = 0.0013; Fig. 2a). However, the correlation disappeared when adjusted for chronological age (P > 0.5 by partial correlation), indicating that without the age dependence, DNAmAge, and FI34 are unrelated. AgeDiff and AgeResid were not correlated with FI34 (Fig. 2b, c).

Fig. 2
figure 2

Scatter plots of FI34 by DNAmAge (a), AgeDiff (b), and AgeResid (c). Each regression line is from the corresponding standardized simple linear regression, whose β value equals the correlation coefficient r

DNAmAge is not a significant predictor of survival when adjusted for age

The performance of a measure of biological age is best assessed by its ability to predict mortality, which is the ultimate consequence of aging. For this purpose, most of the studies of DNAmAge measures used Cox proportional hazard regression. We applied the same statistical method to our survival data (Table 2). As expected, chronological age was a significant predictor of survival: The hazard of death was estimated to increase 13 % annually (P < 2.0e-16; model 1). DNAmAge increased the hazard of death by 5 % for a unit increase in DNAmAge (P = 3.3e-16; model 2). Likewise, FI34 was estimated to increase the mortality hazard by 5 % for an increase of FI34 by 0.01 (P = 4.8e-10; model 3). When present together as explanatory variables, DNAmAge and FI34 remained significant without affecting each other much (P = 2.6e-13 and 8.2e-7, respectively; model 4). In the presence of chronological age as an additional covariate, however, DNAmAge was no longer a significant mortality predictor (P = 0.63 and 0.61 in models 5 and 7, respectively), while FI34 still remained significant (P = 0.0054 and 0.0053 in models 6 and 7, respectively). These results confirm that DNAmAge and chronological age largely overlap with each other, but FI34 is a separate measure distinct from the two.

Table 2 Cox regression for time-to-death as a function of age, DNAmAge, or FI34 (N = 262)

Different types of leukocytes exist in blood, and their proportions may vary, depending on individuals’ health conditions at the time of blood collection. This leukocyte heterogeneity may confound estimation of intrinsic DNA methylation levels (Reinius et al. 2012). Methods to estimate leukocyte type counts were developed and have been often used to adjust DNA methylation measurements (Houseman et al. 2012, 2014; Accomando et al. 2014). Therefore, leukocyte type estimates were obtained using Horvath’s Online Age Calculator and included as additional covariates in our Cox regression analysis. The outcome of the leukocyte composition adjustment (model 8) was very similar to the outcome without the adjustment (model 7). The only noticeable difference is a slight decrease in the Z value of FI34 from 2.8 to 2.4, resulting in a higher P value (P from 0.0053 to 0.016). This small change is likely due to the redundancy of FI34 and leukocyte composition in reflecting health conditions.

The Cox regression analysis was repeated with AgeDiff (Table 3). A unit increase in AgeDiff was estimated to reduce the hazard by 2 % (P < 0.020; model 1). When present together, AgeDiff and FI34 remained significant without affecting each other much (P = 0.028 and 6.1e-10, respectively; model 2). Along with chronological age, however, AgeDiff was not significant at all, whereas FI34 maintained its significance (models 3–5). The Cox regression analysis was also repeated with AgeResid (Table 4), but AgeResid was not significant under any of the covariate combinations. As before, FI34 remained significant in all the models considered in Table 4.

Table 3 Cox regression for time-to-death as a function of AgeDiff or FI34 (N = 262)
Table 4 Cox regression for time-to-death as a function of AgeResid or FI34 (N = 262)

FI34 outperforms chronological age and DNAmAge-related measures in predicting mortality of nonagenarians

In all the Cox regression models presented so far with the whole study cohort, which includes subjects of ages from 60 to 103, chronological age was the best predictor of mortality (Fig. 3a). When the Cox regression was limited to nonagenarians only, however, FI34 was a better predictor of mortality than chronological age (P = 0.035 vs. P = 0.054, respectively; Fig. 3b). This indicates that FI34 is a better measure of biological age at later years when accumulation of health deficits accelerates differentially among the oldest old.

Fig. 3
figure 3

Bar plots of effect sizes (Z scores) from Cox proportional hazards regressions. a Z scores in model 7 of Table 2, model 4 of Table 3, and model 4 of Table 4 were plotted, with * representing 0.01 < P =< 0.05, ** 0.001 < P =< 0.01, and P =< 0.001. b Z scores from the same Cox regression models applied to nonagenarians only (N = 161)

Discussion

Since Horvath’s calculation of DNAmAge based on genomic DNA methylation levels (beta values) and chronological age (Horvath 2013), two DNAmAge-derived measures have been used: AgeDiff and AgeResid. AgeDiff has been associated with mortality (Marioni et al. 2015; Christiansen et al. 2016; Perna et al. 2016) and a frailty index (Breitling et al. 2016), after adjustment for various sets of covariates, including the leukocyte type composition. More recently, Chen et al. expanded the original observation of Horvath using more than 13,000 individuals from 13 different cohorts, including three racial/ethnic groups, and found AgeResid to be a significant predictor of mortality (Horvath 2013; Chen et al. 2016). They also found that incorporation of leukocyte composition information greatly enhanced the significance of AgeResid in predicting mortality.

A true measure of biological age should predict mortality/survival with high accuracy. Our study showed that FI34 is a significant predictor of mortality, whereas DNAmAge, AgeDiff, and AgeResid are not, regardless of adjustment for leukocyte type composition. Our study sample consists of 262 Caucasians whose ages range from 60 to 103, and by comparing FI34 and each of the DNAmAge measures side by side, we clearly showed that FI34 is a far better predictor of mortality than the DNAmAge measures. DNAmAge was significant when the Cox regression was unadjusted for chronological age, but DNAmAge became nonsignificant when the regression was adjusted for this variable. This confirms a high degree of redundancy between DNAmAge and chronological age. The same is true with AgeDiff, the difference between DNAmAge and chronological age. Because DNAmAge and age are highly correlated with each other, AgeDiff is also correlated with age, though the correlation is not as strong as that of DNAmAge with chronological age. The negative correlation of AgeDiff with chronological age (Fig. 1c) indicates that the difference between DNAmAge and chronological age decreases as chronological age increases, which is due to decreasing DNAmAge with increasing chronological age. A similar observation was made in a longitudinal twin study (Christiansen et al. 2016). The cause of this leveling of DNAmAge relative to chronological age in later years of life remains to be determined. On the other hand, AgeResid, the residual of the linear regression of DNAmAge on chronological age, is not correlated with chronological age or DNAmAge. This is not surprising because residuals from linear regressions are not correlated with either of the variables used to calculate them. However, AgeResid was not a significant predictor of mortality in all the models we examined in our study. A larger sample size would likely give better linear regression fitting, which would yield better residual estimates. This could be one reason why our study could not detect a significance association of AgeResid with survival.

Our study used a cohort consisting of 262 subjects of European origin, whereas all the studies of DNAmAge used meta-analyses of the results from multiple cohorts involving many more subjects. A meta-analysis of combinable studies, if based on accurate statistical procedures without any bias, should give a higher statistical power than individual studies. Thus, it is possible that the meta-analysis studies involving large numbers of subjects were able to detect the significance of the various DNAmAge measures that our study was unable to. However, this consideration points to the fact that FI34 is a much more robust predictor of survival and measure of biological age than any of the DNAmAge measures proposed thus far. This is because it assesses biological factors that have large effects on survival, whereas the DNAmAge measures only detect small statistical differences which require very large samples. It is worth noting that this usually involves the use of leukocyte type estimates, which perhaps introduce some biological meaning that is sufficient to make them perform. FI34 is a particularly strong measure of biological age because it is a better predictor of survival than is chronological age (Fig. 3) in the oldest old.