Background

Glomerular filtration rate (GFR) measures an important aspect of kidney function. Measurement of GFR requires infusion of inulin, iothalamate, iohexol, 51Cr-EDTA, gadolinium-DTPA or gadolinium-DOTA [1]. The adjusted GFR (aGFR) is the measured GFR divided by body surface area (derived from height and weight) and indexed to 1.73m2.

Many organizations recommend use of estimated aGFR (eGFR) using the Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) equation [2], which is based on sex, race, age and serum creatinine (SCR), in clinical practice [3,4,5]. Since kidney disease caused by T2DM is the most frequent cause of kidney disease in the United States [6, 7], clinicians often estimate aGFR using the CKD-EPI equation to make healthcare decisions in people with diabetes.

The CKD-EPI equation generally tends to underestimate aGFR, and the higher the aGFR the more severe the extent of underestimation [8]; this underestimation is often attributed to differences between the populations in which the CKD-EPI equation was derived and those in which it has been subsequently applied. Due to this inaccuracy at high aGFRs, when the eGFR is equal to or greater than 60 ml/min/1.73m2, clinical labs often report results as “≥60 ml/min/1.73m2”. An additional potential limitation of the CKD-EPI equation for estimating of aGFR in individuals with diabetes is that it does not consider measures of glucose which can drive aGFR up [9,10,11,12]. This inaccuracy in identifying hyperfiltration is a potential limitation of the CKD-EPI equation [13]. Although there is little direct evidence that restraint of hyperfiltration decreases incidence of diabetic kidney disease [14] some studies are suggestive that this is the case [15,16,17,18,19,20,21,22]. Thus, individuals with hyperfiltration might be candidates for renoprotective therapy.

The purpose of the present study was to determine the extent to which glycemic variables, along with other standard clinical variables, account for the difference between the CKD-EPI eGFR and measured aGFR in diabetic Pima people, a Native American population with a high prevalence of type 2 diabetes [23, 24]. The other purpose of the study was to develop equations that more closely model aGFR to identify hyperfiltration in individuals in this cohort.

Methods

Study subjects and design

From 1965 to 2007, the Pima from Arizona participated in a longitudinal study of diabetes. All individuals age ≥ 5 years were invited to have a biennial health examination, which included a 75 g oral glucose tolerance test for diagnosis of diabetes. In addition, adults (ages 18–65) with type 2 diabetes from this population were invited to participate in studies that included repeated measurements of GFR in the context of several studies of the natural history of diabetic kidney disease and one that tested the renoprotective efficacy of losartan in early diabetic nephropathy (ClinicalTrials.gov number, NCT00340678) [25,26,27,28]. All GFR studies were performed between 1988 and 2014.

Clinical and anthropometric measures

Fasting plasma glucose was measured on several autoanalyzers over the years 1988–2014 using a hexokinase method (Boerhinger Mannheim/HK). HbA1c was measured by high performance liquid chromatography (HPLC). Serum specimens for measurement of creatinine concentration were stored at − 80 °C until the assay which was performed within 30 days of sample collection. SCR was measured by a modified Jaffé reaction until 2011 [29], and by an enzymatic method thereafter. Samples were calibrated initially to the laboratory at the Cleveland Clinic and later became traceable to an isotope-dilution mass spectrometry measurement procedure. Glomerular filtration rate (GFR) was measured in the morning after an overnight fast by infusion of iothalamate. An HPLC system was used to measure iothalamate concentrations. Serum and urinary measurements were made in four different collection periods and an average of these was taken to get the GFR [30]. Average urinary clearance of iothalamate was equated with the GFR. Height was measured at the first examination for each protocol and weight was measured at each measurement of GFR. aGFR was calculated as measured GFR divided by BSA, calculated using the Du Bois equation [31].

eGFR (in ml/min/1.73m2) was calculated according to the CKD-EPI formulae for non-black race with SCR measured in mg/dl [2]:

$$ {\displaystyle \begin{array}{ll}\mathrm{eGFR}=144\times {\left(\mathrm{SCR}/0.7\right)}^{\hbox{-} 0.329}\times {(0.993)}^{\mathrm{age}}& \mathrm{if}\ \mathrm{female}\ \mathrm{and}\ \mathrm{SCR}\le 0.7\ \mathrm{mg}/\mathrm{dl}\\ {}\mathrm{eGFR}=144\times {\left(\mathrm{SCR}/0.7\right)}^{\hbox{-} 1.209}\times {(0.993)}^{\mathrm{age}}& \mathrm{if}\ \mathrm{female}\ \mathrm{and}\ \mathrm{SCR}>0.7\ \mathrm{mg}/\mathrm{dl}\\ {}\mathrm{eGFR}=141\times {\left(\mathrm{SCR}/0.9\right)}^{\hbox{-} 0.411}\times {(0.993)}^{\mathrm{age}}& \mathrm{if}\ \mathrm{male}\ \mathrm{and}\ \mathrm{SCR}\le 0.9\ \mathrm{mg}/\mathrm{dl}\\ {}\mathrm{eGFR}=141\times {\left(\mathrm{SCR}/0.9\right)}^{\hbox{-} 1.209}\times {(0.993)}^{\mathrm{age}}& \mathrm{if}\ \mathrm{male}\ \mathrm{and}\ \mathrm{SCR}>0.9\ \mathrm{mg}/\mathrm{dl}\end{array}} $$

For comparison we also calculated eGFR according to the Modification of Diet in Renal Disease (MDRD) formulae [32]:

$$ {\displaystyle \begin{array}{ll}\mathrm{eGFR}=175\times {\left(\mathrm{SCR}\right)}^{\hbox{-} 1.154}\times {\left(\mathrm{age}\right)}^{\hbox{-} 0.203}\times 0.742& \mathrm{if}\ \mathrm{female}\\ {}\mathrm{eGFR}=175\times {\left(\mathrm{SCR}\right)}^{\hbox{-} 1.154}\times {\left(\mathrm{age}\right)}^{\hbox{-} 0.203}& \mathrm{if}\ \mathrm{male}\end{array}} $$

Statistical analysis

Since one of the goals was to characterize hyperfiltation, which by definition does not occur in those with established chronic kidney disease, examinations where eGFR was < 60 ml/min/1.73m2 were excluded from analysis. Because of the longitudinal nature of these studies, there were often multiple aGFR measurements made in the same participant.

Multiple linear regression models were used to assess associations between independent variables (age, sex, SCR, BSA, diabetes duration, FPG and HbA1c) and the dependent variable, which was aGFR, the inverse of SCR, or the difference between aGFR and CKD-EPI eGFR (defined as aGFR - CKD-EPI eGFR). In these analyses to determine variables associated with aGFR, men and women were analyzed together, and sex was one of the independent variables. The purpose of regressions modelling the inverse of SCR was to find which of the independent variables (excluding the SCR) were most closely associated with creatinine as a clinical measure of GFR. Models were fit using a linear mixed model procedure to account for dependence among multiple measurements in the same individual, with the assumption of an autoregressive correlation structure. Akaike’s Information Criterion (AIC) was used as a measure of goodness of fit for each model; all combinations of variables were tested. The best model has the lowest AIC. These analyses were used to select variables for inclusion in the approximating equations for predicting aGFR. HbA1c and FPG, although highly correlated (r = 0.71 in the present data), may have different effects on GFR, thus both were available for selection in the model.

Using the selected variables, an equation for approximating aGFR was developed. First these same data were divided into 8 groups defined by sex and quartiles of SCR. A separate equation was derived for each of the 8 groups; this accounts for nonlinearity over the range of SCR values. (For clinical utility, we did not include diabetes duration in these analyses because it is frequently unknown in patients with type 2 diabetes.) To reduce “overfitting” caused by including a participant’s own data in developing the prediction equation, a 10-fold cross-validation procedure was used. The set of individuals was split into 10 groups of approximately equal size. The models were fit for prediction of aGFR in 90% of the data leaving out one group at a time. The predicted aGFR in each of the 10% of individuals “left out” was calculated from the equation derived in the other 90% of individuals.

To assess fit of the prediction equation, we calculated the correlation coefficient between this predicted GFR and the aGFR. Root mean square error and r2 (the proportion of variance in aGFR explained by the model) were also calculated. Differences in predictive properties of different equations were assessed by comparison of the correlation coefficients of each estimate with aGFR; the method of Kleinbaum was used to account for the correlation among the estimates [33]. For purposes of presentation, we show the predicted values for each individual as those derived from the cross-validation procedure, while regression coefficients for the final equations are shown as the average across all 10 “training” sets. The P30 is defined as the probability that an estimated GFR that is within 30% of the aGFR [10].

We also calculated the area under the receiver operating characteristic (ROC) curve for detecting hyperfiltration (defined as aGFR> 120 ml/min/m2). This area represents the probability that an equation can distinguish correctly between an individual with hyperfiltration and one without [34]. The difference between the area under ROC curve for the new equation and that for the CKD-EPI equation was tested by the method of DeLong et al. [35]. All statistical analyses were performed with SAS version 9.4 (SAS Institute, Cary, NC).

Results

Glomerular filtration rate was measured at 2798 examinations in 269 individuals where eGFR was greater or equal to 60 ml/min/1.73m2. For participants who had aGFR measured more than once, mean time between adjacent aGFR studies was 3.3 ± 1.9 years. Clinical and biological characteristics of participants are summarized in Table 1 and the frequency distribution of measured aGFR is shown in Fig. 1. Figure 2a shows a plot of aGFR versus CKD-EPI eGFR.

Table 1 Characteristics of the 269 subjects who participated in 2798 GFR studies over the years 1988–2014
Fig. 1
figure 1

Frequency distribution of measured, aGFRs in 2798 measurements of 269 people in the study cohort over the years 1988–2014

Fig. 2
figure 2

Measured aGFR [(mGFR/BSA) per 1.73m2] on the x-axis and the (a) CKD-EPI estimated and (b) approximated GFR on the y-axis. Black diamonds represent estimations by the CKD-EPI equation and open squares represent approximating equations by new equation. There is better agreement between the measured aGFR and the approximated GFR than there is using the estimating equation CKD-EPI

The best multivariate regression for predicting aGFR, as shown by the smallest AIC, was one that included all variables except BSA; the regression equation is shown in Table 2A. The established variables, age, sex and SCR explained 44.8% of the variance in the logarithm of aGFR, while HbA1c and FPG explained an additional 2.7%. We performed similar analysis to find the best predictors of the inverse of SCR. The inverse of SCR was positively associated with HbA1c as expected, but slightly inversely associated with fasting glucose, as shown in Table 2B. While FPG and HbA1c explained 5.4% of variation in aGFR after adjustment for age and sex, they only explained 1.7% of the reciprocal of SCR. Finally, we used multivariate regression to find which variables best accounted for the difference between CKD-EPI eGFR and aGFR (aGFR−eGFR) (Table 2C). The best model was one that included all the variables except BSA. Although there was a tendency for eGFR from the CKD-EPI equation to underestimate aGFR, high fasting plasma glucose and HbA1c were associated with a larger difference.

Table 2 Regression models for aGFR, 1/SCR and aGFR-eGFR

These same variables were used to develop a prediction equation for aGFR. The final prediction equations are shown in Table 3. The relationship between aGFR and its predicted value using the new approximating equations is shown in Fig. 2b. The correlation between aGFR estimated by the best approximating equations and measured aGFR (iothalamate GFR / BSA) was 0.648 and r2 = 0.420. The correlation between the aGFR estimated by age and serum creatinine (without glucose variables), on the other hand, was 0.619 and r2 = 0.383. In comparison, the CKD-EPI equation had a correlation with the aGFR of 0.620 and r2 of 0.384; the correlation between aGFR and the new approximating equations was significantly higher than that between aGFR and the CKD-EPI equation (p = 1.2 × 10− 6 for significance of difference). The correlation between aGFR and an equation derived by 10-fold cross-validation excluding FPG and HbA1c was not significantly different from that between the CKD-EPI equation and aGFR (p = 0.63). The MDRD equation had a correlation of 0.554 (r2 = 0.307) with aGFR. In women, the correlation between aGFR and the new approximating equations was 0.67 (r2 = 0.44), while that between aGFR and the CKD-EPI equation was 0.63 (r2 = 0.39, p = 8.4 × 10− 9 for difference). In men, the corresponding numbers were 0.60 (r2 = 0.36) and 0.61 (r2 = 0.37, p = 0.52 for difference).

Table 3 Intercepts and coefficients of variables in the final approximating equations

Plots of aGFR and the difference between the aGFR and the estimates of the aGFR by each equation show that the CKD-EPI equation overestimates aGFR when it is low and underestimates it when it is high. The mean difference between aGFR and eGFR estimated by the CKD-EPI equation is 25.7 ml/min/1.73m2. However, the new approximating equation distributes error over a wider range of means, aGFRs (Fig. 3). The mean difference between aGFR and eGFR by the new approximating equations is 3.3 ml/min/1.73m2 (which still underestimates the aGFR on average, but to a much smaller degree). The P30 of the new approximating equation was 82%. By comparison, the CKD-EPI calculated eGFR was within 30% of the aGFR among 75% of measurements.

Fig. 3
figure 3

Plots of the (a) difference between the CKD-EPI equation and measured aGFR on the y-axis (b) difference between the new equation and measured aGFR on the y-axis, both analyzed against the measured aGFR on the x-axis. The CKD-EPI systematically underestimates the aGFR especially at higher values. The new equation distributes error over a wider range of means but also underestimates the aGFR slightly. The mean aGFR was 130.45 ± 40.47 ml/min/1.73m2. The mean GFR using the new estimating equations was 126.25 ml/min/1.73m2 whereas the mean aGFR using the CKDEPI equation was 104.73 ± 17.75 ml/min/m2

When the new approximating equations were tested against the CKD-EPI equation to find which more accurately estimates hyperfiltration defined as aGFR ≥120 ml/min/1.73m2, the area under the ROC curve for the new approximating equation was 0.836 versus 0.825 for the CKD-EPI equation (Fig. 4). This difference was highly statistically significant (p = 0.000001). For example, the threshold for detecting hyperfiltration with 80% specificity is ≥109 ml/min/1.73m2 by the CKD-EPI equation, and this has sensitivity of 64%. By contrast, the threshold for 80% specificity with the new approximating equations is ≥127 ml/min/1.73m2 with a sensitivity of 72%. This represents a statistically significant, but modest, improvement in prediction of hyperfiltration.

Fig. 4
figure 4

Receiver-operator curve (ROC) showing the false positive rate (1-specificity) on the x-axis and the sensitivity on the y-axis of the new approximating equations versus the CKD-EPI equation

Discussion

In Pimas with T2DM, glycemic variables including FPG, and HbA1c were modestly associated with aGFR, measured by urinary clearance of iothalamate. Thus, the glycemic variables provide additional information about aGFR, along with the usual variables of age, sex and SCR, and these associations can account, in part, for the differences between aGFR and eGFR. The new approximating equations modeled aGFR better than did the CKD-EPI equation, in that they explained significantly more of the variance. Although the additional contribution of glycemia to variance explained is relatively small, it may have clinical importance. However, such clinical relevance has not been demonstrated.

In general, the CKD-EPI eGFR systematically underestimates aGFR and this underestimation is greater in those with hyperglycemia [8,9,10,11]. Our analysis was conducted among individuals with eGFR ≥60/ml/min/1.73m2 by the CDK-EPI equation, since the CDK-EPI performs well in clinical practice when it is < 60 ml/min/1.73m2. This could have biased our results by truncating the range of the CKD-EPI values, but in a sensitivity analysis, elimination of those with aGFR < 60 ml/min/1.73m2 gave almost the same results.

The utility of a prediction equation is reflected in accuracy and precision. An equation will generally fit better in the data from which it was derived than in a set of independent individuals. To minimize this “overfitting” bias, we used 10-fold cross-validation. With this approach, there was improvement in the correlation (which largely reflects precision) with the new approximating equations over the CKD-EPI equation, but the correlation of a Pima-specific equation based on the same variables used in CKD-EPI was not significantly different from the CKD-EPI equation. This suggests that the increased precision of the new approximating equations over the CKD-EPI equation is largely due to the use of glycemic information, and not simply due to fitting the model to our own data (and that the 10-fold cross-validation procedure has largely removed any overfitting bias). The correlation of the new equations with the glycemic variables is, of course, strongly significantly higher than the Pima equation without these variables (p = 1.9 × 10− 10). Inaccuracy of the CDK-EPI equation for predicting aGFR has often been attributed to differences between the populations in which it was derived and those in which it has subsequently been applied [36, 37]. The systematic underestimation of aGFR in the present study is largely due to the population difference. The difference between the aGFR and that by CKD-EPI (which largely reflects accuracy) is 25.7 ml/min/1.73m2 higher. This difference is mostly removed when using either of the Pima equations derived from the 10-fold cross-validation procedure. The small difference between the actual and predicted aGFR that we observe with the new equations likely reflects a modest residual nonlinearity in the relationship.

In our analyses SCR was not correlated with glycemic variables to the same extent as was aGFR, so when creatinine-based estimates of GFR are employed in diabetic individuals, information about glycemia-induced hyperfiltration is not captured. Our observation that the difference between eGFR and aGFR is associated with the degree of hyperglycemia is consistent with studies that show that the extent to which eGFR underestimates aGFR depends on the degree of glycemia [38]. Since glycemic variables show more correlation with aGFR than with SCR, it underscores the importance of the non-renal variables used to calculate eGFR. It is possible that the body composition of diabetic people, for example, modifies the relationship between SCR and eGFR, potentially explaining the different directions in correlations of SCR with HbA1c and FPG. However, this study does not include variables such as body mass index, body composition and lipid levels; these might have added to the predictive model. No equation, including the CKD-EPI eGFR, has been validated in a cohort of diabetic subjects with eGFR ≥60 ml/min/m2 [38, 39].

We propose that the new approximating equations, which take into account measures of glycemia that the CKD-EPI does not, may be useful when the diabetic patient is NOT approaching end-stage renal disease but might actually be hyperfiltering, i.e. early in diabetes. Using the current reporting system, hyperfiltration might never be uncovered. Using the new approximation of aGFR, the actual value would be reported (rather than “eGFR ≥60 ml/min/1.73m2”) so that people with diabetes who are hyperfiltering could be identified and medical decisions made appropriately. One such decision might regard treatment. Newer SGLT2 inhibitors can lower HbA1c and also offer renal protection [40, 41]. Although the Food and Drug Administration does not yet recommend SGLT2 inhibitors for a renal indication, it has been proposed that they may exert renoprotective properties due to the reduction in hyperfiltration [42]. It might be useful to recruit individuals with hyperfiltration into clinical trials to test this hypothesis. If this proves to be the case, it might also be sensible to prescribe SGLT2 inhibitors to those who are hyperfiltering, as one might expect them to have the greatest benefit. The new approximating equations developed here may be useful for identifying such individuals.

Limitations of the present study include the use of the CKD-EPI equation to identify those with low eGFR in order to remove them from the cohort of people with higher aGFRs. Ideally one equation should suffice in the identification of both individuals with low eGFR and those who are hyperfiltering. With the discontinuity in the equation developed here, its optimal use may be in identifying those with hyperfiltration rather than clinical tracking. This equation is actually eight equations, making use complicated though less so with the help of a computer or smart phone. Additionally, many of the assays of creatinine were done before standardization. The effects of drugs like inhibitors of the renin-angiotensin-aldosterone system that can lower the aGFR were not accounted for in this study.

The plots of predicted versus observed aGFR show that the new equation is more accurate than the CKD-EPI equation, but the improvement in terms of variance explained is modest. It is probable that aGFR could be better modeled if there were an endogenous marker of renal function better than creatinine, which has several problems. Not only does it reflect muscle bulk and therefore varies by sex and age, it also shows the effects of hyperglycemia imperfectly, as shown in the present study.

The strengths of this study include the use of a wide range of aGFRs measured by the urinary clearance of iothalamate, division by BSA, and standardization to 1.73m2. Another strength of the study is that hyperfiltration was defined at aGFR greater than or equal to 120 ml/min/m2 which captures older adults with type 2 diabetes. Since diabetes is the most common cause of kidney failure in the US and, indeed, worldwide, the new equations approximating aGFR represent a potential advance. However, they were developed in a single ethnic group, and applicability to other populations of people with T2DM requires further study and validation.

Conclusions

In conclusion, aGFRs in Pima people with type 2 diabetes are increased by high FPG and high HbA1c. This information about glycemia-induced hyperfiltration is not reflected in measurements of serum creatinine. These glycemic variables are significant, but of modest effect, in explaining the difference between aGFR and CKD-EPI eGFR. Thus, measures of glycemia are important in approximating aGFR, and possibly for making clinical decisions regarding hyperfiltration.