Introduction

The etiology of chronic kidney disease (CKD) varies considerably across the world. In high-income and middle-income countries, diabetes and hypertension are the primary drivers of CKD. The global prevalence of diabetes mellitus and hypertension is witnessing a rapid upsurge due to an aging population [1, 2]. Consequently, it is imperative to consistently monitor renal function in the elderly demographic.

At present, the glomerular filtration rate (GFR) serves as the principal indicator for evaluating the severity of kidney damage [3]. Ideally, GFR is measured using an exogenous marker (such as inulin, iohexol, iothalamate, Cr-ethylenediaminetetraacetic acid, etc.) [4]. Nonetheless, measurements employing exogenous markers are often cumbersome and expensive. Consequently, estimated GFR (eGFR) based on a patient's serum creatinine (Scr) and serum cystatin C (Cysc) remains the most widely utilized indicator in clinical settings [5]. Factors, such as gender, age, and muscle volume, can influence Scr [6], while CysC, an endogenous protein, boasts relatively stable production and serum levels not impacted by variables other than renal function [7]. However, Cysc measurements are costly and not routinely conducted in many primary hospitals. As a result, several equations have been developed based on one or both endogenous markers.

In 1999, the MDRD equation was developed by the American Modification of Diet in Renal Disease (MDRD) working group using data from 1070 CKD patients in Western countries [8]. This equation was later simplified by considering four variables: gender, age, creatinine, and race, and became widely used for estimating GFR in Western populations [9, 10]. In 2006, the MDRD equation was modified by the Chinese eGFR Investigation Collaboration to better suit the characteristics of Chinese CKD patients, resulting in the MDRDc equation [11]. However, the MDRD equation tends to underestimate higher levels of GFR. To address this issue, the Kidney Disease/Improving Global Outcomes (KDIGO) organization developed the CKD-EPI creatinine equation in 2009 [12]. This equation, based on serum creatinine values and GFR levels from 5504 participants of diverse ethnicities, including Asians, provides more accurate estimations for both high and low GFR levels. In 2012, the CKD-EPI creatinine-cystatin C equation and cystatin C equation were introduced, incorporating cystatin C as an alternative filtration marker for estimating GFR [13]. Additionally, the Berlin Initiative Study (BIS) equations were introduced in the same year, specifically tailored for elderly individuals [14]. Finally, in 2016, the Full Age Spectrum (FAS) equation was developed to estimate GFR across all age groups [15]. To date, no consensus has been reached regarding the efficacy of these equations in evaluating the eGFR of elderly individuals in China. This study, employing the gold standard GFR measured by 99m-Technetium-diethylenetriamine-pentaacetic acid (99m Tc-DTPA) renal dynamic imaging, aims to assess the applicability of the aforementioned equations in the Chinese elderly population.

Materials and methods

Study population and design

In accordance with Chinese legislation, the age threshold for senior citizens starts at 60 years old [16]. In this comprehensive retrospective study, we enrolled a total of 444 elderly Chinese participants (aged 60 years and above) who were hospitalized at the Division of Nephrology, Jiangsu Province Hospital of Chinese Medicine in China. The study period spanned a decade, from January 2010 to December 2020. All subjects met the diagnostic criteria for CKD as specified in the KDIGO guideline [17] and underwent 99m Tc-DTPA renal dynamic imaging for the measurement of GFR. The following inclusion criteria were used for the subjects: (i) Scr and Cysc levels were measured within 1 week before or after 99m Tc-DTPA renal dynamic imaging; (ii) Renal function of all subjects remained stable. The exclusion criteria were as follows: (i) Acute kidney injury; (ii) Severe hypoproteinemia (albumin < 30 g/L); (iii) Edema; (iv) Ascites or pleural effusion; (v) Skeletal muscle atrophy; (vi) Amputation; (vii) Severe heart failure; (viii) Ketoacidosis; (ix) Renal replacement therapy (Fig. 1). Among the 444 participants, 118 patients were not included due to incomplete Scr or cysc data within a week. Eight patients were excluded due to edema, four patients due to hypoproteinemia, five patients due to CKD with acute kidney injury, one patient due to an amputation, eight patients due to heart failure, and two patients due to pleural effusion. Finally, a total of 298 participants' clinical data were included in this study.

Fig. 1
figure 1

Flowchart of the study

Laboratory measurements

The Laboratory department of Jiangsu Province Hospital was responsible for all laboratory measurements. Fasting blood samples were collected from the subjects in the morning. The American Beckman Coulter AU5800 automatic biochemical analyzer was employed to measure Scr using the enzymatic method and CysC using latex-enhanced immunoturbidimetry.

GFR measurements

The gold standard method of renal dynamic imaging was utilized to measure GFR [18]. A single-photon emission computed tomography scanner (SPECT. SKYLIGHT), manufactured by Philips in the Netherlands, was used for the imaging process. The radiotracer employed was 99m Tc-DTPA, which was injected intravenously into the patients, followed by dynamic image acquisition. Post-image processing, the instrument's software calculated the standard GFR, taking into account the patient's weight and height.

Estimation methods of eGFR

In this study, seven equations (CKD-EPI-Scr, CKD-EPI-Cysc, CKD-EPI-Scr-Cysc, MDRD, MDRDc, FAS, BIS1) were employed to estimate eGFR based on the patients' age, blood Scr, and CysC levels [8,9,10,11,12,13,14,15]. The specific equations used are presented in Table 1.

Table 1 The expression of seven equations basd on Scr or Cysc in the study

Statistical analysis

Data with normal distribution are expressed as mean ± standard deviation (\(\overline{x }\)± s), while non-normally distributed data are presented as median (interquartile range [IQR]). IQR and median of difference were employed as assessment indices. The percentage of estimates within 30% of the measured value (P30) was defined as the proportion of eGFR values deviating within 30% of the measured GFR (mGFR), and is often used to evaluate the accuracy of the eGFR estimation equation [19, 20]. Root-mean-square error (RMSE) was also utilized to aid the analysis process. The Bland–Altman plot analysis was employed to describe the mean difference and precision between mGFR and eGFR. Non-parametric rank-sum tests and chi-square tests were used for comparing baseline, bias, and accuracy. A p-value < 0.05 was considered statistically significant. All calculations and statistical analyses were conducted using IBM SPSS Software (version 26.0, IBM Corp) and GraphPad Prism (version 9.2, Dotmatics).

Results

Participant characteristics

A total of 298 subjects aged ≥ 60 years were recruited for the study, comprising 44 subjects with mGFR < 30 ml/min/1.73 m2, 163 subjects with mGFR 30–59 ml/min/1.73 m2, and 91 subjects with mGFR ≥ 60 ml/min/1.73 m2. Of the participants, 43.96% were female. The age, Scr, and CysC were 71 (65, 77) years, 116.00 (91.53, 168.83) μmol/L, and 1.48 (1.14, 2.01) mg/L, respectively. The mGFR was 48.77 (36.50, 62.45) ml/min/1.73 m2. Detailed characteristics and eGFR values estimated by different equations are provided in Table 2.

Table 2 General characteristics of 298 Chinese subjects

Comparisons of the applicability of the examined equations across all samples

The results of these analyses revealed that, except for the MDRD and MDRDc equations, the other five equations were more likely to underestimate mGFR (bias < 0) (Table 3). The bias of the MDRDc equation significantly differed from all other equations except the MDRD equation (P < 0.05), and the bias of the MDRD equation differed from the other four equations except the MDRD and CKD-EPI-Scr equations (P < 0.01). The bias of the FAS equation was apparently different from the other equations except the MDRD and MDRDc equations (P < 0.01). There was no significant difference in the bias of other equations (P > 0.05).

Table 3 Performance of the seven equations in the overall sample and in the subgroups

Regarding IQR results, the MDRDc equation had the highest IQR value, followed by the MDRD equation. The IQR values of the remaining equations were arranged in descending order as the CKD-EPI-Scr-Cysc, CKD-EPI-Scr, FAS, BIS1, and CKD-EPI-Cysc equations. The MDRDc and FAS equations showed accuracies (P30) lower than 60% (57.7% and 59.7%, respectively) and were significantly lower than other equations (P < 0.05). The accuracy (P30) of the BIS1 Eq. (72.1%) was significantly higher than all other tested equations (P < 0.05), while the accuracy (P30) of the remaining four equations (CKD-EPI-Scr 62.4%, CKD-EPI-Cysc 67.1%, CKD-EPI-Scr-Cysc 66.4%, MDRD 64.1%) did not differ significantly from each other (P > 0.05).

The RMSE of the BIS1 equation was the smallest, while the RMSE of the MDRD equation was the largest, with the RMSE of the other equations fluctuating around 15–18. The Bland–Altman plots analysis demonstrated that the BIS1 equation exhibited the highest accuracy, with 95% confidence interval (CI) widths of 52.37. Meanwhile, FAS, CKD-EPI-Scr-Cysc, CKD-EPI-Cysc, and CKD-EPI-Scr equations exhibited slightly lower accuracies with 95% CI widths of 55.99, 59.53, 61.68, and 66.73, respectively. The MDRD and MDRDc equations exhibited the lowest accuracy with 95% CI widths of 70.12 and 80.71 (Fig. 2, Table 4).

Fig. 2
figure 2

Comparisons of mGFR and eGFR. a CKD-EPI-Scr equation; b CKD-EPI-Cysc equation; c CKD-EPI-Scr-Cysc equation; d MDRD equation; e MDRDc equation; f FAS equation; g BIS1 equation. Solid and dashed lines in the Bland–Altman plot represent the mean and 95% limits of agreement (LoA) of difference, respectively. CKD-EPI chronic kidney disease epidemiology collaboration, MDRD modification of diet in renal disease, MDRDc MDRD-China, FAS full age spectrum, BIS1 Berlin Initiative Study 1

Table 4 Diagnostic accuracy comparison based on Bland–Altman plots analysis

Comparisons of the applicability of the examined equations for different mGFR level groups

All subjects were divided into three groups based on their mGFR levels: the mGFR < 30 ml/min/1.73 m2 group, the mGFR 30 ~ 59 ml/min/1.73 m2 group, and the mGFR ≥ 60 ml/min/1.73 m2 group.

For the mGFR < 30 ml/min/1.73 m2 group, there was no significant difference in bias among the seven equations (P > 0.05). IQR results revealed that the CKD-EPI-Cysc equation had the lowest IQR value (6.77), while the MDRDc equation had the highest IQR value (15.29). The accuracy (P30) of the CKD-EPI-Cysc and BIS1 equations was the highest (59.0% and 47.7%, respectively) but did not significantly differ from the rest of the equations (CKD-EPI-Scr 38.6%, CKD-EPI-Scr-Cysc 43.6%, MDRD 43.2%, MDRDc 40.9%, FAS 38.6%, P > 0.05). In this group, the accuracy of all equations was below 60%. The RMSE results showed that the CKD-EPI-Cysc and CKD-EPI-Scr-Cysc equations had the lowest values, followed by the FAS and BIS1 equations, while the remaining three equations had values greater than 12 (Table 3).

In the mGFR 30 ~ 59 ml/min/1.73 m2 group, the bias of the FAS equation was significantly lower than the bias of the other tested equations (P < 0.05), except for the CKD-EPI-Cysc equation (P > 0.05). The bias of the MDRD and MDRDc equations was positive and significantly different from the CKD-EPI-Cysc and CKD-EPI-Scr-Cysc equations (P < 0.05). The bias of the other equations did not significantly differ (P > 0.05). IQR results indicated that the BIS1 and FAS equations had the lowest values (12.85 and 14.38, respectively), while the MDRDc and MDRD equations had the highest values (22.07 and 18.02, respectively). The MDRDc equation also showed the lowest accuracy (P30 60.7%), while the BIS1 equation showed the highest accuracy (P30 74.8%). However, the accuracy of the equations did not significantly differ (P30 CKD-EPI-Scr 63.2%, CKD-EPI-Cysc 69.4%, CKD-EPI-Scr- Cysc 68.8%, MDRD 63.8%, FAS 63.8%, P > 0.05). Similar results were obtained for RMSE, with the BIS1 equation exhibiting the lowest RMSE and the MDRDc equation having the highest RMSE (Table 2).

In the mGFR ≥ 60 ml/min/1.73 m2 group, the bias of the FAS equation was significantly smaller than the other examined equations (P < 0.05), except for the BIS1 equation. The bias of the MDRD and MDRDc equations was the highest; the bias of the MDRD equation was significantly higher than the BIS1 equation (P < 0.05), and the bias of the MDRDc equation was notably higher than the CKD-EPI-Cysc, CKD-EPI-Scr-Cysc, and BIS1 equations (P < 0.05). The bias of the remaining equations did not differ significantly (P > 0.05). The BIS1 equation had the lowest IQR value, followed by the FAS, CKD-EPI-Scr, MDRD, CKD-EPI-Scr-Cysc, and MDRDc equations, while the CKD-EPI-Cysc equation had the highest IQR value. Among the tested equations, the accuracy (P30) of the BIS1 equation was as high as 79. 1%, the accuracy of the MDRD, CKD-EPI-Scr, and CKD-EPI-Scr-Cysc equations was also above 70% (MDRD 74.7%, CKD-EPI-Scr 72.5%, CKD-EPI-Scr-Cysc 72.2%), and the accuracy of the remaining three equations fluctuated between 60 and 70% (CKD-EPI-Cysc 66.7%, MDRDc 60.4%, FAS 62.6%). However, the differences in accuracy between these equations were not significant (P > 0.05). The RMSE results revealed that BIS1, CKD-EPI-Scr-Cysc, and CKD-EPI-Scr equations had RMSE values lower than 20, the CKD-EPI-Cysc and MDRD equations had slightly increased RMSE values, and the FAS and MDRDc equations had the highest RMSE values.

Discussion

GFR is of paramount importance in drug dosing, CKD diagnosis, treatment, and prognosis evaluation. Direct measurement of GFR using 99m Tc-DTPA clearance has been shown to correlate well with inulin clearance [18]. In our research, we also used 99m Tc-DTPA renal dynamic imaging as a reference. However, 99m Tc-DTPA renal dynamic imaging is not widely available in many grassroots medical institutions in China, and the cost of this examination remains a significant economic burden for numerous patients. Consequently, developing an accurate and reliable method for estimating GFR is essential. In this study, we compared the effectiveness of seven commonly used eGFR equations in terms of bias, accuracy, and precision.

Currently, the CKD-EPI and MDRD equations are the two most frequently recommended equations for adults [21, 22]. The FAS equation, developed based on 6870 patients from Europe and the United States (including 1764 elderly individuals over 70 years old), is applicable to all age groups [15]. These equations are based on Scr levels or Cysc levels and are not specifically designed for the elderly population. However, muscle mass, protein intake, and the prevalence of chronic diseases vary among different age and ethnic groups, affecting creatinine levels [23]. Cysc is less influenced by these factors, but the elderly are underrepresented in the development and validation of Cysc-based equations [13, 24]. The BIS1 equation is tailored for individuals aged 70 and above; however, the samples included in the equation's development were all residents of Berlin [14]. Thus, the applicability of these equations to the elderly in China remains uncertain. Our research aims to shed light on the accuracy and precision of these seven equations in the Chinese population aged 60 and above, presenting different glomerular filtration rates.

In our study, the BIS1 equation demonstrated the highest accuracy and the smallest RMSE among overall subjects and in mGFR ≥ 30 ml/min/1.73 m2 groups. This finding aligns with the BIS1 developer's assertion that the BIS1 equation is more accurate in elderly patients with normal renal function or only moderate decline [14]. According to the K/DOQI guideline, the acceptable level of P30 for GFR prediction equations should exceed 70% [19]; the BIS1 equation was the only one of the seven equations that met this standard in overall subjects. The Bland–Altman diagram also indicated that the BIS1 equation had the narrowest acceptable limits.

When mGFR falls below 30 ml/min/1.73 m2, the applicability of all equations was unsatisfactory, as the highest accuracy was only 59%, and all equations underestimated the true GFR. Consequently, for the elderly population, when eGFR drops below 30 ml/min/1.73 m2, it is advisable to measure GFR or evaluate the patient's condition in conjunction with their clinical symptoms. Otherwise, there is a risk of misjudging the severity of the patient's condition, exacerbating their anxiety, and even prompting premature renal replacement therapy. When mGFR is between 30 and 59 ml/min/1.73 m2, only the BIS1 equation achieves 70% accuracy, and its precision is superior to the other six equations. When mGFR ≥ 60 ml/min/1.73 m2, the accuracy of the CKD-EPI-Scr, CKD-EPI-Scr-Cysc, MDRD, and BIS1 equations all met the standard, but the BIS1 equation's performance remained the best, boasting the lowest IQR and RMSE and the highest accuracy. Therefore, we can conclude that the BIS1 equation is the most promising for use in the Chinese elderly population with mGFR ≥ 30 ml/min/1.73 m2.

In a cohort of older adults from Iceland, Li Fan evaluated various equations for estimating the prevalence of GFR reduction (mGFR < 60 ml/min/1.73 m2) and their performance in estimating mGFR. They discovered that the CKD-EPI equations had similar or better performance than the BIS1 equation within the measurement range of the test [25]. Other studies comparing the CKD-EPI and BIS equations in the elderly have reported generally comparable performance, although some differences exist among studies [26,27,28]. However, since Asians constitute a relatively small proportion of the CKD-EPI equations' development population [12], the performance of the CKD-EPI equations in Chinese older individuals is not as robust as in the aforementioned studies.

Limitations of the study

In this study, there are several limitations that should be acknowledged. First, our research was conducted in a single-center setting, which may not fully represent the diverse elderly population in China. Second, it is important to consider the significant impact of diabetes and hypertension on CKD patients when evaluating the applicability of each equation. A more detailed analysis could be performed by stratifying patients according to their primary diseases. Lastly, serum values in this study were measured only once for each subject, which could potentially affect the accuracy of the results.

Conclusion

In this study, we aimed to assess the applicability of various GFR estimation equations, including the CKD-EPI series, MDRD, MDRDc, FAS, and BIS1, in the context of the elderly Chinese population. Our findings revealed that when mGFR was less than 30 ml/min/1.73 m2, all of the evaluated equations demonstrated suboptimal performance, indicating a need for further refinement or alternative approaches in this specific range of renal function.

On the other hand, for subjects with mGFR values greater than or equal to 30 ml/min/1.73 m2, our analysis showed that the BIS1 equation exhibited the most superior performance among the tested equations. This finding underscores the potential value of the BIS1 equation in estimating GFR for elderly Chinese individuals with mGFR levels over 30 ml/min/1.73 m2. As a result, healthcare professionals and researchers could consider the BIS1 equation as a more suitable and reliable method for GFR estimation in this population, which may ultimately contribute to improved diagnosis, treatment, and management of renal diseases among elderly Chinese patients.