Introduction

The low accuracy of formulas for estimating glomerular filtration rate (GFR) in children has long been a major challenge, with studies showing that less than 50% of the GFR levels estimated (eGFR) using formulas based on serum cystatin C, creatinine and/or urea are within ± 10% of the gold standard GFR measurement [1]. In pediatric nephrology care, more accurate determinations of kidney function are therefore needed with a feasible measured GFR (mGFR) methodology based on the plasma clearance of an exogenous GFR marker. Since the 1980s, GFR has been increasingly measured using the non-ionic agent iohexol [2,3,4,5,6,7]. To avoid extended examinations with multiple blood samples for measuring GFR, many centers have chosen to use the one-pool slope-intercept technique with a minimum of two blood samples [8,9,10,11]. Numerous single-point GFR (GFR1p) methods have been developed, and especially in pediatric care, it is clearly beneficial to reduce the number of blood draws from two or three to a single sample, provided an adequate level of accuracy can be preserved [11,12,13,14]. However, current guidelines from the British Nuclear Medicine Society (BNMS) do not endorse the routine use of a GFR1p method and recommend a one-pool slope-intercept technique requiring at least two samples [11]. The GFR1p methodology was introduced in adult patients by Fisher and colleagues in 1975 based on 51Cr-EDTA clearance [15], and an improved concept was described by Groth and colleagues in 1981 [16]. In 1983, Jacobsson published a formula for GFR1p which takes into account different distribution volumes and different sampling time points in adults based on 99TCm-DTPA clearance [12]. The Jacobsson formula has been widely used for GFR1p with different markers. Confusingly, modified versions of the Jacobsson formula have also been used but reported as being Jacobsson’s original formula [14, 17,18,19]. Here we report results for Jacobsson’s original adult single-point formula [12] and for the modified, non-iterative formula [14, 17], which does not include Jacobsson’s correction for non-uniform distribution. Groth and Aasted published the first pediatric GFR1p formula in 1984 in which they used 51Cr-EDTA clearance with a sampling point at 2 h [20]. In 1991, Ham&Piepsz published a new formula for GFR1p in children, also with sampling at 2 h and based on 51Cr-EDTA clearance. A modification of the Jacobsson formula for pediatric use was published the same year by Stake and colleagues; these authors recommended a sampling point at 3 h based on 99TCm-DTPA clearance [3, 21]. In 2005, Fleming and colleagues described a new formula for GFR1p which they developed from a cohort of 100 children and 225 adults; this formula provided GFR values consistent with those obtained by the slope-intercept technique [22]. Although the Fleming formula first and foremost was suggested as a quality control method for the slope-intercept technique [22], a recent study [19] reports results arguing for the GFR1p-Fleming as a potential stand-alone formula for pediatric nephrology care.

The aims of our study were to: (1) assess the accuracy of the different formulas for GFR1p determination by comparison with the reference iohexol seven-point plasma clearance measurements (GFR7p) and (2) determine the optimal single time point for blood sampling for GFR1p within a feasible time frame (i.e. blood sampling not later than 5 h after injection).

Patients and methods

Patients

Ninety-six children with chronic kidney disease (CKD) were recruited in a cross-sectional study (ClinicalTrials.gov Identifier NCT01092260) which has evaluated the two-point methodology [23]: 54 children at Haukeland University Hospital, Bergen, Norway, and 42 children at Oslo University Hospital, Oslo, Norway. The median age of the included children (55 males, 41 females), was 9.2 years (range 3 months to 17.5 years), the median weight was 28.3 (range 6.6–84.6) kg and the median height was 134 (range 59–177) cm. Median reference GFR based on seven blood sample time points (GFR7p) was 66 (range 6–153) mL/min/1.73 m2. The individual GFR measurements were divided between the different GFR stages, namely, from 28, 27, 23, and 18 patients in CKD stage 1, 2, 3 and 4–5, respectively.

Methods

Iohexol was administered as Omnipaque®300 mg I/mL (647 mg iohexol/mL; GE Healthcare Technologies Norway AS, Oslo, Norway) in a dose adapted to body weight. Blood samples were drawn at 10, 30 or 60, 120, 180, 210, 240 and 300 min after injection. Additional details are provided in an earlier study published on the same cohort with a focus on two-point methodology [23].

Calculations and statistics

The GFR7p was calculated according to Sapirstein, as described by Schwartz et al. [3, 24] (Tables 1, 2). A two-compartment model was fitted using linear regression of the log concentration values. For three patients the two-compartment slope-intercept method could not be used due to negative values after the slow component of the curve was removed; for these three patients, we fitted the two-compartment model directly using non-linear least squares. GFR was normalized to 1.73 m2 body surface area (BSA) by the ratio 1.73/BSA, using the formula of Haycock et al. [25]. The GFR1p was calculated with six different formulas: the Fleming formula [22], the Ham and Piepsz formula (Ham&Piepsz; [26], the Stake formula [13, 21], the Groth and Aasted formula (Groth&Aasted; [20]), the Jacobsson formula [12] and a modification of Jacobsson’s formula (GFR1p-Jacobsson-mod.) [14, 17] which is based on performing only the first step in Jacobsson’s three-step GFR calculation. Tables 1 and 2 show the formulas used in the calculation of the GFR values, along with numerical examples. One patient had an obviously incorrect value measured for the 3.5-h sample, and this value was therefore removed before the analyses; otherwise the data were complete, with no missing values.

Table 1 Methodology of glomerular filtration rate calculations
Table 2 Example data, with additional information on calculations

To compare the GFR1p methods and the reference method, we calculated the difference between the various GFR1p and the reference GFR for each patient, along with estimated bias (mean difference) and limits of agreement (bias ± twice the standard deviation of the differences). The data are presented as difference plots comparing: (1) different methods within a single sampling time point (Fig. 1), (2) different sampling time points for each method (Fig. 2), and (3) the bias for different GFR values for each method (Fig. 3). We also present the corresponding numerical estimates for the time points recommended in the original publications (Table 3) and for various subgroups (Table 5) according to age (< 10 years and ≥ 10 years), BSA group (< 1.0 m2 and < 1.45 m2) and stage of CKD (< 30 mL/min/1.73 m2, 30 to < 60 mL/min/1.73 m2, 60 to < 90 mL/min/1.73m2, ≥ 90 mL/min/1.73m2).

Fig. 1
figure 1

Plot of estimation error versus the estimation method for glomerular filtration rate (GFR) calculated by six single-sample formulas [12, 13, 17, 20, 22, 26], stratified by sampling time point (n = 96 children). The y-axis shows the difference between the single-point GFR and a reference GFR based on seven sampling time points. Each point corresponds to a combination of patient, estimation method and sample time. The solid horizontal line is the bias, i.e. the mean difference between the single-point GFR estimate and the reference GFR. The dashed lines are limits of agreement, i.e. bias ± two standard deviations of the differences. The figure can be used to compare different methods within each sampling time point

Fig. 2
figure 2

Plot of estimation error versus time point for glomerular filtration rate (GFR) calculated at five time points, stratified by estimation method (n = 96 children). The y-axis shows the difference between the single-point GFR [12, 13, 17, 20, 22, 26] and a reference GFR based on seven sampling time points. Each point corresponds to a combination of patient, estimation method and sample time. The solid horizontal line is the bias, i.e. the mean difference between the single-point GFR and the reference GFR. The dashed lines are limits of agreement, i.e. bias ± two standard deviations of the differences. For each estimation method, the figure can be used to compare the performance of the single-point GFR estimates at different sampling time-points

Fig. 3
figure 3

Plot of estimation error versus reference glomerular filtration rate (GFR) for GFR calculated by six single-sample formulas [12, 13, 17, 20, 22, 26] and at five sampling time points (n = 96 children). The y-axis shows the difference between the single-point GFR estimate and a reference GFR based on seven sampling time points. The x-axis shows the reference GFR. Each point corresponds to a combination of patient, determination method and sample time. The solid horizontal line is the bias, i.e. the mean difference between the single-point GFR and the reference GFR. The dashed lines are limits of agreement, i.e. bias ± two standard deviations of the differences. Large determination errors, i.e. errors outside the displayed range, are indicated by arrows. The figure can be used to examine patterns in how the estimation errors of the different estimation methods vary with GFR for each method and sampling time

Table 3 Effect of different formulas at their recommended time points

To further quantify the performance of the GFR1p-methods, we calculated the number of GFR values that were within 5%, 10%, 15% or 20% of the reference method for each formula, labeled as P5, P10, P15 and P20, respectively, along with confidence intervals based on the recommended Wilson method [27] (Tables 3, 4, and 5; Fig. 4). Differences between methods and between time points for these ‘Px’ values (x = 5, 10, 15 or 20) were evaluated using the McNemar test with mid-P correction.

Table 4 Percentage of GFR1p measures within 10% of reference stratified by GFR-levels
Fig. 4
figure 4

Percentage plot showing the determination accuracy of six single-sample determination methods [12, 13, 17, 20, 22, 26] at five sampling time points (n = 96 patients/children). Each symbol, labeled Px (P5, P10, P15 and P20), shows the calculated proportion of single-sample glomerular filtration rate (GFR) within x% of the reference method. The horizontal lines show the corresponding 95% confidence intervals

For comparison, subanalyses for data on the best available two-point methodology (GFR2p) (Jødal and Brøchner-Mortensen [23, 28, 29]) at 2 and 5 h (i.e. GFR2p-JBM) are also included in Tables 3, 4, and 5.

Table 5 Subgroup analysis

We used R version 3.4.0 for Windows for all statistical analyses and figures [30]. Statistical significance is defined as P values ≤ 0.05, using two-sided tests, not adjusted for multiple comparisons.

Results

The performances of six different formulas for GFR1p determination [12, 13, 20, 22, 26] compared to the reference method are shown in Tables 3, 4, and 5 and Figs. 1, 2, 3, and 4. The results of different time points of blood sampling in Table 3 are limited to the recommended time points given in the respective original publications. The formula of Fleming with sampling at 3 h (GFR1p3h-Fleming) showed the best performance, with 82% of the GFR values falling within 10% of the reference method (P10). For the samplings at 2, 3, and 3.5 hours, the results with the Fleming formula were also significantly better than all the other tested formulas for P10 (Table 3). A comparison between the performances of all tested GFR1p formulas and slope-intercept methodology revealed that the GFR2p-JBM methodology was significantly better than all GFR1p formulas in the entire cohort of children with CKD 1–5 (Tables 3, 4).

With respect to the effect of sampling time on the performance, the Fleming formula gave results for sampling at 2, 3.5 and 4 h (i.e. time frame recommended by Fleming) which were not significantly different from the results at 3 h (Table 3; Figs. 2, 4). However, when blood was drawn at 5 h (i.e. outside the time frame recommended by Fleming), this formula showed a significantly lower performance, with a P10 of 55% (P < 0.01) (Table 4; Figs. 2, 4). When sampling at 4 h, the Fleming and Jacobsson-mod. formulas performed significantly better (P10 was 75 and 71%, respectively) than the formulas of Ham & Piepsz, Groth & Aasted and Stake (Figs. 1, 4). For blood sampling at 5 h, the two Jacobsson formulas had the best performance, with a P10 of 74 and 72%, respectively, which was significantly better (P < 0.01) than the performance of all other tested formulas at 5 h (Figs. 1, 4; Tables 3, 4).

All GFR1p formulas studied showed large bias when blood was drawn outside the time frames originally described for the respective formulas (Fig. 1). Nevertheless, the formulas of Fleming and Jacobsson gave relatively good GFR1p determinations for the entire 2- to 5-h range (Figs. 2, 3). The GFR1p formulas also showed non-constant bias (and, to a lesser degree, variance) over the GFR range (Fig. 3), especially outside their recommended time frames. However, the Fleming and Jacobsson formulas at their best-performing time points (3 and 5 h, respectively) had an approximately constant bias and variance as a function of GFR (Fig. 3).

Subgroup analysis revealed that in children with CKD 1–3, GFR1p3h-Fleming scored very well, with a P10 of 92%, which was significantly better than those of all other GFR1p formulas investigated, and not significantly different from the P10 of GFR2p-JBM, which was 96% (P = 0.29) (Table 4). In those patients with a GFR < 30 mL/min/1.73 m2, much lower performances were found for all GFR1p formulas. In this subgroup, the highest P10 was 67% when the GFR1p-Fleming formula was used with blood sampling at 5 h (GFR1p5h-Fleming). However, the performance of GFR1p5h-Fleming was not significantly better than that of the GFR1p5h-Jacobsson which had a P10 of 44% (P = 0.23). In contrast, the GFR2p-JBM scored 100% for P10 (P < 0.0001) in the patients with GFR < 30 mL/min/1.73 m2 (Tables 4, 5).

Age and BSA did not seem to influence the scores of GFR1p-Fleming, whereas GFR1p-Ham&Piepsz, GFR1p-Groth&Aasted and GFR1p-Jacobsson all had better scores in the smaller children (Table 5).

Discussion

The results of our iohexol plasma clearance study of a cohort of 96 children with CKD 1–5 shows that GFR1p measurements reached acceptable precision in patients with CKD 1–3. The best formula for single-point measured GFR in children was the GFR1p-Fleming, which showed a significantly better performance than the GFR1p-Ham&Piepsz, GFR1p-Groth&Aasted and GFR1p-Stake formulas [13, 20,21,22, 26] at all tested time points (Table 3; Fig. 1). GFR1p-Fleming was also significantly better than GFR1p-Jacobsson [12] when blood samplings were done after 2, 3 and 3.5 hours, whereas no significant difference was found between these formulas at 4 h (Table 4; Fig. 4). For blood sampling at 5 h, GFR1p-Jacobsson was significantly better than all other single-point formulas (Table 4; Fig. 1). Comparison with the two-point methodology showed that GFR2p-JBM, with a P10 of 97%, was significantly better (P < 0.001) than all single-point methods investigated in this study when all CKD stages were included in the analysis. However, an interesting finding was evident from the subgroup analysis, which showed no significant difference between the best single-point method, GFR1p3h-Fleming, and GFR2p-JBM in children with CKD 1–3 (Tables 4, 5). The scores for all single-point formulas were low in children with CKD 4–5, with the best P10 of 67% compared to 100% with GFR2p-JBM (P < 0.001) (Tables 4, 5). McMeekin and colleagues recently compared the GFR1p3h-Fleming with a multi-point reference method in a combined cohort of children and adults, with a total of 411 tests (247 pediatric and 164 adult tests) [19]. These authors found that 92.7% of measures [95% confidence interval (CI) 90–95%] were within 20% of the reference. This is in accordance with the results from our cohort showing a P20 of 91% (95% CI 83–95%). Our results also support the discrepancy between formulas reported by McMeekin and colleagues who found lower P20 for GFR1p2h-Ham&Piepsz, GFR1p2h-Groth&Aasted, GFR1p3h-Stake and GFR1p4h-Jacobsson compared to GFR1p3h-Fleming in their cohort [19].

Our study clearly demonstrates the importance of using the optimal blood sampling time points adapted to each formula. This is especially evident in the methods described by Ham&Piepsz and Groth&Aasted [11, 26], where the performance scores of all time points outside the recommended are low (Figs. 2, 3; Table 4). Furthermore, variable performance across GFR levels has to be taken into account since both these formulas scored fairly well in children with CKD 1–2, whereas the scores were unacceptably low in children with CKD 3–5 (Table 4). As the GFR1p-Ham&Piepsz formula has been a recommended single-point method in guidelines [11] and was developed from a high number (n = 657) of GFR measurements [26], a higher general score should be expected. Interestingly, in our study, the P10 of GFR1p-Ham&Piepsz was very high in the group of children with CKD 2, with a P10 of 96%, but only 57% in those with CKD 3, and no patient was within 10% of the reference with a GFR of < 30 mL/min/1.73 m2 (Table 4). A plausible explanation could be that the reference method used in the Ham & Piepsz study was not a multipoint-method, and the development of the formula was based on GFR measurements mainly in the normal range [26].

The Groth & Aasted formula was developed in a cohort with a broader distribution of GFR [16], which could explain why the single-point scores with this formula were more evenly distributed across the different CKD groups in our study (Table 4). The cohort of Groth & Aasted was, however, considerably smaller, and their five-point reference GFR had the last time point set early (2 h) [20], which probably explains the low scores in general for GFR1p-Groth&Aasted. The fairly good scores for GFR1p-Stake in children with CKD 2–3 at 3 h in contrast to the low scores for those with CKD 1 and CKD 4–5 (Fig. 2; Table 4) are probably due to the fact that the Stake-formula was developed in a cohort of 100 children mainly with CKD 2–3 and with a two-point3h,4h iohexol-GFR as reference method [13].

Both the Fleming and the Jacobsson formulas have distribution volume and time-point adaption included in the respective formulas. This gives a lower vulnerability in terms of time-point variability for blood sampling, as long as the true sampling time is used in the formula. The GFR1p3h-Fleming scored significantly better (P < 0.05) than all other formulas on their recommended time points, except for GFR1p5h-Jacobsson-mod.. (P = 0.08) (Table 3) in the cohort as a whole, and it was not significantly different from GFR2p-JBM in the subgroups CKD 1, CKD 2 and CKD 3. The subgroup analysis also showed that age and body size did not significantly influence the scores of GFR1p-Fleming. Importantly, when a child is expected to have CKD 4–5, our study shows that a single-point methodology with blood sampling up to 5 h is not recommended and that at least two blood samples should be collected (Table 5). Calculation of the eGFR [1], despite its limitations, can be helpful in making the decision to take more than one blood-sample or not, i.e. with 30 ml/min/1.73 m2 as the cutoff value.

Iohexol has been increasingly used as a marker for GFR measurements in recent decades. It is a nonradioactive substance, safe, inexpensive, has low inter-laboratory variation and is stable and easy to use [4, 31, 32]. Although the GFR1p-Fleming formula was originally developed using a radioactive marker in adults and children [21], our iohexol study has shown that this formula gives an accurate mGFR determination in children with CKD 1–3.These findings are of great clinical value. For the follow-up of children with cancer treated with nephrotoxic substances, as well as for children with renal and urologic diseases and mild and moderate kidney dysfunction, it is clearly beneficial to reduce the number of blood draws from two to three to a single sample. The risk of outliers is an issue in all tests, and in a single-point procedure it is necessary to redo the test if a result is surprising, whereas using a multi-point GFR procedure it is possible to remove the outlier based on examination of the elimination curve.

A limitation of this study is the lack of inulin-based gold standard analyses, but the continuous intravenous infusion and timed urine collections necessary in inulin clearance is cumbersome, and inulin is nowadays difficult to obtain. In addition, the number of patients in our study was limited to 96 children, which reduces the power of subgroup analysis. The last time point of iohexol measurement at 5 h may limit the value of the study in patients with severely reduced kidney function. However, the validity of our study is strengthened by our comparisons of a high number of blood samples at different time points and with multiple formulas.

Conclusion

Determination of GFR in children at all ages with CKD stage 1–3 based on iohexol plasma clearance and single-point sampling at 3 h analyzed with the Fleming formula achieved the same level of performance as the two-point method. All other tested single-point formulas had a considerably lower performance. When the GFR is lower than 30 mL/min/1.73 m2, a procedure with at least two blood-samples is recommended for mGFR.