Background

The recommended treatment option for renal calculi and staghorn calculi is percutaneous nephrolithotomy (PCNL) according to the guidelines of the European Association of Urology (EAU) [1]. PCNL has increasingly been used over the past few decades and may continue in the future [2, 3]. However, PCNL outcomes among the authors are different, because of the vast heterogeneity in the methods for clinical and academic characterization of nephrolithiasis besides the evaluation of surgical outcomes. So assessing the preoperative factors that affect SFR and complications is critical.

The Guy’s stone score, the Clinical Research Office of the Endourological Society(CROES) nomogram and the S.T.O.N.E.(stone size, tract length, obstruction, number of involved calices and essence) stone score are seen as predictors of stone-free status (SFS) and complications after PCNL [4,5,6]. The widespread use of a standardized stone scoring system is very precious for counseling patient, clinical decision, and assessment of outcomes, in addition to improving academic reporting [7]. However, no universally accepted stone scoring system for predicting SFR and complications after PCNL exists. Comparison of the SSSs in different clinical studies indicated some advantages as well as disadvantages of one nomogram to another for different variables. Hence, we performed a systematic review of the literature with a meta-analysis of the available published literature to compare the feasibility and accuracy of three SSSs in predicting PCNL outcomes concerning SFR and complications.

Methods

Study selection

According to the Cochrane Handbook recommendations, a systematic review of published literature was performed [8]. To identify all studies published up to Dec 31, 2018, which assessed the feasibility and accuracy of three SSSs. The following MESH search headings were used: “comparative studies”, “Guy”, “CROES”, “S.T.O.N.E”, “percutaneous nephrolithotomy”, “stone free rate”, and “complication”.

Inclusion and exclusion criteria

All studies included in this meta-analysis satisfied the following requirements: (a) compare the two or three SSSs, (b) report the outcomes of two or three SSSs, (c) document the surgery as PCNL, (d) document indications for PCNL with renal stones. Studies were excluded if: (a) the article did not meet the inclusion criteria, (b) no outcomes were mentioned or the parameters were impossible to analyze the three SSSs from the published findings.

Data extraction and outcomes of interest

Two of the authors (JKH and SF) extracted data from the included studies including: author identification, country, publication years, study design, age, and the number of patients. All disagreements about eligibility were resolved by consensus through author discussions. The outcomes, including SFR and, overall complications, were extracted to compare between three SSSs. Overall complications were graded based on the Clavien-Dindo system [9].

Study quality assessment

In accordance with the criteria of Centre for Evidence-Based Medicine in Oxford, we evaluated the level of evidence (LOE) of the included ten studies. Furthermore, Jaded Score was applied to evaluate the methodological quality of RCTs [10]. while the Newcastle-Ottawa Scale (NOS) assessed the methodological quality of non-RCTs observational studies [11]. Besides, JKH and ZJG evaluated the quality of the articles and discrepancies were rechecked and resolved by discussion.

Statistical analysis

All analyses were conducted by Review Manager 5.3 (Cochrane Collaboration, Oxford, UK). Continuous and dichotomous variables were analyzed by weighted mean differences (WMDs) and odds ratios (ORs). All analysis results were reported with 95% CIs. I2 and X2 statistics were applied to evaluate the quantity of heterogeneity, and when I2>50%, the evidence was considered to have substantial heterogeneity, the random-effects (RE) model would be applied, otherwise, the fixed effects (FE) model was applied. Egger’s test and funnel plot evaluated the publication bias. Sensitivity analyses estimated the influence of studies with a high risk of bias on the overall effect.

Results

Characteristics of eligible studies

Ten studies [12,13,14,15,16,17,18,19,20,21] conformed to the inclusion criteria of this meta-analysis and were there included in the analysis of three SSSs (Fig. 1). The demographic and clinical characteristics of the included literature were shown in Table 1.

Fig. 1
figure 1

PRISMA diagram. The search strategy and number of studies identified for inclusion in this meta-analysis

Table 1 Characteristics of included studies

Quality of the studies and level of evidence(Table 1)

In this meta-analysis, the NOS quality assessment method of the observational studies [11], and the US Preventive Services Task Force grading system were applied to evaluate the quality of all studies [10]. Included studies were all level 3b. Also, the demographic variables of the three SSSs were extracted from included articles (Table 1).

Description of included studies and preoperative characteristics of the patients (Table 2)

Patients were categorized as stone free(SF) and no-stone free(NSF), and the pooled data of included studies [13, 14, 16, 17, 19, 20] showed that stone free patients had significantly lower proportion of male(OR = 1.48, 95% CI: 1.18 to 1.86, P = 0.0007) (Fig. 2), lower stone burden (WMD = -504.28, 95% CI: − 673.88 to − 334.67, P < 0.0001) (Fig. 2), fewer No of involved calyces (OR = -1.23, 95% CI: − 1.94 to − 0.52, P = 0.0007) (Fig. 2) and lower proportion of staghorn stone (OR = 0.33, 95% CI: 0.26 to 0.43, P < 0.0001) (Fig. 2). And stone free patients had significantly lower score of Guy score (WMD = -0.64, 95% CI: − 0.90 to − 0.37, P < 0.0001) (Fig. 2), lower score of S.T.O.N.E. score (WMD = -1.23, 95% CI: − 1.78 to − 0.67, P < 0.0001) (Fig. 2) and higher score of CROES nomogram (WMD = 29.48, 95% CI: 9.84 to 49.12, P = 0.003) (Fig. 2) (Table 2). However, there were no remarkable difference in terms of age, BMI, laterality and stone density between SF and NSF patients (Table 2, Additional file 1: Figure S1).

Table 2 Overall analysis of demographic and clinical characteristics compared stone-free with not stone-free after PCNL
Fig. 2
figure 2

Forest plot and meta-analysis of preoperative characteristics between the stone-free patients and not stone-free patients

Outcomes of perioperative variables (Table 3)

With respect to perioperative variables, pooled data of 6 studies [13, 14, 16, 17, 20, 21] involving 1074 participants demonstrated that SF patients were associated with shorter operative time than NSF patients (WMD:-19.85 min; 95% CI: − 25.52 to − 14.18; P<0.0001)(Fig. 3). SF patients were also associated with shorter length of hospital stay (LOS) (WMD: − 0.53 days; 95% CI: − 0.88 to − 0.17; P = 0.003) (Fig. 3) and lower transfusion rate (OR: 0.22; 95% CI: 0.08 to 0.55; P = 0.001) (Fig. 3) than NSF patients, respectively. However, there were no statistically difference between SF and NSF patients in term of No of tract (OR = -0.03; 95% CI: − 0.14 to 0.08; P = 0.62) (Fig. 3), mean tract length (WMD = 0.77; 95% CI: − 2.06 to 3.61; P = 0.59) (Fig. 3) and change in Hb level (WMD = − 0.21 g/mL; 95% CI: − 0.55 to 0.13; P = 0.22) (Fig. 3)(Table 3).

Table 3 Overall analysis of perioperative outcomes comparing stone free with not stone free after PCNL
Fig. 3
figure 3

Forest plot and meta-analysis of outcomes of perioperative variables between stone-free patients and not stone-free patients

Outcomes of three scoring systems for predicting SFR after PCNL (Table 4)

Pooled data of 7 studies [5, 12,13,14, 16, 17, 20] reported an accuracy of three SSSs in predicting post-PCNL SFR, the forest plot indicated that there was no statistical difference between Guy score and CROES nomogram with respect to area under curves(AUC) of prediction of SFR(WMD = 0.09; 95% CI:-0.12 to 0.29; P = 0.41) (Table 4, Fig. 4). There was also no significant difference between S.T.O.N.E. score and Guy score with respect to area under curves (AUC) of prediction of SFR (WMD = 0.00; 95% CI:-0.03 to 0.04; P = 0.92) (Table 4, Fig. 4) and no significant difference between CROES nomogram and S.T.O.N.E. score with respect to area under curves (AUC) of prediction of SFR (WMD = 0.00; 95% CI: − 0.05 to 0.04; P = 0.94) (Table 4, Fig. 4).

Table 4 Overall analysis of three stone-scoring systems for predicting SFR after PCNL
Fig. 4
figure 4

Forest plot and meta-analysis of outcomes of three scoring systems for predicting SFR after PCNL. SFR = stone free rate

Outcomes of three scoring systems for predicting complications after PCNL (Table 5)

Pooled data of reported that the three SSSs in predicting post-PCNL complications, the forest plot indicated that Guy score was the only stone scoring system for predicting post-PCNL complications (WMD = -0.29, 95% CI: − 0.57 to − 0.02, P = 0.03) (Table 5, Fig. 5). No association between the two other scoring systems (CROES nomogram and S.T.O.N.E. score) and post-PCNL complications (Table 5, Fig. 5).

Table 5 Overall analysis of three scoring systems for predicting complications after PCNL
Fig. 5
figure 5

Forest plot and meta-analysis of outcomes of three scoring systems for predicting complications after PCNL

Discussion

Widely applicable and straightforward tools will highly improve patient counseling, clinical decision making, assessment of operation outcomes and academic study after PCNL for renal stones [22, 23]. These can allow reliable and accurate comparisons of treatment safety and efficacy, and facilitate the meaningful comparison of clinical studies [24]. We considered that the commonest and validated SSSs (Guy score, CROES nomogram, and S.T.O.N.E. score) could be predictive of SFR and complications after PCNL.

Both similar and divergent variables between the three SSSs has to be known. The Guy stone score consists of four grades based on stone burden and patient anatomy [7, 25]. The CROES nomogram is highly generalizable based on global data and uniquely grades risk across a continuous scale rather than dividing stones of varying complexity into discrete groups [7, 25]. The S.T.O.N.E. stone score stratifies patients into low-, moderate-, and high-risk groups, and it is more useful for decision making [7, 25]. The three SSSs included different parameters, however, the stone location, stone count and staghorn calculi were pivotal variables in the three SSSs [7].

On the other hand, Guy stone score included renal anatomy but not stone burden, and this was a difference from CROES nomogram and S.T.O.N.E. score. The CROES nomogram included prior treatment as well as operation volume, and these variables had remarkable relationships to the SFR. However, CROES nomogram lacked imageology information on hydronephrosis and calyceal abnormalities. The S.T.O.N.E. score comprised stone size, tract length, obstruction, number of involved calices and essence, and it had greater feasibility and accuracy than any of the individual variables alone. In our meta-analysis, the results showed preoperative variables gender, stone burden, number of involved calyces and staghorn calculi were remarkably correlating with SFR after PCNL for kidney calculi.

However, there is still no widely accepted stone scoring system for the prediction of outcomes after PCNL, and contradictions between different authors exist concerning the prediction of outcomes by the SSSs. Some experts reported that three SSSs were efficacy and equally predictive of SFR by estimating and comparing the three SSSs in 246 patients after PCNL [17]. Tailly et al. reported that three SSSs have similar predictive accuracy of SFS by comparing the three SSSs in 586 patients after PCNL, but no association between three SSSs and complications [20]. Moreover, Bozkurt et al. reported that both the Guy and CROES nomogram had a remarkable relationship with SFS and complications [12]. Noureldin et al. also reported that the Guy score and S.T.O.N.E. score were equally associated with SFS, but there were no significant correlations with complications [18]. However, when other experts evaluated and compared three SSSs in predicting post-PCNL SFR and complications, the results showed that only S.T.O.N.E. score was a predictor of SFS after PCNL for renal stones [13]. Kocaaslan et al. reported that the CROES nomogram was well correlated with the success of PCNL in cases with anatomical abnormalities, and that the Guy score and S.T.O.N.E. score failed to predict the SFS and complications after PCNL [16]. Choi et al. compared the predictability and accuracy of the Guy score, CROES nomogram and S.T.O.N.E. score, the results showed that only Guy score could predict SFS and complications after PCNL [14]. Sfoungaristos et al. also compared three SSSs, and the results indicated that S.T.O.N.E. score was the only predictor of the SFR for post-PCNL, and the three SSSs were not associated with complications [19]. In our meta-analysis, the results indicated that the three SSSs were remarkably associated with SFS and equally predictive of SFR, but only Guy score was a predictor of complications.

However, some critical limitations exist in the three SSSs. Firstly, stone burden and density as the important parameters dide not reflect in Guy’s score. Moreover, it is also failed to describe procedure difficulties as well as clinical variability. Secondly, the CROES nomogram is also did not reflect stone density and lacked important variables affecting the outcomes, including imaging information on hydronephrosis and pelvicalyceal abnormalities [5, 26]. Moreover, the CROES nomogram was complex in the clinical applications [17]. Thirdly, the limitations of S.T.O.N.E. score were validated with a small cohort which may limit its widespread usage [4, 7].

Similarly, several limitations existed while analyzing and interpreting results in our meta-analysis. Firstly, to identify prognostic factors, we acknowledge that other variables, such as surgeon experience and advanced surgical instruments, may need further investigation. These factors in the future may need to be incorporated in multicenter and larger samples clinical applications. Secondly, there existed heterogeneities of studies, some studies in this meta-analysis had the risk of selection bias. Lastly, all patients were evaluated for SFR after PCNL by KUB but not by CT, which may have overstated the SFR. Therefore, a need to develop more accurate and practical SSS to assess the relationship between the SSS and SFR, complications.In conjunction, our meta-analysis thus provides some up to date conclusions for the advantages and disadvantages of three SSSs in predicting of SFS and complications.

Conclusions

The Guy score, CROES nomogram and S.T.O.N.E. score were equally accurate predictive of SFR in patients undergoing PCNL, but the Guy score is the only SSS for predicting complications.