Introduction

Renal cell carcinoma (RCC) stands as the predominant solid lesion affecting the kidney, with clear cell RCC (ccRCC) constituting its most prevalent histological subtype, accounting for approximately 70%–80% of all subtypes [1]. Partial nephrectomy (PN) is endorsed by a majority of global urological guidelines for T1-ccRCC, as it offers not only comparable tumor control but also superior preservation of renal function compared to radical nephrectomy (RN) [2, 3]. Yet, the question remains unresolved as to whether PN is non-inferior to RN in patients with T1-ccRCC at higher risk of recurrence.

Nuclear grade, commonly used to measure ccRCC malignancy, has demonstrated a robust association with the risk of recurrence. Present study suggested that the patients with high nuclear grades treated with PN deserves more attention and might need adjuvant management because of the tendency of recurrence [4,5,6]. In fact, we found that the patients with high nuclear grades T1-ccRCC treated with PN have poorer prognosis, which is not consistent with similar published researches. This discrepancy led us to discover that prior studies comparing the efficacy of PN and RN did not account for the possible impact of unbalanced nuclear grades. Specifically, the sample size of high nuclear grades (especially G4) was typically much smaller than that of the low grades [7,8,9,10]. If the results are not separately analyzed, the large sample size of low nuclear grade group may obscure the true results of high nuclear grade group. It is important to separately reconsider the oncologic outcomes of the two types of surgery in terms of the different nuclear grades.

To discern whether the oncological outcomes of PN are not inferior to RN in patients with high nuclear grade, we employed a large cohort to evaluate and confirm the distinct efficacy of RN and PN in managing pT1-ccRCC with various nuclear grades. Our findings aim to guide surgeons in selecting the most appropriate surgical options for patients with high nuclear grade T1-ccRCC, thereby enhancing the oncological outcomes for these patients.

Patients and methods

Patient selection

After Institutional Review Board approval (ID:202,201,007), we identified patients who underwent PN or RN at three urological centers in China for sporadic, unilateral, pT1, N0, and M0 ccRCC without positive surgical margins and neoadjuvant therapy between 2010 and 2019 (including 1,164 males and 550 females). Patients who underwent robot-assisted surgery were not included in this study. All the surgeons possess more than 5 years of experience in performing RN/PN. The clinical tumor (cT) stage and pathological tumor (pT) stage was determined according to the American Joint Committee on Cancer Cancer Staging Manual (8th edition) [11].

Patient features

The clinical features were sex, age at surgery, smoking status, lumbago, hematuresis, laterality, Eastern Cooperative Oncology Group (ECOG) performance status, age-adjusted Charlson Comorbidity Index (aCCI), baseline cardiovascular disease (defined using the myocardial infarction, congestive heart failure, and peripheral vascular disease components of the Charlson score), body mass index (BMI), preoperative estimated glomerular filtration rate (eGFR, mL/min/1.73 m2), and surgical approach (open versus laparoscopic). The radiographic features were cT-stage, hemorrhage, necrosis, calcification, and cystic formation. Enhanced computed tomography was used to assess the radiographic features. The pathologic features were rereviewed by 2 genitourinary pathologists, included tumor size, pT-stage and WHO/ISUP grade.

Outcomes

The main outcomes were local ipsilateral recurrence, distant metastases, and all-cause mortality (ACM). Local ipsilateral recurrence did not include the ipsilateral adrenal gland and distant metastases included the contralateral kidney. The follow-up duration was calculated from the date of treatment to the date of the main outcome or the date of the last follow-up.

Statistical analysis

The patient features were summarized as frequencies and percentages for categorical variables, and medians and interquartile ranges (IQRs) for continuous not normally distributed variables. To compare patient features between two groups, the Wilcoxon rank-sum test was used for not normally distributed variables, and the Fisher exact test and the chi-square test were used for categorical variables.

Due to the limited sample size in G4 subset, we have refrained from utilizing the PS matching. In the four subsets cohort we used overlap weighting (OW) [12] to balance the confounding factors caused by the nonrandomized design of this study. OW is a recently developed balancing weighting scheme which has been demonstrated to optimize the precision of the estimated association between the treatment and outcomes among a large class of propensity score (PS) weighting methods, including inverse probability treatment weight [13]. We used weight (Wi) according to the following formula: Wi = PS for RN and Wi = 1-PS for PN, where PS is the propensity score. The PS representing the probability of treating with RN was estimated using a logistic regression model with RN as the exposure, and all patient features, except nuclear grade, as the covariates (nuclear grade was used as a variable for subgroups) [14].

A total of 1,714 patients met the inclusion and exclusion criteria, whereas 23 patients who underwent RN and 16 patients who underwent PN were missing data for at least one of the patient characteristics studied. Therefore, a total of 1,675 patients formed the cohort for PS analysis, including 774 (46%) RN patients and 901 (54%) PN patients. The outcome differences were assessed between RN and PN in the four nuclear grade subsets. The probabilities of local ipsilateral recurrence-free, distant metastases-free, and overall survival were estimated using the Kaplan–Meier method, and the associations of treatment and outcome were examined using univariable and multivariable Cox proportional hazards regression models. Multivariable Cox regression was performed for variables with p < 0.05 on univariable Cox regression and summarized as hazard ratios (HRs) and 95% confidence intervals (CIs). Statistical analyses were performed using SAS version 9.4 and R version 4.2.2. All statistical tests were two-sided with significance set at p < 0.05.

Results

Patient features

The patient features are described in Table 1. There were 1,714 patients in the overall cohort, which included 797 (46%) patients who underwent RN and 917 (54%) patients who underwent PN (Fig. 1). The patients who underwent RN were significantly older with a lower preoperative eGFR, more symptoms (lumbago), larger tumors, and higher tumor stages than patients who underwent PN. All features were well balanced between RN and PN groups after PS adjustment (Table 2). The median follow-up time of the 1675 patients included in the PS analysis was 49 (IQR 26, 82) mo. During this time, 48 patients developed local ipsilateral recurrence, 60 developed distant metastases, and 52 died.

Table 1 Comparisons of patient features in the overall cohort (N = 1,714)
Fig. 1
figure 1

Study flow chart ccRCC = clear cell renal cell carcinoma; RN = radical nephrectomy; PN = partial nephrectomy; PS = propensity score

Table 2 Comparisons of patient features in the pseudo overall cohort after OW

We divided the overall cohort into four subsets, G1–G4, based on the different nuclear grades. As presented in Supplementary Tables 14, all patient features were balanced in the subsets after reweighting by OW, except preoperative eGFR in the G1 subset; tumor size in the G2 subset; and tumor size, cT stage, pT stage, and surgical approach in the G3 subset. The remaining imbalanced features were further adjusted by regression in survival analysis.

Subset analysis

Survival analysis of the G1 subset

The five-year local ipsilateral recurrence-free survival rates, five-year distant metastases-free survival rates, and five-year overall survival rates for patients who underwent RN and PN were 99.5% and 99.0%, 99.5% and 99.5%, and 99.5% and 99.2%, respectively (Figs. 2, 3 and 4). After applying the multivariable Cox proportional hazards model and further adjusting by the preoperative eGFR, no statistical differences in the risk of local ipsilateral recurrence, distant metastases, and ACM between PN and RN were observed (Table 3).

Fig. 2
figure 2

Local recurrence-free curves for RN versus PN after OW. A Local recurrence-free curve for RN versus PN in G1 subset after OW (p = 0.518). B Local recurrence-free curve for RN versus PN in G2 subset after OW (p < 0.05). C Local recurrence-free curve for RN versus PN in G3 subset after OW (p < 0.05). D Local recurrence-free curve for RN versus PN in G4 subset after OW (p < 0.05). The shaded area denotes the 95% confidence interval for the predicted probability. OW = overlap weighting; PN = partial nephrectomy; RN = radical nephrectomy

Fig. 3
figure 3

Distant metastases-free curves for RN versus PN after OW. A Distant metastases-free curve for RN versus PN in G1 subset after OW (p = 0.788). B Distant metastases-free curve for RN versus PN in G2 subset after OW (p = 0.699). C Distant metastases-free curve for RN versus PN in G3 subset after OW (p < 0.05). D Distant metastases-free curve for RN versus PN in G4 subset after OW (p < 0.05). The shaded area denotes the 95% confidence interval for the predicted probability. OW = overlap weighting; PN = partial nephrectomy; RN = radical nephrectomy

Table 3 Associations of nephrectomy type with oncologic outcomes in the four subsets cohort after further adjusting for the imbalanced features

Survival analysis of the G2 subset

The five-year local ipsilateral recurrence-free survival rates, five-year distant metastases-free survival rates, and five-year overall survival rates for patients who underwent RN and PN were 99.3% and 92.3%, 98.1% and 94.7%, and 98.2% and 94.8%, respectively (Figs. 2, 3 and 4). After applying the multivariable Cox proportional hazards model and further adjusting by tumor size, RN associated with a significantly reduced risk of local ipsilateral recurrence (HR = 0.148, 95% CI 0.046–0.474; p < 0.05) compared with PN. No statistical differences in the risk of distant metastases and ACM between PN and RN were observed (Table 3).

Fig. 4
figure 4

Overall survival curves for RN versus PN after OW. A Overall survival curve for RN versus PN in G1 subset after OW (p = 0.663). B Overall survival curve for RN versus PN in G2 subset after OW (p = 0.246). C Overall survival curve for RN versus PN in G3 subset after OW (p = 0.642). D Overall survival curve for RN versus PN in G4 subset after OW (p = 0.439). The shaded area denotes the 95% confidence interval for the predicted probability. OW = overlap weighting; PN = partial nephrectomy; RN = radical nephrectomy

Survival analysis of the G3 subset

The five-year local ipsilateral recurrence-free survival rates, five-year distant metastases-free survival rates, and five-year overall survival rates for patients who underwent RN and PN were 98.0% and 80.1%, 94.4% and 88.7%, and 98.2% and 99.3%, respectively (Figs. 2, 3 and 4). After applying the multivariable Cox proportional hazards model and further adjusting by tumor size, cT stage, pT stage, and surgical approach, RN associated with a significantly reduced risk of local ipsilateral recurrence (HR = 0.097, 95% CI 0.021–0.455; p < 0.05) compared with PN. No statistical differences in the risk of distant metastases and ACM between PN and RN were observed (Table 3).

Survival analysis of the G4 subset

The five-year local ipsilateral recurrence-free survival rates, five-year distant metastases-free survival rates, and five-year overall survival rates for patients who underwent RN and PN were 95.9% and 59.7%, 88.7% and 55.6%, and 63.1% and 47.7%, respectively (Figs. 2, 3 and 4). Compared with PN, RN associated with a significantly reduced risk of local ipsilateral recurrence (HR = 0.091, 95% CI 0.011–0.736; p < 0.05) and distant metastases (HR = 0.071, 95% CI 0.016–0.325; p < 0.05). No statistical difference in ACM between PN and RN was identified (Table 3).

Discussion

In subset analysis, we observed that RN was linked to a decreased risk of local recurrence in G2 (HR = 0.148), G3 (HR = 0.097), and G4 (HR = 0.091) subsets, with increased five-year local recurrence-free survival rates of 7.0%, 17.9%, and 36.2%, respectively. Although the oncologic outcomes of PN and RN in the treatment of pT1-ccRCC have been widely studied [1, 7, 8, 15,16,17,18,19], this equivalence is better reflected in overall survival (OS) and cancer specific survival (CSS). A randomized trial reported a higher rate of local recurrence for PN (6/268) compared to RN (1/273) [7], but the study did not separately examine whether the association between treatment and local recurrence was statistically significant. While there were studies supporting our results, a recent study by Gershman et al. revealed that, compared with PN, RN significantly associated with a marked reduction in local recurrence (HR = 0.19, 95% CI 0.11–0.35) [8]. Since the importance of nuclear grade in the prognosis of highly malignant renal tumors has been established [5, 20,21,22], assigning a greater complete resection range to patients with high nuclear grade can reduce the risk of postoperative recurrence, which supports clinical plausibility.

Surprisingly, the distant metastases-free curve showed statistical significance after OW of the data in the G3 subset (P < 0.05), but PN did not increase the risk of distant metastases after multivariate Cox regression modeling (P > 0.05). Given that the KM method is a nonparametric test and the Cox regression model is a semi-parametric test and can remove the impact of confounding factors, we believed that the results of the multivariate Cox regression model were more reliable.

Our results on the relationship between PN and distant metastases were different from those of previous studies [7, 8, 23,24,25]. We observed a significant association between RN and distant metastases in the G4 subset (HR = 0.094), with the five-year distant metastases-free survival rate increasing by 33.1%. In previous studies, which compared PN and RN for the treatment of T1 RCC, the number of patients with G4 RCC tended to be much smaller than that of the low grades. For example, the number of patients with G4 RCC treated with PN in the EORTC 30904 trial [7], as well as the studies of Gershman et al. [8], Simone et al. [22], Minervini et al. [23], and Antonelli et al. [24] were 1/268, 10/1175, 5/434, 1/332, and 13/1266 (G4/all grade), respectively. If the results are not analyzed separately, the substantial sample size of low nuclear grade groups may obscure the accurate results of the G4 group. Furthermore, in the RCC disease-free survival (DFS) model established by Correa, G1–G3 subsets associated with a significantly reduced risk of disease progression compared with the G4 subset (AF = 0.47, 95% CI 0.36, 0.62) [26]. This implied that G4 ccRCC was markedly more malignant than other nuclear grades, potentially impacting the efficacy of the operation. Therefore, we contend that, particularly for G4 patients, our study's results, which analyzed the G4 subset separately, are more reliable than previous studies with limited sample sizes for the G4 group. Finally, similar to local recurrence, assigning a wider resection range to G4 patients can reduce the risk of postoperative distant metastases also supports clinical plausibility.

No significant difference in ACM between PN and RN was observed, which is consistent with the results of previous studies [7, 8, 19, 22, 23]. Although some studies have reported an association between PN and the reduced risk of ACM [10, 18], this contradiction may be explained by the fact that RN associated with a lower postoperative eGFR [7, 8, 17, 18], and worsening renal function can increase in the incidence of complications, such as cardiovascular events [27], which can affect ACM. Of course, this may also be due to selection bias. Surprisingly, we did not observe a difference in ACM in the G4 subset, which had a large difference in the distant metastases-free survival rate. This was likely due to the short follow-up time (median follow-up time, 34 [IQR 26, 54] mo).

As for the laparoscopic and open groups. After applying the OW to balance baseline features, within the G4 subset, there was no significant correlation between the surgical approach (open vs. laparoscopic) and local recurrence (HR = 0.902, p = 1.108), distant metastasis (HR = 0.716, p = 0.570), or overall survival (HR = 0.580, p = 0.380). Similar results were observed in the other three nuclear grade subsets. This aligns with previous findings indicating that the outcomes of laparoscopic surgery are comparable to those of open surgery [28, 29].

Our study enrolled patients with pT1-stage ccRCC. If we wanted to extend these findings to patients with cT1-stage ccRCC, then we must consider that cT1 can be upstaged to pT3-4 compared with pT1. Shah reported that in patients upstaged to T3a from T1, PN was associated with a shorter recurrence-free survival compared to RN (HR = 2.04, 95% CI 1.12–3.68, p = 0.019) [30]. Thus, we provided a lower limit for the increased risk of recurrence associated with PN compared to RN for the cT1 stage.

There are several limitations in this study. Firstly, this was not a randomized trial, although we used OW to control the confounding factors brought about by non-randomness. However, PS method could not adjust for the patient features that were not measured and included in the PS model, so the possibility of residual confounding exists [14]. Secondly, our follow-up time was short (median follow-up time, 49 [IQR 26, 82] mo), especially in the G4 subset, which may have affected the estimation of the five-year rates. Thirdly, we conducted a multicenter study to expand the sample size, and a larger G4 ccRCC sample size was obtained compared with the other studies mentioned in the discussion [7, 8, 22,23,24]. However, the sample size of the G4 subset was still small. Then, owing to the belated adoption of robot-assisted surgery in the participating institutions, the count of patients undergoing robot-assisted PN/RN was limited. Additionally, patients in the participating institutions who had undergone robot-assisted surgery lacked an adequate follow-up period, prompting the exclusion of robot-assisted surgery as a covariate in our study. Finally, the cause of death was not available for some patients. Thus, to ensure the preciseness of the study, we used ACM instead of cancer-specific mortality and other-cause mortality as the outcome of the study.

Despite the above limitations, to our knowledge, this is the first study to compare the oncologic outcomes of PN and RN in four nuclear grade subsets. In comparison to previous studies with significant variations in nuclear grade sample sizes, our subset analysis enabled a more reliable comparison of the efficacy of RN and PN within high nuclear grade groups. Our study, for the first time, revealed that the risk of local ipsilateral recurrence and distant metastases is higher with PN than RN in the high nuclear grade (especially G4) group. Therefore, it is of great clinical significance to reassess the oncological equivalence of PN and RN in different nuclear grades. We will explore methods to enhance the preoperative determination of nuclear grade accuracy, aiming to offer more tailored surgical plans for patients with high nuclear grades.

Conclusions

Implications and key lessons learnt

Our results indicated that, in comparison to PN, RN was associated with a decreased risk of local ipsilateral recurrence in the G2, G3, and G4 subsets, a lowered risk of distant metastases in the G4 subset, and no significant difference in ACM among patients with pT1-ccRCC. Different from previous studies, our findings substantiate that opting for RN, as opposed to PN, is more advantageous for local recurrence-free survival and distant metastases-free survival in patients with high nuclear grade (especially G4) pT1-ccRCC. We recommend placing a heightened emphasis on enhancing preoperative nuclear grade assessment, as it can significantly influence the choice of surgical plan.

Limitations

The main limitation of this study comes from the non-random design. An additional randomized controlled trial with a larger sample size of high nuclear grade ccRCC is warranted, as the need for high-quality prospective data remains.