Introduction

According to a 2020 report, bladder cancer (BC) is the tenth most common malignancy overall and the sixth most common malignancy in men worldwide (Sung et al. 2021). Bladder urothelial carcinoma (BUC) constitutes over 90% of BC cases (Hsieh et al. 2023). Moreover, ~ 75% of BC patients, in whom BC is confined to the mucosa or submucosa, are defined as having non-muscle‑invasive bladder cancer (NMIBC) (2017); these patients are typically treated with transurethral resection of the bladder tumor (TURBT), followed by intravesical chemotherapy or Bacillus Calmette-Guérin vaccine administration (Babjuk et al. 2022). However, even after these treatments, recurrence or progression occur in nearly 40% of high-risk or very high-risk patients (Kamat et al. 2018). Although muscle-invasive bladder cancer (MIBC) accounts for only a small proportion of BC cases, it is associated with a higher risk of cancer-specific mortality in relation to NMIBC (Burger et al. 2013).

Radical cystectomy (RC) plus pelvic lymph node (LN) dissection (LND) and urinary diversion are the standard treatment options for MIBC patients and some patients with high-grade or very high-grade NMIBC (Lenis et al. 2020a, b). However, the development of lymph node metastasis (LNM) in BUC patients is one of the most valuable indicators of poor prognosis and tumor invasiveness (May et al. 2011; Shariat et al. 2012). The 5-year overall survival rate of BUC patients with LNM following RC treatment alone is only 19%. The combination of RC with neoadjuvant or adjuvant chemotherapy raises the 5-year overall survival rate to 31% and 26%, respectively, which is still inadequately low (Galsky et al. 2016). It was reported that nearly 8% of NMIBC patients and 25% of MIBC patients develop LNM (Lenis et al. 2020a, b). Nevertheless, BUC patients with LNM can still achieve prolonged survival with appropriate treatment measures before they develop distant metastases (Darwish et al. 2020). LND typically involves the removal of nodal tissue up to the common iliac bifurcation. Although one prospective randomized trial in BC patients found no significant survival advantage in extended LND, a trend towards a survival benefit was observed (Gschwend et al. 2019). Several reviews also suggested that extended LND prolonged survival in a subset of BUC patients with LNM (Bruins et al. 2014, Ghodoussipour and Daneshmand 2019). However, the complication rates of RC, classed according to the Clavien-Dindo Classification system, ranged from 50 to 87.5% (for grades 1–4) and the severe complication rate ranged from 30 to 42% (for > grade 3) (Cicione et al. 2020; Demaegd et al. 2020; Furrer et al. 2019; Haas et al. 2021). Some selected MIBC patients were reported to gain quality-adjusted life years after receiving bladder-sparing tri-modality therapy (Royce et al. 2019). Besides, RC may be viewed as an overtreatment for some high-grade NMIBC patients with a low risk of progression and metastasis (Klaassen et al. 2018). Therefore, it is vital to preoperatively predict LNM in clinical decision.

Several studies have extensively researched the role of nomograms in predicting LNM in BC to aid clinical decision making. While some of these studies used radiomic (Wu et al. 2017, 2018a, b) or genomic data (Lu et al. 2019, Wu et al. 2018a, b) to develop a preoperative nomogram, the difficulty of collecting suitable information restricts the clinical use of such a nomogram. Meanwhile, two other nomograms were designed to predict non-organ-confined BUC prognosis by including clinical and pathological information from BUC patients; however, their accuracy was limited by a selection bias introduced as a result of a small sample size and the use of data from a single institution (Green et al. 2013; Xie et al. 2012). A large-scale multicenter nomogram was constructed using demographic and pathological data from the Surveillance, Epidemiology, and End Results (SEER) database (Tian et al. 2021). However, the area under the receiver operating characteristic (ROC) curve (AUC) for this nomogram in the training cohort was only 0.69, indicating its low accuracy at predicting the development of LNM in BC patients after RC. Hence, in the present study, we aimed to develop and validate a preoperative nomogram for predicting LNM in BUC using patient demographic information, pathologic characteristics from TURBT specimens, imaging data, and laboratory measurements.

Materials and methods

Patients and patient characteristics

This retrospective study included patients with histologically confirmed BUC, who underwent RC and bilateral lymphadenectomy at the Affiliated Hospital of Qingdao University (between January 2016 and April 2022) and Qingdao Campus of Qilu Hospital of Shandong University (between January 2014 and December 2022). The exclusion criteria were as follows: (a) age < 18 years; (b) patients who did not undergo TURBT before RC or those without muscle tissue in their TURBT specimen; (c) patients with distant metastasis; (d) patients with incomplete imaging examination data before RC; (e) patients with tumors that originated at sites other than the bladder; (f) patients with incomplete laboratory measurements, collected within 1 month before RC; (g) patients with severe inflammation or immune system diseases; (h) patients receiving preoperative radiotherapy; (i) patients with severe or end-stage chronic kidney disease; and (j) patients with RC specimens which were not pathologically confirmed as BUC.

The following patient characteristics were recorded: (a) demographics, including age, sex, and body mass index; (b) pathologic TURBT characteristics before RC, including tumor grade, papillary tumor presence, urothelial variants, muscle invasion, and infiltration; (c) imaging characteristics, including hydronephrosis, extravesical invasion, LNM on imaging, and tumor size; and (d) laboratory measurements, including neutrophil count, monocyte count, basophil count, eosinophil count, lymphocyte count, erythrocyte count, platelet count, hemoglobin, fibrinogen, urea nitrogen, creatinine, and albumin. If patients underwent several rounds of TURBT, the pathological characteristics of the highest tumor grade or cancer stage were recorded.

The neutrophil to lymphocyte ratio (NLR), platelet to lymphocyte ratio (PLR), monocyte to lymphocyte ratio (MLR), and neutrophil to platelet ratio (NPR) were calculated from the respective cell counts. The systemic immune-inflammation index (SII) was calculated by multiplying the platelet count by the neutrophil count and dividing this value by the lymphocyte count. Pre-RC laboratory measurements were collected in cases when the latest TURBT was performed over 1 month before RC; otherwise, these measurements were collected before TURBT to reduce the impact of surgery on the results.

Independent risk factors for LNM and nomogram construction

Patients from the Affiliated Hospital of Qingdao University were assigned to the primary cohort, while those from Qingdao Campus of Qilu Hospital of Shandong University were assigned to the external validation cohort. Univariate and multivariate logistic regression analyses were performed to explore the independent preoperative risk factors for LNM in BUC. The significant risk factors identified in the univariate analysis of the primary cohort data were included in the multivariate logistic regression model. The nomogram of LNM in BUC was then established based on the significant risk factors identified in multivariate logistic regression model.

Internal and external validation of the nomogram

Next, the nomogram-associated ROC curve was plotted, and AUC was determined. Discrimination was assessed using Harrell’s C-index. Bootstrap resampling validation (with 1000 bootstrap resamples) was used to calculate a corrected C-index. The Hosmer–Lemeshow goodness-of-fit test and calibration curve were used to assess the calibration of the nomogram. Decision curve analysis (DCA) was performed to demonstrate net benefit for each risk threshold probability, as well as the clinical application value of the nomogram. In addition, the clinical impact curve was plotted to demonstrate the potential benefit of using the nomogram in clinical practice.

External validation was performed by applying the established internal nomogram model to the validation cohort. We therefore compared the ROC curves, AUC, Harrell’s C-indexes, corrected C-indexes, calibration curves, DCA results, and the clinical impact curves of the primary and validation cohorts to verify the stability of the nomogram.

Statistical analyses

Frequencies and proportions were used to describe categorical variables, means and standard deviations were used for continuous variables with a normal distribution, and medians and interquartile ranges were used for continuous variables with an abnormal distribution. Student's t test, Mann–Whitney U test, and Chi-squared test were used in the univariate analysis of continuous variables with a normal distribution, continuous variables with an abnormal distribution, and categorical variables, respectively. Forward step-wise selection was applied in the multivariate logistic regression analysis. The nomogram, ROC curve, calibration curve, DCA, and clinical impact curve were constructed/performed using the R software packages rms, pROC, and rmda. Statistical analyses were performed using SPSS (version 24.0) and R software (version 4.1.0). A bilateral P value < 0.05 was considered as a measure of statistical significance.

Results

Patient selection and characterization

A total of 641 potentially eligible BUC patients were identified from the Affiliated Hospital of Qingdao University cohort, while 250 patients were identified from the Qingdao Campus of Qilu Hospital of Shandong University cohort. After selection, we enrolled 522 BUC patients (85 patients with LNM and 437 patients without) in the primary cohort and 215 BUC patients (35 patients with LNM and 180 patients without) in the validation cohort (Fig. 1).

Fig. 1
figure 1

Flowchart showing the patient selection process for the primary and external validation cohorts. TURBT transurethral resection of the bladder tumor, RC radical cystectomy

Patient characteristics and the results from the univariate analyses of both cohorts are presented in Table 1. The rate of LNM was not significantly different between the primary and validation cohorts (χ2 test, P = 0.999). The univariate analysis of data from BUC patients with or without LNM in the primary cohort showed that those with LNM were significantly older (P = 0.017) and had the following characteristics: higher grade tumors (P < 0.001), which were not of papillary nature (P < 0.001); more extensive tumor infiltration (P < 0.001); more possibility of hydronephrosis (P < 0.001) and extravesical invasion (P < 0.001); more evidence of LNM on imaging (P < 0.001); larger tumors (≥ 4 cm) (P < 0.001); higher neutrophil count (P = 0.007); lower erythrocyte count (P = 0.027); higher platelet count (P = 0.034); lower hemoglobin (P = 0.011); higher fibrinogen (P < 0.001); higher serum creatinine (P = 0.004); lower albumin (P = 0.012); higher NLR (P = 0.001); higher PLR (P = 0.021); higher MLR (P = 0.003); and higher SII (P = 0.001).

Table 1 Baseline characteristics of the cohort by lymph node status

Independent risk factors for LNM and nomogram construction

Parameters that were identified as significantly different between BUC patients with LNM and those without in the univariate analysis were next included in the binary multivariate logistic regression analysis. We found that high tumor grade (odds ratio [OR] = 8.400, 95% confidence interval [CI] 1.814–38.904, P = 0.007), infiltration (OR = 1.878, 95%CI: 1.001–3.522, P = 0.050), extravesical invasion (OR = 2.743, 95% CI 1.548–4.861, P = 0.001), presence of LNM on imaging (OR = 3.823, 95% CI 2.065–7.078, P < 0.001), larger tumor size (≥ 4 cm) (OR = 2.469, 95%CI: 1.427–4.272, P = 0.001), and higher serum creatinine levels (OR = 1.008, 95% CI 1.000–1.017, P = 0.043) were independent preoperative risk factors for LNM in BUC (Table 2). We pooled these independent risk factors to establish the prediction model and presented the model as a nomogram (Fig. 2).

Table 2 Logistic regression analysis of the risk factors for lymph nodes metastasis (Forward LR)
Fig. 2
figure 2

A nomogram for preoperatively predicting LNM in BUC patients, according to tumor grade, tumor infiltration, extravesical invasion, LNM on imaging, tumor size, and creatinine levels. LNM lymph node metastasis, BUC bladder urothelial carcinoma

Internal validation of the nomogram

The AUC for the nomogram in the primary cohort was 0.817 (95% CI 0.767–0.866) (Fig. 3a), indicating that the nomogram effectively predicted LNM risk in BUC patients. According to the Hosmer–Lemeshow goodness-of-fit test, the nomogram had a P = 0.285, indicating a good logistic regression model fit. The corrected C-index was 0.805 after 1000 bootstrap resampling, demonstrating little change from the primary C-index value (0.817). The calibration curve (Fig. 4a) closely resembled the standard curve, demonstrating that the nomogram had a good level of reproducibility and reliability.

Fig. 3
figure 3

The ROC curves for preoperatively predicting LNM in patients with BUC in the primary (a) and external validation (b) cohorts. ROC receiver operating characteristic, LNM lymph node metastasis, BUC bladder urothelial carcinoma

Fig. 4
figure 4

Nomogram calibration curves in the primary (a) and external validation (b) cohorts (bootstrap method using 1000 repetitions)

External validation of the nomogram

After applying the established internal nomogram model to the validation cohort, the AUC was 0.825 (95% CI 0.752–0.897) (Fig. 3b), demonstrating that the nomogram exhibited effective predictive accuracy in the external validation cohort. According to the Hosmer–Lemeshow goodness-of-fit test, the nomogram had a P = 0.480 in the validation cohort, indicating a good logistic regression model fit. The corrected C-index was 0.799 after 1000 bootstrap resampling, indicating little change from the C-index value (0.825) of the external validation cohort. In addition, the calibration curve (Fig. 4b) closely resembled the standard curve, demonstrating that the nomogram also performed well in the external validation cohort.

Clinical application value of the nomogram

Results of the DCA for predicting LNM in BUC, showed that the model offered a clinical benefit in the primary cohort at a threshold of between 0.05 and 0.50 (Fig. 5a). Moreover, DCA demonstrated that the nomogram was helpful in predicting LNM in BUC patients when the threshold probability was 0.05–0.45 in the external validation cohort (Fig. 5b). The clinical impact curve showed that the nomogram had a good clinical application value in differentiating BUC patients with LNM and those without in both the primary (Fig. 6a) and validation (Fig. 6b) cohorts.

Fig. 5
figure 5

Nomogram decision curves in the primary (a) and external validation (b) cohorts; the x-axis indicates the threshold probability, while the y-axis indicates the net benefit. The grey line indicates all patients with LNM and the blue line indicates all patients without LNM. LNM lymph node metastasis

Fig. 6
figure 6

Nomogram clinical impact curves in the primary (a) and external validation (b) cohorts. The red curve indicates the number of patients who were classified as having LNM at each threshold probability according to the nomogram, while the blue curve indicates the number of patients with true LNM at each threshold probability

Discussion

LNs are the most common site of bladder cancer metastasis. However, patients with pathological N1–3 tumor were reported to have a significantly shorter 5-year overall survival than those with pN0, indicating that LNM in BUC is associated with a poor prognosis (Karl et al. 2009; Zehnder et al. 2014). Because numerous studies have recognized LNM as the most valuable independent prognostic marker for survival outcomes in BUC, the sensitive and accurate identification of LNM in BUC is of crucial importance (Kawada and Taketo 2011). Results from previous LNM mapping studies have revealed the presence of malignant LNs outside the region of standard LND in a group of patients with LNM (Dorin et al. 2011; Wang et al. 2014). Because extended LND is more challenging to perform and is associated with more perioperative complications than standard LND, it should be used only when strictly necessary in clinical practice. The preoperative prediction of LNM would enable clinicians to choose between extended or standard LND and bladder-sparing treatments or RC, for the benefit of BUC patients.

Computed tomography (CT) and magnetic resonance imaging (MRI) are the imaging methods most commonly used to evaluate whether abdominal and pelvic LNs contain evidence of tumor involvement. However, both CT and MRI have difficulty in identifying normal-sized or minimally-enlarged metastatic modes, thus, limiting the accuracy of LNM identification (Witjes et al. 2021). Several studies have demonstrated that conventional cross-sectional imaging techniques, including CT and MRI, have high specificity but low sensitivity when it comes to identifying LNM. For example, Brunocilla et al. reported that the specificity of enhanced-CT in identifying LNM in BUC was 89%, but the sensitivity was only 14% (Brunocilla et al. 2014). One recent study also found that there was only 64.9% agreement between the N stage identified using conventional cross-sectional imaging and the pathologic N stage; again, high specificity (84%) and low sensitivity (30%) was observed (Lonati et al. 2022). Data from our primary (specificity = 90.6%; sensitivity = 36.5%) and external validation (specificity = 95.0%; sensitivity = 25.7%) cohorts echoed these findings. Although the most advanced imaging techniques, such as positron emission tomography (PET)-CT, had a higher sensitivity and a similar specificity to CT and MRI, they still could not accurately predict LNM in BUC patients (Brunocilla et al. 2014; Einerhand et al. 2020; Ha et al. 2018). Besides, the high cost of PET-CT has limited its widespread use in the preoperative examination of BUC patients. Thus, the aim of the present study was to develop an accurate nomogram to predict LNM in BUC, by including common preoperative imaging parameters and other readily accessible clinical data. Our univariate analyses revealed that BUC patients with pathologically confirmed LNM had significantly more possibility of hydronephrosis, extravesical invasion, LNM on imaging, and enlarged tumors. The multivariate logistic regression analysis then showed that extravesical invasion, LNM on imaging, and tumor size were independent risk factors for LNM in BUC. A study using data from the SEER database also identified tumor size as an independent predictor of LNM in BC (Tian et al. 2021). Moreover, LNM (detected using imaging) was incorporated into two other nomograms, which were based on radiomics signatures, as a significant LNM risk factor (Wu et al. 2017, 2018a, b). Although T stage was reported as independent predictor of LNM in BC by several groups (Shariat et al. 2012; Tian et al. 2021; Schuettfort et al. 2022), the staging accuracy of CT, as the most common radiologic imaging tool in BC, was low. Therefore, in the present study, we selected the “presence or absence of extravesical invasion” parameter instead of “T stage”, and eventually identified it as an independent predictor of LNM in BUC.

To obtain an accurate histopathological diagnosis and reliable staging data, TURBT is typically recommended before RC; moreover, the resection specimen should contain bladder muscle tissue (Witjes et al. 2021). The univariate analysis showed that BUC patients with LNM had significantly higher grade tumors and more infiltration, with little papillary tumor presence, compared with LNM-negative BUC patients. Meanwhile, the multivariate logistic regression analysis identified tumor grade and infiltration as the most significant pathological risk factors for LNM in BUC. Tian et al. and Kim et al. also used TURBT pathological data to identify tumor grade as a predictor of LNM in BC (Tian et al. 2021; Kim et al. 2015). Although the risk of LNM was reported to be significantly higher in MIBC patients than in NMIBC patients (Karakiewicz et al. 2006; Tian et al. 2021), we discovered no statistically significant difference in the extent of muscle invasion between these groups in our study. This might be because the staging accuracy of TURBT pathological examination is low. This is evidenced by the fact that 25–51% patients who were diagnosed with NMIBC following TURBT were upstaged to MIBC at RC (Shariat et al. 2007; Svatek et al. 2011; Turker et al. 2012). Thus, the reason for identifying infiltration as an independent risk factor for LNM in BUC might be because it is accurately reported following TURBT. As for urothelial variants, one recent published study indicated that non-muscle invasive BUC patients with variant histology had significantly worse disease-free survival and cancer-specific survival (Lopez-Beltran et al. 2022). We explored the relationship between urothelial variants and LNM, and we discovered no statistically significant difference between patients with and without urothelial variants. One article demonstrated that BUC patients with urothelial variants had better or worse prognosis compared with pure BUC depending on different specific variants (Claps et al. 2023). Therefore, the influence of each variant in LNM should be further explored in the future. In the present study, we chose to include the papillary status of BUC tumors as a parameter because tumors with a papillary component are associated with slower BC progression (Beijert et al. 2023). Our univariate analysis of both BUC cohorts showed that tumors with a papillary component were significantly associated with reduced possibility of LNM; however, it was not an independent predictor in the final model. Because immunohistochemistry is not a mandatory tool in BC diagnosis (Comperat et al. 2021), the presence of lymphovascular invasion (LVI) was not accurately reported for many of the TURBT specimens included in our study. Thus, we did not include incomplete LVI variable, even if it was reported to be associated with a worse BUC prognosis and increased LNM risk (Mari et al. 2019; Martin-Doyle et al. 2015).

The diagnostic and prognostic value of preoperative inflammatory biomarkers and other laboratory measurements in BUC has been previously reported. For instance, Tang et al. found that preoperative NLR, SII, and derived NLR were significantly different between BUC patients with LNM and those without (Tang et al. 2020). D'Andrea et al. also demonstrated that MLR and NLR were associated with LNM in BUC (D'Andrea et al. 2017). Other measurements, such as hemoglobin and PLR, are also linked to poor prognosis or LN status in BUC (Pang et al. 2016; Sejima et al. 2014; Viers et al. 2014). In the present study, we used preoperative laboratory measurements and inflammatory biomarkers, including NLR, PLR, MLR, NPR, and SII, to predict LNM in BUC. Two previously published studies identified MLR, fibrinogen, and NLR as preoperative predictors of LNM in BUC and included them in nomograms (Ou et al. 2020; Schuettfort et al. 2022). Although most laboratory measurements (including MLR, fibrinogen, and NLR) included in our study were significantly different between BUC patients with LNM and those without in the univariate analysis, serum creatine was the only variable capable of independently predicting LNM in BUC before RC in the multivariate analysis. Besides, a recent multicenter study indicated that low albumin-to-fibrinogen ratio was also associated with LNM at time of RC (Claps et al. 2021). Although both albumin and fibrinogen were significant in our univariate analysis, they were not significantly different between patients with and without LNM in multivariate logistic regression analysis. The opposite conclusion indicated the controversy of these markers, which should be further analyzed in research with larger sample size.

Thus, the final nomogram to preoperatively predict LNM before RC in patients with BUC was based on tumor grade, infiltration, extravesical invasion, LNM on imaging, tumor size, and serum creatinine levels. To ensure the clinical relevance of our nomogram, we included variables which were representative and readily available from the pathological analysis of TURBT specimens, imaging data, and laboratory measurements. The nomogram had an AUC of 0.817 (95% CI 0.767–0.866) in the primary cohort and an AUC of 0.825 (95% CI 0.752–0.897) in the external validation cohort, suggesting that it exhibited a good level of predictive accuracy. The calibration curves of the primary and validation cohorts indicated that the model performed consistently well during internal validation and external validation. Moreover, the DCA results and clinical impact curves of both cohorts showed that the nomogram was highly clinically applicable.

Our study had some limitations. First, the retrospective study design can lead to inaccurate data selection and the introduction of other potential confounders. Second, although the internal and external cohort validation results were consistent and stable, the sample size was small, especially for the external validation cohort. Third, because we were limited by the quality of the pathological data derived from TURBT samples, we did not collect LVI or carcinoma in situ data for inclusion in the nomogram. The limited sample size also restricted us to analyze the influence of each variant of urothelial variants in LNM in BUC patients. Finally, some new model constructing algorithms, such as machine learning, have been used to predict the risk of LNM in prostate cancer and renal cell carcinoma (Li et al. 2022; Sabbagh et al. 2023; Zhang et al. 2023). Our model was developed using traditional univariate and multivariate logistic regression analyses. Further research is needed to determine the comparative efficacy of conventional and modern models in preoperatively predicting LNM in BUC.

Conclusion

Here, we used pathologic information from TURBT specimens, imaging data, and laboratory measurements to develop a nomogram, which comprised the tumor grade, infiltration, extravesical invasion, LNM on imaging, tumor size, and serum creatinine parameters to preoperatively predict LNM in BUC. This nomogram displayed high levels of accuracy, reliability, and clinical applicability following internal and external validation.