Introduction

Neuroblastoma (NB) is a childhood malignancy that originates from the developing sympathetic nervous system (Maris 2010; Tolbert and Matthay 2018; Vo et al. 2014; Irwin and Park 2015). The most common primary sites of NB are the adrenal medulla and the sympathetic ganglia (Maris 2010; Tolbert and Matthay 2018; Vo et al. 2014; Irwin and Park 2015). As the most common extracranial solid tumor in childhood, NB accounts for approximately 6–10% of all pediatric malignancies, with an annual incidence of around 1 case per 10,000 children under the age of 15 in the United States (Lu et al. 2021; Yao et al. 2017). The treatment of neuroblastoma follows a multidisciplinary model, determined by a multitude of factors including age at diagnosis, tumor stage, and tumor biology. In recent years, the ongoing enhancement of conventional treatment techniques like surgery, chemotherapy, and radiotherapy, combined with the advancement of emerging therapies such as immunotherapy, has substantially improved the prognosis for children with neuroblastoma.

Despite these advancements, the prognosis for patients with neuroblastoma still demonstrates substantial variability due to the considerable tumor heterogeneity. Patients with low- and intermediate-risk neuroblastoma exhibit an overall survival (OS) rate exceeding 90%, while those with high-risk neuroblastoma face a dismal prognosis, with survival rates as low as 50% (Baker et al. 2010; Rubie et al. 2011; Strother et al. 2012; Pinto et al. 2015; Morgenstern et al. 2018). The prognosis of neuroblastoma is widely recognized as highly reliant on the tumor stage, with two primary staging systems in use: one centered on post-surgical staging (the International Neuroblastoma Staging System, INSS), and the other emphasizing risk classification prior to treatment (the International Neuroblastoma Risk Group Staging System, INRGSS) (Brodeur et al. 1993; McCarville 2011; Monclair et al. 2009). However, current tumor stage approaches fail to provide precise personalized prognostic models for predicting OS in patients with neuroblastoma. Consequently, there is an urgent need to assess the prognosis and risk stratification of patients with neuroblastoma early on.

In recent years, nomograms have demonstrated superiority over traditional TNM staging system and have been extensively used for individualized estimation of prognosis in various malignancies (Liang et al. 2015; Zhou et al. 2018; Sharouni et al. 2021). Within the realm of neuroblastoma prognosis, nomograms exhibit immense potential in offering a personalized approach to OS and risk stratification, generating a visually interpretable probability of a specific outcome. The objective of this study was to develop an accurate and reliable predictive nomogram for estimating OS and providing individualized risk assessment for neuroblastoma patients using the Surveillance, Epidemiology, and End Results (SEER) database.

Methods

Patients and methods

Data source

The patient data were extracted from the SEER database of the National Cancer Institute (NCI). The SEER database, administered by the NCI, serves as the authoritative source of information that provides updated data on cancer incidence and patient survival rates from population-based cancer registries, covering approximately 48.0% of the U.S. population.

Patients selection

Patients with neuroblastoma between 2004 and 2015 were selected from the SEER database according to the International Classification of Diseases for Oncology 3rd Edition (ICD-O-3). Inclusion criteria were: (1) age equal to or below 18 years, and (2) diagnosed with neuroblastoma or ganglioneuroblastoma (GNB). Exclusion criteria included: (1) survival time less than 1 month or unknown, (2) deaths attributed to causes other than neuroblastoma or ganglioneuroblastoma or unknown cause of death, (3) uncertainty surrounding whether cancer-directed surgery was performed, and (4) uncertainty regarding whether a surgical procedure other than at the primary site was performed. The selection criteria and screening process are depicted in Fig. 1.

Fig. 1
figure 1

Flowchart illustrating the inclusion and exclusion of patients

Clinical variables and outcomes

The collected patient information included age, race, sex, histology, primary site, tumor number, tumor size, first malignant primary indicator, tumor grade, distant metastases, tumor stage, surgery, scope of regional lymph node surgery, regional nodes, surgical procedure of other sites, chemotherapy, and radiotherapy. Racial categories were classified as White and others (including Asian or Pacific Islander, American Indian/Alaska Native, Black, or Unknown). The primary site was determined in several locations including the adrenal gland, retroperitoneum, and others. The tumor stage was classified into four types based on SEER Combined Summary Stage: localized, regional, distant, and unknown/unstaged. Distant metastases occurred in organs such as the bone, brain, liver, and lung. The optimal cutoff value for tumor size was determined by the X-Tile software, and then categories as 0–62 mm, 63–87 mm, 88–989 mm, and unknown. Tumor grade was stratified as Grade I (well-differentiated), Grade II (moderately differentiated), Grade III (poorly differentiated), Grade IV (undifferentiated), or unknown. OS served as the primary endpoint, defined as the duration (in months) from the date of diagnosis to death or the last follow-up.

Statistical analysis

Analyses were performed using SPSS 26.0 (IBM, Chicago, IL, USA) and R software (version 4.2.2). Statistical significance was defined as P < 0.05 using two-sided tests. The measurement data was described using the median and interquartile range (IQR). The enumeration data, described as the number of cases or percentage, was analyzed using the chi-square test. A total of 2185 patients were assigned to training and testing groups in a 7:3 ratio. The training group was designated for constructing the nomogram and internal validation, while the testing group was used for model validation.

Prognostic nomogram construction

Univariate and multivariate Cox regression analyses were carried out to identify independent prognostic factors. Significant factors (P < 0.05) were selected for nomogram construction. Variables were represented as line segments with varying lengths according to weight, with scores ranging from 0 to 100. Total scores predicted 1, 3, and 5-year OS.

Prognostic nomogram validation

The concordance index (C-index) was used to measure the accuracy of model predictions. A value above 0.7 indicates that the predictive model has excellent discriminative ability. The receiver operating characteristic (ROC) curve was employed to evaluate the performance of classification models. An area under the ROC curve (AUC) above 0.7 signifies that the model possesses excellent discriminative ability. The calibration curve was used to verify the accuracy of probability predictions. A curve close to the 45-degree diagonal line indicates that the predicted probabilities are consistent with the actual observed probabilities. Finally, decision curve analysis (DCA) was utilized to appraise the clinical utility of a predictive model at different thresholds.

Risk stratification based on nomogram

Patients were categorized into three groups according to their total scores on the nomogram using X-tile software: the low-risk group (total score ≤ 140), the intermediate-risk group (140 < total score < 223), and the high-risk group (total score ≥ 223). The differences in survival among these risk categories were compared using Kaplan–Meier curves and log-rank tests.

Dynamic nomogram construction

A web-based dynamic nomogram was constructed using the open source R Shiny Server, which allows clinicians to conveniently assess patient prognosis using the nomogram.

Results

Patient characteristics

A total of 2,185 patients were diagnosed with neuroblastoma or ganglioneuroblastoma between 2004 and 2015. Of these, 1529 were assigned to the training group and 656 to the testing group. The demographic and clinical features of the patients are outlined in Table 1. No significant differences were observed between the training and testing groups (P > 0.05). Generally, the median age of children was 1 year (IQR: 0–3), comprising 1146 males and 1039 females. The primary tumor site was primarily the adrenal gland (45.9%), with 10.3% in the retroperitoneum. Bone, liver, lung, and brain metastases were present in 16.3%, 6.0%, 2.4%, and 1.8% of the patients, respectively. Moreover, 78.8% of patients underwent surgery, 66.7% received chemotherapy, and 24.4% underwent radiotherapy.

Table 1 Demographics and clinical characteristics of patients with neuroblastoma in the training and testing groups

Nomogram construction

We initially identified variables strongly associated with outcomes (P < 0.05) through univariate analysis in the training group. These variables comprised 14 factors, including age, histology, metastases (bone, brain, liver, lung), regional nodes, primary tumor site, surgical procedure at other sites, tumor stage, tumor size, chemotherapy, radiation, and regional lymph node surgery, which significantly impacted OS (Table 2). These factors were incorporated into a multivariate Cox analysis for OS. Age, chemotherapy, brain metastases, primary site, tumor stage, and tumor size were identified as independent risk factors (Table 2). These factors were then used to construct a nomogram predicting 1-, 3-, and 5-year OS of NB patients (Fig. 2). The sum of scores for individual factors in the nomogram provides an estimate of patient prognosis.

Table 2 Univariate and multivariate Cox regression analyses of overall survival in patients with neuroblastoma in the training group
Fig. 2
figure 2

Nomogram predicting 1-, 3-, and 5-year overall survival (OS) rates for patients with neuroblastoma

Nomograms validation

The accuracy and applicability of the nomogram were assessed using both the training and testing groups for internal and external validation. The C-index was first used for estimation, resulting in values of 0.790 (95% CI 0.768–0.812) and 0.781 (95% CI 0.750–0.812) for the training and testing groups, respectively. In contrast, the C-index values for the conventional tumor stage were 0.718 (95% CI 0.698–0.738) and 0.724 (95% CI 0.695–0.754) in the training and testing groups, respectively. In the ROC curve analysis, the AUC values for predicting 1-, 3-, and 5-year OS were 0.764 (95% CI 0.724–0.804), 0.812 (95% CI 0.784–0.840), and 0.829 (95% CI 0.803–0.855) in the training group, and 0.730 (95% CI 0.670—0.791), 0.783 (95% CI 0.743–0.823), and 0.810 (95% CI 0.773–0.848) in the testing group, respectively (Fig. 3A, B). Comparatively, the AUC results for the conventional tumor stage were 0.718 (95% CI 0.682–0.753), 0.732 (95% CI 0.708–0.758), and 0.750 (95% CI 0.725–0.774) in the training group, and 0.713 (95% CI 0.656–0.770), 0.736 (95% CI 0.701–0.771), and 0.746 (95% CI 0.711–0.782) in the testing group for predicting 1-, 3-, and 5-year OS, respectively (Fig. 3C, D). The calibration plots also demonstrated satisfactory concordance between nomogram-predicted risk and observed risk for 1-, 3-, and 5-year OS in both the training and testing groups (Fig. 4A, B). Furthermore, the decision curves revealed that the nomogram displayed positive clinical utility in predicting OS at 1-, 3-, and 5-year intervals in both the training group (Fig. 5A–C) and the testing group (Fig. 5D–F). Overall, these results demonstrated the exceptional discriminative ability of the constructed model.

Fig. 3
figure 3

Receiver operating characteristic (ROC) curves of the nomogram and combined summary stage. A ROC curves for 1-, 3-, and 5-year OS using the nomogram model in the training group. B ROC curves for 1-, 3-, and 5-year OS using the nomogram model in the testing group. C ROC curves for 1-, 3-, and 5-year OS using the combined summary stage in the training group. D ROC curves for 1-, 3-, and 5-year OS using the combined summary stage in the testing group.

Fig. 4
figure 4

Calibration plots of the nomogram. A Calibration plots of 1-, 3-, and 5-year OS in the training group. B Calibration plots of 1-, 3-, and 5-year OS in the testing group

Fig. 5
figure 5

Decision curve analysis (DCA) of the nomogram. A–C DCA curves of 1-, 3-, and 5-year OS in the training group. D–F DCA curves of 1-, 3-, and 5-year OS in the testing group. The green line signifies the assumption that no patients have died, while the red line represents the supposition that all patients have died

Nomogram-Based risk stratification system

The overall prognostic score for each patient was calculated based on variables within the nomogram. Two optimal cutoff values were identified at 140 and 223 scores using X-tile software. Utilizing these thresholds, patients were stratified into low, intermediate, and high-risk groups, consisting of 1100 patients (50.3%, total score ≤ 140), 831 patients (38.0%, 140 < total score < 223), and 254 patients (11.6%, total score ≥ 223), respectively. The OS rates for the low-risk group were 98.7%, 97.4%, and 97.0% at 1, 3, and 5 years, respectively. In the intermediate-risk group, the OS rates were marginally lower at 90.0%, 77.1%, and 70.4% at 1, 3, and 5 years, respectively. However, patients in the high-risk group exhibited a substantially worse prognosis, with OS rates of merely 86.7%, 61.6%, and 46.9% at 1, 3, and 5 years, respectively (Fig. 6A).

Fig. 6
figure 6

Establishment of a risk stratification system through the optimal cut-off value of the risk score. A Kaplan–Meier survival analysis comparing different risk groups based on nomogram scores. B, C Subgroup analysis of the primary tumor site (B) and surgery (C) according to the new risk stratification. The survival time is expressed in months

Subgroup analysis based on the new risk stratification

To underscore the benefits of risk stratification, we conducted an analysis of primary tumor sites and surgical outcomes across the different risk groups. Intriguingly, we observed a poorer prognosis in the retroperitoneum for the intermediate-risk group, whereas a worse prognosis was noted in the adrenal gland for the high-risk group (Fig. 6B). Moreover, we found that surgical intervention significantly enhanced the prognosis for the high-risk group (Fig. 6C).

Web-based nomogram

We developed a web-based nomogram to predict patient outcomes (≤ 18 years) diagnosed with neuroblastoma. This accessible tool empowers physicians and patients alike to individually and visually appraise survival probability of each patient by selecting common clinical variables (https://neuroblastomanomogram.shinyapps.io/DynNomapp/).

Discussion

The prognosis of NB is known to vary considerably based on a multitude of clinical and biological factors. However, specific biological markers may not be feasible in regions where medical resources are limited. In this study, we analyzed data from 2185 pediatric patients diagnosed with neuroblastoma between 2004 and 2015, utilizing the SEER database. We successfully identified age, chemotherapy, brain metastases, primary site, tumor stage, and tumor size as independent risk factors significantly impacting overall survival. Based on these factors, we developed a nomogram that accurately predicts 1-, 3-, and 5-year OS rates, outperforming conventional tumor staging methods in both internal and external validations. Furthermore, we established a risk classification system, derived from the nomogram model, that effectively stratifies patients into low, intermediate, and high-risk groups, facilitating early prognosis assessment. Collectively, the independent factors constituting the nomogram can be readily obtained through standard clinical practice, enhancing their broad applicability.

The prognosis of neuroblastoma patients is influenced by various key risk factors, among which the age at diagnosis plays a crucial role (Sokol et al. 2020). For stage 3 and 4 MYCN non-amplified tumors, patients under 18 months of age exhibit better event-free survival than those 18 months or older (Sokol et al. 2020). In line with this, our results also demonstrate that older patients have worse OS rates. Moreover, distant metastases is also a significant predictor of neuroblastoma patient outcomes, with a 5-year survival rate of only 19.9% for patients with brain metastases (Hu et al. 2019; Coughlan et al. 2017). Our results further corroborate that brain metastases is an independent risk factor. The primary tumor site in neuroblastoma also affects numerous aspects, including clinical and biological characteristics, event-free survival, and overall survival, with tumors in the adrenal gland associated with poorer outcomes (Vo et al. 2014). Interestingly, our findings reveal that the poorest prognosis was identified in the retroperitoneum for the intermediate-risk group, and in the adrenal gland for the high-risk group. This discrepancy may be due to differing tumor behavior and biological characteristics in these locations, although further investigation is needed to fully understand the underlying mechanisms. Furthermore, the tumor stage at diagnosis and tumor size are crucial prognostic factors for neuroblastoma patients (Brodeur and Maris 2002; Wang et al. 2022). According to our results, patients with distant tumors have a worse prognosis than those with localized and regional tumors. Additionally, larger tumors are correlated with a worse prognosis. Collectively, these independent risk factors constitute a predictive model with potential clinical utility.

The treatment strategies for neuroblastoma are multifaceted, encompassing surgery, chemotherapy, radiotherapy, retinoic acid, immunotherapy, and other supplemental treatments. Surgery plays an indispensable role in the treatment of neuroblastoma; however, it comes with its own set of challenges and potential risks, such as vascular damage or bleeding (Simon et al. 2013). Our findings demonstrate that, among high-risk patients, surgical intervention significantly improves survival outcomes. Hence, the utility of surgery may be underestimated in these high-risk patients. We recommend for the consideration of surgical intervention, wherever possible and safe, for high-risk neuroblastoma patients. In contrast to surgery, our study identified a significant association between chemotherapy and unfavorable outcomes. This could potentially be attributable to the patients receiving high-intensity chemotherapy, who were already categorized as high-risk, or it could be due to deaths related to the treatment itself.

Our study admittedly has several limitations. Firstly, the SEER database lacks some crucial prognostic variables, including MYCN amplification status, DNA ploidy, and the INSS stage. However, given that the variables included in the nomogram are readily available and easy to generalize, the nomogram predictive model based on the SEER database remains a valuable tool. Secondly, our study is retrospective, which could introduce selection bias. Therefore, further prospective clinical data are required to verify the accuracy and validity of our results.

In summary, we developed a pragmatic nomogram and risk stratification system that outperforms traditional tumor staging methods in predicting the overall survival of neuroblastoma patients. The incorporation of easily accessible clinical risk factors significantly bolsters the clinical applicability and utility of the model.