Introduction

Gastric cancer (GC) is the fifth most common tumor and the third leading cause of cancer-related mortality worldwide, accounting for over 1,000,000 new cases and 783,000 deaths worldwide in 2020 [1]. In Italy, 14,500 new cases of GC and 8700 GC-related deaths have been estimated to be in 2020 [2].

Gastrectomy with adequate lymphadenectomy represents the gold standard for treatment of resectable disease. Although surgery offers the best chances of curative treatment for GC, recurrences occur in 20–50% of patients after surgery [3]. In fact, recurrence typically occurs within 3 years of surgery and is associated with a poor prognosis [3,4,5]. Traditionally, depth of tumor invasion, nodal metastasis, lymphovascular invasion and Lauren’s classification are the main risk factors associated with recurrence [3,4,5,6].

Accurate staging systems are crucial to assess prognosis and recommend chemotherapy or close surveillance. However, the TNM staging system has been criticized for grouping patients within the same stage despite these often have different prognosis [7, 8]. The N stage, which stratifies patients according to the number of metastatic lymph nodes retrieved at surgery, was reported to be an important prognostic factor affecting survival. For this reason, a patient undergoing an inadequate nodal dissection may be under-staged, and subsequent survival predictions would be inaccurate [9, 10]. Beyond TNM stage, GC survival is affected by several other factors, such as demographic factors (e.g. age, gender, genetic predisposition), comorbidities, tumor characteristics (e.g. size, location, Lauren type), as well as the role of chemotherapy and post-operative complications [11, 12]. Therefore, a prognostic system reporting purely data on tumor depth, nodal status and the presence of metastasis will be incomplete and deficient of several elements affecting prognosis [13, 14].

More recently, efforts have increasingly focused on developing improved prognostic systems to offer a more accurate prediction of long-term prognosis and risk of recurrence [15, 16]. In particular, nomograms have been increasingly adopted within the oncological field for a variety of tumors. Nomograms are graphical representations of complicated algorithms that are able to estimate survival of an individual patient by combining information on demographics and tumor characteristics with data on depth of tumor invasion, nodal status and metastasis provided by the TNM staging system. Nomograms can be helpful both in the pre-operative setting to estimate the risk of lymph node metastasis, as well as in the post-operative setting to estimate overall survival (OS) and recurrence risk [8]. While several nomograms are available for GC, most of these nomograms were based exclusively on data from United States or Asia [7, 11, 16,17,18]. To do, GC nomogram based on European data is lacking, and we sought to develop a novel nomogram based on a 10-year single-institution experience in GC management.

The aim of the current study was to give an individual prognostication of recurrence risk and survival using these graphical algorithms.

Methods

Data collection

All the patients who underwent curative-intent resection for GC from January 2010 to May 2020 were retrospectively collected.

Inclusion criteria were primary histologically proven gastric cancer and curative-intent surgery (R0-R1). Gastroesophageal Siewert type III tumors were included in the study, while types I and II were excluded. Patients who underwent surgery with a palliative intent (R2) or urgent/emergent surgery were also excluded. Data on demographics (i.e. age, gender, BMI, comorbidity, familiar history of gastric cancer, Helicobacter pylori infection), symptoms (bleeding, obstructive symptoms) and pre-operative work-up (endoscopy, CT scan, endoscopic ultrasound) were collected.

GC resection was performed using an open approach and grouped according to the type of resection (total, distal, proximal gastrectomy, extended total gastrectomy, esophagogastric resection, pancreaticoduodenectomy, remnant gastrectomy). Remnant gastrectomy was performed in patients with a previous gastric resection for benign disease or gastric remnant novel tumors. Surgical procedures other than gastric resection, such as esophagogastric resection, pancreaticoduodenectomy or multivisceral resection, were performed to achieve negative resection margins at final pathology. Data regarding chemotherapy regimens and radiation therapy, both perioperative and adjuvant, were also recorded. Post-operative complications occurring within 30 days from surgery were classified according to Clavien–Dindo classification, and categorized into minor complications (i.e. Clavien–Dindo 0–2) and major complications, (i.e. Clavien–Dindo 3–5) [19].

Lymphadenectomy was classified as D1 or D2 according to Japanese Gastric Cancer Association (JGCA 5th ed.) guidelines [20]. A D2 lymphadenectomy was performed when a locally advanced disease or nodal metastasis were clinically and/or intra-operatively suspected. Lymph node status was recorded considering the presence or absence of metastatic nodes at any nodal station (from n.1 to n.12.), the total number of nodes retrieved in the specimen, the total number of metastatic nodes, and the lymph nodes ratio (LNR).

Data concerning histopathological examination were recorded including margin status (R0: no cancer at resection margins, R1: microscopical residual cancer), tumor histological subtype according to the WHO classification, tumor size, lymphovascular invasion, Lauren’s classification, Ming classification and grading. The 8th edition of AJCC/UICC TNM was used for cancer staging [21]. Data prior to 2017 were updated to the 8th edition of TNM staging system.

Follow-up was obtained matching clinical visits recorded in electronic archives of Surgical and Oncological divisions. Follow-up time was calculated from the date of surgery to the date of last contact. Disease-free survival (DFS) was defined as the time from surgery to the first documented recurrence. Overall survival (OS) was defined as the time from operation to death or last follow-up. The pattern of recurrence was categorized as local (i.e. recurrence involving anastomosis or gastric remnant), nodal (including both loco-regional and distant nodes) or distant (i.e. peritoneum, liver, lung, bone or multiple sites).

Statistical analysis

Descriptive statistics were reported as absolute number percentages for categorical variables, while continuous variables were expressed as median values with interquartile ranges (IQR). OS and DFS were calculated from the date of surgery to the date of the event (local or distant recurrence, death, or the last follow-up), and were evaluated using Kaplan–Meier method. Variables associated with OS and DFS were recorded and analysed according to univariable and multivariable Cox models. Results were reported as Hazard Ratio (HR), 95% CI and p value. A p value < 0.05 was considered statistically significant.

The primary end-point of the study was to create nomograms to predict OS and DFS. Nomograms predicting OS and DFS were built according to variables resulting from multivariable Cox models. The performance of the two models was internally cross-validated via bootstrap resampling procedure with 10,000 replicates to quantify any overfitting. Discrimination ability was calculated using the Harrell’s Concordance Index [22], which is a proxy of the concordance between predicted and observed outcomes. C-index values within 0.7–0.8 indicated a good discrimination, while values > 0.8 indicated an excellent discrimination.

All the analyses were performed using R software (version 4.0.3) [23] with the packages survival and rms [24].

Results

Patients’ demographic and clinical characteristics

Overall, 168 patients were collected (Table 1). Median patient age was 71 years (IQR 62–77), and 57.1% (n = 96) of patients were male. Median BMI was 24.1 kg/m2 (IQR 22.4–28.4). Familial history of gastric cancer was present in 9.6% (n = 16) of patients. On EGD, most tumors were located in the antrum (n = 87; 51.8%) or gastric body (n = 41; 24.4%), while fewer were located in the fundus (n = 14; 8.3%) or at the gastroesophageal junction (Siewert III) (n = 16; 9.5%). A minority of patients received perioperative chemotherapy (n = 19; 11.0%), while 93 patients (55.4%) received adjuvant chemotherapy following resection.

Table 1 Patients’ demographic, clinical, and treatment characteristics

Distal gastrectomy was performed in 76 patients (45.2%), total gastrectomy in 66 (39.3%). The remaining patients underwent an extended total gastrectomy (n = 12; 7.1%), a proximal gastrectomy (n = 1; 0.6%) and a resection of the gastric remnant (n = 7; 4.2%), pancreaticoduodenectomy (n = 1; 0.6%), esophagogastric resection (n = 5; 3.0%). Multivisceral resections were performed in 17 (10.1%) cases. Splenectomy was the most common procedure, occurring in 12 (7.1%) patients, followed by distal pancreatectomy in 6 (3.6%) patients, and colon/bowel resection in 6 (3.6%) patients. D1 lymphadenectomy was performed in 44 (26.2%) patients, while D2 lymphadenectomy in 124 (73.8%) patients.

Among patients who underwent remnant gastrectomy, 5 patients had a previous surgery for benign disease (i.e. gastric and duodenal ulcers), while 2 patients had a history of tumor. Specifically, one patient underwent distal gastrectomy for GIST in 1999 with a negative follow-up until 2015, the other had a distal gastrectomy for a T2N2 GC followed by 6 cycles of adjuvant chemotherapy in 2009, with a negative follow-up until December 2018. Considering the long time between these two malignancies, the second neoplasm was considered a novel tumor instead of a local recurrence.

Median length of hospital stay was 11 days (IQR 10–13). Post-operative morbidity occurred in 41% of patients (n = 69). Pulmonary complications (n = 33; 19.6%) were the most frequent, followed by deep intraabdominal collections (n = 23; 13.7%) and post-operative bleeding requiring blood transfusions (n = 18; 10.7%). Anastomotic leakage occurred in 4.8% of patients (n = 8). Cardiovascular, thromboembolic complications, and duodenal stump leak and bowel perforation occurred in less than 5% of patients. A minority of patients (n = 12; 7.1%) needed a radiological procedure (i.e. Clavien–Dindo grade 3a) or re-intervention (i.e. Clavien–Dindo 3b) to treat post-operative complications. A total of 5 (3.0%) patients developed single or multi-organ dysfunction (i.e. Clavien–Dindo grade 4a and 4b). Post-operative mortality rate was 0.6% (n = 1).

Pathological analysis and long-term outcomes

On histopathological examination (Table 2), 72 (48.6%) patients had a diffuse type tumor, while the remaining were either intestinal (n = 67; 45.3%) or mixed (n = 9; 6.1%) type. Median tumor size was 40 mm (IQR 30–60). The majority of patients had poorly differentiated or undifferentiated tumors (n = 94; 65.3%), and most were classified as tubular (n = 68; 40.5%) or poorly cohesive (including signet ring cell, n = 56; 33.3%) subtypes. Pathological locally advanced stage was found in 49 (29.2%), 51 (30.4%), and 11 (6.5%) in pT3, pT4a and pT4b, respectively. Overall, most patients had stage III disease (n = 79; 47.0%); whereas 43 (25.6%) and 45 (26.8%) of patients had stage I and II disease, respectively.

Table 2 Histopathological characteristics

Microscopically, tumor infiltration (i.e. R1) was found in 10 (5.9%) patients, since they had a positive resection margins (gastric, esophageal or duodenal). Vascular and lymphatic invasion was reported in 41.6% (n = 67) and 69.4% (n = 109) patients, respectively. The median of the total number of nodes examined was 31.5 (IQR 21–43). Lymph node metastasis was found in 101 (60.1%) patients, resulting in an N1 stage in 28 (16.7%) patients, N2 stage in 19 (11.3%), N3a in 25 (14.9%), and N3b in 29 (17.3%). Median LNR was 0.06 (IQR 0–0.280).

The median follow-up for our cohort was 20.1 months (IQR 8.2–49.5). During follow-up, 60 (35.7%) patients experienced a recurrence, and 73 (43.5%) patients died. Overall, 3 (1.7%) patients had local recurrence, 10 (5.9%) patients had a nodal recurrence, and 47 (28.0%) had distant recurrence. The sites of distant recurrence were peritoneum in 30 patients, liver in 8, lung in 1, and multiple in 8.

The 1-, 3- and 5-year DFS was 75.3% (95% CI 68.6–82.7), 60.2% (95% CI 52.4–69.3) and 51.6% (95% CI 42.8–62.3), respectively. The 1-, 3- and 5-year OS was 85.7% (95% CI 80.4–91.5), 58.7% (95% CI 50.9–67.7) and 44.2% (95% CI 36.0–54.5), respectively (Fig. 1).

Fig. 1
figure 1

Kaplan–Meier curves demonstrating overall survival a and disease-free survival b for patients following resection for primary gastric cancer

Model specification and predictors of overall and disease-free survival

Cox regression analysis results are reported in Tables 3 and 4. At univariable analysis, pre-operative CEA, preoperative clinical locally advance disease, margin status (R1), number of total lymph nodes dissected, lymph nodes metastases, LNR, Lauren’s classification, vascular and lymphatic invasion, T and N stage, TNM stage, thromboembolic complications, and neoadjuvant and adjuvant chemotherapy were significantly associated with DFS. At multivariable analysis, Lauren’s mixed type compared with the diffuse type (HR 3.01, 95% CI 1.22–7.42, p = 0.017), and LNR (HR 2.13, 95% CI 1.63–2.79, p < 0.001) were independently predictor of DFS.

Table 3 Variables associated with Disease-Free Survival (DFS) according to the Cox proportional hazards regression model
Table 4 Variables associated with Overall Survival (OS) according to the Cox proportional hazards regression model

Age, tumor location, clinical locally advanced stage, T stage, N stage, lymphatic and vascular invasion, LNR, margin status (R1), length of stay, post-operative complications, post-operative bleeding, and anastomotic leakage were significantly associated with OS. At multivariable analysis, age (HR 1.63, 95% CI 1.02–2.62, p = 0.041), tumor size (HR 0.50, 95% CI 0.33–0.75, p < 0.001), T stage (T1 vs T4: HR 0.21, 95% CI 0.06–0.73, p = 0.014; T2 vs T4: HR 0.31, 95% CI 0.10–0.96, p = 0.042; T3 vs T4: HR 0.43, 95% CI 0.22–0.85, p = 0.016), N stage (N1 vs N0: HR 4.53, 95% CI 1.84–11.16, p = 0.001; N2 vs N0: HR 5.38, 95% CI 2.09–13.86, p < 0.001; N3a vs N0: HR 3.15, 95% CI 1.31–7.56, p = 0.010; N3b vs N0: HR 9.16, 95% CI 3.45–24.36, p < 0.001), tumor location (body vs antrum/pylorus: HR 2.37, 95% CI 1.14–4.93, p = 0.020; fundus/cardia vs antrum/pylorus: HR 3.95, 95% CI 1.97–7.93, p < 0.001), and post-operative complications (HR 3.09, 95% CI 1.69–5.64, p < 0.001) were independent predictors of OS.

Nomograms to predict DFS (Fig. 2) and OS (Fig. 3) were developed according to variables included in the multivariable Cox regression models. LNR, age and tumor size were used as continuous variables and possible non-linear effects on log HR were explored using restricted cubic splines with 3 knots (Fig. 4). A weighted score was given to each parameter composing the nomograms. The sum of scores was incorporated into an algorithm predicting an individualized OS and DFS. Survival plots for OS and DFS completed the models. For example, a 70-year-old patient with diffuse type GC and LNR = 0.14 would have an estimated DFS of 80% at 12 months of follow-up. This can be deducted by summing the points scored by the patient for every single item included in the nomograms in Fig. 2 and Fig. 3. On the other hand, a 70-year-old patient with a 35 mm tumor located in gastric antrum, T3N2M0, without post-operative morbidity would have a predicted 24 months OS of approximately 70%. On internal validation, both nomograms demonstrated a good discrimination with a Harell’s C-index of 0.77 for OS and 0.71 for DFS.

Fig. 2
figure 2

Nomogram predicting disease-free survival

Fig. 3
figure 3

Nomogram predicting overall survival

Fig. 4
figure 4

Transformation of continuous variables in univariable analysis using restricted cubic splines relating to age and tumor size

Discussion

In addition to the TNM staging system, other factors have been associated with survival after curative-intent surgery for GC. To this point, the TNM system is internationally adopted standard for cancer staging, and GC patients within the same pathological TNM stage often have different survival [7, 11]. In turn, a more accurate prediction of long-term oncological outcomes would be achieved by including other relevant variables, using prediction model such as nomograms. In contrast to TNM staging system, nomograms provide an individual estimation of survival, rather than a stratification in risk groups [14]. Individualized prediction of survival can be useful for several aspects of clinical practice, such as informing patients and families, and recommending a close follow-up or specific treatment in high-risk cases.

In the current study, we developed nomograms based on a single European institution experience of 10 years of treatment of GC. To date, ours is the first Italian nomogram predicting OS for GC in all pathological stages. A previous tool was developed to predict OS for advanced GC in the second line setting, after failure of first line chemotherapy [25]. To our knowledge, this is also the first nomogram that included the effect of post-operative complications on risk of long-term survival and recurrence. The impact of post-operative complications has been documented previously with an observed decreased 5-year OS among individual who experienced post-operative morbidity following curative-intent gastrectomy for GC [12, 26, 27]. Of note, morbidity following surgery for GC is fairly common as Powell et al. reported a post-operative morbidity of 34% within 30 days from surgery [27]. According to these results, in the current study, post-operative morbidity occurred in 41% of patients, and on multivariable analysis, post-operative morbidity was an independent predictor of OS (HR 3.09 1.69–5.64).

Kattan et al. from Memorial Sloan Kettering Cancer Center (MSKCC) first introduced a prognostic model alternative to AJCC/UICC system for GC. The MSKCC nomogram was built on a Western cohort of patients and considered 8 variables associated with disease specific survival (DSS) after curative-intent surgery for GC (age, gender, tumor site and size, Lauren’s classification, depth of tumor invasion, number of metastatic nodes and number of total nodes removed). Although just age, tumor site, depth of invasion and number of metastatic nodes were associated with outcomes, the predictive ability of the MSKCC nomogram was superior to that of the AJCC/UICC system with a C-index of 0.80 versus 0.77 for the TNM (p value < 0.001) [14]. However, in external validation in Eastern countries using a Korean cohort of patients, the MSKCC nomogram performed worse than the AJCC/UICC system [16]. To overcome these limitations, Song et al. developed a nomogram with similar variables based on Korean patients, which also demonstrated a better performance to predict OS compared with AJCC/UICC system [16]. To date, several other nomograms have been developed based on Korean, Chinese, Japanese and American datasets. These models share similar variables, with some differences regarding the inclusion of some variables, such as Lauren’s classification, tumor size, LNR and CA 19-9 [5, 11, 15, 17, 18].

Recent literature has repeatedly stressed the importance of proposing simplified prognostic models, which can be easily adapted to different facilities all over the world. Regarding GC, Zheng et al. published a simplified nomogram that was validated into three different patient cohorts (American, Italian and Chinese). This nomogram included common variables from previous nomograms that had been associated with OS (e.g. age, sex, depth of invasion and number of metastatic lymph nodes) [7, 16, 18]. Notably, the nomogram proposed in the current study incorporated almost all variables included in the simplified nomogram proposed by Zheng et al. On multivariable analysis, we noted that age, T stage, number of metastatic nodes as well as tumor size, tumor location and post-operative complications were independent predictors of OS. Unlike other neoplasms in which tumor size is part of AJCC/UICC staging system, GC tumor diameter was not included in TNM system, although several authors have reported its clinical relevance [28,29,30]. While the specific cut-off for tumor size has varied, several previous reports have confirmed the prognostic role of tumor diameter on overall survival [29,30,31].

The majority of nomograms have focused on the accurate prediction of OS or DSS. In contrast, only a few nomograms have been developed to predict recurrence risk after gastrectomy for GC. Our data suggest that LNR and Lauren’s classification are associated with recurrence risk after curative-intent surgery for GC. Lai et al. proposed a nomogram to predict DFS for early GC [32]. However, this nomogram had limited usefulness in Western countries where early GC is less common. In 2005, an Italian multicenter study proposed a scoring system incorporating age, N stage, depth of invasion, tumor location and extent of lymphadenectomy to predict recurrence [33]. Subsequently Kim et al. proposed a similar nomogram to predict DFS in which N stage and the extent of lymphadenectomy were replaced by LNR [17]. Recent studies found that LNR was a more accurate predictor of OS rather than N stage [34, 35]. Kim et al. reported that LNR could also be a useful tool to select which patients might benefit of adjuvant therapy after resection [36]. Furthermore, a recent nomogram developed by Ma et al. showed that LNR > 0.335 was associated with early tumor recurrence [37].

In the current study, mixed-type GC had a higher recurrence risk than diffuse al GC (HR 3.01, 95% CI 1.22–7.42). The prognostic role of Lauren’s classification has also been recently reported by Chen et al. who observed a better OS and a lower recurrence risk in patients with intestinal type GC than those with diffuse type GC [38]. Furthermore, Lee et al. reported different patterns of recurrence between intestinal and diffuse/mixed GC. Distant metastases were frequently observed in intestinal tumors, whereas peritoneal recurrence was more common in diffuse/mixed GC [39]. A few studies analysed the role of mixed-type GC, finding more aggressive behaviour and higher risk of nodal metastasis versus diffuse and intestinal type GC [40, 41].

The current study had several limitations. First, the study was based on a retrospective collected data using information from a single Italian institution experience. The cohort of patients was relatively small when compared with similar Asian or American studies, and it covered patients treated in 10 years. GC incidence is decreasing worldwide, especially due to Helicobacter Pylori eradication, and in Italy between 2008 and 2016 the incidence decreased up to 2% [2]. Second, all the cohort underwent a traditional open gastric resection and only a minority received perioperative treatment. When performing minimally invasive gastrectomy, our nomograms may be useful since no difference in terms of long-term oncological outcomes was reported when comparing open and a minimally invasive approach [42, 43]. Furthermore, considering the wide diffusion of the multimodal treatment, our study lacks including patients treated with pre- or perioperative treatment. In this setting, the proposed nomograms may be useful mostly in patients treated with upfront resection. Third, the nomograms proposed included mainly post-operative features, and it would not be applicable to the pre-operative setting. Last, the nomograms also require external validation, to evaluate the performance on different cohorts of patients.

Conclusion

In the current study, we proposed two different nomograms including clinically relevant variables associated with DFS and OS after curative-intent surgery for GC. On internal validation, both nomograms demonstrated a good discrimination. To our knowledge, these are the first nomograms predicting OS and DFS for GC in Italy for all pathological stages. The proposed nomograms may be useful to guide clinician decision-making, as well help identify patients with high recurrence risk or with a poor estimated survival.