Introduction

In recent decades, there has been brilliant development in gastric cancer diagnosis and treatment, as evidenced by the growing proportion of early-stage gastric cancer and improved survival rates1. Randomized controlled trials and observational research have proven that endoscopic screening enables the detection of more early-stage patients, offering a substantial potential for reducing the burden of gastric cancer2,3. The percentage of T1 cancers has constantly expanded from 30.4% in 1995 to 61% in 2014, according to a nationwide survey in South Korea4. Current guidelines for gastric cancer therapy have shown that combined modality therapy (CMT), such as laparoscopic gastrectomy, can drastically amplify survival in gastric cancer patients, and postoperative chemoradiotherapy (CRT) or perioperative chemotherapy (CT) are the desired procedures for treating localized gastric cancer5,6. Nevertheless, since most gastric adenocarcinoma has the characteristics of chromosomal instability and high level of copy number changes, which makes it immune exclusion, the treatments for these immunologically cold gastric cancers include ICI-chemotherapy, such as nivolumab and oxaliplatin-based chemotherapy7, and ICI-targeted therapy, such as ICIs combined HER2-targed treatments, and even ICI-targeted-chemotherapy combinations8. This advancement is anticipated to further enhance the survival rate of patients. However, there may be a stronger focus on exploring the therapeutic benefits of agents for survival rather than prioritizing the research on the new and inevitable adverse events10, as well as the currently unknown long-term effects. Patients with prolonged survival times and a higher proportion of survivors face an increased risk of developing secondary malignancies (SPM)11. The presence of SPM poses a challenge as patients with SPMs generally present lower survival rates than those without it12. To date, there is limited information available regarding the role of pathology and treatment options in SPM models, and additional efforts are required to acquire a more comprehensive understanding of the promising yet under-researched subject of GC management.

SPM are late complications arising after exposure to genotoxins13. It is no longer feasible to expect which sufferers will develop SPM, nor are there recognized mechanisms explaining the pathogenesis of different SPM histologies. The incidence of SPM is increasing, mostly since the growing population of younger cancer survivors in recent decades. For SPM in gastric carcinoma, 235 secondary malignancies (2.32%) were reported amongst 10,138 patients who underwent gastrectomy14. The ratio is elevated to that of 2018 SPMs among 33,705 (5.99%) patients in the study by Li et al.15. The chance of SPM is influenced by factors such as the patient’s age, cancer stage at diagnosis, and cure modality16. However, for a significant number of patients with an early remedy of stomach cancer, the occurrence of secondary malignant tumors has not been well-documented. Knowledge of which elements are the hazard of growing therapy-induced secondary malignancies could guide noninvasive monitoring for secondary malignancy after early-stage GC therapy.

In this study, we retrospectively analyzed and sorted out the clinicopathological points and unbiased prognostic factors of patients with SPMs. Based on the recognized prognostic elements by Cox proportional risk model and competitive-risk model, the nomogram had been built to predict the incidence of SPM. Overall, our findings may single out the comprehensive scientific elements of patients with SPM and offer more detailed insight into the risk factors for SPM.

Methods

Data sources

Data analyzed in this study have been received from the Surveillance, Epidemiology, and End Results (SEER) application of the United States National Cancer Institute (https://seer.cancer.gov/), which provides information on cancer incidence, prevalence, mortality, and evidence-based scientific findings across various US states covering a period of decades17. Patient demographic and clinical data released in 2010–2015 were downloaded using the SEER*Stat software program (version 8.3.9; National Cancer Institute, Bethesda, MD, USA). A total of 159,897 patients with a pathologically confirmed diagnosis of gastric cancer were initially included for analysis. Primary gastric cancer was defined according to the criteria established by the International Agency for Research on Cancer18. According to the SEER database coding manual, all patients with Histologic Type, coded by the International Classification of Diseases for Oncology (ICD-O-3) of 8140, 8211, 8260, 8480 and 8490 gastric adenocarcinoma, and TNM classification confined to T1N0M0, were included. Patients with unknown information on survival time, surgery, radiation, and region nodes were excluded. In addition, patients whose secondary tumor interval equal to zero were also excluded. Finally, 3289 patients with secondary malignant tumors were enrolled in the primary cohort (Supplementary Fig. S1).

Variable declaration

This study compared different clinicopathologic factors between the second malignancies (SPM) and non-SPM groups using Pearson’s chi-squared test for categorical variables. Secondary malignancy was defined as the subsequent first primary cancer, occurring at least two months after the first cancer diagnosis19. The SEER database manual provides detailed definitions of all variables used in this study. Continuous variables such as age, period of case inclusion, and involvement of regional lymph nodes were converted into categorical variables. The main endpoint for evaluation was the occurrence of SPM. The Kaplan–Meier curve was used to show differences in the incidence of secondary malignancy between different groups. Hazard ratios have been estimated by fitting a Cox proportional hazards model (proportional hazards have been examined and showed no significant, see the Supplementary Table S1). Univariate and multivariate Cox regression analyses were used to discover the prognostic value of these factors.

The fine-gray proportional hazards modeling

We used a competing-risk model to extend the proportional hazards model and estimate the risk of SPM according to the Fine-Gray model. Briefly, we regarded SPM as the result of interest, whereas death events were considered competing for risk events, and an affected patient alive was considered a censored event. The cumulative hazard of the outcome variables was calculated using the cumulative hazard function, and the cumulative incidence function (CIF) for the covariates was plotted20. Exploratory one-way subgroup analysis was performed using the Gray model to analyze the effect of chemotherapy on patient prognosis in different subgroups of each covariate, and the likelihood ratio test (LRT) was used to analyze the interaction between chemotherapy and other covariates.

Competing-risk nomogram construction and evaluation

Eligible patients were randomized into training and testing sets at a ratio of 1:1. A clinical predictive model based on a COX regression model was constructed in the training set. The Stepwise method was applied to filter variables by “regplot” R package. The independent predictive factors have been used to construct the nomogram for predicting SPM by using the “rms” package in R.

The nomogram was evaluated in the training set. Model performance for predicting the SPM consequences was evaluated by calculating the Harrell concordance index (C-index) and the area under the curve (AUC) of the receiver operating characteristic curve (ROC)21. Calibration analysis was used to visualize the accurate evaluation of the model22, while the brier score was used to evaluate the discrimination and calibration degree. Additionally, the decision curve analysis (DCA) was used to consider the clinical benefit23.

Statistical analysis

The data were analyzed and visualized using R software version 4.1.2. The following R packages were also utilized: survminer, mstate, riskRegression, rms, pec, cmprsk, and foreign. All tests were 2-sided with a P value less than 0.05 considered statistically significant.

Ethics approval

All authors have utilized and bought permission to use the database from SEER’s authentic website. The ethics committee requires no assessment and content material to use the SEER database.

Results

Demographic and clinical characteristics of patients

Clinical and demographic details of the patients are summarized in Table 1. In the GC cohort, a total of 3134 patients were stratified into the NSPM group, and 155 into the SPM group. Among the variables, we found that diagnosis during different periods (P = 0.005), different AJCC T stage (P = 0.012), histologic type (P = 0.037) of tumor and surgery methods (P = 0.003) were significantly different between the two groups based on the Fisher exact test (P < 0.05). The general variables of the patients in the NSPM and SPM groups were further analyzed by competitive risk model (Table 2), revealing significant differences in almost every variable (P < 0.01) except for gender (P = 0.345). Among the cases of SPM, stomach cancer (15.48%), lung and bronchus cancer (10.97%), and prostate cancer (9.68%) exhibited the highest incidence rates (Fig. 1A). The frequency of secondary tumors in each cancer species is shown in Fig. 1B, with stomach cancer (24 cases), lung and bronchus cancer, and prostate cancer being the three cancers with the highest frequency. The Kaplan–Meier curve for the SPM was significantly different in histologic types (P < 0.05), regional nodes (P = 0.013), marital status (P = 0.034), and tumor primary site (P = 0.013) (Supplementary Fig. S2).

Table 1 Demographics and clinicopathologic characteristics of non-secondary primary malignancies and the secondary primary malignancies group in the GC patients.
Table 2 Analysis of prognostic factors for occurrence of secondary malignant tumour in GC patients by competitive hazard model.
Figure 1
figure 1

Frequency (percentage) of secondary malignancies by site of involvement. (A) Incidence of secondary malignancies divided by diseases location. (B) Frequency of secondary malignancies in various organs.

Univariable and multivariable cox regression analyses in patients with SPM

SPM were analyzed by the univariate and multivariate Cox proportional hazards models (Fig. 2). In the Univariate Cox regression analysis, two general variables, namely fewer regional nodes and histologic type other than SRCC, were found to be individually associated with improved SPM. Subsequently, the regional nodes and histologic type of SPM were further examined in the multivariate Cox regression analysis. Patients with a histologic type other than SRCC of gastric cancer were found to have an increased risk of SPM compared to those with SRCC (HR 2.262, 95% CI 1.146–4.465, P = 0.019), after considering regional nodes involvement.

Figure 2
figure 2

The forest plot of univariate and multivariate Cox proportional hazards regression analyses for SPM in the gastric carcinoma.

Sensitivity analyses and cumulative incidence of a SPM

The Forest plot (Fig. 3A) confirmed the subdistribution hazard ratios estimated from the Fine and Gray competing risk regression model. In the univariate analysis of SPM, T1NOS, fewer regional nodes, surgical treatment, and histologic type other than AC and SRCC were individually associated with an increased risk of SPM. In the multivariate analysis, after multivariate adjustment, both other histologic type (HR 2.318, 95% CI 1.193–4.504, P = 0.013) and surgery methods such as biopsy/local excision (HR 2.3, 95% CI 1.291–4.095, P = 0.005) and subtotal and total resection (HR 1.947, 95% CI 1.028–3.687, P = 0.041) were identified as independent risk factors for SPM. It is noteworthy that patients who received chemotherapy had an estimated 50% amplify risk of SPM (HR 1.527; 95% CI 1.006–2.316; P = 0.047) (Fig. 3B). CIF curves for all variables are shown in Supplementary Fig. S3. The cumulative incidences of SPM and death grouped by different variables are presented (Fig. 4). The results showed that the CIF curve of the covariates, and by the univariate Gray’s test, cumulative incidences of SPM were significantly different in patients with different age, race, marital status, grade, regional node, AJCC stage, chemotherapy, radiotherapy and primary site significantly correlated with the cumulative risk of secondary malignancy (P < 0.001), while surgery method, AJCC T1 substage and histologic type were associated with both the cumulative risk of SPM and death ascompetitive events(P < 0.05). In COX regression analysis, the AUC values suggested that the diagnosis period (0.73), regional nodes (0.63) and surgical methods (0.6) could predict SPM well at 72 months (Supplementary Table S2). However, the AUC values for the diagnosis period (0.26) and regional nodes (0.37) predict SPM poorly in the competitive risk model (Supplementary Table S3).

Figure 3
figure 3

The forest plot of competing risk univariate and multivariate analyses for SPM in the gastric carcinoma.

Figure 4
figure 4

Univariate subgroup analysis of impact of chemotherapy on SPM in GC.

Univariate subgroup analysis

In the univariate subgroup analysis (Fig. 4), chemotherapy status showed a significant association with SPM in the age 18–65 years subgroup, with a sHR (95% CI) of4.21 (3.039, 5.833), P < 0.001. In the age 66–75 years subgroup, chemotherapy was also significantly associated with secondary malignancy, with a sHR (95% CI) of 2.508 (1.848, 3.405), P < 0.001; the P value of the interaction test was 0.018. In the 76–100 year subgroup, the sHR (95% CI) for chemotherapy was1.538 (1.24, 1.909), with a P < 0.001; the P value of the interaction test was less than 0.05. This suggests that there is a significant interaction between chemotherapy and age. In addition, being widowed (sHR 1.401, 95% CI 1.047–1.875, P = 0.003), having an AJCC IB stage (sHR 1.312, 95% CI 0.94–1.832, P = 0.01), and undergoing surgical biopsy/local excision treatment (sHR 6.222, 95% CI 2.455–15.766, P = 0.001) were positively correlated with SPMs. On the other hand, radiotherapy (sHR 0.614, 95% CI 0.453–0.834, P < 0.001) was negatively correlated with SPMs.

Competing-risk nomogram construction and validation

A predictive nomogram based on the Cox regression model was set up to predict the probabilities of SPM at 24, 36, and 72-month,incorporatinghistology type, chemotherapy, and regional nodes (Fig. 5A). A high score on the nomogram indicated a higher risk of SPM and a lower risk of death. The performance of the nomogram was evaluated using ROC curves in the training and testing sets (Fig. 5B and C). The AUC (95% CI) for the 24-, 36-, and 72-month survival rates were0.633 (0.575–0.691), 0.633 (0.577–0.686), and 0.701 (0.639–0.764) in the training set, and 0.632 (0.534–0.729), 0.658 (0.581–0.735), and 0.734 (0.646–0.823) in the testing set. Calibration analysis (Fig. 5D and E) was performed to compare the predicted SPM probabilities from the nomogram with the observed probabilities at 24, 36, and 72-months in both the training and testing set. The nomogram calibration plot indicates that the nomogram was well calibrated, with the similar predicted probabilities in the training and testing sets compared to an ideal model. Using a decision curve analysis (DCA), the nomogram for 24-, 48- 72-month SPM have exhibited a wide range of thresholds probabilities for net clinical benefit (Fig. 5F–K). The C-index and Brier score for the 72 months cutoff were 0.685 and 0.09 in the training set, and were confirmed to be 0.734 and 0.08 in the testing set, respectively (Supplementary Table S4). These results demonstrate that the nomogram exhibits good discriminative ability and accurate predictive performance.

Figure 5
figure 5

Competing-risk nomogram predicting 24-, 48- and 72-month cumulative incidence probabilities of SPMs for GC patients based on training cohort. The nomogram is used by adding up the points identified on the points scale for each variable. The total points projected on the bottom scales indicate the probabilities of 24-, 48- and 72-month SPM (A). ROC calibration plots (B,C) and calibration curves (D,E) of SPM at 24-, 48- and 72-months in the training set and validation set. Decision curve analysis of the model for predicting 24-, 48- and 72-month SPM in the training set (FH) and test set (IK). ACs, adenocarcinomas; SRCC, signet ring cell carcinoma.

Discussion

No relevant previous studies have focused on the risk factors of malignant tumours secondary to specifically early gastric cancer24,25, our analysis includes many characteristics, such as demographic characteristics and AJCC stages. This research has potential applications and research prospects in helping clinical doctors and researchers to better understand the mechanisms and risk factors related to SPM, thus guiding early diagnosis and treatment. We conducted a COX analysis of related factors and obtained some new findings that did not cover in previous studies, such as the discovery of marital status influencing gastric cancer SPM over time, the increased incidence of SPM due to divorce or separation, and the discovery that surgery may increase the risk of SPM, indicating the relative risk of specific surgical methods (subtotal/total resection and biopsy/local excision relative to the risk of no surgery and other surgeries). In addition, based on the confirmation that chemotherapy is an independent risk factor for SPM, we identify that increased age, widowed, AJCC IB stage, and surgical biopsy/local excision treatment positively correlated to SPMs, whereas radiotherapy was negatively correlated to SPMs. Through the above analysis, we can assist in identifying high-risk patients who are susceptible to SPM and provide them with personalized treatment plans to improve their prognosis and survival.

The incidence of SPM in gastric cancer patients has been observed increase in several pieces of research. In a large hospital-based study of patients who underwent abdominopelvic CT scans and were thus at high risk of intra-abdominal SPM in patients with early gastric cancer26. The incidence of extra-gastric recurrence was ranged from 1.4 to 2.2% herein slightly higher among patients after surgical resection of EGC, than that after endoscope resection27. In our study, 155 of 3289 (4.7%) GC patients developed SPM, with an incidence comparable to that of other similar studies. The improvement of therapeutic level has extended the life of GC patients. This special patient group has a range of potential health issues; many cancer survivors are at risk of creating a second primary malignancy. SPM is portends a poor prognosis with the median survival time is usually around one year28. Therefore, it is particularly important to rationally standardize the future treatment regimens, clear the excessive risk elements for the occurrence of SPMs, and boost centralized intervention and clinical interventions.

Studies that have found risk factors for SPM of GC include male, old age (over 70 years), with another basic diseases including COPD, diabetes, cirrhosis. In addition, chemotherapy and radiotherapy are independent risk factors, but surgery is a protective factor29. Some recent studies on the effects of patient characteristics have yielded conflicting results. For example, young adults have a higher risk of SPM than older survivors and being male is not a risk factors30. According to KM analysis in our study, AC and SRCC produce more SPM than other types of gastric cancer. The main anatomical parts of GC, such as curvature, produce lower SPM than other sites, and with the increase in lymph nodes region and divorce/separation also increases SPM. Unlike previous findings, in our Multivariate cox regression analysis, SPM was affected only by histological types of GC. If considering death as a competitive factor, the histological types and therapy methods, including chemotherapy and surgical, are independent risk factors of SPM. Biopsy/local excision is associated with a low risk of SPM, whereas subtotal and total resection with high risk. Therefore, It is important to closely monitor GC patients and establish follow-up programs to detect SPMs29. In the present study, we report that patients in the early AJCC stage, T1 stage and no lymph node infiltration, without irradiation and chemotherapy, had an increased risk of developing SPMs than in those who in the distant stage and received radio- and chemotherapy. One possible explanation is that patients in the early stage generally have a better prognosis and longer follow-up time, increasing the likelihood of developing SPMs. In contrast, patients in the advanced stage may have limited follow-up time and could be lost to follow-up, resulting in a lower chance of developing SPMs31. Further investigations through randomized controlled trials are required to clarify the impact of radio- and chemotherapy.

Some studies have shown that use of chemotherapy not associated with incidence of SPM32, which is inconsistent with our study. The cause may be linked to the dose of the chemotherapy drug used33 or whether the statistical results came from a randomized controlled trial34. Clinicians may be aware of tips from our results to enable them give patient optimal adjuvant therapy but not overzealous chemotherapy. While the SEER database lacks detailed data, it is worth noting that chemotherapy may still carry a higher risk of certain SPMs compared to radiotherapy, albeit to a lesser extent35. However, it is important to highlight that limited data exist thus far regarding the specific risk of chemotherapy-associated SPMs36. Furthermore, it is expected that the use of ICIs and immune-based combinations is destined to grow in gastric cancer patients in the future. While ICIs monotherapy and immune-based combinations are generally regarded as having an acceptable safety profile, it is important to note that ICIs can cause a distinct set of treatment-related adverse events. These adverse events, commonly referred to as immune-related adverse events (irAEs)9, there is a pressing need for additional studies to comprehensively understand the implications of these events. Unlike previous studies that examined the general probability of developing SPMs among cancer survivors, we have established a competing-risk nomogram37 to identify GC survivors who are at a high risk of developing SPMs. This nomogram allows us to estimate the likelihood of SPMs based onpatients’ characteristics. To assess the accuracy and reliability of our model predictions, we utillized measures such as AUC values, C indexes, calibration curves, and DCA test line plots. The results indicate that our nomogram performed well and can be generalized without limitations based on race, gender, or marital status.

Currently, there is a knowledge gap in this research field, primarily in the following areas. First, there is a lack of research on the mechanisms of SPM, which requires further exploration in basic research. Second, the identification and validation of high-risk factors for SPM require larger-scale clinical studies and validation38. To address these knowledge gaps, researchers should actively conduct molecular biology and clinical oncology, explore the mechanisms of SPM through interdisciplinary collaborations, and conduct large-scale clinical cohort studies. Meanwhile, we should still acknowledge some limitations in this study. First, the SEER dataset lacks data on doses of radiotherapy, specifics on chemotherapy regimens and genetic mutations, and we have no access to patient-specific lifestyle data, including smoking and alcohol intake, as this information was not included in the database. Second, the low rate of SPM limited statistical power for the analysis, specifically for the stratified analysis. Third, we could not validate the constructed model using other cohorts due to the disease’s rarity. Finally, as this study utilized a retrospective methodology, it is necessary to conduct additional prospective studies to determine if the results can be generalized for predicting SPMs before clinical application. Furthermore, since the number of participants in this study was not adequate to enable multivariate analyses to produce more accurate and general conclusions, more careful evaluation in a large sample size will be required in the future.

This study presented a comprehensive analysis of patient characteristics and predictors of subsequent SPMs based on a large population. These findings suggested that in specific histologic types of GC, the risk of SPM increases when areas of lymph node invasion are missed after local surgical resection and concurrent chemotherapy. Over the next five years, we believe that the development of this field will primarily focus on the following aspects. Firstly, in order to enhance early screening and prevention, further research should be conducted on the mechanism and prediction model of SPM. The assessment of SPM risk in patients with GC should also take into consideration crucial factors like inflammation, inflammation-driven gastric cancer and tumour microenvironment (TME)39, and the nutritional status of patients. The collection of peripheral blood-based biomarkers over an extended period can serve as an indirect indicator of patient immune status and effective measure to evaluate SPM in the future. Examples of these biomarkers include the neutrophil–lymphocyte ratio (NLR), the platelet-lymphocyte ratio (PLR), the pan-immune-inflammation value, the albumin levels and the Royal Marsden Hospital (RMH) score. Recent studies have demonstrated that these biomarkers can serve as valuable prognostic tools, not only for patient selection in clinical trials but also for patients in real-world clinical settings40,41. Secondly, exploring treatment strategies for SPM, including surgical method selection, radiotherapy and chemotherapy regimens, and the use of targeted drugs and ICIs as adjuvant therapy, to improve treatment outcomes and survival rates for patients. Lastly, integrating new technologies such as artificial intelligence and big data, research on personalized treatment plans based on clinical characteristics and genetic information. In conclusion, future research endeavors will continue to enhance the prediction and intervention strategies of SPM, ensuring its optimum application in clinical practice and decision-making for the treatment of GC.