Introduction

Most people with colorectal caner (CRC) die from the disease, and it is the second leading cause of death in the United States [1]. There are approximately 1.8 million new cases of CRC and 900,000 related deaths each year [2]. Metastasis is a critical element in cancer-related deaths [3, 4]. Nearly half of patients with CRC will develop metastasis [5]. Approximately 21% of all CRC patients are diagnosed with stage IV disease. The areas that are most prone to metastasis are the liver and the lung [6]. The prognosis of CRC is related to the American Joint Committee on Cancer/International Union Against Cancer (AJCC/UICC) tumour-node-metastasis (TNM) staging system [7]. However, some literature highlights that, due to patient heterogeneity, the understanding of the prognosis of patients after lung metastasis treatment is beyond the scope of AJCC staging [8]. We need a more accurate prognostic system.

The lung is the second most common site of CRC metastasis, after the liver, accounting for approximately 10–15% of metastasis [9]. The median survival time for CRC patients with lung metastasis is 17.7 months (range, 5.9–31.2) [10]. We are also interested in the prediction of prognosis for those who have already developed lung metastasis. Prognosis evaluation experiments show that the 5-year overall survival (OS) rate of patients with lung metastasis is much better than that of patients with liver metastasis and brain metastasis (16.70, 15.99 and 5.51%, respectively) [11].

Some articles suggest that the site of the primary tumour also affects prognosis. That is, because of a more aggressive phenotype, right colon cancer has a worse prognosis [12,13,14]. In addition, there are some clinical differences between patients with left colon tumours and rectal cancers [15]. An article pointed out that R0 resection, preoperative carcinoembryonic antigen level, lymph node involvement rate and number of lesions are important factors affecting the prognosis of CRC patients with lung metastasis [16]. The nomogram is an intuitive display of a statistical prediction model that produces numerical probabilities of clinical events [17]. Nomograms are widely used to predict the survival of tumour patients.

Due to the lack of large-scale retrospective studies describing the clinical characteristics of CRC patients with lung metastasis, no prediction of cancer-specific survival (CSS) has been reported, and limited information is available to analyse the prognosis of CRC patients with lung-only metastasis. Therefore, we used information from the Surveillance, Epidemiology, and End Results (SEER) database to analyse the incidence, risk factors, and prognostic factors of CRC lung metastasis. This analysis was performed separately for CRC patients who developed lung metastasis and those who had only lung metastasis. Additionally, we reviewed the data of 70 CRC patients who were diagnosed with lung metastasis and were admitted to our hospital and analysed their clinical characteristics, treatment methods and efficacy.

Materials and methods

Data extraction

The SEER database is a comprehensive cancer statistics database that contains 17 population-based cancer registries that account for 28% of the US population and record information on individuals with malignant tumours in the United States. We first submitted the data consent form to the SEER administration and then collected the relevant data using SEER*Stat version 8.3.5. The information presented in this study is based on the most recent follow-up (May 30, 2020) available in the SEER database. Informed consent was not necessary in our investigation since the data gathered from the SEER database were anonymized prior to release.

Data arrangement

New CRC cases were collected from the SEER database between January 2010 and December 2015. Based on these data, we chose patients using the procedure depicted in Fig. 1. The eligibility criteria were as follows: (1) no history of other tumours before CRC diagnosis; (2) a positive pathologic diagnosis; and (3) follow-up time of more than 1 year. The exclusion criteria were as follows: (1) unclear lung metastatic status; (2) previous tumour diagnosis; (3) no comprehensive tumour stage information; (4) unknown race; and (5) unknown survival time or survival time not coded. We gathered information from each patient record, including race, sex, age, original tumour site, degree of tumour differentiation, tumour size, AJCC stage, T stage, N stage, and the occurrence of distant metastasis affecting bone, brain, liver, or lung at the time of first CRC diagnosis. In addition, the patients’ survival statistics were obtained. All patients were classified into two groups: those with lung metastasis and those with lung-only metastasis but no metastasis in other organs. The two groups were then randomly divided into two cohorts (training cohorts and validation cohorts) and a 7:3 population was obtained. To further evaluate the performance of the predictive model, the CRC data from the SEER database for the period 2016–2019 was used as the external validation set (Supplemental Table 1).

Fig. 1
figure 1

The flow diagram of the selection process for the study

Patient data from our hospital

The study included 70 CRC patients with lung metastasis who underwent resection at Shanghai East Hospital between May 2016 and March 2019. The following criteria were used to determine inclusion: stage III-IV CRC; lung metastasis discovered after CRC resection in our institution; CT utilized to confirm lung metastasis; and more than 2 years of follow-up. Sex, age, site, T and N stage, survival months, organ metastasis, lymph node involvement, classification of lung metastasis, interval between operation and lung metastasis, treatment after lung metastasis, interval between lung metastasis and death or last follow-up, carcinoembryonic antigen (CEA) level (normal range, 0–5 ng/mL), and status were collected from each record. Due to the limited sample size, the cause of death for 70 patients at our hospital could not be determined. The 2016 ESMO handbook defines oligometastasis as “the presence of metastasis at up to two or three locations and five or more lesions.” This study followed the Declaration of Helsinki and the ethical principles of Shanghai East Hospital.

Statistical analysis

For the description of baseline attributes, descriptive statistics are utilized. The x2-test was performed to analyse lung metastasis incidence between subgroups for categorical variables to identify risk factors for CRC patients with lung metastasis, and variables with statistical significance were chosen for multiple logistic regression. The Kaplan–Meier technique and log-rank test were used in the training cohort to determine factors that impacted OS; competitive risk analysis was utilized to identify significant variables that influenced CSS; and a multivariate analysis was performed using a Cox regression model. The concordance index (C-index) was used to examine the internal validity of the nomogram, which was calculated by comparing the nomogram-predicted probability with the observed probability. The nomogram was then verified further by comparing the nomogram-predicted likelihood of patients in the validation cohort to their actual survival. SPSS 24.0 (IBM Corp, Armonk, NY, USA) and the ggplot and rms packages in R 3.4.3 (https://www.r-project.org/) were used for statistical analyses. All statistical tests were two-tailed, with p < 0.05 deemed statistically significant.

Results

Demographic and clinical characteristics

As shown in Table 1, 129,170 CRC patients diagnosed between 2010 and 2015 were included. Out of 6977 patients who developed lung metastasis, 1065 had lung-only metastasis. The median age of the 6977 patients was 65 years, with pathological grade II accounting for 65.5%, and the lesions were mostly located in the right-sided colon (40.9%). The median age of the 1065 patients with only lung metastasis was 66 years. The majority of the population (58.8%) had stage II CRC, and the lesions were mostly located in the rectum (40.5%).

Table 1 Clinical and demographic characteristics of the patient cohort

Risk of factors for developing LM among CRC patients

As illustrated in Supplemental Table 2, age (p < 0.001); race (p < 0.001); sex (p < 0.001); grade (p < 0.001); primary tumour site (p < 0.001); T stage (p < 0.001); N stage (p < 0.001); the presence of liver (p < 0.001), brain (p < 0.001), or bone (p < 0.001) metastasis at initial diagnosis; marital status (p = 0.004) and insurance status (p < 0.001) were independent prognostic factors that affected the risk of lung metastasis.

Survival analysis

Table 2 shows that 6278 of the 6977 patients were on the training cohort list. The median survival time in the training cohort was 12 months (range, 11.4–12.6 months), with a 1-year OS of 49.2% ± 0.6% and a 3-year OS of 15.9% ± 0.5%. Another 745 of the 1065 CRC patients with isolated lung metastasis were enrolled as the training set. The median survival time in the training set was 24 months (range, 11.8–26.1 months), with 1-year and 3-year OS rates of 68.5 ± 1.4 and 36.1% ± 1.6%, respectively. Kaplan–Meier survival analysis was applied to compare the difference in OS between the two groups of CRC patients with lung metastasis and lung-only metastasis (Fig. 2).

Table 2 1- and 3-year medium survival rates and univariate analysis of patients with lung metastasis in colorectal cancer
Fig. 2
figure 2

Overall Kaplan-Meier survival curves for CRC patients with lung metastasis and lung-only metastasis

In the colorectal cancer lung metastasis group, the Kaplan–Meier analysis results are delineated in Fig. 3, which demonstrates that age (p < 0.001); race (p = 0.027); tumour grade (p < 0.001); T stage (p < 0.001); N stage (p < 0.001); M stage (p < 0.001); bone metastasis (p < 0.001); brain metastasis (p < 0.001); liver metastasis (p < 0.001); marital status (p < 0.001); insurance status (p < 0.001); chemotherapy (p < 0.001); grade (p < 0.001); radiation therapy (p < 0.001); surgery (p < 0.001); and primary tumour site (p < 0.001) were significant prognostic factors. Moreover, competing risk models were constructed, and the CSS curves of the patients were parsed (Supplemental Fig. 1).

Fig. 3
figure 3

1 Overall Kaplan-Meier survival curves for CRC patients with lung metastasis in training cohort according to A Age, B Bone metastasis, C Liver metastasis, D Brain metastasis, E Marital status, F Chemotherapy, G Insurance status, H Grade. 2 Overall Kaplan-Meier survival curves for CRC patients with lung metastasis in training cohort according to I T stage, J N stage, K Race, L Radiotherapy, M Site, N Surgery

As shown in Table 3, several variables were independently associated with OS, as follows: age (p < 0.001); pathological grade (p < 0.001); T stage (p < 0.001); N stage (p < 0.001); bone metastasis (p < 0.001); brain metastasis (p < 0.001); liver metastasis (p < 0.001); primary tumour site (p < 0.001) and surgery (p < 0.001). Additionally, the survival outcomes among CRC patients with isolated lung metastasis were as follows. The results of the Kaplan–Meier analysis are presented in Fig. 4. In Table 4, we screened the following independent prognostic factors affecting the prognosis of CRC patients with only lung metastasis: age (p < 0.001); T stage (p < 0.001); marital status (p < 0.001); chemotherapy (p < 0.001) and surgery (p < 0.001).

Table 3 Multiple COX regression results of OS and Non-cancer-specific survival (NCCSS) in CRC patients with lung metastasis
Fig. 4
figure 4

1 Overall Kaplan-Meier survival curves for CRC patients with lung-only metastasis in training cohort according to A Age, B Gender, C Marital status, D Race, E Chemotherapy, F Radiotherapy, G Insurance status, H Surgery. 2 Overall Kaplan-Meier survival curves for CRC patients with lung-only metastasis in training cohort according to I Grade, J T stage, K Site, L N stage

Table 4 Univariate and multivariate Cox regression analysis results of OS in patients with lung-only metastasis

Construction and validation of the nomogram

As shown in Fig. 5, the final nomogram was developed for all colorectal cancer lung metastasis cases, depicting the 1- and 3-year OS by weighting the score of each variable. The accuracy of the nomogram was validated internally and externally using the identification and calibration method, and the calculated C-index was 0.67 for each. Further external validation was performed by applying data from SEER data for the years 2016–2019, involving a total of 2365 cases, and the C-index was 0.71(Supplemental Fig. 2). The nomogram was created incorporating six variables for CRC patients with lung metastasis only, as shown in Fig. 6. The nomogram was validated internally and externally, and the C-index of the resulting ROC curve was 0.81 for each. Further external validation was performed involving a total of 565 cases with the C-index of 0.76 for both, as shown in Supplemental Fig. 2.

Fig. 5
figure 5

A nomogram for prediction of 1- and 3-year OS rates of CRC patients with lung metastasis (A); Calibration curve of the nomogram predicting 1- and 3-year OS rates of CRC patients with lung metastasis in training cohort (B); Calibration curve of the nomogram predicting 1- and 3-year OS rates of CRC patients with lung metastasis in the validation cohort (C)

Fig. 6
figure 6

A nomogram for prediction of 1- and 3-year OS rates of CRC patients with lung-only metastasis (A); Calibration curve of the nomogram predicting 1- and 3-year OS rates of CRC patients with lung-only metastasis in training cohort (B); Calibration curve of the nomogram predicting 1- and 3-year OS rates of CRC patients with lung-only metastasis in the validation cohort (C)

Clinical characteristics of patients with lung metastasis in our centre

Supplemental Table 3 presents detailed clinical data from 70 CRC patients with lung metastasis who underwent radical surgery at our centre. The mean CEA concentration (ng/mL) was 34 (range: 1–220). The median time between the start of therapy and the onset of lung metastasis was 14 months (range 1 to 106 months). From diagnosis to lung metastasis, the median period was 17 months. Within 2 years of surgery, metastasis developed in 47 instances. Twenty-three of the 70 individuals had only one pulmonary metastasis, whereas the remainder had numerous pulmonary metastasis. Thirteen of the 70 patients died during the follow-up period. The median time between pulmonary metastasis and death or final follow-up was comparable in patients with metastasis to multiple locations and individuals with only pulmonary metastasis (28 months vs. 29 months). In terms of treatment regimens, surgical resection was chosen for 8 patients, radiotherapy was chosen for 47 patients, chemotherapy was chosen for 54 patients. As illustrated in Fig. 7, data from this group of patients were entered into the existing nomogram that predicts OS as an external verification set to validate the 3-year OS rate and the C-index was 0.69.

Fig. 7
figure 7

Calibration curve of the nomogram predicting 3-year OS rates of patients with lung metastasis in colorectal cancer in our hospital

Discussion

This study assessed the independent prognostic factors affecting survival in 6278 CRC patients with lung metastasis and 1065 patients with only lung metastasis from the SEER database records of CRC diagnosed from 2010 to 2015. Two nomograms were separately established, which performed well in predicting survival. The data of 70 hospitalized patients diagnosed between May 2016 and March 2019 were collected and analysed using the nomogram. Data from patients with colorectal cancer lung metastasis and lung-only metastasis diagnosed between 2016 and 2019 selected from the SEER database were used as external validation cohorts.

Previous studies have predicted the 1- and 3-year total survival in patients with lung metastasis from CRC, but the variables included in these equations are different due to a lack of univariate analysis [18]. Furthermore, there is an important problem that has not received attention, which is that patients with lung metastasis from colorectal cancer are prone to complicated metastasis from other sites, such as the liver, brain, and bone. If only colorectal cancer pulmonary metastasis is analysed without excluding the combined metastasis of other organs, it will affect the judgement of survival of people with colorectal cancer pulmonary metastasis. In our study, of the 6278 patients with lung metastasis from CRC, liver metastasis accounted for 71.9%, bone metastasis accounted for 9.7% and brain metastasis accounted for 2.7%. To further explore the factors independently affecting the prognosis of patients with pulmonary metastasis, we examined 1065 patients with only pulmonary metastasis to fill this gap. In addition, most articles have focused on analysing OS and CSS, but 6.2% of patients who died did not die of cancer. We focused on this population and performed survival analyses.

In clinical practice, any organ metastasis (such as brain or bone metastasis) corresponds with a relatively poor prognosis [19]. According to previous reports, liver metastasis is a useful independent parameter for predicting the survival of patients with stage IV CRC [20]. According to the results of our study, it was clearly observed that the occurrence of liver, brain, and bone metastasis were all independent prognostic factors. In the nomogram, any organ metastasis led to a higher total score and lower long-term OS.

Different primary sites of tumours determine the different biological behaviours of tumours. In our study, the right colon was the most common site of colon cancer (40.9%), while rectal cancer accounted for 22.6% of cases. In terms of lung metastasis, the incidence of lung metastasis and the incidence of lung-only metastasis in the right colon compared to rectal cancer was 31.1% vs. 31 and 28.5% vs. 40.5%, respectively, which indicates a higher risk of lung metastasis in rectal cancer. This result is similar to findings from other studies [21]. Previous studies have shown shorter disease-free survival in patients with rectal cancer due to direct spread to the systemic circulation through the haemorrhoidal vein [22, 23]. Regarding the prognosis of patients, the results showed that the median survival times for patients with lung metastasis and lung-only metastasis in the rectal, left-sided colon, and right-sided colon were 15 months vs. 13 months vs. 8.0 months and 31 months vs. 27 months vs. 19 months, respectively. Patients with metastatic rectal cancer showed better survival outcomes than patients with metastatic colon cancer, and the conclusion are also obtained in a previous study [24]. Primary CRC tumours located in the rectum often lead to lung metastasis, but their OS is better than that of CRC tumours located in other parts of the colon, and in turn, the prognosis is better in metastatic left-sided colon than in right-sided colon. There is increasing evidence that right colon cancer has a more aggressive phenotype and leads to a worse prognosis than other cancers [12, 13]. The clinical difference between patients with right- and left-sided colon cancers has been observed, and the patients with right-sided colon cancer were significantly older, mostly females, and had a higher incidence of complications [14]. This may explain the poor survival of patients with right-sided colon cancer.

In terms of treatment, surgery is also a key factor affecting the prognosis of cancer. The choice of primary tumour surgery remains controversial. In fact, more than two-thirds of elderly patients with stage IV CRC are known to have undergone primary tumour surgery [25]. The reason for this is that primary tumours may promote the development of metastasis and have serious complications that can significantly reduce the survival time of patients [26,27,28]. Moreover, resection of the primary tumour may contribute to the recovery of autoimmunity [27]. Some articles have confirmed that primary tumour surgery is associated with an increase in OS [29]. However, considering that complications from surgery delay the overall treatment plan [30] and that only a small proportion of people develop severe primary tumour-related complications [31, 32], it has been suggested that palliative resection does not prolong survival [33]. In our study, surgical resection was proved beneficial for survival. For CRC patients with lung metastasis, the 1- and 3-year OS rates were 61.7% vs. 41.7 and 25.7% vs. 9.7%, respectively. For those with and without primary tumour resection, the rates were 79.4% vs. 55.3 and 50.1% vs. 17.9%, respectively, among patients with lung metastasis only. In addition, evidence from population-based cancer survival analyses and clinical trial reviews suggests that chemotherapy improves survival in patients with CRC, particularly in stage IV patients [34]; similarly, in the current study, chemotherapy was shown to be beneficial to OS in both patient groups (either patients with lung metastasis from CRC or patients with lung metastasis only). The results also revealed that chemotherapy reduced the risk of cancer-specific death in patients with lung metastasis from colorectal cancer (Supplemental Fig. 1–1), and chemotherapy was also effective in reducing the risk of noncancer-specific death in this population (Table 3).

The data for 70 patients with colorectal cancer lung metastasis in our hospital were evaluated. The incidence of lung metastasis 1 year after primary resection was 40.7%, and the probability of lung metastasis after 2 years was 20.0%, which suggests that we should identify lung metastasis as soon as possible, especially for patients with high-risk factors, and regular CT reviews should be conducted. In comparison with a previous study, Facciorusso A. et al. [35]. defined the features of advanced colorectal adenomas associated with recurrence and classified the population according to their clinical characteristics in terms of the risk of recurrence to assist doctors in generating more individualized surveillance recommendations. In our study, we identified that age, race, grade, T stage, N stage, metastasis to other organs (including the liver, brain and/or bone), insurance status, and site of tumour growth were associated with the risk of lung metastasis from CRC. However, there are no risk classes to guide the clinical identification of specific high-risk patients among CRC patients, so further research is warranted in the future.

Our study has clear limitations. First, the manuscript lacks of granularity in the SEER database as it relates to the therapy sequencing and timing, patient comorbidities, and timing of clinical progression. Numerous data were missing in the SEER registry. Second, the differences in race, treatment methods and some baseline data could lead to bias when using data from 70 patients in China to verify the nomogram established from the SEER database in the US. Third, in addition to clinical baseline data, many biomarkers are associated with prognosis in CRC patients, such as RAS, BRAF, and MMR/MSI [36,37,38]. Several inflammatory biomarkers have been evaluated in patients with metastatic colorectal cancer. There is an article suggesting that the lymphocyte to monocyte ratio can be used to predict survival of colorectal liver metastasis [39]. In future research, more emphasis will be placed on conducting a prospective multicentre research project for enrolling large sample cases and complementing data related to molecular markers and inflammatory biomarkers to explore independent prognostic factors affecting Chinese patients with colorectal cancer lung metastasis.

Conclusion

In summary, we developed two nomograms to predict OS in CRC patients with lung metastasis and lung metastasis only. It is recommended to use nomograms as a useful tool to predict prognosis at different time points in CRC patients with lung metastasis and to help clinicians choose appropriate treatment options for different populations. In addition, advanced age, nonwhite race, female sex, poor differentiation, T4 stage, lymph node metastasis, liver metastasis, brain metastasis, bone metastasis, being unmarried, and rectal cancer are associated with an increased risk of lung metastasis.