Introduction

Lung cancer is currently the most common and deadliest malignancy in the world, and the number of new diagnoses and deaths of lung cancer patients in the United States is expected to be 235,760 and 131,880, respectively, in 2021, according to cancer data published in the United States1. Lung cancer can be divided into two categories: non-small cell lung cancer (NSCLC) and small cell lung cancer, and NSCLC account for approximately 85% of lung cancer cases, due to clinical manifestations and signs are not obvious, and about 40%-55% of these patients are advanced at the time of diagnosis2. The common sites of metastasis in advanced NSCLC are bone, brain, adrenal gland, and liver3. For advanced patients presenting with metastases, their median survival is 8–12 months, while in patients with metastases at common sites, the median Overall survival(OS) is only 3 months once liver metastases are diagnosed4,5. It has also been demonstrated that liver metastasis is one of the independent poor prognostic factors for patients with NSCLC, and therefore patients with liver metastases from NSCLC have the worst prognosis6. TNM staging is now widely accepted as a tool to predict patient prognosis7. However, the patient's age, gender, marital status, pathological features, distant metastatic sites and numbers, and treatment methods all have an impact on their prognosis8.Therefore, it is still difficult to accurately predict the prognosis of metastatic lung cancer using TNM staging.

Compared with brain metastases and bone metastases, the prognosis of patients with liver metastases from non-small cell lung cancer has been less studied, and there is no effective model to predict the prognosis of NSCLC patients with liver metastases. The nomogram is a simple statistical tool that can meet our needs for a comprehensive clinical model. Nomograms enable rapid calculations as well as higher accuracy and easier to understand prognosis compared to traditional TNM staging to assist physicians in clinical decision making9. Therefore, through this study, we aimed to integrate the clinical characteristics of patients in the Surveillance, Epidemiology, and End Results database (SEER) database between 2010 and 2015 and quantify the impact of each risk to create a nomogram, which can make the results of the prediction model more readable through visual graphs and can more accurately determine the OS of NSCLC patients with liver metastases, helping clinicians to be able to better develop treatment strategies.

Methods

Patient selection

We screened patients diagnosed with NSCLC combined with liver metastases between 2010 and 2015 in the SEER database. Because patient information in the SEER database is publicly available and free of charge, institutional review board approval was not required for this study. Inclusion criteria were: (1) patients aged ≥ 18 years, (2) patients whose only primary site tumor was pathologically diagnosed as NSCLC, (3) those with complete information, including race, primary site, grade, marital status, surgery, chemotherapy, radiation therapy, and insurance status. We finally screened 1963 NSCLC patients with liver metastases for inclusion in this study.

Variable definitions

We extracted factors from the patient data that might be associated with prognosis, including age, gender, race, laterality, primary site, histological type, grade, T stage, N stage, surgery, radiotherapy, chemotherapy, bone metastases, brain metastases, lung metastases, marital status, and insurance status. Age was changed from a continuous variable to a categorical variable by X-tile software and divided into < 55, 55–66, and > 66. Similarly, the tumor size is divided into < 42, 42–71 and > 71. The tumor site was divided into the main bronchus, upper lobe, middle lobe, lower lobe, and overlapping sites based on anatomy. According to the 8th edition of the American Joint Committee on cancer guidelines, T was divided into T1, T2, T3, and T4, and similarly, N was divided into N1, N2, and N3. The primary endpoint of this study was overall survival (OS), defined as the time interval from the date of diagnosis to the date of patient death.

Statistical analysis

We randomly divided patients into a training group (n = 1375) and a validation group (n = 588) in a ratio of 7:3. The variables associated with prognosis in NSCLC with liver metastases were identified by univariate Cox regression analysis of prognosis-related indicators. Subsequently, variables with P values < 0.05 in the univariate Cox regression analysis were subjected to multivariate Cox regression analysis to obtain independent prognostic factors for NSCLC with liver metastases, and multiple predictors were integrated to assign different scores using the degree of contribution of different factors to OS. Finally, a nomogram model for predicting OS in NSCLC patients with liver metastases was established by converting the scores to a functional relationship with OS using R software. The model performance is divided into two main aspects, discrimination and calibration, which we have validated in the training and validation groups, respectively. We used calibration curves to measure the agreement between predicted and actual outcomes. The discriminant of the model was measured by calculating the receiver operating characteristic curves (ROC) curves area under the curve (AUC), which took values in the range of 0.5–1.0. Finally, the clinical application value of the model was evaluated by decision curve analysis (DCA). The random grouping, nomogram, calibration curves, ROC, and DCA were composed by R language software (version 4.0.3).The cox proportional hazards regression analyses were performed by the R software “survival” and “MASS” packages. The Kaplan–Meier survival curve was produced using the “tidyverse”and “survminer” package.The nomogram and calibration plots were produced using the “rms” “foreign”and “survival” package. The DCA was performed using the function “stdca.R”.The ROC was performed using the “survival”package. Bilateral P values < 0.05 were considered statistically significant.

Results

Baseline characteristics of NSCLC patients with liver metastases

A total of 199,564 patients with confirmed NSCLC were collected in the SEER database between 2010 and 2015, and 1963 eligible cases were finally obtained, of which 1375 were randomly assigned to the training group and 588 were randomly assigned to the validation group. Table 1 summarizes the demographic and clinical characteristics of the NSCLC patients with liver metastases. In the training group, 733 (53.3%) patients were > 66 years old, 1073 (78%) were white, 804 (58.4%) were male, and 1330 (96.7%) patients had insurance. For tumor characteristics, 54.6% of them were adenocarcinoma, 59.2% were upper lobe, 68.4% were histologic grade III, and 50.2% were N2. Among the modalities of follow-up treatment received, the majority (97%) of patients did not receive surgical treatment, 55.7% received chemotherapy, and 42.9% received radiation therapy. Patient details are shown in Table 1.

Table 1 Demographic and clinical characteristics of patients.

Development and validation of a prognostic nomogram for NSCLC patients with liver metastases

After univariate Cox regression analysis, a total of 14 factors were significantly associated with the prognosis of NSCLC patients with liver metastases (P < 0.05), including age, race, gender, histological type, grade, T stage, N stage, brain metastases, bone metastases, lung metastases, surgery, chemotherapy, insurance status, and tumor size (Table 2). These factors were then subjected to multivariate Cox regression analysis, and finally, 11 factors were identified as independent prognostic factors for NSCLC with liver metastases including age, race, gender, grade, T stage, N stage, brain metastases, bone metastases, surgery, chemotherapy, and tumor size. We established a prognostic nomogram for NSCLC patients with liver metastases based on the independent prognostic factors selected in the multivariate Cox regression analysis (Fig. 1), from which it is clear that the T stage has the greatest impact on the prognosis of NSCLC patients with liver metastases, followed by tumor size. The area under the curve of the clinical prognostic model predicting overall survival at 6, 9, and 12 months was 0.793, 0.787, and 0.784 in the training group and 0.767, 0.771, and 0.773 in the validation group, respectively (Fig. 2). In addition, we further compared the difference of AUC value between nomogram and all independent prognostic factors and the results showed that the AUC value of the nomogram was higher than the AUC of all independent factors at 6, 9, and 12 months, both in the training group and the validation group (Fig. 3). The calibration curves are shown in Fig. 4. The curves are all close to 45 degrees, indicating that the prognostic nomogram has a good calibration performance. Also, the DCA showed that the prognostic nomogram has strong clinical utility (Fig. 5). We calculated the total score for all patients and then used X-tile software to cutoff. The results showed that the best cut-off values for OS were 105 and 167. Based on the cut-off values, patients were divided into the high-risk group (> 167), medium-risk group (106–166), and low-risk group (< 106). By depicting the Kaplan–Meier survival curve, we can find that as the risk increases, the prognosis of the patients will become worse. (Fig. 6).

Table 2 Univariate and multivariate Cox regression analysis in NSCLC patients with liver metastases.
Figure 1
figure 1

A prognostic nomogram for NSCLC patients with liver metastases.

Figure 2
figure 2

ROC curves for NSCLC patients with liver metastases. (a) ROC curves of 6-, 9-, and 12-months in the training group, (b) ROC curves of 6-, 9-, and 12-months in the validation group. ROC: Receiver operating characteristic.

Figure 3
figure 3

The ROC curves of nomogram and all independent predictors at 6- (a), 9- (b), and 12-months (c) in the training group and at 6- (d), 9- (e), and 12-months (f) in the validation group.

Figure 4
figure 4

The calibration curves of the nomogram for the 6-, 9-, and 12-months OS prediction of the training group (ac) and validation group (df). The x axis represents the nomogram- predicted survival rates, whereas the y axis represents the actual survival rates.

Figure 5
figure 5

DCA of the nomogram for the survival prediction of NSCLC patients with liver metastases.6-months (a), 9-months (b), 12-months (c) survival benefit in the training group. 6-months (d), 9-months (e), 12-months (f) survival benefit in the validation group. The green curve represents the assumption that all patients will expire, while the blue curve indicates the net benefit of the nomogram model by summing the true positives and subtracting the false positives.

Figure 6
figure 6

Kaplan–Meier survival analysis of the signature for both the training group and the validation group. Patients with a higher risk score demonstrated a worse OS than those with a low risk score in the training group (a c d) and validation group (b, e, f), which suggests the strong predictive ability for NSCLC patients with liver metastases survival outcome.

Discussion

The incidence of liver metastases in lung cancer ranges 2.9–4.1% and is 20–30% in patients with NSCLC, which makes the prognosis worse10. The risk of death in patients with NSCLC in the presence of liver metastases is 1.53–2.41 times higher than that of distant metastases from other sites8,11. However, there is no relevant model that can personalize the prognosis of patients with liver metastases from NSCLC. Therefore, we created a prognostic nomogram for NSCLC patients with liver metastases by analyzing the basic information of patients in the SEER database and achieved a more precise to determine the prognosis of patients by corresponding each variable to the nomogram graph. The study confirmed that race, age, gender, grade, T stage, N stage, tumor size, presence of distant metastases from remaining sites (e.g., brain, bone, etc.), surgery, and chemotherapy were all associated with prognosis.

The results of this study showed that females had a better prognosis compared to males in NSCLC patients with liver metastases, which may be related to the stimulating effect of androgens on the growth of lung cancer12. The 8th edition of the AJCC guidelines states that regardless of the patient's T or N stage, as long as distant metastases are present, their stage is stage IV. A study has shown that T and N stages cannot be ignored even in stage IV patients and that T and N stages indicate to a certain extent the malignancy of tumor cells and determine the treatment of NSCLC patients13. The study showed that the more advanced the T and N stages and the larger the tumor size, the worse the prognosis of the patients, probably because as the tumor size increases, the sensitivity of the tumor cells to treatments such as radiotherapy decreases14. Similarly, the higher the histological grade of the patient, the more aggressive the tumor tissue is, and accordingly the shorter the survival of the patients15. In patients with NSCLC, adenocarcinoma, compared to squamous carcinoma and other histological types, is more likely to develop liver metastases16. However, the histologic type is not one of the poor prognostic factors for NSCLC patients with liver metastases, which may be due to different chemotherapy regimens for patients with different histologic types. Among patients with single-site metastases from NSCLC, liver metastases have a relatively shorter median survival compared to brain or bone metastases. Patients with multisite metastases had a worse prognosis than single-site metastases, and among patients with multisite metastases, OS was reduced by 1 month or more in patients with liver metastases than in patients without liver metastases17.

The current standard treatment for NSCLC patients with liver metastases is systemic treatment, while for liver metastases, local treatments such as surgery, radiotherapy, radiofrequency ablation, and interventional therapy are available 18. Hepatectomy may be a treatment option when the patient’s liver metastasis is single lesions19. Sun et al. demonstrated that for NSCLC patients with only three metastases or less and single metastases, surgery can improve OS, as well as reduce the tumor burden of patients and reduce or eliminate tumor-induced complications, which can improve patient prognosis to some extent20,21. Although surgery significantly improves the prognosis, the vast majority of patients are lost to surgery at diagnosis, and only 3% of patients in our study underwent surgical treatment22. According to the previous opinion, patients with liver metastases from NSCLC have a relatively low response rate to chemotherapy, which may be since some patients with liver metastases have liver dysfunction and cannot tolerate chemotherapy23. A study by katsunori et al. showed a median survival of 6 months for patients receiving chemotherapy, which improved overall survival by 4 months compared to patients receiving only symptomatic supportive therapy16. Similarly, our study confirms that chemotherapy improves the prognosis of patients with liver metastases from NSCLC. Consistent with previous studies radiotherapy did not show an advantage in multivariate Cox regression analysis, suggesting that radiotherapy does not improve OS in patients with liver metastases from NSCLC, but radiotherapy can still be used as one of the palliative treatments to relieve pain, reduce complications and improve patients' quality of life7.

There are some limitations to our study. First, targeted therapies (TKIs) and immunotherapy are currently available for some populations of non-small cell lung cancer, but specific information on these treatments is not available in the SEER database, so we were unable to analyze whether these two treatments modalities could improve patient prognosis. Secondly, the study was a retrospective analysis, and details of the treatment modalities such as the surgical approach, chemotherapy regimen, and the site and dose of radiotherapy were not available. Third, the SEER database only collects data from the United States, and further research is needed to determine whether this nomogram is generalizable to other countries and ethnic groups.

Conclusion

In brief, we established a prognostic nomogram for NSCLC patients with liver metastases by analyzing the basic information in the SEER database, which contains 11 variables including race, age, gender, grade, T stage, N stage, tumor size, bone metastases, brain metastases, surgery, and chemotherapy. Clinicians can use the prognostic nomogram to accurately assess the OS of patients, identify high-risk patients, and provide a reference for optimizing treatment plans.