Background

Thymic epithelial tumors common occur in the anterior mediastinum and can be divided into thymoma and thymic carcinoma according to histology [1, 2]. The Masaoka-Koga staging system, which is based on the progression of the primary tumor and the degree of involvement of the surrounding organs, has been widely accepted for thymoma and thymic carcinoma [3,4,5]. However, Yanagiya et al. found that age and histological type were significant prognostic factors in their cohort, which were not observed or reported in the Masaoka-Koga staging system [6]. Similarly, the results of the study published by Fukui et al. revealed that the new classification showed a better prognostic effect for thymic tumors than the Masaoka-Koga classification [7, 8]. Moreover, compared with the staging systems for most other malignant tumors, the Masaoka-Koga system does not include the effect of lymph node or distant organ metastasis on prognosis as finely as the TNM staging.

At the same time, an increasing number of studies have used clinical factors such as history of hypertension, diabetes, [9] smoking, [10] and body mass index (BMI) [11] and hematological indicators including hemoglobin (Hb) [12], neutrophil-to-lymphocyte ratio (NLR) [13, 14], albumin (ALB) [15], and other such indicators to analyze the prognosis of various tumors. However, only few studies have comprehensively analyzed a combination of the two types of indicators to establish a prognostic model for patients with thymic epithelial tumors after thymectomy. Currently, nomograms have been developed for most cancer types [16,17,18]. Compared with the traditional staging system for many cancers, the use of a nomogram has advantages in terms of prognostic prediction. Therefore, it has been proposed as an alternative method for cancer staging [19,20,21].

Therefore, in this study, we aimed to use both preoperative hematological indicators and clinical factors to construct a prognostic predictive nomogram for patients with thymic epithelial tumors after thymectomy for a comprehensive evaluation. In addition, the nomogram score was used for risk stratification to identify high-risk patients.

Materials and methods

Study population

This study was approved by the Medical Ethics Committee of Sun Yat-sen University Cancer Center (SYSUCC; Approval No. B2020–353-01) and complies with the Declaration of Helsinki.

This study retrospectively analyzed 156 patients who underwent R0 resection of thymus epithelial tumor at SYSUCC between May 2004 and August 2015. Most patients were included in the training group (n = 116), and the remaining patients were included in the verification group (n = 40). The inclusion criteria were as follows: (1) patients older than 18 years; (2) patients who underwent thymectomy at our center; (3) presence of histopathologically confirmed thymic epithelial tumors; (4) related laboratory examinations (blood routine, biochemical routine and so on) were completed within 7 days before the operation. 5) the scope of surgical resection was R0 resection. The exclusion criteria and the screening process are shown in Fig. 1.

Fig. 1
figure 1

Flow chart of patient screening

Clinical data collection

Data were collected for the following clinical variables: hematological indicators (obtained within 1 week before the operation), neutrophil count (NE), lymphocyte count (LY), platelet count, albumin levels, globulin levels as well as patients’ sex, age, smoking history, drinking history (Drinking alcohol every day, and the specific amount of drinking is not limited and described), family history of tumor, underlying disease (hypertension and/or diabetes), tumor size, histological subtype, myasthenia gravis symptoms, tumor capsule status (complete or incomplete), great vessel infiltration, Masaoka-Koga staging, and T staging. In addition, T staging and Masaoka staging were obtained by combining imaging data with intraoperative records and postoperative pathological information, we staged all patients according to the eighth edition of the TNM staging system and the modified Masaoka-Koga staging system.

Follow-up

We followed up patients regularly. In the first 2 years, all patients were followed up every 6–12 months, every 12 months from the third to the fifth year, and then an annual follow-up was continued. The last follow-up date was August 22, 2020. The methods of follow-up were telephone and outpatient. The primary observational endpoint was RFS. RFS was defined as the length of time after primary treatment for a cancer ends that the patient survives without any signs or symptoms of that cancer.

Statistical analysis

Statistical analysis was performed using SPSS 25.0 (IBM, Chicago, Illinois, USA) and R software (version 4.0.3; https://www.r-project.org/). In addition to age and tumor size, each component was converted to binary according to the best cutoff value (using X-tile software; http://www.tissuearray.org/rimmlab) according to the best cutoff value defined by the minimum P value method variables. This method has also been applied in other studies [22]. In the training cohort, the Cox regression model was used to analyze risk factors through univariate and multivariate analyses. Univariate analysis was performed to determine important risk factors for RFS. Variables with P values of less than 0.05 were further included in the multivariate Cox proportional hazard regression model. In the final multivariate analysis, P values of less than 0.05 were considered independent prognostic factors. Then, based on the results of the multivariate Cox analysis, we constructed a nomogram showing three-year and five-year relapse-free survival rates.

The R statistical software packages “rms,” “survival,” “foreign,” “survivalROC,” and “rmda” were used to calculate the C index; to generate the calibration curve, receiver operating characteristic (ROC) curves, decision curve analysis (DCA) curve, and Kaplan-Meier (KM) curve; and to construct a nomogram. The nomogram was used to calculate the prognostic risk score for each patient; X-tile was used to divide the patient’s score into different risk levels (low risk, medium risk, and high risk) and to show their stratification effect through the KM curve [23]. The C-index, DCA curve, and ROC curve were used to evaluate the predictive ability of the nomogram. All statistical tests were two-sided, and P values of less than 0.05 were considered statistically significant.

Results

Basic characteristics

A total of 156 patients participated in the study. Among them, 116 patients (Approximate 70%) were randomly assigned to the training group to build a nomogram, and the remaining 40 patients (Approximate 30%) were assigned to the verification group. Table 1 shows the data of the clinicopathological characteristics of the 156 patients. The three-year and five-year relapse-free survival rates were 0.932 and 0.905, respectively. We found that 129 patients (81.1%) achieved a five-year relapse-free survival time in all patients. These clinicopathological factors did not differ significantly between the training and validation cohorts.

Table 1 Patient, tumor, and treatment-related characteristics of thymic tumor (n = 156)

Univariable and multivariable analyses in the training cohort

According to the results of univariate Cox regression analysis, there were nine variables related to RFS: WHO histologic types, T stage, Tumor capsule status, Invasion of great vessels, ALB, Neutrophils (NE), NLR, PLR and systemic immune-inflammation Index (SII) (Table 2). In the multivariate Cox regression analysis, four parameters were defined as independent prognostic factors of RFS: T stage (T1 vs. T2–3, hazard ratio, HR = 7.518, 95% confidence interval, CI [1.355–41.718], ALB (HR = 0.157, 95% CI [0.035–0.697]), WHO histologic types (A-AB vs B1-B3, HR = 0.379, 95%CI [0.070–2.040], A-AB vs C, HR = 5.892, 95% CI [0.660–52.582]), and NLR (HR = 15.426, 95% CI [1.759–135.300]) (Table 2).

Table 2 Univariate and multivariate analysis results in Training cohort(n = 116)

Establishment of the nomogram

According to the results of the multivariate Cox regression analysis, T stage, ALB, WHO histologic types, and NLR were defined as independent prognostic factors, and these factors were integrated to form a nomogram (Fig. 2). In the training cohort, the C index was 0.902 (95% CI: 0.843–0.961). Internal calibration curves for the three- and five-year RFS closely matched those of the baseline in the training cohort (Fig. 3Aand B).

Fig. 2
figure 2

Nomogram predicting 3- and 5- relapse-free survival after thymectomy for thymic epithelial tumors patients

Fig. 3
figure 3

The calibration curves for predicting patient relapse-free survival at (A, C) 3-y and (B, D) 5-y in the training and validation cohorts

Verification of the nomogram

To better verify the actual predictive power of the nomogram, the above results were verified using the verification group data, showing that the C index was 0.785 (95% CI: 0.614–0.957), and the five-year and three-year external calibration curves met those of the standard baseline (Fig. 3C and D). We also used the ROC curve to verify the nomogram performance (Fig. 4). The AUC values of the training and validation groups at 3 and 5 years were both greater than 0.65 and by comparing the AUC values of the two groups, the nomogram model were showing well accuracy of the nomogram in predicting RFS.

Fig. 4
figure 4

Receiver operating characteristic curve analysis for the sensitivity and specificity of the nomogram system to predict 3-y relapse-free survival (A, C) and 5-y relapse-free survival (B, D) in training and validation cohorts

Decision curve analysis

Decision Curve Analysis (DCA) is a novel method for evaluating prognostic strategies that can evaluate the predictive power of prognostic models. Figure 5 shows the DCA curve of nomogram in the training and validation cohorts. The DCA of the nomogram has a high net benefit, which indicates that the nomogram can have well clinical application.

Fig. 5
figure 5

Decision curve analysis of the training cohort (A, B) and validation cohort (C, D) for 3- and 5- years relapse-free survival

Risk stratification of OS

Based on the nomogram scores, patients were divided into low-risk (0–100 points), medium-risk (101–179 points), and high-risk (180 points or higher) subgroups. In the training cohort, there were 77 patients in the low-risk group, 26 patients in the intermediate-risk group, and 13 patients in the high-risk group. In the validation cohort, 29 patients were included in the low-risk group, 7 patients were included in the medium-risk group, and 4 patients were included in the high-risk group. There were significant differences in the incidence of RFS among the subgroups, and the relapse-free survival rate of the high-risk subgroup was lower than that of the other groups (P < 0.05) (Fig. 6).

Fig. 6
figure 6

The relapse-free survival analysis of patients after risk-stratification (A for training cohort; B for validation cohort)

Discussion

In this study, we obtained results from 156 patients at SYSUCC, and ALB, NLR, T stage, and WHO histologic types were found to be independent prognostic factors of RFS. We developed a nomogram that can effectively predict the RFS rate of patients with thymic epithelial tumors at 3 and 5 years.

Slow disease progression and good prognosis have largely limited research on thymic epithelial tumors. At the same time, in the era of precision medicine, it is very important to analyze patient information as comprehensively as possible to screen factors affecting the prognosis for treatment decision-making. At present, the Masaoka-Koga staging system is still the gold standard for predicting the prognosis of thymic epithelial tumors. However, similar to the results of Fukui et al., [7] our results show that the T staging is better than the Masaoka-Koga staging in predicting prognosis. Therefore, the nomogram developed in this study was mainly included T stage. Based on this, the nomogram along with the patient’s various clinical and hematological indicators was used to construct a prognostic model. The nomogram has the advantage of a multi-dimensional comprehensive prediction of prognosis [24]. Some studies have reported nomograms related to the prognosis of thymic epithelial tumors. However, they considered only the clinicopathological characteristics or treatment methods of patients and do not consider relevant hematological indicators. At the same time, many studies on hematological indicators have shown that Hb, NE, LY, platelet (PLT), NLR, ALB, GLB, ALB/GLB, platelet-lymphocyte ratio (PLR), NLR, and SII [12, 13, 15, 25] have potential to become prognostic hematological indicators for various tumors including thymic epithelial tumors.

Our nomogram is composed of several factors that affect prognosis, which are commonly used in clinical practice. The nomogram showed that higher WHO pathological stage was related to poor RFS in patients with thymic epithelial tumors. This finding is consistent with the findings of other studies [26]. In terms of T staging, lower T staging can result in a more satisfactory RFS. In the studies of other researchers, the patient’s preoperative clinical staging was a factor affecting the prognosis of thymic epithelial tumors [27]. At the same time, we found that elevated NLR was associated with poor prognosis in patients with thymic epithelial tumors. This is the same as other people’s research results [6, 28]. In addition, a higher preoperative serum ALB level can often result in a more satisfactory prognosis, which is consistent with the findings of other studies reporting better prognosis in patients with thymic epithelial tumors with higher ALB levels [15].

In predicting the prognosis of certain cancers, nomograms have been developed and proven to be more accurate than traditional staging systems [29, 30]. Therefore because thymectomy is an effective treatment for thymic epithelial tumors, [26, 31,32,33] we constructed a prognostic nomogram for patients with thymic epithelial tumors after surgery. The nomogram performs well in predicting the survival rate. Its prediction is supported by the C index (0.902 and 0.785 for the training and verification cohorts, respectively), and the calibration curve was consistent with that of the baseline. The nomogram has high accuracy in predicting survival, and the DCA curve also showed that the nomogram had well predictive ability.

To the best of our knowledge, this is a relatively new attempt to develop a prognostic RFS nomogram for patients with thymic epithelial tumors by combining hematological and clinical indicators. Although this nomogram did not include many hematological indicators in the end, it combines clinical indicators, hematological indicators and other important clinical information to achieve the ultimate goal of integrating multi-dimensional data to jointly predict the prognosis of thymic epithelial tumors.

This study has several limitations. First, this was a retrospective study. Moreover, this was only a single-center study that included a small number of patients. Further research including more number of cases is still needed to verify our results. Second, tumor markers (CEA, SCC, AFP, etc.) and other potentially valuable hematological indicators were not included in this study. Third, the dynamic changes in hematological indicators considered in this study were not followed up after the operation.

Conclusions

In summary, by combining hematological and clinical indicators, we established and validated a nomogram for predicting the relapse-free survival of patients with thymic epithelial tumors. This convenient nomogram had well performance to distinguish the prognosis and risk of patients. Our findings suggest that it may be a potentially easy-to-use tool for physicians and can aid in postoperative personalized prognosis assessment and early identification of high-risk patients. Although the nomogram appears useful for prognostication and identifying high-risk patients, further prospective studies are needed to validate the nomogram and confirm the contribution of each prognostic factor.