Introduction

Nasopharyngeal Carcinoma (NPC), a malignant tumor of the head and neck originating from nasopharyngeal epithelium, exhibits distinct epidemiological features, with the highest incidence observed in Southern China and Southeast Asia [1]. In 2020, the global incidence of NPC accounted for approximately 133,000 new cases and 80,000 fatalities, with China representing nearly half of these cases [2]. Presently, prognosis remains dismal for locally advanced stages, particularly for stage IVA NPC, with a 5-year OS rate lingering around 65% [3]. 19–29% of NPC patients develop distant metastasis post-treatment, a figure that escalates in stage IVA [4]. Distant metastasis has emerged as a predominant cause of treatment failure in NPC [5, 6].

Despite strides in immunotherapy and platinum-based chemotherapy for metastatic NPC, the prognosis for patients with metastases remains bleak, with a median OS of 20–29 months [7, 8]. Consequently, stratifying the risk of distant metastasis is vital, especially in stage IVA NPC, to facilitate targeted interventions.

The TNM staging system is pivotal in prognostication and treatment guidance for cancer patients, yet the role of pre-treatment haematological indicators is increasingly recognized. Plasma EBV DNA has emerged as a reliable biomarker for NPC, instrumental in diagnosis, treatment planning, risk stratification, prognostic evaluation, and condition monitoring [9, 10]. Pre-treatment plasma EBV DNA levels have been identified as predictors of distant metastasis in NPC [11].

The Prognostic Nutritional Index (PNI) serves as a straightforward and practical indicator for predicting the nutritional status of cancer patients [12]. Meanwhile, NLR represents inflammatory markers [13,14,15]. Recently, emerging indicators such as PNI and NLR have garnered significant attention in prognosticating the outcomes of nasopharyngeal carcinoma (NPC) patients. Moreover, their effectiveness has been validated by previous research endeavors [16].

However, comprehensive research on tumor burden, nutrition, and inflammation prognostic markers for stage IVA NPC, as well as the establishment of an effective distant metastasis prediction model, are still lacking. Additionally, many patients undergoing concurrent chemoradiotherapy (CCRT) may experience severe acute toxic reactions, leading to reduced compliance with subsequent adjuvant chemotherapy (AC). Of note, S-1, an oral fluoropyrimidine, has shown potential in improving overall survival and metastasis-free survival in previous studies [17, 18]. Therefore, this study aims to establish effective prognostic indicators from the perspective of reducing toxicity and achieving personalized treatment, identifying high-risk patients most likely to benefit from adjuvant S-1 therapy.

Methods

Study population

This retrospective study analyzed patients diagnosed with stage IVA NPC at Fujian Cancer Hospital from July 2016 to December 2019. Inclusion criteria were: (1) histological diagnosis of squamous cell carcinoma; (2) stage IVA as per the 8th edition UICC/AJCC classification; (3) treatment with IMRT; (4) complete baseline clinical and laboratory data; and (5) comprehensive follow-up records. Exclusion criteria included: (1) distant metastasis at diagnosis; (2) prior anti-tumor treatment; and (3) comorbidities significantly affecting complete blood count or biochemistry, such as aplastic anemia, myelofibrosis, acute or chronic hepatitis, cirrhosis, etc. A total of 834 patients met these criteria. The study adhered to the Helsinki Declaration, was approved by our institutional ethics committee, and all patients provided written informed consent before treatment.

Treatment protocol

All patients received standardized treatment as per NPC guidelines, including induction chemotherapy (IC), IMRT, concurrent chemotherapy (CC), adjuvant chemotherapy (AC), and targeted therapy.IC: Comprised of platinum-based drugs combined with taxanes, doxorubicin, 5-fluorouracil, or gemcitabine, administered intravenously every three weeks for 1–7 cycles.CC: Single-drug platinum-based chemotherapy intravenously every three weeks for 1–3 cycles.AC: Oral maintenance therapy with Teysuno (S-1) or capecitabine, administered every four weeks for at least two cycles.Targeted Therapy: Nimotuzumab (NTZ) or Endostar (E), or their combination, primarily during IC and/or radiotherapy. NTZ was given intravenously at 200 mg/week for 3–21 cycles, and E was administered at 7.5 mg/m2 on days 1–14, every three weeks for 2–8 cycles. Specific treatment details are available in Additional file 1.

Follow-up and study endpoints

Post-treatment, patients were followed every three months for the first two years, every six months for years 2–5, and annually thereafter until death. Routine physical examinations, nasopharyngeal endoscopy, nasopharyngeal and neck MRI, abdominal ultrasound, chest CT, plasma EBV DNA level measurements, and other hematologic markers were monitored. PET-CT was considered when necessary. The primary endpoint was distant metastasis-free survival (DMFS); the secondary endpoint was OS. DMFS was defined as the time from diagnosis to the first occurrence of distant metastasis, death from any cause, or the last follow-up, whichever occurred first. OS was measured from the date of diagnosis to death from any cause or the last follow-up.

Statistical analysis

Statistical analysis was performed using R language (version 4.2.2). Using SPSS 26 software, patients were stratified based on the presence or absence of distant metastasis. Subsequently, a 2:1 randomized stratified sampling approach was employed to divide the patients into training and validation sets. This process is graphically represented in Fig. 1. Maximally Selected Rank Statistics’ optimum cutoff value was used to dichotomize the candidate continuous variables. Variables with a P-value < 0.05 in the baseline table entered the multivariate logistic regression [19]to assess independent prognostic factors for distant metastasis. We developed a predictive model based on multivariate analysis and clinically relevant factors, assigning scores to each factor in the model based on its association with the risk of distant metastasis in nasopharyngeal carcinoma patients. Patients were then stratified into high-risk and low-risk groups accordingly. Utilizing the tree model and the “partykit” package, we can accurately stratify nomogram scores into precise subgroups. Decision curve analysis (DCA) was performed to determine the clinical utility of the nomogram, quantifying net benefits at different probability thresholds in the training cohort. Survival outcomes were analyzed using the Kaplan-Meier method, with survival curves compared using the log-rank test. In addition, Inverse Probability of Treatment Weighting (IPTW) was utilized to balance baseline characteristics, checking for standardized mean differences (SMD) < 0.1. All statistical tests were two-sided, and a P-value < 0.05 was considered statistically significant.

Fig. 1
figure 1

The flowchart of the enrolled patients

Results

Clinical characteristics and survival outcomes

We analyzed the clinical profiles of 557 patients in the training cohort and 277 in the validation cohort (Table 1). Overall, they exhibited comparable characteristics. The median age for both groups was 48 years. The median follow-up time for the entire cohort was 53.8 months (95% CI 52.9–55). Among the study population, 150 stage IVA patients developed distant metastases: 90 had single-organ involvement, and 60 had multiple metastases, primarily in the bone, lung, and liver. Notably, 18% of patients in each cohort experienced distant metastases (100 in the training cohort and 50 in the validation cohort). In the validation set, the 3-year OS rate was 82.5% (95% CI = 0.767–0.877), while the 3-year DMFS rate was 95.9% (95% CI = 0.924–0.995). For the training set, the 3-year OS rate was 87.7% (95% CI = 0.850–0.905), and the 3-year DMFS rate was 84.3% (95% CI = 0.813–0.874). The overall 3-year OS rate was 88.0% (95% CI = 0.858–0.902), and the overall 3-year DMFS rate was 84.1% (95% CI = 0.816–0.867).

Table 1 Clinical characteristics of patients in training cohort and validation cohort

Development and validation of the nomogram

In the training set, both univariate and multivariate Cox regression models are presented in Table 2. The results of the multivariate Cox regression model indicate correlations between PNI, NRI, N stage, EBV DNA. Based on these four independent prognostic factors, a nomogram model was developed, illustrated in Fig. 2.The area under the receiver operating characteristic (ROC) curve (AUC) for the model demonstrated good predictive accuracy, with an AUC of 0.718 (95% CI = 0.650–0.785) in the training cohort (Fig. 2A) and 0.729 (95% CI = 0.635–0.823) in the validation cohort (Fig. 2B). Calibration curves for both the training and validation cohorts (Fig. 2D and G) closely aligned with the diagonal line, indicating a high concordance between the model’s predictions and the actual outcomes for patients. The AUC values for all independent factors are presented in Fig. 2B and E, showing that in both the training and validation cohorts, the nomogram achieved the highest AUC. We constructed decision curve analysis (DCA) curves for the nomogram and each independent factor, as depicted in Fig. 2C and F. The net benefit of the nomogram surpassed that of individual independent factors significantly.

Table 2 Univariate and multivariate cox proportional hazards models of distant metastasis
Fig. 2
figure 2

Nomogram Construction for DMFS in IVA NPC (A). AUC of the nomogram at 5-year in the training set (B) and validation set (E). Decision curves analysis of the nomogram at 5 years in the training set (C) and validation set (F).Calibration plot of the nomogram at 1-, 3-, and 5-year in the training set (D) and validation set (G). AUC, area under curve; pre EBV DNA: pre-treatment EBV DNA; NLR: Neutrophil to Lymphocyte Ratio; DMdistant metastasis; PNI: prognostic nutritional index

Risk stratification

Utilizing the nomogram, risk scores for each patient with stage IVA NPC were calculated. The optimal cut-off value, determined from the training cohort using a tree model, was identified as 149.056 (Additional file 2). Patients were subsequently stratified into high-risk and low-risk groups based on this threshold. In the training cohort, the 3-year OS rates were 90.8%(95%CI = 0.874–0.943)and 84.8%(95%CI = 0.807–0.891), respectively (P = 0.025, Fig. 3A). The 3-year DMFS rates for the low and high-risk subgroups were 93.5%(95%CI = 0.905–0.965)and 75.5%(95%CI = 0.707–0.808), respectively (P < 0.001, Fig. 3D).In the validation cohort, the 3-year OS rates were 95.7% and 81.0%, respectively (P = 0.008, Fig. 3B).The 3-year DMFS rates for the low and high-risk subgroups were 95.0%(95%CI = 0.912–0.990) and 74.7%(95%CI = 0.681–0.820), respectively (P < 0.001, Fig. 3E). Across the entire study population, the 3-year OS rates were 92.4(95%CI = 0.898–0.95)and 84.0%(95%CI = 0.806–0.875), respectively (P < 0.001, Fig. 3C).The 3-year DMFS rates for the low and high-risk groups were 94%(95CI%=0.916–0.964)and 75.3%(95CI%=0.713–0.795), respectively (P < 0.001, Fig. 3F). These results indicate that the DMFS and OS in the low-risk group were significantly better than those in the high-risk group, further validating the discriminative ability of the model.

Fig. 3
figure 3

Kaplan-Meier survival curves of different risk groups. OS (A) and DMFS (D) of the training cohort, OS (B) and DMFS (E) of the validation cohort, OS (C) and DMFS (F) of the all. (low-risk group: risk score<149.056; high-risk group: risk score ≥ 149.056)

Subgroup analysis of high-risk group

Further analysis of various risk strata revealed distinct outcomes for patients in the high-risk group, as detailed in Table 3; Fig. 4. Within this cohort, patients receiving IC combined with IC + CCRT + S-1 demonstrated significantly improved outcomes compared to those undergoing standard IC + CCRT. Specifically, the 3-year DMFS rates were 88.6% (95% CI = 0.796–0.986) for IC + CCRT + S-1 versus 69.4% (95% CI = 0.605–0.795) for IC + CCRT (P = 0.008, Fig. 4A). Similarly, the 3-year OS rates were 95.4% (95% CI = 0.895-1) for IC + CCRT + S-1 compared to 80.6% (95% CI = 0.73–0.891) for IC + CCRT (P = 0.001, Fig. 4B). These findings highlight the superiority of the IC + CCRT + S-1 treatment regimen in terms of 3-year DMFS and OS compared to the standard IC + CCRT protocol. Utilizing IPTW to match baseline characteristics (Additional file 3) further validated the clinical significance of S-1 for high-risk individuals. The results revealed that both OS (p = 0.001 after IPTW, Fig. 4C) and DMFS (p = 0.008 after IPTW, Fig. 4D) benefited from IC + CCRT + S-1 treatment in the high-risk group, irrespective of IPTW adjustment.

Table 3 Baseline characteristics of patients in high-risk group
Fig. 4
figure 4

Kaplan-Meier survival curve in high-risk group, (A) DMFS before IPTW, (B) OS before IPTW, (C) DMFS after IPTW, (D) OS before IPTW. (IC + CCRT: induction chemotherapy plus concurrent chemoradiotherapy; IC + CCRT + S-1: induction chemotherapy plus concurrent chemoradiotherapy plus S-1 maintenance therapy)

Subgroup analysis of low-risk group

For the low-risk group, baseline characteristics were delineated in Table 4. We conducted a comparative analysis of Disease-Free Survival (DMFS) and Overall Survival (OS) between IC + CCRT and IC + CCRT + S-1, revealing respective 3-year DMFS rates of 91.8% (95% CI = 0.870–0.968) and 94.0% (95% CI = 0.862-1) (P = 0.456, Fig. 5A), and 3-year OS rates of 90.5% (95% CI = 0.856–0.958) and 94.6% (95% CI = 0.876-1) (P = 0.231, Fig. 5B). Additionally, we employed Inverse Probability of Treatment Weighting (IPTW) to balance baseline characteristics across the population for further confirmation of our findings. Appendix Table 2 illustrates baseline characteristics before and after IPTW adjustment. Notably, regardless of IPTW application, IC + CCRT demonstrated non-inferior 3-year DMFS and OS compared to IC + CCRT + S-1(Fig. 5C-D). Therefore, in the low-risk population, a prudent consideration of treatment intensity reduction, while maintaining therapeutic efficacy, could enhance patient tolerance and mitigate toxicities.

Table 4 Baseline characteristics of patients in low-risk group
Fig. 5
figure 5

Kaplan-Meier survival curve in low-risk group, (A) DMFS before IPTW, (B) OS before IPTW, (C) DMFS after IPTW, (D) OS before IPTW. (IC + CCRT: induction chemotherapy plus concurrent chemoradiotherapy; IC + CCRT + S-1: induction chemotherapy plus concurrent chemoradiotherapy plus S-1 maintenance therapy)

Discussion

In our study, we conducted a retrospective analysis and found that pretreatment plasma EBV DNA, NLR, PNI and N stage are independent risk factors for distant metastasis in stage IVA nasopharyngeal carcinoma (NPC). We developed a nomogram model integrating these four factors to predict the risk of distant metastasis in 834 patients diagnosed with stage IVA NPC.

Notably, T stage did not emerge as a risk factor for distant metastasis, while an increased N stage correlated with a heightened risk of distant metastasis in stage IVA NPC [20]. This study further supports the hypothesis that distant metastasis in NPC typically initiates with lymph node dissemination rather than originating from the primary tumor. Tumor cells exhibit distinct metabolic patterns from normal cells, relying predominantly on glycolysis for energy metabolism even in oxygen-sufficient environments [21].

A low PNI is indicative of a decline in both peripheral blood lymphocyte count and albumin levels, strongly hinting at suboptimal nutritional status and a weakened immune system in patients. A study conducted within the same institution has firmly established the predictive value of PNI in NPC patients [22]. Additionally, a retrospective analysis on newly diagnosed metastatic nasopharyngeal carcinoma patients has further bolstered the argument that PNI serves as a superior nutritional predictor in comparison to BMI [23]. Nevertheless, another investigation revealed that PNI did not emerge as an independent predictive factor when evaluated alongside NRI, CRP, ALP, and LDH [24]. This inconsistency could be attributed to the distinct patient populations targeted by each study.

Elevated levels of plasma EBV DNA strongly correlate with an augmented risk of distant metastasis in regions with a high prevalence of NPC [25] and serve as reliable prognostic indicators. Varied studies demonstrate significant disparities in the optimal cut-off values of pretreatment EBV DNA for survival prediction. Leung et al.’s investigation [26] stratified patients with early-stage NPC into high-risk (similar to stage III survival outcomes) and low-risk (similar to stage I survival outcomes) subgroups, with a designated EBV DNA cut-off value of 4000 copies/mL. Similarly, Lin et al.’s study [27], analyzing 99 patients with locally advanced NPC, revealed that pre-treatment plasma EBV DNA concentrations exceeding 1500 copies/mL were associated with significantly worse OS and locoregional recurrence-free survival (LRFS). In our present study, plasma EBV DNA emerged as an independent predictor of distant metastasis in stage IVA NPC, with a specific cut-off value of 10,100 copies/mL.

The inflammatory response of the organism profoundly influences tumorigenesis, development, metastasis, and prognosis. The Neutrophil-to-Lymphocyte Ratio (NLR), as an inflammation indicator, mirrors the tumor immune microenvironment, closely correlating with the prognosis of various malignant tumors [28,29,30,31,32]. NLR stands as a crucial marker for prognostic prediction in NPC patients and aids TNM staging for effective prognostic assessment. A high NLR level suggests heightened tumor aggressiveness and a predilection for distant metastasis, resulting in an unfavorable prognosis. Stage IVA NPC, characterized by unsatisfactory outcomes despite standard treatment modalities, has been classified in previous studies into three types via the TNM staging system: type A (predominantly nasopharyngeal primary foci), type D (predominantly regional lymph node metastasis), and type AD (both). In this study, the multidimensional assessment of the risk of distant metastasis in stage IVA NPC involved combining clinically important factors to achieve precise treatment.

Simultaneous radiotherapy followed by oral maintenance chemotherapy emerges as an elective therapeutic option for patients with locally advanced NPC. In this study, we observed that IC + CCRT + S-1 significantly enhanced DMFS and OS compared to IC + CCRT in the high-risk scoring group. Conversely, in the low-risk scoring group, IC + CCRT + S-1 did not yield superior DMFS and OS outcomes. This implies the necessity of maintenance therapy for stage IVA NPC in the high-risk scoring group, while low-risk patients may benefit from close clinical observation to avert over-treatment, thereby offering a valuable reference for the precise treatment of stage IVA NPC. Our study aligns with the objectives and conclusions of previous research, which aimed to identify high-risk groups that would benefit from adjuvant chemotherapy, with the goal of achieving personalized and less toxic treatment. However, our study distinguishes itself by incorporating a broader range of easily accessible clinical variables (e.g., PNI, NLR, EBV DNA) to enhance predictive accuracy, focusing specifically on stage IVA patients, and employing distinct endpoints, namely the occurrence of distant metastasis [18].

Inevitably, several limitations persist in retrospective studies. Firstly, inherent selectivity bias is unavoidable due to the retrospective nature of the study. Additionally, the availability of information on patient-specific characteristics is constrained by limitations in data sources. Secondly, the study was confined to data from a single center, necessitating validation with a diverse cohort from different centers to confirm the accuracy of the findings. Thirdly, potential bias may arise from variations in the choice of treatment regimen among different clinicians. Fourthly, we cannot guarantee the complete similarity in the distribution of relevant features between the training and validation sets. Finally, the applicability of the nomogram model to younger patients (< 18 years old) and regions with lower NPC incidence demands further validation. In terms of the specific limitations mentioned in the study, for patients with stage IVA/B NPC, the study indicates that the TPC induction regimen is more effective than the PF induction regimen, supporting the TPC regimen as the new standard regimen for induction chemotherapy in NPC [33]. The induction regimens for patients in our study were primarily GP and TP. Due to potential biases arising from individual heterogeneity, we did not include more personalized induction chemotherapy regimens. In future studies, we look forward to further promoting the TPC regimen’s clinical application and gradually improving and refining our predictive model.

Conclusions

Pre EBV DNA, NLR, PNI and N stage emerged as independent risk factors for DMFS in stage IVA NPC. In the context of IMRT, the nomogram developed in this study exhibits commendable accuracy and discriminative prowess in predicting distant metastasis for stage IVA NPC. Importantly, our nomogram can guide the utility of S-1 adjuvant chemotherapy, offering crucial guidance for selecting optimal treatment modalities for patients at high and low risk of distant metastasis in stage IVA NPC.