Introduction

Nasopharyngeal carcinoma is a specific head and neck cancer with unique geographical and ethnic distribution. Approximately 133,354 new cases occurred in 2020, with the highest incidence in southern China [1]. Radiotherapy is the main treatment modality for early-stage nasopharyngeal carcinoma, while concurrent chemoradiotherapy with or without induction chemotherapy is recommended for locoregionally advanced nasopharyngeal carcinoma.

Conventional work-ups, including head and neck magnetic resonance imaging (MRI), chest X-ray or computed tomography, abdominal sonography or computed tomography, and bone scans, are recommended for tumor, node and metastasis (TNM) staging. For patients with bilateral enlarged lymph nodes or palpable lymph nodes below the cricoid cartilage, [18F]-Fluorodeoxyglucose (FDG) positron emission tomography and computed tomography (PET/CT) is highly recommended because of the high risk of occult distant metastasis [2, 3]. For N0-1 patients with Epstein-Barr virus (EBV) DNA less than 4000 copies/mL, a prior study [3] insisted on a low risk of distant metastasis and the comparable value of conventional work-up versus PET/CT for initial staging and finally did not recommend PET/CT due to its cost. Additionally, a recent study confirmed that there was no survival benefit of adding PET/CT to conventional work-up for stage I–II nasopharyngeal carcinoma [4]. We suppose that precise detection of metastatic cervical lymph nodes and correct N-stage perhaps more affects the prognosis of these patients, instead of focusing on the value of detecting occult distant metastasis. Stage T3 nasopharyngeal carcinoma without distant metastasis is the most typical representation. For example, patients with T3N0M0 can achieve comparable overall survival by intensity-modulated radiotherapy alone as patients with stage II disease [5], whereas the risk of distant metastasis is as high as 18% at 3 years after radical chemoradiotherapy for T3N2-3M0 nasopharyngeal carcinoma, and induction chemotherapy followed by concurrent chemoradiotherapy is strongly recommended for these patients [6,7,8]. Therefore, we aimed to determine whether PET/CT can influence the prognosis of nasopharyngeal carcinoma by providing an accurate diagnosis of metastatic lymph nodes.

Currently, the optimal treatment mode for the subgroup of T3N1M0 remains the most controversial [9, 10]. T3N1M0 patients have locoregionally advanced disease, but clinical trials that justified the benefit of induction chemotherapy [8] did not include this type of patient. A retrospective study found no survival benefit of additional induction chemotherapy for these patients [9], whereas male patients staged with T3N1M0 and EBV DNA higher than 2000 copies/mL were the target population, as suggested by another study [10]. Although EBV DNA showed prognostic value, it made little sense in clinical practice due to the lack of a unified test standard, robust serum level, and accepted cut-off value. Considering the generality of PET/CT and MRI in different hospitals, we aimed to identify an approach to individualized treatment by developing a radiologic score in a cohort of T3N1M0 nasopharyngeal carcinomas.

Methods

Patients and study design

Cohort A included 460 cervical lymph nodes from 336 patients who underwent node fine-needle aspiration biopsy guided by ultrasonography and PET/CT and MRI examinations before treatment to test the performance of PET/CT in diagnosing metastatic lymph nodes. Cohort B consisted of 1093 T3N1M0 nasopharyngeal carcinoma patients who received both PET/CT and head and neck MRI, while cohort C included 1377 T3N1M0 patients who underwent MRI alone. Cohort B and cohort C were compared to find the survival benefit of adding PET/CT to MRI. Specifically, 838 patients in cohort B who received concurrent chemoradiotherapy with or without induction chemotherapy were identified as cohort D to analyze the benefit of induction chemotherapy. The flowchart is presented in Supplementary Fig. 1. All patients were restaged based on the 8th edition American Joint Committee on Cancer/Union for International Cancer Control (AJCC/UICC) staging system. This study was approved by the Sun Yat-sen University Cancer Center Institutional Review Board (No. B2021-059–01).

Imaging analysis

The PET/CT and MRI protocols are described in the Supplementary Methods. MRI images were read by two experienced radiologists, and PET/CT was read by two experienced nuclear physicians who were blinded to the MRI results. Any differences were resolved by consensus. Metastatic lymph nodes were diagnosed according to the radiologic criteria [11]: (1) retropharyngeal lymph nodes with a minimal axial diameter of 5 mm or greater and cervical lymph nodes with a minimal axial diameter of 10 mm or greater; (2) minimal axial diameter of 8 mm for clusters of 3 or more lymph nodes; and (3) lymph nodes with necrosis or extranodal extension. Similar to a previous study [12], radiologic extranodal extension was categorized into 4 grades: grade 0, no extranodal extension; grade 1, invasion to surrounding fat; grade 2, coalescent nodes; grade 3, infiltrating adjacent structures. As reported [13], the diagnostic criteria for nodal necrosis based on MRI included (1) focal area of low signal intensity on T1-weighted images with or without enhanced edges and (2) focal area of high signal intensity on T2-weighted images. On PET/CT, lymph nodes were considered positive when [18F]-FDG uptake increased significantly compared with the background [14]. In the final decision-making, the PET/CT results were supplemented with the MRI findings (Fig. 1).

Fig. 1
figure 1

Cervical lymph nodes in PET/CT (left) and contrast-enhanced T1-weighted MRI (right), a PET/CT correctly diagnosed positive lymph nodes, while MRI diagnosed negative lymph nodes by mistake (b); both PET/CT (c) and MRI (d) correctly detected metastatic lymph nodes. Abbreviations: MRI, magnetic resonance imaging; PET/CT, [18F]-fluorodeoxyglucose positron emission tomography with computed tomography

Treatment and follow-up

All patients received intensity-modulated radiotherapy. The prescribed doses were 66–72 Gy to gross tumor and lymph nodes. The treatment modality included concurrent chemoradiotherapy with or without induction chemotherapy, radiotherapy alone, and induction chemotherapy plus radiotherapy. Induction chemotherapy regimens included docetaxel plus cisplatin and 5-fluorouracil, cisplatin plus 5-fluorouracil, docetaxel plus cisplatin, and gemcitabine plus cisplatin every 3 weeks for two to four cycles. For concurrent chemotherapy, weekly cisplatin or a 3-weekly cisplatin regimen was administered. After treatment, follow-up examinations were conducted at least every 3 months during the first 2 years and then every 6 to 12 months thereafter. Examinations including EBV DNA testing, nasopharyngoscopy, head and neck MRI, chest X-ray or computed tomography, and abdominal sonography or computed tomography were performed regularly. PET/CT was recommended if distant metastasis or locoregional recurrence was still uncertain by regular examination during the follow-up period. Recurrence or metastasis was confirmed by biopsy if possible.

Statistical analysis

Failure-free survival (FFS, time from diagnosis to treatment failure or death) was defined as the primary endpoint, and the secondary endpoints were distant metastasis-free survival (DMFS, time from diagnosis to distant metastasis or death), locoregional relapse-free survival (LRRFS, time from diagnosis to locoregional recurrence or death), and overall survival (OS, time from diagnosis to death).

Categorical variables were compared by the chi-square test. The cut-off values of the continuous variables were determined by receiver operating characteristic curve (ROC) analysis. In cohort A, McNemar’s paired-sample test or chi-square test was used to compare PET/CT and MRI in terms of sensitivity, specificity, positive predictive value, and negative predictive value. In cohort B, Kaplan–Meier survival curves stratified by MRI-based N stage and PET/CT-based T stage were evaluated by the log-rank test. The propensity score matching (PSM) method was applied to balance confounders between cohort B and cohort C at a ratio of 1:1. To confirm the benefit of PET/CT in N0-1 patients with EBV DNA less than 4000 copies/mL, the cut-off value of EBV DNA was 4000 copies/mL in cohort B and cohort C [3]. The survival rates of MRI and PET/CT plus MRI were compared by the Kaplan–Meier method. Univariate and multivariate Cox regression analyses in the PSM cohort were performed to determine independent factors. In cohort D, the PSM method was also performed to balance confounders between patients receiving concurrent chemotherapy with or without induction chemotherapy at a ratio of 1:1. The cut-off value was 2000 copies/mL in cohort D to compare with a model reported in a previous study [10]. Univariate and multivariate Cox regression analyses in PSM cohort D were conducted to select independent factors for risk stratification. The survival curves of the patients receiving concurrent chemotherapy with or without induction chemotherapy were also compared. The model was evaluated by the concordance index (C-index) and compared with a previous model [10] by computing the p value using the “survcomp” package in R. Statistical analysis was conducted by R 4.0.1 (http://www.r-project.org/) and SPSS 26.0. A two-sided p < 0.05 was considered to be statistically significant.

Results

Advantage of PET/CT vs. MRI in diagnosing cervical lymph nodes

In cohort A, 58.0% of patients (195/336) were diagnosed with stage III, and patients staged N1 accounted for 77.1% (259/336) (Supplementary Table 1). Among the 460 biopsied cervical lymph nodes from 336 patients, 269 (58.5%) and 191 (41.5%) lymph nodes were pathologically positive and negative, respectively. Among them, 96.7% (260/269) of positive and 75.9% (145/191) of negative lymph nodes were correctly detected by PET/CT, while only 88.5% (238/269) of positive and 70.7% (135/191) of negative lymph nodes were correctly diagnosed by MRI. PET/CT was significantly more sensitive than MRI for detecting cervical lymph node metastasis (p < 0.001). Regarding specificity, no significant difference was observed between the two imaging methods (75.9% vs. 70.7%, p = 0.174). The negative predictive value, positive predictive value, and accuracy of PET/CT and MRI were 94.2% vs. 81.3%, 85.0% vs. 81.0%, and 88.0% vs. 81.1%, respectively (Table 1). The area under the curve (AUC) of PET/CT was higher than that of MRI (0.863 vs. 0.796, p < 0.05). Notably, 14.4% (66/460) of lymph nodes had discrepancies between the two imaging tests. Among them, PET/CT showed true positives in 22 lymph nodes that were mistakenly diagnosed as negative lymph nodes by MRI and true negatives in 27 lymph nodes misdiagnosed as positive lymph nodes by MRI. Nonetheless, 17 lymph nodes were wrongly diagnosed as positive lymph nodes by PET/CT according to histopathology.

Table 1 Performance of PET/CT versus MRI for diagnosing 460 biopsied cervical lymph nodes of 336 patients

To evaluate whether the staging is altered by PET/CT or MRI and subsequently affects the prognosis, cohort B included 1093 patients who received pretreatment PET/CT and MRI. Their median age was 45 (range, 12–79) years, men accounted for 71.9%, and 819 (74.9%) patients had EBV DNA less than 4000 copies/mL (Table 2). With a median follow-up time of 50 (range, 1–118) months, 48 (4.4%) patients died, 142 (13.9%) patients suffered from treatment failure, and 62 (5.7%) patients had distant metastasis. The 5-year OS, FFS, DMFS, and LRRFS rates were 96.0%, 85.6%, 93.6%, and 92.9%, respectively.

Table 2 Baseline characteristics of patients with T3N1M0 in the primary cohort B, cohort C, and PSM cohort

Based on PET/CT alone, 664 of 1093 patients in cohort B were staged as T3N1M0. All of the patients were consistently staged with T3 by MRI, whereas 3.2% (20/664), 83.1% (526/664), 13.4% (85/664), and 5.2% (33/664) of patients were diagnosed with N0, N1, N2, and N3 by MRI alone. However, remarkably, no significant differences in OS, FFS, LRRFS, or DMFS were observed among these N0, N1, N2, and N3 patients staged by MRI (p = 0.68, p = 0.68, p = 0.61, and p = 0.96, respectively; Supplementary Fig. 2).

Based on the MRI criteria alone, 599 of 1093 patients in cohort B were diagnosed with T3N1M0. All of these patients were also staged as N1 by PET/CT. Nonetheless, 12.2% (73/599) and 87.8% (526/599) of patients were classified as T2 and T3 retrospectively by PET/CT alone. The survival rates were not significantly different between T2 and T3 patients staged by PET/CT (p = 0.72 for OS, p = 0.85 for FFS, p = 0.93 for LRRFS, and p = 0.65 for DMFS; Supplementary Fig. 3).

Prolonged survival rates of patients staged by PET/CT vs. MRI

To determine whether the advantage of PET/CT in diagnosis can contribute to survival differences, cohort C, in which patients underwent MRI only, was compared with cohort B. As shown in Table 2, the baseline characteristics between PET/CT plus MRI and MRI alone were compared. However, the results showed that there was an imbalance in age, lymph node location, and albumin between the two groups (p = 0.021, p = 0.014, and p < 0.001, respectively). After PSM at a ratio of 1:1, no imbalanced variables were observed between the two groups. Of 1908 patients who were included in the PSM cohort, 485 (25.4%) patients had EBV DNA greater than 4000 copies/mL. With a median follow-up period of 52 (1–151) months, 132 (6.9%) patients died, 165 (6.5%) patients developed distant metastasis, 211 (11.1%) patients suffered from locoregional relapse, and 93 (4.9%) patients had nodal recurrences. The 5-year OS, FFS, DMFS, and LRRFS rates were 93.0%, 78.6%, 90.9%, and 87.2%, respectively.

In the survival analysis, patients who underwent both PET/CT and MRI had better OS than those who underwent MRI alone (5-year OS, 95.7% vs. 90.4%, p < 0.001). In terms of FFS, DMFS, and LRRFS, patients receiving both PET/CT and MRI also had better outcomes than those receiving MRI alone (5-year FFS, 85.7% vs. 71.7%, p < 0.001; 5-year DMFS, 93.9% vs. 87.9%, p < 0.001; and 5-year LRRFS, 93% vs. 81.4%, p < 0.001; Fig. 2). The univariate analysis is presented in Supplementary Table 2. As shown in Supplementary Table 3, multivariate analysis indicated that the application of PET/CT was an independent favorable prognostic factor for OS (hazard ratio [HR] = 0.5, 95% confidence interval [CI] 0.3–0.7, p < 0.001), FFS (HR = 0.5, 95% CI 0.4–0.6, p < 0.001), DMFS (HR = 0.4, 95% CI 0.3–0.6, p < 0.001), and LRRFS (HR = 0.4, 95% CI 0.3–0.5, p < 0.001).

Fig. 2
figure 2

Survival curves comparing PET/CT + MRI with MRI alone in the PSM cohort: OS (a), FFS (b), LRRFS (c), and DMFS (d). Abbreviations: DMFS, distant metastasis-free survival; FFS, failure-free survival; LRRFS, locoregional relapse-free survival; MRI, magnetic resonance imaging; OS, overall survival; PET/CT, [18F]-fluorodeoxyglucose positron emission tomography with computed tomography; PSM, propensity scoring matching

Subgroup analysis was conducted for patients with EBV DNA less than 4000 copies/mL in the PSM cohort. In this subgroup, 1423 patients were eligible, among which 712 patients received both PET/CT and MRI. As shown in Supplementary Fig. 4, patients undergoing both PET/CT and MRI had a survival benefit for OS (5-year OS, 96.5% vs. 91.4%, p = 0.0012), FFS (5-year FFS, 86.1% vs. 75.4%, p < 0.001), DMFS (5-year DMFS, 93.7% vs. 90.9%, p < 0.001), and LRRFS (5-year LRRFS, 92.9% vs. 84.2%, p = 0.004) compared with those undergoing MRI alone. The univariate analysis is detailed in Supplementary Table 4. In the multivariable analysis (see Supplementary Table 5), the addition of PET/CT was also an independent factor for OS (HR = 0.5, 95% CI 0.3–0.8, p = 0.001), FFS (HR = 0.5, 95% CI 0.4–0.7, p < 0.001), DMFS (HR = 0.6, 95% CI 0.4–0.8, p = 0.004), and LRRFS (HR = 0.5, 95% CI 0.3–0.6, p < 0.001).

Guiding individualized induction chemotherapy

In cohort B, 838 patients who received concurrent chemoradiotherapy with or without induction chemotherapy were selected for cohort D. However, there were significant differences in baseline characteristics between the two treatment modalities (see Table 3). After PSM at a 1:1 ratio, 698 patients were included in this now well-balanced cohort. In PSM cohort D, the median age was 46 years old (range, 13–73), 132 (18.9%) patients had lymph node necrosis, and patients with grade 0, 1, 2, and 3 radiologic extranodal extension accounted for 48.1% (336/698), 18.8% (131/698), 20.9% (146/698), and 12.2% (85/698), respectively. With a median follow-up period of 50 (1–118) months, 117, 50, 45, and 35 patients had treatment failure, locoregional relapse, distant metastasis, and death, respectively. The 5-year FFS, LRRFS, DMFS, and OS rates were 82.0%, 92.6%, 92.8%, and 95.0%, respectively. Interestingly, univariate analysis (Supplementary Table 6) and multivariable analysis indicated that an SUVmax-N higher than 9.35, together with nodal necrosis and extranodal extension infiltrating adjacent structures, had prognostic significance for FFS (p < 0.001, p = 0.002, and p = 0.002, respectively, Supplementary Table 7). The radiologic score was thus developed based on the number of the three factors. Patients with higher radiologic scores had lower FFS (p < 0.001, Supplementary Fig. 5). Thus, patients with one or more risk factors were classified into the high-risk group (radiologic score > 0, n = 454), while patients with no risk factors were stratified into the low-risk group (radiologic score = 0, n = 244). The survival curves showed that patients in the high-risk group had lower FFS, DMFS, LRRFS, and OS than those in the lower-risk group (all p < 0.05, Supplementary Fig. 6). The radiologic score model had a higher C-index than the model with sex and EBV DNA (0.72 [95% CI: 0.65–0.78] vs. 0.56 [95% CI: 0.49–0.63], p < 0.001).

Table 3 Baseline characteristics of patients in the primary cohort D and PSM cohort D

For all of the patients, induction chemotherapy showed no survival benefit (p = 0.78, Fig. 3). However, in the high-risk group stratified by radiologic score, patients receiving induction chemotherapy plus concurrent chemoradiotherapy had a higher 5-year FFS than those receiving concurrent chemoradiotherapy alone (82.2% vs. 71.5%; p = 0.006, Fig. 3). After adjusting for covariates, multivariate analysis also confirmed that the addition of induction chemotherapy was an independent risk factor for FFS (HR: 0.5, 95% CI: 0.4–0.8, p = 0.003; Supplementary Table 8 and Supplementary Table 9). In contrast, no survival difference was observed between the two treatment modes in the low-risk group (p = 0.074, Fig. 3). The same conclusion was also reached for DMFS and RRFS. The detailed results for DMFS, LRRFS, and OS are shown in the supplementary Tables and Figures.

Fig. 3
figure 3

Kaplan–Meier FFS curves comparing IC + CCRT and CCRT alone in the whole PSM cohort D (a), low-risk group (b), and high-risk group (c) and DMFS curves in the whole PSM cohort D (d), low-risk group (e), and high-risk group (f). Abbreviations: CCRT, concurrent chemoradiotherapy; DMFS, distant metastasis-free survival; FFS, failure-free survival; IC, induction chemotherapy; PSM, propensity scoring matching

Discussion

In this large cohort study, PET/CT was proven to be more accurate than MRI for diagnosing cervical lymph nodes as confirmed by histopathology. Accordingly, as accurate N staging more precisely revealed the true prognosis of patients, PET/CT plus MRI could identify T3N1M0 patients who had better survival outcomes than MRI-staged T3N1M0 patients, even if their EBV DNA was less than 4000 copies/mL. PET/CT-based SUVmax of lymph nodes together with nodal necrosis and extranodal extension involving adjacent structures could be used to build a radiologic score model and identify high-risk T3N1M0 patients who can benefit from the addition of induction chemotherapy.

In fact, this is not the first report of the advantage of PET/CT over MRI for diagnosing lymph nodes in patients with nasopharyngeal carcinoma. However, different from previous studies [4, 15], the 460 lymph nodes included in our study were pathologically confirmed instead of by clinical follow-up. Certainly, the mistakes and biased of mapping biopsied lymph nodes on PET/CT and MRI images cannot be absolutely avoided, although we included solitary lymph nodes, the largest lymph node, and other nodes that could be definitely located without any uncertainty according to the ultrasonic reports and graphs with detailed characteristics of the biopsied node, such as the exact size, level, extranodal extension, and distance from the skin, muscles, vein, artery, or other landmark structures. In addition, the findings of PET/CT being superior to MRI were also consistent with the results of studies in head and neck cancer [16]. Although the sensitivity of PET/CT we found in nasopharyngeal carcinoma (96.7%) was slightly higher than that of head and neck cancer (90.0%), the specificity of PET/CT was only 75.9% herein, lower than that of head and neck cancer (94.0%) [17] but still better than MRI (70.8%). Perhaps a new deep learning algorithm might be a good assistant to further improve its diagnostic performance. Notably, 18.6% (118/633) of PET/CT-diagnosed T3N1M0 cases were upstaged to T3N2-3M0 by MRI, while only 3.2% (20/633) of patients were downstaged to T3N0M0 by MRI, without any discrepancy in T3 staging. This also indicated the strong potential of overdiagnosis by MRI. The similar survival rates (Supplementary Fig. 2) across the misdiagnosed T3N0-3M0 patients by MRI but staged T3N1M0 by PET/CT again supported the higher possibility of getting closer to the true prognosis if staged by PET/CT in terms of treatment outcomes. Certainly, MRI-staged T3N1M0 patients were divided into T2N1M0 and T3N1M0 by PET/CT, but no survival differences were observed between T2 and T3; as no gold standard to confirm the T stage of either type of examination equipment was used, we failed to draw a firm conclusion for PET/CT versus MRI for the T stage of nasopharyngeal carcinoma. Reviewing previous studies [15, 18], MRI seemed to be more accurate than PET/CT for diagnosing the involvement of local structures. Therefore, a combination of PET/CT and MRI may be the best recommendation for diagnosing and staging treatment-naïve nasopharyngeal carcinoma.

To test whether the diagnostic advantages of PET/CT can benefit treatment outcomes, we directly compared two cohorts of T3N1M0 patients staged by PET/CT plus MRI or MRI alone. The significantly higher survival rates of patients with PET/CT plus MRI, regardless of the EBV DNA load, supported our supposition. Obviously, PET/CT as an examination test cannot directly alter the final treatment outcomes by itself, but several prospective studies reported that adding PET/CT to conventional work-ups could provide additional information and may change management approaches in 15.7–33.8% of head and neck cancer patients [16, 19, 20]. Given the retrospective design of our study, the magnitude of actual changes in treatment choices before and after the application of PET/CT in these patients was not available. Although PET/CT is recommended at primary staging for patients with initially unknown primary cancer, enlarged lymph nodes at multiple or lower levels, locoregionally advanced cancer, or ambiguous lymph nodes or metastases by conventional imaging [3, 21], patient requests and doctor preferences should also be considered in clinical practice. Certainly, the potential bias related to financial constraints is hard to eliminate in retrospective studies, even in randomized clinical trials. Patients with good financial support tend to receive more advanced chemotherapy regimens, better supportive treatment, and more regular surveillance. On the other hand, we aimed to investigate whether PET/CT could predict the survival rate of patients and accordingly identify high-risk patients to receive more intensive treatment. In a cohort of PET/CT- and MRI-staged T3N1M0 patients, induction chemotherapy showed no survival benefit for all patients, as observed in prior studies [9]. Previously reported EBV DNA load- and sex-guided risk stratification [10] did not work here, possibly because of its poor specificity and generalization. In contrast, SUVmax-N, nodal necrosis, and extranodal extension involved with adjacent structures remained highly prognostic. In fact, these were previously reported [121322], which indicated the potential for good generalization.

The radiologic score model based on the three characteristics showed a significantly (p < 0.001) higher C-index (0.72) than the model based on EBV DNA and sex [10] (C-index = 0.56) in the risk stratification. In addition, the radiologic score model selected high-risk patients who could benefit from the addition of induction chemotherapy. As shown in our study, the 5-year FFS rate of the high-risk T3N1M0 patients was similar to those of more advanced patients included in clinical trials of induction chemotherapy [23]. Thus, it is not unreasonable that the high-risk T3N1M0 patients had significantly improved survival outcomes when receiving induction chemotherapy followed by concurrent chemoradiotherapy.

Based on 460 biopsied cervical lymph nodes, we firmly concluded there is an advantage of PET/CT for diagnosing lymph nodes. In a cohort of T3N1M0 patients, the interference of covariate factors including nodal size, nodal level, nodal laterality, and T stage was completely excluded; we proved the survival benefit was due to the accurate diagnosis by PET/CT and found the way to guide individualized induction chemotherapy for T3N1M0 patients by applying a radiologic score model based on PET/CT and MRI.