Advanced stage head and neck cancer (HNC) is known for generally unfavorable outcome with only ~ 40–50 % 3-year overall survival [1, 2, 3]. Clinical T4 stage includes a wide range of tumor volumes. The lack of further nonsurgical subgrouping of cT4 stage makes intercenter comparison of outcome results in irradiated cT4 patient cohorts difficult. The estimation of operability (cT4a versus cT4b) is sometimes quite dependent of a surgeon’s individual opinion and experience. In addition, the in- or exclusion of very advanced cT4 any NM0 into curatively aimed treatment regimens remains quite subjective.

The aim of this analysis was to further stratify cT4 stage squamous cell HNC disease using volumetric staging. This was performed with the help of a formerly prospectively tested and published volumetric scoring system [4, 5, 6, 7]. Using this scoring system, we previously demonstrated that volumetric staging was superior compared to the standard TN/AJCC systems regarding predictive power of disease control and survival of our irradiated cohorts.

Included in the presented analysis were all cT4 stage primary squamous cell cancer (SCC) HNC patients referred for definitive radiation.

Methods

Between January 2002 and January 2013, a total of 201 cT4 stage SCC HNC patients were referred to our department. All were treated with curative intent with modulated radiotherapy ± chemotherapy. All patients were retrospectively stratified using a prospectively evaluated volumetric staging system. T4 lymphoepithelial nasopharynx tumors (n = 13) and paranasal tumors (n = 8) were excluded. The used staging system is based on three cut-offs (15/70/130 ml, see also previous publications [4, 5, 6, 7]) to stratify the total gross tumor volumes (tGTV: primary and nodal tumor volume), allowing a subdivision of cT4 stages into 4 prognostic subgroups [1–15 ml (n = 15), 16–70 ml (n = 108), 71–130 ml (n = 62), > 130 ml (n = 16)]. Overall survival (OS), disease-free survival (DFS), locoregional control (LRC), and distant metastasis-free survival (DMFS) rates were calculated using Kaplan–Meier curves. Demographic data and tumor characteristics are listed in Tab. 1.

Tab. 1 Patient and tumor characteristics

All patients underwent modulated radiation therapy using simultaneously integrated boost techniques [SIB-IMRT/SIB-volumetric modulated arc therapy (SIB-VMAT)]. In 84 %, concomitant cisplatin chemotherapy (40 mg/m2/radiation week) or cetuximab (loading dose 400 mg/m2, followed by concomitant doses of 2250 mg/m2/radiation week) was administered. In 36 patients with very advanced disease of questionably curable stage, TPF (docetaxel, cisplatin, 5-fluorouracil)-based induction chemotherapy was given as a decision aid to add or not curatively intended radiation. The remaining 16 % of patients were treated with radiation only because of age or substantial comorbidity.

All GTVs were contoured or reviewed by at least one of the authors on all relevant axial computerized images without using interpolation; in most cases the contouring was also reviewed by a third staff physician. In addition, the wide volumetric ranges (cut-offs 15/70/130 ml) render the system quite robust with respect to interindividual contouring differences. Volumetric three-dimensional measurements (cm3) of contoured structures were calculated by the Varian Treatment Planning System volume algorithm (Eclipse® External Beam Planning System, Version 7.3.10 and PRO 8.9, AAA 8.9, Varian Medical Systems). A detailed description of the applied SIB modulated techniques and contouring of gross tumor volume (GTV) and planning target volumes (PTVs) has formerly been published [7]. In several patients with very large GTVs, dose compromises were performed delivering 66–68 Gy to the boost volume, while the 70 Gy dose volume was limited to the GTV.

Statistical analysis

Statistical calculations were performed using the statistics program implemented in StatView® (version 4.5; SAS Institute, Cary, NC, USA). Univariate analyses were performed with a Cox proportional hazards regression model in StatView®. Actuarial survival data were calculated using Kaplan–Meier curves and log-rank tests implemented in StatView®. P values < 0.05 were considered statistically significant.

Results

Outcome prediction by volumetric scoring

Between January 2002 and January 2013, a total of 201 cT4 stage SCC HNC patients were curatively treated at our department. The mean/median follow-up was 31/23 months (range 1–116 months). In all, 67 % of all patients were alive at last follow-up, and 49 % had no signs of disease. Of the 33 % of patients who had died, 24 % died due to disease-related reasons. The 3-year OS, DFS, LRC, and DMFS rates of the entire cohort were 63, 44, 48, and 77 %, respectively.

Volumetric staging revealed its potential to prognostically statistically significantly divide the cT4 cohort into 4 volume subgroups (V1/2/3/4): OS: 90 %/72 %/58 %/18 %; DFS: 83 %/50 %/39 %/10 %; LRC: 81 %/53 %/47 %/15 %; DMFS: 93 %/90 %/70 %/41 %, all p < 0.0001, (Tab. 2, Fig. 1).

Tab. 2 Outcome according to volume subgroups (V1-4, using cut-off values of 15/70/130 ml)
Fig. 1
figure 1

Overall survival rates of 201 cT4 patients staged by total tumor volume (ml)

Additional parameters with potential impact on disease control and OAS

The following parameters were tested in univariate analysis:

  • histopathological grading (grade 2 versus 3, no grade 1 cases), not significant,

  • age (>/< 70 years), not significant,

  • cT4a versus cT4b: in 63 % of the cases this differentiation was not indicated; most of the remaining cases were scored as cT4a (therefore statistically not evaluable),

  • nodal status (cN0 vs N1 vs N2a vs N2b vs N2c vs N3; cN0 vs N1–2b vs N2c vs N3; cN0 vs cN1–2 vs cN3), not significant,

  • systemic therapy: as the sample sizes of the subgroup with versus without systemic therapy was unbalanced (84 % vs 16 %—not the same patients with respect to substantial comorbidity and age), and systemic therapy was not homogeneous, no reliable information can be drawn from this analysis, which, however, showed a significant difference in favor of the combined modality subgroup (p = 0.2; OS 65 % vs 50 % at 3 years).

Treatment tolerance

With respect to treatment tolerance, the following findings in 117 locoregionally controlled patients were stated as based on the last clinical visit: 16/117 patients experienced any late term grade 3/4 side effects (LENT-SOMA, 14 %). Only 6/16 patients (38 %; 3 % of all patients) suffered from persistent late term sequelae (1 × xerostomia G3, 1 × loss of taste G3, 1 × chondronecrosis, 1 × dysphagia G3, 2 × feeding tube dependence).

Discussion

Aim of this work was to assess the potential of volumetric stratification of our cT4 SCC HNC cohort into different prognostic subgroups. We found volumetric stratification highly statistically significant in predicting outcome for different volume subgroups in the assessed cT4 HNC cohort. The volumetric system itself is considered robust with respect to interobserver GTV contouring, as its cut offs values differ markedly (15 ml/70 ml/130 ml) [4, 7]. The potential benefit of the assessed stratification lays in its more precise prediction of disease control in irradiated cT4 patient cohorts, and therefore more accurate characterization of cT4 cohorts for intercenter comparison purposes.

A weakness of this study is its retrospective stratification approach, which however applied a prospectively tested staging system [4, 5, 6, 7]. In addition, the assessed cohort includes different unbalanced tumor sites as well as unbalanced volume subgroups (Tab. 1).

To our knowledge there are no similar comparable volumetric staging analyses published. Most published volumetric focused outcome analyses were based on dichotomizing the GTV (i.e., using just one cut-off), (Tab. 3, [4, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35]). Four [17, 18, 20, 25] of the 31 listed reports were based on two or three cut-off values, our own system included. All but two analyses showed significant difference in outcome between larger vs smaller tumor volumes. Been et al. [34] failed to demonstrate statistical significance between pGTV and locoregional outcome, perhaps due to not considering the nodal tumor volume which may significantly impact locoregional outcome. Mendenhall et al. [8] found no outcome difference in tumors of the hypopharynx/base of tongue/posterior tonsillar pillar when using a cut off value of 6 ml. This cut-off may have been too low.

Tab. 3 Literature on head and neck cancer (HNC) outcome prediction based on volumetric classifications

The data presented here are derived from a cohort treated with IMRT techniques, with previous careful staging (in most cases using PET-CT) [36, 37].

Conclusion

Volumetric staging was shown to allow for highly statistically significantly stratification of cT4 stage SCC HNC into different prognostic subgroups, offering the option of better comparability of irradiated advanced stage HNC cohorts.