Introduction

Head and neck squamous cell carcinoma (HNSCC) is the sixth most common malignancy accounting for approximately 6% of all cancers and for an estimated 1–2% of all cancer-related deaths [1]. Radiotherapy (RT), alone or combined with chemotherapy (ChemoRT), is the mainstay non-surgical treatment despite often being hampered by a non-negligible rate of radiation-induced dysphagia (RID) [2]. A recent study [3] showed RID as having the highest priority among a cluster of quality of life (QoL) RT-related symptoms in a heterogeneous HNSCC population, as also reported by several previous studies [4,5,6]. For most people, the possibility to enjoy eating helps to achieve a good QoL, whereas labored deglutition, prolonged eating times, and the limited range of foods that can be swallowed can lead to disruption of relationships and social isolation [7].

The scientific community recognizes that clinicians underestimate the consequences of RID and now encourages the use of patient-reported outcomes (PROs) questionnaires [8]. In this regard, the M.D. Anderson Dysphagia Inventory (MDADI) is the most widely recognized PRO metric to assess deglutition-related quality of life (QoL), properly aimed at investigating patients’ perception of deglutition disorders and its related QoL [9].

To date, several cross-sectional studies have reported MDADI outcomes in a heterogeneous HNSCC survivor population but only few studies have investigated the longitudinal changes over time and they have sometimes not included a pre-treatment evaluation [10,11,12,13,14,15]. Indeed, the latter is of primary importance to detect possible pre-treatment cancer-related dysphagia and to properly interpret post-treatment RT-related dysphagia. This often represents the sum of both factors, combined with the patient’s ability to compensate spontaneously or with targeted therapy [8].

On the other hand, the advancement of intensity-modulated radiotherapy (IMRT) technologies has shown promising results in terms of better deglutition outcomes [16]. The normal swallowing is a very complex process simultaneously involving multiple muscles, such as the base of tongue for the posterior propulsion of the bolus through the pharynx, the pharyngeal constrictors for the cranio-caudal peristalsis of the bolus, and the cricopharyngeal muscle whose opening promotes the passage of the bolus into the upper digestive tract. Also, the suprahyoid musculature contraction for the upward and forward motion of the hyolarynx complex, together with the glottic and supraglottic closure represents the main mechanism to protect the lower airways from possible inhalation. Therefore, if the reduction of radiation dose to parotids and several swallowing organs at risk (SWOARs-sparing IMRT) by generating highly conformal and steep gradient dose distributions around these organs might decrease the percentage of radiation-related deglutition disorders is yet to be elucidated [17].

In this regard, there is a small amount of published data showing a significant reduction of severe deglutition disorders when comparing SWOARs-sparing with standard parotid-sparing IMRT, although outcomes focused on diet texture and feeding tube placement alone [18]. This study aims to report longitudinal patient-reported swallowing outcomes for SWOARs-sparing Chemo-IMRT as part of a multicenter study.

Materials and Methods

Study Protocol

The study was approved by the Ethics Committee of the Coordinating Center (Protocol number 42380) and that of each participating center; all patients enrolled in the study accepted the protocol and signed a study-specific informed consent form. The protocol is registered in ClinicalTrials.gov (ID. NCT03448341).

Briefly, patients underwent evaluation by means of MDADI questionnaire at baseline and at 6 and 12 months after treatment.

Briefly, the eligibility criteria included all patients affected with nasopharynx or oropharynx cancers (Stage III–IVA according to 7th TNM Stage Edition), with histologically proven diagnosis of undifferentiated nasopharyngeal-type carcinoma or squamous cell carcinoma, Eastern Cooperative Oncology Group Performance Status (ECOG PS) 0–2, and age < 80 years. Exclusion criteria were as follows: a different tumor site from naso or oropharynx, a different histology from undifferentiated nasopharyngeal type or squamous cell carcinoma, ECOG Performance Status ≥ 3, Stage IVB and C, prior induction chemotherapy or HN treatment (surgery and/or RT), and diagnosis of comorbid conditions potentially compromising basic deglutition function (e.g., demyelinating or degenerative diseases; connective tissue diseases; history of excessive use of benzodiazepines or similar, gastroesophageal reflux resistant at medical therapy and age > 80 years). Patients who underwent salvage surgery at the primary tumor site and those who experienced primary recurrence or metastatic disease within the timeframe of the study were dropped out of the study. Differently, patients who underwent salvage surgery for nodal tumor persistence or recurrence after treatment were not excluded.

Radiotherapy

Details regarding RT treatment planning criteria are reported in the Electronic Supplementary Material File. Briefly, all patients required bilateral neck irradiation and underwent tridimensional image-guided RT (IGRT) and IMRT. The high- and low-risk clinical target volumes (HR-CTVs and LR-CTVs) were defined together with an optional intermediate risk (IR-CTVs), according to HNSG-AIRO guidelines [19]. Corresponding Planning Target Volumes (PTVs) were automatically created by uniform expansions of 0.3 cm. The prescribed radiation doses were 66 Gy, 60 Gy, and 54 Gy given over 30 daily fractions to HR-PTVs, IR-PTVs, and LR-PTVs, respectively. According to the computed tomography (CT)-based delineation guidelines of the swallowing organs at risk (SWOARs) by Christianen et al. [20], 8 different SWOARs (superior, medium, and inferior constrictor muscle; base of tongue; supraglottic and glottic larynx; cricopharyngeal muscle; cervical esophagus) were defined to optimize the IMRT plan for a dose reduction to these structures (SWOARs-sparing IMRT). One radiation oncologist for each participating center was identified for the delineation of SWOARs, assisted by a dedicated radiologist if necessary. The IMRT plans set target prescription goals and spinal cord and brainstem maximum dose (Dmax) as the highest priority, whereas SWOARs constraints were set as secondary.

Chemotherapy was given weekly using Cisplatin 40 mg/m2 for a maximum of 6 cycles or 100 mg/m2 every 3 weeks during the 6-week RT course for a maximum of 3 cycles.

Evaluation of Dysphagia

All the enrolled patients filled in the MDADI questionnaire at baseline and at 6 and 12 months after ChemoRT. Briefly, the MDADI consists of a self-reported questionnaire with 20 questions and 4 subscales about deglutition-related QoL: Global assessment, single-item MDADI-G; Functional, 5-item MDADI-F; Physical, 8-item MDADI-P; and Emotional, 6-item MDADI-E. Each question includes a 5-point response scale. A composite score (MDADI-C) based on 19 items (excluding the G-item) was applied to evaluate deglutition-specific QoL. All subscale MDADI scores (F, P, and E) and C scores were normalized to values ranging from 20 (extremely low functioning) to 100 (high functioning), with higher scores representing better QoL [9]. In addition, the validated Italian version of MDADI was used [21]. A MDADI score of at least 80 represented an “optimal” deglutition-related QoL, a score between 60 and 80 represented an “adequate” deglutition-related QoL, and a score less than 60 represented a “poor” deglutition-related QoL; this classification based on the MDADI score was used for the intent of our analysis as a post hoc stratification.

A 10-point difference in the MDADI composite score was considered a minimal clinically important difference (MCID) in deglutition-related QoL [22]. Only the patients who filled in all the time interval questionnaires (baseline, 6, and 12 months) were considered for the analysis.

Statistical Analysis

Categorical data were described by absolute and relative frequencies and continuous data were described by mean and standard deviation. To compare the MDADI score among the three temporal measures, ANOVA for repeated measures stratified for MDADI value at baseline (< 80; ≥ 80) was used and multiple comparisons were performed by the Bonferroni method. Significance was fixed at 0.05. All analyses were carried out by SPSS v.27 technology.

Results

Patients

Between August 2015 and May 2020, among a total of 103 eligible patients, 63 (61.2%) were enrolled from 6 participating centers. Among them, 10 dropped out of the study of which 6 discontinued control visits, 2 died due to tumor recurrence, and 2 died due to SARS Cov-2 infection within 5 months from the end of treatment. Of the 53 remaining evaluable patients, 6 missed to fill in all the questionnaires. Thus, a total of 47 patients were definitively considered for the present final analysis as we decided to consider only those who filled in all the time interval (baseline, 6, and 12 months) questionnaires (Fig. 1). Patients and tumor characteristics are shown in Table 1.

Fig. 1
figure 1

The CONSORT diagram of the study

Table 1 Patients and tumor characteristics

Average and range doses to the SPCM, MPCM, IPCM, BOT, SL, GL, CPM, and CE were 58.2 Gy (59.9–32.8 Gy), 55.45 Gy (70.1–31.5 Gy), 44 Gy (62.9–20.3 Gy), 58.6 Gy (70.5-35 Gy), 49.2 Gy (67.1–30 Gy), 39.6 Gy (68–9.6 Gy), 38.1 Gy (54.2–16.7 Gy), and 29 Gy (49.2–11.2 Gy).

Among the 42 oropharyngeal patients, 23 (55%) had tumor localized within the base of tongue, 16 (38%) within the tonsil, 2 (5%) within the soft palate, and 1 (2%) within the posterior pharyngeal wall. Also, 28/42 (67%) were N2 of which 2 (7%) were N2a, 19 (68%) were N2b, and 7 (25%) were N2c. A total of 23/47 (49%) patients underwent triweekly Cisplatin, whereas 24/47 (51%) underwent weekly Cisplatin. Overall, 33/47 (70%) patients completed the planned cycles of CT, whereas 14/47 (30%) did not. Then, 2/47 (4%) patients required feeding tube positioning before RT which was removed at 3 and 9 months after the end of treatment, respectively.

Among the 47 analyzed patients, 5 (10.5%) died of whom 3 (6%) due to tumor recurrence and 2 (4%) to non-cancer-related causes. Overall, 6/47 (13%) patients experienced regional recurrence of which only 1 (2%) died due to nodal recurrence (not amenable for salvage neck dissection), whereas the others underwent salvage neck surgery. Among these patients, only 1 developed concurrent lung metastases and was addressed to systemic treatment.

MDADI Score Changes

Among the 47 evaluable patients, 26 (55%) were classified as “optimal,” 14 (30%) as “adequate,” and 7 (15%) as “poor” according to the MDADI-C-score at baseline. 25 (53%) were classified as “optimal,” 17 (36%) as “adequate,” and 5 (11%) as “poor” for MDADI-C score at 6 months. Finally, 26 (55.3%) were classified as “optimal,” 18 (38.3%) as “adequate,” and 3 (6.4%) as “poor” for MDADI-C score at 12 months. The mean value and standard deviations (SD) of the MDADI-C-score for the entire patient population were 80.52 (SD 17.65) at baseline, 79.50 (SD 13.00) at 6 months, and 81.20 (SD 13.65) at 12 months, respectively (Table 2).

Table 2 MDADI composite score variations for the overall patient population

An overall statistically significant worsening of MDADI-C (p = 0.001) and all subscale scores (p < 0.05) were reported for the “optimal” group as well as a statistically significant improvement of MDADI-C (p = 0.009) and all subscale scores (p < 0.05) were reported for the “adequate/poor” group from before to after treatment. The overall MDADI-C and all subscale scores trajectory for both groups of patients are shown in Fig. 2.

Fig. 2
figure 2

M.D. Anderson Dysphagia Inventory (MDADI) scores trend for the “optimal” and “adequate/poor” group

In the “optimal” group, mean baseline MDADI-C scores were 93.60, dropping to 81.00 at 6 months (p = 0.001) and slightly rising to 85.50 at 12 months (p > 0.05) after treatment. On the contrary, in the “adequate/poor” group, the mean baseline MDADI-C score was 64.30 rising to 77.50 at 6 months (p = 0.006) and stabilizing at 76.00 (p > 0.05) after 12 months. An analogous trend was reported for all subscale scores for both the “optimal” and the “adequate/poor” groups. The MDADI-C score remained statistically significant (p = 0.013) but not clinically worsened (93.6 vs 85.5) in the “optimal” group comparing baseline scores to those recorded after ChemoRT. Indeed, the MDADI-C score was statistically significant (p = 0.033) and clinically improved (64.3 vs 76.0) in the “adequate/poor” group from before to after ChemoRT. Table 3 shows all other multiple comparisons between scores at different time points.

Table 3 Comparison of MDADI scores at different time points

MDADI-C Group Shifts

The trend of the MDADI-C group shifts for the “optimal,” “adequate,” and “poor” groups of patients at different times is shown in Fig. 3.

Fig. 3
figure 3

M.D. Anderson Dysphagia Inventory composite (MDADI-C) group shifts for the “optimal,” “adequate,” and poor” group at 6 and 12 months

Specifically, in the “optimal” group, 9/26 patients (35%) experienced a downshift of the C-score from baseline to 6 months. Of these, 6 (67%) decreased to “adequate” and 3 (33%) to “poor,” respectively. Again, in the “optimal” group, 7/26 patients (27%) experienced a downshift of the C score from baseline to 12 months of which 5 patients (71%) maintained “adequate” function, 1 (9.5%) worsened from “optimal” to “adequate,” and 1 (9.5%) maintained a “poor” function from 6 to 12 months.

Among the “adequate” group, 5/14 patients (35,7%) experienced an upshift to “optimal” and 1/14 patient (7%) a downshift to “poor” from baseline to 6 months for the C-score. Equally, 5/14 patients (35,7%) experienced an upshift to “optimal” and 1/14 patients (7%) a downshift to “poor” from baseline to 12 months for the C score. Of these 6 patients, 3 (50%) maintained an “optimal” function and 2 (33%) improved from “adequate” to “optimal” function from 6 to 12 months, whereas 1 patient (17%) downshifted from “adequate” to “poor” function from 6 to 12 months.

Among the “poor” group, 3/7 patients (43%) upshifted from “poor” to “optimal” function and 3/7 patients (43%) from “poor” to “adequate” function for C-score from baseline to 6 months. Indeed, 2/7 patients (28.5%) upshifted to “optimal” and 4/7 (57%) upshifted to “adequate” function for C-score from baseline to 12 months, respectively. Of these, 1 (14%) maintained an “optimal” function, 1 (14%) upshifted from “adequate” to “optimal,” 3 (43%) maintained an “adequate” function, and 1 (14%) downshifted from “optimal” to “adequate” from 6 to 12 months.

Association Between ∆MDADI Score and Base of Tongue Cancer

No significant association for any covariate was found for ∆MDADI-C both in the “optimal” and “adequate/poor” groups from baseline to 6 months.

Indeed, a significant association was found in the “optimal” group for ∆MDADI-C score from baseline to 12 months for patients affected by base of tongue cancer (p = 0.031). Later, in the “optimal” group the presence of base of tongue cancer showed a statistically significant association with ∆MDADI of all subscale scores, except for F-item, from baseline to 12 months, as shown in Table 4.

Table 4 Association between base of tongue cancer and MDADI score variations from baseline to 12 months after treatment

Discussion

In this paper, we report the primary results of a multicenter still ongoing prospective trial endorsed by the Head and Neck Study Group (HNSG) of the Italian Association of Radiotherapy and Oncology (AIRO), aimed at investigating patient-reported deglutition outcomes after a SWOARs-sparing-based ChemoRT. Specifically, we report data regarding a primary endpoint for which the pre-planned number of patients has been achieved.

Overall, our study seems to show successful deglutition outcomes reporting a mean MDADI-C score of 79 and 81 at 6 and 12 months after treatment, regardless of the deglutition-related QoL at baseline. Hence, most patients seem to maintain an acceptable deglutition-related QoL suffering at worst from moderate disabilities (considered “adequate”), although severe ones (“poor”) were reported in a low but non-negligible percentage of patients. In fact, an MDADI-C score less than 60 (“poor”) was reported in 11% and 6.4% of patients at 6 and 12 months, respectively. This probably means that, after recovering from acute RT-related side effects, only a few patients perceive severe deglutition disorders. Thus, at first glance, our results on the overall patient population suggest an optimal preservation of deglutition-related QoL, reporting similar MDADI-C values between baseline and after treatment scores.

Actually, at a post hoc analysis a completely opposite trend was observed for all MDADI scores for patients with normal ability (“optimal”) compared with those with a certain degree of disability (“adequate” and “poor”) at baseline. Interestingly, “optimal” patients underwent a statistically significant worsening of all MDADI scores at 6 months followed by a slight non-statistically significant improvement at 12 months, according to most of the literature data on this issue [23,24,25]. On the contrary, “adequate/poor” patients experienced a statistically significant improvement of their function at 6 months followed by a subsequent stabilization at 12 months. Therefore, our findings seem to reveal a predominantly beneficial impact of ChemoRT on “cancer-related dysphagia.” This is probably due to the downsizing of the primary tumor mass that apparently overcomes its sequelae on the deglutition structures (“treatment-related dysphagia”). To our knowledge, this is the first study specifically reporting a beneficial impact of ChemoRT in patients with a certain degree of pre-treatment deglutition disorders and, in our opinion, the IMRT plan optimization to reduce dose to the SWOARs could play a decisive role in balancing the subtle equilibrium of these two opposite RT-induced adverse consequences.

Notwithstanding the fascinating clinical implication of our findings, the lack of robustness in the categorization of MDADI scores for the 3 different groups (“optimal,” “adequate,” “poor”) represents the main weakness of our study. In fact, despite this classification has already been reported in the previous major experience by Goepfert et al. [13] in order to facilitate the interpretation of clinical results, a real validation of the anchor points for this classification is yet to be done by the current literature. Anyway, despite the above-mentioned limitation, we honestly think that our findings might have interesting implications in clinical practice and are worth to be considered by clinicians. Moreover, a further major concern of our study was a proper interpretation of the longitudinal ∆MDADI scores in the two groups of patients to avoid misleading reports between statistical significance and clinical meaningfulness variations at different time points. In the study by Hutcheson et al. [22], a 10-point difference was defined as MCID to consider when using the MDADI as an outcome measure in clinical studies. Despite this, as declared by the authors themselves, the methodology used was designed to identify clinically relevant differences “between groups” rather than “within subjects.” As such, a general consensus on MCID for longitudinal ∆MDADI scores is yet to be defined in the scientific community. Thus, we decided to consider a “cut-off” value of 10-point difference as it is the only reference in the literature so far. In this regard, in the “adequate/poor” group, an overall increase of the MDADI-C score greater than 10 points (64.3 vs 76) was reported from before to after treatment suggesting a clinical meaningfulness other than a statistically significant improvement of deglutition-related QoL after RT. Besides, only 14% of patients maintained a poor C-score from baseline to both 6 and 12 months, whereas as much as 43% and 29% experienced an upshift to optimal function at 6 and 12 months, respectively. Therefore, a clinically relevant beneficial impact of ChemoRT on “cancer-related dysphagia” seems to emerge from our study. On the contrary, in the “optimal” group, an overall decrease of the MDADI-C score less than 10 points (93.6 vs 85.5) was reported from before to after treatment suggesting a statistically but not clinically meaningful worsening of the deglutition-related QoL. Besides, as much as 65% and 83% of patients maintained an optimal function for C-score at 6 and 12 months, whereas only 11% and 4% experienced a downshift from “optimal” to “poor” function for C score at 6 and 12 months, respectively. However, a clinical validation of the MCID cut-off value for longitudinal ∆MDADI scores (pre- vs post-treatment) is warranted for a proper interpretation of studies using the MDADI-C score as a primary clinical endpoint. However, we do recommend caution in the interpretation of our results that are limited by the use of a PRO metric, such as MDADI questionnaire, to measure the impact of a function-preserving radiation technique, such as SWOARs-sparing IMRT. In fact, at the moment our primary findings must be considered only as an “indirect” measurement of deglutition outcome that would need to be complemented by objective instrumental measures by means of Fiberoptic Evaluation of Swallowing (FEES) and/or Videofluoroscopy (VFS), specifically aimed to assess the deglutition function.

To date, most important experiences on this issue [13, 25, 26] are concordant in reporting a significant worsening of MDADI-C score early after treatment (mostly at 3 months) followed by a slow improvement at 6 and 12 months without returning to the baseline values, indicating a gradual partial recovery of deglutition-related QoL. The comparison of these data with ours might be troubled due to the opposite trend of MDADI-C scores between the two groups (“optimal” vs “adequate/poor”) that cancel the impact of treatment showing no significant differences in MDADI scores from before to after treatment. Indeed, we can strongly claim that in our study, the trend reported in the most experiences has been observed in the “optimal” MDADI group. Noteworthy, a recent experience by Grant et al. [27] using the more advanced intensity-modulated proton therapy (IMPT) technique, that significantly reduces dose to surrounding normal tissue compared with photon-based radiation, showed a rapid recovery of deglutition-related QoL at 10 weeks after treatment followed by a further steady improvement through 2 years achieving similar values of MDADI-C and subscale scores to the baseline.

Last but not least, patients affected with base of tongue cancer having a baseline “optimal” function seem to be most likely to suffer from RID. We observed a statistically significant association between the presence of this tumor subsite and variations of all but one (F-score) MDADI scale and subscale scores from baseline to 12 months after RT. It is noteworthy that, in a recent mono-institutional experience [28], the base of tongue has been identified as one of the most crucial SWOARs whose damage has been ranked of highest priority in the occurrence of post-deglutition inhalation. In this regard, a possible explanation of our findings might be the reduction of the posterior propulsive driving force of the base of tongue, due to the pre-treatment tumor infiltration of its musculature as well as to post-treatment fibrotic radiation damage, causing patients to perceive deglutition disorders. According to the most current literature [29,30,31,32,33], to address these patients to a continuous and early rehabilitation before, during, and early after treatment, together with the effort to recover as much as possible total oral nutrition (hopefully with semisolid or solid consistencies) might help reduce this self-reported deglutition disorders.

Conclusion

Despite the lack of a MCID-validated cut-off for longitudinal ∆MDADI scores, our results seem to suggest a possible doubly clinical benefit of SWOARs-sparing IMRT both on the improvement of “Cancer-Related Dysphagia” as well as on the prevention of “Treatment-Related Dysphagia.” Of course, as PROs metrics such as MDADI questionnaire have shown to underestimate deglutition disorders compared to instrumental assessments [34], it will be interesting to correlate these findings with more objective measures, such as Fiberoptic Endoscopic Evaluation of Swallowing and Videofluoroscopy, that is within the additional endpoints of our study.