Background

Swallowing is a complex series of events that involves the seamless coordination of multiple muscles and nerves to bring food from the oral cavity to the stomach. Swallowing dysfunction is a known long-term toxicity of post-operative radiotherapy (RT), definitive RT, and chemoradiation therapy (CRT) used in the treatment of H&N cancer. Swallowing dysfunction has a major impact on the quality of life experienced by H&N cancer survivors and requires close examination [1, 2].

Treatment-related swallowing dysfunction results from damage to structures involved in different phases of the swallowing cycle. These include the salivary glands, tongue and oropharyngeal muscles, laryngeal-hyoid range of motion, and the upper esophagus [3]. Proper laryngeal anatomy and movement is particularly crucial to normal swallowing physiology and prevention of aspiration. Toxicity is related to RT factors such as total dose, fraction size, radiated volume, inter-fraction time interval, treatment technique, and tissue-dose compensation [4,5,6,7]. It is also impacted by the addition of chemotherapy, tumor site and stage, patient age, as well as tobacco use [8,9,10,11].

A dose-volume relationship has been shown to exist between various H&N organs and risk of swallowing dysfunction. These include the larynx, pharyngeal constrictors, and upper esophagus. Various studies have shown that dose reduction to these organs-at-risk (OARs) can help decrease the risk of long-term swallowing dysfunction [12,13,14,15,16,17,18,19,20,21]. In recent years, several contouring guidelines have been released in order to facilitate standard and accurate delineation of laryngeal OARs [22, 23].

Overall, more information is needed to determine which laryngeal substructures, when irradiated, are most associated with swallowing dysfunction. Given this, we designed a prospective study to examine the dose-volume relationship between 12 larynx substructures and long-term aspiration risk. This assessment required accurate knowledge of each patient’s baseline swallowing function and then consistent, interval post-RT follow-up to monitor clinical progress or decline over time.

Methods

Eligibility & Data Collection

Forty-nine patients with stage III/IV loco-regionally advanced H&N squamous cell cancer were prospectively enrolled in this IRB-approved, federally funded study from 2006 to 2011. All patients received intensity modulated radiotherapy (IMRT)-based CRT.

Patients were scheduled for videofluorography (VFG) prior to CRT and at 3, 6, 9, 12, and 24 months post-CRT. During each study the patient received an oral barium bolus, and a total of 22 temporal components of swallowing function [24] were evaluated. The VFG studies were conducted in the lateral plane according to the procedure outlined by Logemann [25] and recorded at 30 frames per second. Each swallow study was assessed for presence or absence of aspiration before, during, and after the pharyngeal swallow. Aspiration was defined as being present if any portion of the barium bolus entered the airway below the level of the true vocal folds. Aspiration may have occurred at multiple times for a swallow. Each individual patient was then classified as having aspirated if they were assessed as having aspiration present at any point (before, during, or after the pharyngeal swallow) on at least one of the multiple swallows in the VFG evaluation.

Two research technicians shared the analysis of the swallow studies for this study. Research technicians were initially trained in their accuracy until their interobserver and intraobserver reliability was at least 0.85. Once the study commenced, 10% of the analyzed swallows were chosen at random for reanalysis by both research technicians to determine inter-judge and intra-judge reliability. Lin’s Concordance Coefficient was used to quantify reliability. Average inter- and intra-observer reliability with Lin’s Concordance Coefficient were .989 and .992, respectively. Since a number of these measures play an important role in aspiration, we correlated DVH data to aspiration only.

The following 12 larynx substructures were retroactively contoured in each patient per guidelines outlined by Choi et al. [22]: thyroid cartilage, cricoid cartilage, total epiglottis, suprahyoid epiglottis, infrahyoid epiglottis, total larynx, supraglottic larynx, subglottic larynx, glottic larynx, arytenoids, aryepiglottic (AE) folds, and glossoepiglottic fold. Of these 49 patients, 13 were excluded due to not having a VFG evaluation at 12 months of follow-up (10 oropharyngeal, 3 unknown primary), 4 were excluded due to prevalent aspiration at baseline (3 oropharyngeal, 1 hypopharyngeal), and 3 were excluded due to having primary laryngeal cancer. In total, 29 patients were included in our analysis.

Treatment technique

The details of our sequential treatment technique have been previously reported [26]. Briefly, planning target volumes (PTVs) were generated from clinical target volumes (CTVs) corresponding to areas of high, intermediate, and low-risk disease. These areas received 66–75 Gy, 56–60 Gy, and 50–56 Gy, respectively. Treatment was delivered without the use of image-guided radiotherapy (IGRT). In this paper our reported mean larynx doses are towards the higher side because during the initial years of this study, we did not utilize very strict mean dose parameters. Additionally, our larynx contours are more generous and comprehensive than what is traditionally defined as laryngeal tissue. In more recent years we restricted the mean larynx dose to < 45 Gy.

Statistical methods

Patient demographics, tumor characteristics, and treatment modalities were compared between aspiration groups using Fisher’s exact test, except for age and total dose where the independent sample t-test was used. Mean dose to each of the 12 structures was compared between aspiration groups using logistic regression, which resulted in a receiver operating characteristic (ROC) curve. The area under the ROC curve (AUC) indicated the strength of the relationship between dose and aspiration. Multivariate analysis was performed using logistic regression.

For each substructure, a dose cutpoint was determined which maximized the sum of sensitivity and specificity. Sensitivity was defined as the percentage of aspirators whose dose to that substructure was above the cutpoint. Specificity was defined as the percentage of non-aspirators whose dose to that substructure was below the cutpoint. The positive likelihood ratio (+LR), which is sensitivity/(100-specificity), is the percentage of aspirators above the cutpoint divided by the percentage of non-aspirators above the cutpoint. Positive predictive value (PPV) was the percentage of patients above the cutpoint who aspirated. Negative predictive value (NPV) was the percentage of patients below the cutpoint who did not aspirate. Relative risk (RR) was the percentage of patients above the cutpoint who aspirated, divided by the percentage of patients below the cutpoint who aspirated. Pearson correlation coefficients were calculated between doses to pairs of structures. Inter-correlation of doses was illustrated using least squares regression and a 95% prediction interval.

Results

Patient demographics, tumor characteristics, and treatment modalities are outlined in Table 1. There was no significant difference on these attributes between aspirators and non-aspirators. Among the final 29 patients analyzed, the median age was 54 (range 30–73). The majority of patients were male (79.3%) and non-smokers (79.3%). Tumor sites included oropharynx (22 patients), unknown primary (6 patients), and hypopharynx (1 patient). Most patients were stage IVA at the time of diagnosis (26), followed by stage III (2), and stage IVB (1). p16 status was available for 15 patients, and 14 of these were p16 positive on pathologic review. None of the patients included in this study had tumor directly infiltrating the larynx. The rate of aspiration at 1 year was 34% (10 aspirators vs. 19 non-aspirators).

Table 1 Patient demographics, tumor characteristics and treatment modalities

Table 2 outlines the results of our univariate and multivariate analysis. For each patient, the mean dose for each of the 12 larynx substructures was determined. We calculated the mean and standard deviation of these mean doses among aspirators and non-aspirators. After generating ROC curves using logistic regression, the AUC was used to determine the strength of the relationship between substructure dose and aspiration at 12 months. The AUC was highest for AE folds (Fig. 1). Univariate analysis identified multiple laryngeal substructures with significant association between RT dose and aspiration, however on multivariate analysis, only dose to the AE folds was significant (p = 0.025).

Table 2 Mean dose (cGy) by aspiration status for each laryngeal substructure
Fig. 1
figure 1

Receiver Operating Characteristic (ROC) curve relating mean dose to aryepiglottic folds with aspiration at 12 months

Table 3 highlights the mean dose cutpoint for each larynx substructure that maximized the sum of sensitivity and specificity for aspiration at 1 year. For example, a mean dose of 6500 cGy or higher to the AE folds was associated with an increased risk of developing aspiration at one year (+LR 2.81, PPV 60%, NPV 92.9%, RR 8.4).

Table 3 Dose cutpoint, sensitivity, specificity and positive likelihood ratio, positive predictive value (PPV), negative predictive value (NPV) and relative risk (RR)

Discussion

This manuscript reports the results of a prospective study examining the relationship between laryngeal substructure dose and aspiration risk at 1 year among 29 patients treated definitively with CRT for locally advanced H&N cancer. All patients underwent baseline and post-treatment swallowing evaluation with VFG for at least 1-year. The rate of aspiration at 1 year was 34% (10/29 patients). We found that dose to the AE folds was most strongly correlated with 1-year aspiration risk, and this held true on multivariate analysis. Furthermore, a mean dose cutpoint of 6500 cGy to this structure was associated with an increased risk of developing aspiration at this time point. This is logical from a functional standpoint as the AE folds provide central airway protection by adducting during swallowing to prevent aspiration.

While treatment intensification for H&N cancer has improved cancer-specific outcomes, it has also increased the incidence of dysphagia and aspiration complications [27,28,29]. However, highly conformal dose distributions, attainable through the implementation of IMRT and on board cone beam computed tomography imaging, can be used to spare dose to critical OARs and reduce adverse toxicity [14, 30,31,32,33]. There is hope that normal tissue sparing could be further improved with the implementation of proton-based radiotherapy in appropriately selected patients [34]. Swallowing function can also be preserved and improved with timely identification of patients at risk of aspiration/dysphagia and initiation of prophylactic swallowing exercises [35, 36].

Dose to the AE folds and its impact on patient swallowing/nutrition has been identified before. A study by Dornfeld, et al. examined the outcomes of 27 patients treated with definitive IMRT for H&N cancer who were disease free for at least 1 year post-treatment [17]. The authors found that higher doses to the AE folds, false vocal cords, and lateral pharyngeal walls were correlated with a more restricted patient diet. Higher dose to the AE folds was also correlated with greater weight loss at 1 year.

When interpreting the results of this paper, several limitations should be noted. One is our relatively small sample size, especially after some patients were excluded for presence of baseline aspiration and insufficient follow-up. Additionally, no quality of life data was collected. Finally, there was heterogeneity in primary target PTV coverage and larynx doses.

The strength of this paper lies in the prospective and objective collection of its data points. Our identification of AE folds as a laryngeal substructure of interest could help streamline the radiation planning and evaluation process because, logistically, it is not practical to contour 12 laryngeal substructures for every patient. Our findings of association between mean dose to the larynx and aspiration is similar to published results by other authors [12, 13, 17]. However, our approach of using standardized methods to contour and define the larynx makes our dose-volume relationship findings unique.

Conclusions

Overall, these are hypothesis-generating data that require further research and validation in a larger number of patients to assess the importance of dose to laryngeal substructures and future aspiration risk. Implementation of modern radiotherapy techniques, such as either IMRT or proton therapy, could be used to further reduce mean radiation doses to the larynx and critical OARs within the larynx, such as the AE folds and reduce the risk for aspiration.