Background

Hypopharyngeal neoplasm, as a term used for malignancies of a subsite of the upper aerodigestive tract, is difficult to catch in its earliest stages. Meanwhile it is reported to have the highest mortality rates among kinds of head and neck cancers [1]. Laryngeal neoplasms are mostly squamous cell carcinomas, which reflect they originate from the skin of the larynx. Its 5 years survival rates in the United States are 60% [2]. So far multiple treatments for hypopharyngeal or laryngeal neoplasms have been developed and compared, which include surgery, radiation therapy (RT), chemoradiation therapy (CRT), or kinds of combination of treatments aforementioned [3]. One of the main targets of treating patients with locoregional advanced laryngeal cancer is to maintain larynx function. Yet surgery, though providing satisfying local control, might impair life quality of patients. Thus the application of surgery alone in treating laryngeal neoplasms and hypopharyngeal neoplasms has been reduced while chemotherapy (CT) and RT have been more widely used in clinical practice these days [2]. However, RT and CT being applied separately or combined have triggered worries regarding a decrease in overall survival rate (OSR) since they are reckoned to have caused acute toxicity and disrupted laryngeal function [4].

Many meta-analyses related to head and neck cancer, hypopharyngeal neoplasm, and laryngeal neoplasms have been conducted to evaluate the efficacy of CT, RT, and CRT. Whereas, there were several deficiencies in prior studies such as small sample sizes [5] or a lack of sufficient prospective studies [5]. Besides, the nature of conventional meta-analysis prevents it from performing more comprehensive analysis of multiple treatments. It is known that network meta-analysis (NMA) could combine both direct and indirect evidence, thus considered to be of high clinical value. Based on the facts mentioned above, NMA might be the most adequate way to compare different regimens of hypopharyngeal neoplasms and laryngeal neoplasms.

Notably, most of existing NMAs comparing treatments for head and neck cancers failed to pay attention to certain specific types of cancers, which are hypopharyngeal and laryngeal neoplasms in our case. Hence it is necessary to conduct a systematic review in order to get more comprehensive results regarding regimens for hypopharyngeal and laryngeal neoplasms.

Methods

Research reporting format

This NMA was conducted according to the preferred reporting items for systematic reviews and meta-analyses (PRISMA) guidelines.

Search strategy

PubMed and Embase were systematically retrieved for eligible trials. Subject terms and key words including “hypopharyngeal neoplasms”, “laryngeal neoplasms”, “radiotherapy”, “chemoradiotherapy”, “transoral laser microsurgery”, “radiotherapy combined with surgery”, “induction chemotherapy radiotherapy”, “current chemotherapy radiotherapy”, and “clinical controlled trials” were searched with Boolean operators OR and AND. Titles and abstracts of all studies obtained after searching were reviewed in turn to screen their qualifications.

Inclusion and exclusion criteria

Included studies were restricted by the following criteria: (1) the study was clinical controlled trial; (2) patients were diagnosed with hypopharyngeal neoplasms or laryngeal neoplasms; (3) therapies included at least one of the six following treatments: surgery, RT, transoral laser microsurgery (TLM), radiotherapy combined with surgery (RT + S), induction chemotherapy radiotherapy (ICRT), and current chemotherapy radiotherapy (CCRT); (4) at least one of the following relevant outcomes was provided: 3/5-year overall survival (3/5-OS), 3/5-year disease free survival (3/5-DFS) and 5-year overall survival rate (5-OSR). Conference abstracts, reviews, duplicates and articles lacking sufficient information or relevant outcomes were excluded. Two experienced reviewers performed an assessment independently to examine eligibility. Any possible disagreements between them would be resolved by consensus and still remaining disagreements would be referred to a third reviewer.

Outcome measurement and data extraction

Baseline characteristics of patients and basic information including first author, year of publication, blind test, etiology, follow-up duration, intervention, and sample size of each treatment were extracted by two reviewers from eligible trials. Discrepancies were solved by discussion or a third reviewer. Five outcomes were selected in order to assess the efficacy of surgery and other five regimens.

Statistical analysis

This NMA is conducted by R software (Version 3.1.3) and STATA (Version 13.0). A random-effect Bayesian model was utilized considering the existence of heterogeneity. Network plots demonstrated all efficacy outcomes of six regimens. Hazard ratios (HR) with 95% credible intervals (CrIs) were calculated to perform a NMA, combining both direct and indirect evidence. The results of NMA were shown in forest plots and heat plots were utilized in our NMA to evaluate the inconsistency, in which colors were associated with the change in inconsistency between direct and indirect evidence (shown in the row) after detaching the effect of certain evidence (shown in the column). In heat plots, cold colors indicated an increase of consistency while warm colors indicated a decrease of consistency. In addition, relative efficacy of six therapies was ranked based on surface under the cumulative ranking curve (SUCRA). Higher probabilities revealed in SUCRA signified more desirable interventions.

Subgroup analysis

A subgroup analysis for locally advanced hypopharyngeal neoplasms or laryngeal neoplasms patients were conducted for the existence of clinical heterogeneity between studies. The subgroup analysis involved 17 studies reported patients with larynx stage III-IV and hypopharynx stage II-IV only.

Results

Literature search

A total of 4737 potential relevant publications were identified by literature search. Four thousand five hundred eighty-three remained for screening after duplicates were deleted. Four thousand three hundred forty-eight articles were excluded due to irrelevant titles and abstracts. Among the 235 full-text studies being assessed, 207 papers were removed on account of insufficient data, single arm studies, and irrelevant outcomes. Finally, 28 trials met the inclusion criteria and were retrieved for data extraction [2, 3, 6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31].

Table 1 and Additional file 1: Table S1 summarized basic information of included trials and patients characteristics. Among 28 trials with 9109 patients, most were two-arm trials comparing one kind of treatment with surgery or a surgery combined with other regimens. Among the included studies, there were seven trials designed to be randomized controlled trials while 21 were reported as retrospective cohort studies. The networks of comparisons between different treatments with corresponding sample sizes were summarized in Fig. 1. Each circle indicated a certain intervention while the size of each circle denoted its sample size. The thickness of lines linking two circles signified the amount of literatures comparing the corresponding two therapies.

Table 1 Basic characteristics of included studies
Fig. 1
figure 1

The network of comparisons of 3-OS, 5-OS, 3-DFS, and 5-DFS. (The width of the lines is proportional to the number of trials comparing each pair of treatments; the area of circles represents the cumulative number of patients for each intervention.) S, surgery; RT, radiotherapy; TLM, transoral laser microsurgery; RT + S, surgery combined with radiotherapy; ICRT, Induction chemotherapy radiotherapy; CCRT, current chemotherapy radiotherapy

Outcomes

In this NMA, 3-OS, 5-OS, 3-DFS, 5-DFS, and 5-OSR were calculated as primary outcomes, the results of which were demonstrated in Table 2 and Fig. 2.

Table 2 Network meta-analysis results with hazard ratio (HR) for 3-OS, 5-OS, 3-DFS, and 5-DFS, and odds ratio (OR) for 5-OSR
Fig. 2
figure 2

Forest plots for four endpoints. Hazard ratio (HRs) and 95% credible interval (CrIs) indicate the relative efficacy under the corresponding endpoint. S, surgery; RT, radiotherapy; TLM, transoral laser microsurgery; RT + S, surgery combined with radiotherapy; ICRT, Induction chemotherapy radiotherapy; CCRT, current chemotherapy radiotherapy

In terms of 3-OS, surgery alone performed significantly better than RT and CCRT (RT: HR 1.25, 95% CrI 1.00–1.57; CCRT: HR 1.23, 95% CrI 1.00–1.53). Meanwhile, RT combined with surgery turned out to be significantly superior to RT, ICRT, and CCRT (RT: HR 1.31, 95% CrI 1.06–1.61; ICRT: HR 1.22, 95% CrI 1.02–1.47; CCRT: HR 1.28, 95% CrI 1.06–1.55). With regard to 5-OS, RT combined with surgery and surgery alone both had significant advantage over RT (HR 1.25, 95% CrI 1.01–1.55; HR 1.27, 95% CrI 1.01–1.59, respectively). Apart from that, there existed no other significant difference among comparisons of 6 regimens.

The results of 3-DFS were in favor of CCRT, since CCRT proved to be more efficacious than RT with statistical significance (HR 1.79, 95% CrI 1.06–3.04). Yet no other significant difference occurred regarding 3-DFS. Notably, there also existed no significant difference among comparisons in 5-DFS and 5-OSR.

Relative ranking analysis

A relative ranking of the 6 therapies could be obtained on the basis of SUCRA results (Table 3). RT combined with surgery and surgery alone demonstrated desirable performance. RT combined with surgery and surgery alone ranked first and second respectively in both 3-OS (RT + S: 0.878; S: 0.763) and 5-OS (RT + S: 0.930; S: 0.773). Besides, surgery ranked highest in 5-ORS (0.716) while RT combined with surgery rank the third place in 3-DFS (0.717).

Table 3 Surface under the cumulative ranking curve (SUCRA) for the treatments under five endpoints

Moreover, ICRT and CCRT proved to be advantageous regarding DFS. ICRT was the optimal therapy in 3-DFS (0.846) and suboptimal in 5-DFS (0.639) while CCRT turned out to be next-best in 3-DFS (0.798) and the best in 5-DFS (0.933).

Inconsistency assessment

As shown in heat plots (Fig. 3), there is no evidence of inconsistency existing in the direct and indirect evidence in 3-OS, 5-OS, 3-DFS, and 5-DFS. Yet there was a sign of inconsistency regarding 5-OSR according to the results of heat plot.

Fig. 3
figure 3

The results of consistency analysis by heat plot for each outcome. The size of the gray squares indicates the contribution of the direct evidence (shown in the column) to the network evidence (shown in the row). The colors are associated with the change in inconsistency between direct and indirect evidence (shown in the row) and the warmer color represents the stronger inconsistency. S, surgery; RT, radiotherapy; TLM, transoral laser microsurgery; RT + S, surgery combined with radiotherapy; ICRT, Induction chemotherapy radiotherapy; CCRT, current chemotherapy radiotherapy

Subgroup analysis

Among 28 included studies, 7 papers included only early-stage patients, 4 papers included both early-stage and locally advanced stages patients and 17 papers only included locally advanced stages patients (larynx stage III-IV and hypopharynx stage II-IV). For the treatments related to 7 papers included only early-stage patients were limited and the data were not sufficient enough to draw reliable conclusion, we decided to add a subgroup analysis of 17 papers only included locally advanced stages patients (larynx stage III-IV and hypopharynx stage II-IV). The network meta-analysis results were shown in Additional file 2: Table S2 and SUCRA results were shown in Additional file 3: Table S3. Surgery, RT, RT + S, ICRT, and CCRT were involved in subgroup analysis. In term of 3-OS and 5-OS, RT + S showed a relatively lower HR and better performance when compared with others without statistically significance; in term of 3-DFS and 5-DFS, CCRT and surgery alone had similar performance. In term of 5-OSR, RT + S and surgery alone performed significantly better than RT (HR 0.39, 95% CrI 0.17–0.9; HR 0.31, 95% CrI 0.13–0.74). The SUCRA results shown in Additional file 3: Table S3 that RT + S was the probably best treatments as for 3-OS and 5-OS (SUCRA = 0.840, 0.808). CCRT may be the best treatments in increasing 3-DFS (0.695) while ICRT may be the best treatments in increasing 5-DFS (0.771). Surgery alone ranked the highest in 5-OSR (0.337).

Discussion

Our NMA focused on comparative efficacies of six therapies of hypopharyngeal neoplasms and laryngeal neoplasms. Results of NMA indicated that surgery alone were significantly better than RT and CCRT according to 1-OS outcome. RT + S were significantly superior to RT, ICRT, and CCRT in terms of 3-OS. Meanwhile RT + S and surgery alone had significant advantage over RT in 5-OS. CCRT proved to be better than RT in 3-DFS. With regards to relative ranking analysis, RT + S performed desirably in both OS and DFS while CCRT and ICRT were advantageous in DFS.

RT is utilized as the primary treatment for early laryngeal cancer in northern Europe and North America for its being an acknowledged treatment for selected laryngeal carcinomas that educes beneficial oncologic results [32]. For instance, RT could provide a possibility of curing un-resectable tumor. When compared with surgery, RT might provide less morbidity regarding subclinical disease and advantageous organ preservation [33].

Yet RT performed poorly in both OS and DFS according to our results, which is contrary to the results of a prior NMA claiming that RT was associated with higher rates of OS and DFS than CT, CRT, and laryngectomy [32]. However, our results that CCRT and ICRT took advantageous over RT turned out to be consistent with paper conducted by Zackrisson et al., which concluded that CRT was better than RT [34]. Besides, TLM has been used in laryngeal malignant lesions since 1972. As pointed out in prior NMAs, there are controversies in clinical literature regarding oncologic outcomes of TLM and RT for T1 glottis carcinoma, and plenty of comparisons between TLM and RT have been made. RT was inferior to TLM for T1 glottic carcinoma in OS according to Mo et al. [35], Zhong et al. [36], and Huang et al. [5]. Whereas Abdurehim et al. thought this question remained unanswered in systematic review since no significant difference was shown between RT and TLM [37], which is also consistent with our results. More random-controlled and prospective trials should be conducted, based on which a more clinically reliable conclusion would be reached in future NMAs.

Furthermore, CCRT and ICRT demonstrated no significant difference on survival control in our analysis. Similar results were shown in Ma et al. paper which claimed that ICRT had no significant benefit compared to CCRT regarding OS or locoreginal control [38]. Yet Ma also claimed that ICRT generated desirable outcomes in larynx preservation [39]. More outcomes should be explored with regard to the effectiveness of ICRT and CCRT. The subgroup analysis of 17 studies reported patients with larynx stage III-IV and hypopharynx stage II-IV also showed similar results in term of 3-OS, 5-OS, 3-DFS, 5-DFS except that the results were all not statistically significant. More clinical researches will be required in the future.

There still exist several limitations in our NMA despite that we have conducted our work as meticulous as possible. First of all, there might be several deficiencies caused by the quality and non-unified standards of the included trials. For instance, patients’ or surgeon’s preference in these trials might lead to selection bias inevitably. Besides, a great proportion of the included trials were non-randomized or retrospective trials, which could increase heterogeneity of our NMA. Notably, toxicity of treatments analyzed in our NMA has not been investigated thoroughly because of a lack of data and comparable outcomes. It should not be ignored that a lack of full consideration on dose of RT and type of TLM might undermine the clinical value of our results to some extent. Secondly, we failed to take more combination therapies into consideration. Several trials and conventional meta-analyses have discovered an increase in efficacy of combination therapies such as chemotherapy combined with CCRT on survival control for some types of head and neck cancers [40,41,42]. Yet a comprehensive analysis regarding various combination treatments is hard to conduct on account of insufficient data in our included studies. Thirdly, sample sizes for some specific comparisons were not big enough. For instance, there were only 23 subjects treated by TLM in 3-DFS and 5-DFS. More prospective trials should be performed to reach more reliable conclusions and further studies focusing on combination therapies should be carried out. Fourthly, due to limited studies and insufficient data on early-stage patients, our analysis could not make a subgroup analysis of these patients. Therefore, future studies are needed to assess the impact of various treatments in the management of early-stage glottic larynx patients in order to provide more credible medical guideline.

Conclusions

RT combined with surgery turned out to be optimal therapy of all the outcomes while the efficacy of RT was relatively poorer in the treatment of patients with larynx stage III-IV and hypopharynx stage II-IV. Also, the good performance of CCRT and ICRT in terms of DFS made them as secondary recommended therapies. There is no significant difference between surgery and TLM. The comparison of more related therapies will be better understood once results of more high-quality clinical trials conducted.