Background

Postoperative nausea and/or vomiting can be defined as nausea and/or vomiting within 24 h of surgery [1, 2]. Between 20 % and 65 % of patients undergoing surgery experience postoperative nausea and/or vomiting [2, 3] and the anesthetic agents administered during the procedure have been identified as a contributing factor. Nausea and vomiting are associated with decreased quality of life and patient satisfaction [4, 5]. Vomiting can also cause complications such as aspiration pneumonia [6] and a longer hospital stay [7].

Serotonin (5-HT3) receptor antagonists reduce nausea and vomiting by inhibiting vagal nerves in the central nervous system and intestinal mucosa [8]. These agents are recommended by clinical practice guidelines for patients undergoing surgery and at risk for nausea and/or vomiting [9, 10].

We were commissioned by Health Canada to conduct a systematic review and network meta-analysis to assess the comparative efficacy of 5-HT3 receptor antagonists.

Methods

Protocol

A protocol based on the Preferred Reporting Items for Systematic reviews and Meta-Analysis for Protocols (PRISMA-P) guidelines was developed [11]. We revised our protocol using feedback from the research team and the research users, including Health Canada, a department of the federal government, who posed the original query. The final protocol was registered with PROSPERO (CRD42013003564) and published in an open-access journal [12]. As described in our protocol [12], our initial objective was to include data for patients undergoing surgery and chemotherapy in the overall analysis for both safety and efficacy outcomes. However, due to the extensive number of studies that met the inclusion criteria, we subdivided the analysis and presentation of results in separate papers for chemotherapy and surgery, as well efficacy and safety outcomes [13]. This paper focuses on the efficacy of 5-HT3 receptor antagonist for patients undergoing surgery. Our methods are described briefly below.

Eligibility criteria

We included studies involving patients of any age undergoing any type of surgery and who were given a 5-HT3 receptor antagonist for nausea and/or vomiting (Additional file 1: Appendix 1). Randomized controlled trials (RCTs), quasi-RCTs, non-RCTs, interrupted time series, controlled before–after studies, and observational (cohort) studies were eligible for inclusion. We limited our systematic review to trials published in English due to resource constraints, and excluded studies that were identified as fraudulent or were retracted [14]. The primary outcome was the number of patients who vomited, and secondary outcomes were the number of patients with nausea and the number of patients with both postoperative nausea and vomiting (PONV).

Information sources

Information sources included electronic databases (MEDLINE, EMBASE, and the Cochrane Central Register of Controlled Trials from inception until January 11, 2013), trial protocol registries, and conference proceedings.

Study selection and data collection

Two reviewers screened the literature search results and potentially relevant full-text articles, independently. The same process was followed for data abstraction and methodological quality/risk of bias appraisal. We contacted authors as necessary; for example, to obtain additional information.

Appraisal of methodological quality and risk of bias

We used the Cochrane Effective Practice and Organization of Care (EPOC) risk-of-bias tool to assess risk of bias for experimental and quasi-experimental studies [15], and the Newcastle–Ottawa Scale (NOS) [16] for cohort studies.

Synthesis of included studies

A random-effects pairwise meta-analysis on the odds ratio (OR) scale was performed to combine studies addressing the same clinical outcome and treatment comparison. We decided to apply a random-effects model, as we expected methodological and clinical heterogeneity across the included studies that compared the same pairs of interventions. For studies with dichotomous outcomes where zero events were reported in one treatment arm, we added 0.5 to all cells. Between-study heterogeneity (τ2) was examined using the restricted maximum likelihood (REML) [17] method, and quantified using the I2 statistic [18]. The R 3.1.2 [19] and metafor package [20] were employed to conduct all pairwise meta-analyses.

For a connected network diagram, we conducted a random effects network meta-analysis to make inferences on the comparative efficacy of the 5-HT3 receptor antagonists [21]. Treatment nodes were selected by the clinicians and statisticians on the research team. If a study compared different doses of a particular intervention, we included only the recommended dose in the analysis [9, 10, 2230].

Prior to conducting a network meta-analysis, we evaluated the transitivity assumption by examining the comparability of the distributions of potential treatment-effect modifiers across comparisons [31]. These included age (children versus adults), timing of administration (all time points versus during surgery), and risk of bias (all versus removing high risk of bias for randomization, allocation concealment, and blinding of outcome assessor). We evaluated transitivity in each network, by visually comparing the mode of the categorical potential effect modifiers across treatment comparisons [32]. We also assessed statistical inconsistency between different sources of evidence in the network using a global χ2 test derived from the design-by-treatment interaction model [21]. In the presence of statistically significant inconsistency, we applied the loop-specific approach [33, 34] to locally assess the network and identify the treatment comparisons responsible for inconsistency. In the network meta-analysis and design-by-treatment interaction models, we assumed common within-network heterogeneity, whereas in loop-specific method we assumed common within-loop heterogeneity. We assumed common heterogeneity across treatment comparisons since the included treatments are of the same nature and it was clinically reasonable to share a common heterogeneity parameter. In all approaches, we estimated the magnitude of between-study heterogeneity using the REML method [17]. Important heterogeneity and/or inconsistency would have been explored using network meta-regression analyses adjusting for potential effect modifiers. For each outcome, we carried out subgroup analyses using time of administration of antiemetics (all time periods versus during surgery) and age (all ages versus children), and sensitivity analyses excluding studies with high risk of incomplete outcome data bias. Although our primary analyses were restricted to RCTs only, as a secondary analysis, we included quasi-RCTs and non-RCTs to examine the robustness of the network meta-analysis results.

We present the network meta-analysis summary of treatment effects along with their 95 % confidence interval (CI) and 95 % predictive interval (PrI). The PrI captures both the uncertainty around the summary treatment effect and between-study variance, and shows the interval within which the treatment effect is expected to lie when a future study is conducted [35, 36]. To visually assess the presence of reporting bias (including publication bias and small-study effects), we used the comparison-adjusted funnel plot [32]. We also ranked the effectiveness of the 5-HT3 agonist receptors using the surface under the cumulative ranking (SUCRA) curve [37]. Network meta-analyses were conducted in Stata 13.0 [38] using the mvmeta command [39].

Results

Literature search

The literature search yielded 7,608 citations in total that met the search criteria, of which 450 full text articles met eligibility criteria for inclusion (444 primary publications and six companion reports reporting on nine studies, Fig. 1; Additional file 1: Appendix 2). Five unpublished conference abstracts were included in the review [4044]. We excluded 77 studies because we suspected or confirmed that their results were fraudulent [14] and 535 studies from previous reviews that did not fulfill our eligibility criteria (reasons for exclusion presented in Additional file 1: Appendix 3).

Fig. 1
figure 1

Study flow. Details the flow of information through the different phases of the review, mapping out the number of records identified, included and excluded, and the reasons for their exclusion

Study and patient characteristics

The majority of the included studies had an RCT design (97 %) with a short duration of follow-up of 12 to 24 h (72 %). Most studies were published between 1995 and 2013 (94 %), and were mainly conducted in Asia (39 %), North America (27 %), or Europe (24 %) (Table 1, Additional file 1: Appendix 4).

Table 1 Study characteristics

The 5-HT3 receptor antagonists we examined were ondansetron (0.1 − 48 mg/day; 76 %), granisetron (0.1 − 3 mg/day; 13 %), tropisetron (0.1 − 5 mg/day; 8 %), dolasetron (12.5 − 200 mg/day; 7 %), palonosetron (0.025 − 0.25 mg/day; 3 %), and ramosetron (0.1 − 0.6 mg/day; 2 %) (Table 1, Additional file 1: Appendix 5). We also included studies comparing combinations of 5-HT3 drugs administered concomitantly with other antiemetics, e.g., dexamethasone (2–20 mg/day; 16 %), butyrophenone (3 %), and benzamide (1 %).

Overall, 286 studies with dichotomous outcome data were included in our analyses. Studies with continuous outcome data and studies investigating the same 5-HT3 treatment in different doses were not included in the analysis.

The median study size was 118 (interquartile range, 75–180) patients, whereas most patients were women (72 %), and adults (59 %), with an American Society of Anesthesiologists (ASA) physical status [45] of I or II (60 %) undergoing obstetrical and gynecological (30 %) surgery (Table 2, Additional file 1: Appendix 6). The included studies often did not report patients’ history of PONV (56 %). Similarly, a history of motion sickness was reported in only 33 % of the studies; comorbidities were rarely reported (5 %).

Table 2 Patient characteristics

Methodological quality and risk of bias

Most of the included RCTs and quasi-RCTs had an unclear or high risk of bias on the following items: allocation concealment (59 %), baseline outcome characteristics (89 %), incomplete outcome data (60 %), and selective outcome reporting bias (97 %). When assessing potential for funding bias, we considered a study at a high or unclear risk of bias when it was funded by a private industry or when an author on the publication was employed by the company sponsoring the study, which occurred in 92 % of the RCTs and quasi-RCTs (Additional file 1: Appendices 7 and 8). Of the nine observational studies included in the analysis, eight used a somewhat representative sample, two did not describe ascertainment of exposure, all failed to demonstrate that the outcome was not present at the start of the study, six did not control for confounders, three did not describe the assessment of outcome, and all neglected to report follow-up (Additional file 1: Appendix 9). The visual inspection of the comparison adjusted funnel plots showed that there is no evidence for small-study effects and publication bias (Additional file 1: Appendix 10).

Vomiting

The network meta-analysis for vomiting included 238 RCTs with a total of 12,781 patients. The network geometry and included drugs can be found in Fig. 2a, whereas the statistically significant results are available in Table 3 and the overall results in Additional file 1: Appendix 11. The following treatment comparisons were statistically significant using both the CIs and PrIs: all agents (except for palonosetron plus dexamethasone and granisetron plus droperidol intravenous (IV)) versus placebo, ondansetron plus droperidol IV versus ondansetron, granisetron plus dexamethasone versus ondansetron, ondansetron plus dexamethasone versus dolasetron, ondansetron plus droperidol IV versus dolasetron, granisetron plus dexamethasone versus dolasetron, palonosetron plus dexamethasone versus ondansetron plus dexamethasone, palonosetron plus dexamethasone versus ondansetron plus droperidol IV, and palonosetron plus dexamethasone versus granisetron plus dexamethasone (Fig. 3). According to the SUCRA (Additional file 1: Appendix 12), the most effective agents for vomiting were ondansetron plus droperidol IV (85 % probability) and granisetron plus dexamethasone (84 % probability). The within-network heterogeneity in the network meta-analysis model was estimated at 0.15, and the evaluation of the network inconsistency using the design-by-treatment interaction model suggested that there was no evidence of statistical inconsistency (χ2 = 49.27, degrees of freedom = 44, P = 0.271, heterogeneity variance = 0.15).

Fig. 2
figure 2

Network geometry. Network meta-analysis diagrams for vomiting, nausea, and PONV. Nodes are weighted according to the number of patients included in the corresponding treatments, and edges are weighted according to the number of studies included in the respective comparisons

Table 3 Statistically significant results of network meta-analysis for all time periods of drug administration
Fig. 3
figure 3

Network meta-analysis results for vomiting. All treatments are compared to placebo. The black horizontal lines represent the 95 % confidence intervals (CI) of the summary treatment effects and red horizontal lines the 95 % predictive intervals (PrI). Results are presented on the odds ratio scale

In order to account for the treatment effect modifier ‘age’ (Additional file 1: Appendix 13), a subgroup analysis was conducted for 46 RCTs involving a total of 1,830 children (Table 3, Additional file 1: Appendix 11). The following treatment comparisons were statistically significant for vomiting: ondansetron versus placebo, granisetron versus placebo, dolasetron versus placebo, tropisetron versus placebo, ondansetron plus dexamethasone versus placebo, ondansetron plus droperidol lV versus placebo, ondansetron plus metoclopramide IV versus placebo, granisetron plus dexamethasone versus placebo, ondansetron plus dexamethasone versus ondansetron, ondansetron plus dexamethasone versus dolasetron, ondansetron plus dexamethasone versus granisetron, and granisetron plus dexamethasone versus dolasetron (Additional file 1: Appendix 11). According to the SUCRA, the most effective agents for vomiting in children were ondansetron plus dexamethasone (83 % probability) and granisetron plus dexamethasone (82 % probability).

In order to account for the treatment effect modifier ‘timing of administration’ (Additional file 1: Appendix 14), a subgroup analysis was conducted for 220 RCTs involving 10,811 patients when the agents were administered during surgery (Additional file 1: Appendix 15). The results were the same as for the primary analysis, except that ondansetron plus droperidol IV was statistically superior to granisetron, and ondansetron plus droperidol IV and granisetron plus dexamethasone were superior to ramosetron. According to the SUCRA for this subgroup analysis, the most effective agents for vomiting were ondansetron plus droperidol IV (88 % probability) and granisetron plus dexamethasone (84 % probability).

In order to account for the treatment effect modifier ‘risk of bias’, a sensitivity analysis was conducted in which 11 RCTs were removed because of high risk of incomplete outcome data bias (Additional file 1: Appendix 15); the same results were observed, except granisetron plus dexamethasone was associated with significantly less vomiting compared with ramosetron (OR, 0.52;, 95 % CI, 0.27–0.99). In another sensitivity analysis, in which four cohort studies [4649], two non-randomized controlled trials [50, 51], and one controlled before–after study [52] were added to the included studies, all of the results were the same, except that the differences between ondansetron plus dexamethasone and granisetron or ramosetron and between ondansetron plus metoclopramide IV and palonosetron plus dexamethasone were no longer statistically significant (Additional file 1: Appendix 15).

Nausea

The network meta-analysis for nausea included 195 RCTs with a total of 24,230 patients. The network geometry and included drugs can be found in Fig. 2b. We present the statistically significant treatment effect estimates derived through the network meta-analysis model in Table 3 and overall results in Additional file 1: Appendix 11. Using both the CIs and PrIs, the only treatment comparisons that were statistically significant for nausea were granisetron versus placebo (OR, 0.35; 95 % PrI, 0.13–0.91), ondansetron plus dexamethasone versus placebo (OR, 0.28; 95 % PrI, 0.10–0.76), palonosetron versus placebo (OR, 0.30; 95 % PrI, 0.10–0.89), ramosetron versus placebo (OR, 0.32; 95 % PrI, 0.12–0.86), ondansetron plus droperidol IV versus placebo (OR, 0.26; 95 % PrI, 0.09–0.80), granisetron plus dexamethasone versus placebo (OR, 0.20; 95 % PrI, 0.07–0.57), dolasetron plus droperidol IV versus placebo (OR, 0.19; 95 % PrI, 0.05–0.77), and granisetron plus droperidol IV versus placebo (OR, 0.21; 95 % PrI, 0.05–0.87) (Additional file 1: Appendices 11 and 16). According to the SUCRA (Additional file 1: Appendix 17), the most effective agents for nausea were granisetron plus dexamethasone (82 % probability) and dolasetron plus droperidol IV (78 % probability). The within-network heterogeneity in the network meta-analysis model was estimated at 0.24, and the design-by-treatment interaction model suggested that there was no statistically significant inconsistency (χ2 = 26.65, degrees of freedom = 41, P = 0.959, heterogeneity variance = 0.27).

In order to account for the treatment effect modifier ‘age’ (Additional file 1: Appendix 13), a subgroup analysis was conducted for 11 RCTs involving 1,326 children (Table 3, Additional file 1: Appendix 11). The following treatment comparisons were statistically significant for nausea: ondansetron versus placebo, dolasetron versus placebo, ondansetron plus dexamethasone versus placebo, and granisetron plus dexamethasone versus placebo (Additional file 1: Appendix 11). According to the SUCRA, the most effective agents for nausea in children were granisetron plus dexamethasone (84 % probability) and ondansetron plus droperidol IV (81 % probability).

In order to account for the treatment effect modifier ‘timing of administration’ (Additional file 1: Appendix 14), a subgroup analysis was conducted for 175 RCTs involving 21,844 patients when the agents were administered during surgery (Additional file 1: Appendix 18). All of the results were the same as for the primary analysis, except that the difference between tropisetron and ondansetron plus dexamethasone was no longer statistically significant. According to the SUCRA for this subgroup analysis, the most effective agents for nausea were granisetron plus dexamethasone (82 % probability) and dolasetron plus droperidol IV (77 % probability).

In order to account for the treatment effect modifier ‘risk of bias’, a sensitivity analysis was conducted in which 10 RCTs were removed because of high risk of incomplete outcome data bias [5362], and the results were unchanged from the primary analysis (Additional file 1: Appendix 18). In another sensitivity analysis, in which two cohort studies [46, 49] and two non-randomized controlled trials [50, 51] were added to the included studies, all of the results were the same, except that the differences between palonosetron and dolasetron and between dolasetron plus dexamethasone and dolasetron were no longer statistically significant (Additional file 1: Appendix 18).

Postoperative nausea and vomiting

The network meta-analysis for PONV included 125 RCTs with 16,667 patients. The network geometry and included drugs can be found in Fig. 2c, statistically significant results are presented in Table 3, and the overall results in Additional file 1: Appendix 11. According to both the CIs and PrIs, the following treatment comparisons were statistically significant for PONV: all agents versus placebo except for tropisetron, granisetron plus dexamethasone versus ondansetron, and granisetron plus dexamethasone versus tropisetron (Additional file 1: Appendix 19). According to the SUCRA, the most effective agents for PONV were granisetron plus dexamethasone (86 % probability) and ondansetron plus droperidol IV (78 % probability; Additional file 1: Appendices 11 and 20). The within-network heterogeneity in the network meta-analysis model was estimated at 0.25, and the design-by-treatment interaction model suggested that there was no statistically significant inconsistency (χ2 = 26.58, degrees of freedom = 32, P = 0.737, heterogeneity variance = 0.26).

In order to account for the treatment effect modifier ‘age’ (Additional file 1: Appendix 13), a subgroup analysis was conducted for 14 RCTs involving a total of 2,394 children (Table 3, Additional file 1: Appendix 11). The following treatment comparisons were statistically significant: ondansetron versus placebo and ondansetron plus droperidol IV versus placebo. According to the SUCRA, the most effective agents for PONV in children were ondansetron plus droperidol IV (85 % probability) and ondansetron plus dexamethasone (59 % probability).

In order to account for the treatment effect modifier ‘timing of administration’ (Additional file 1: Appendix 14), a subgroup analysis was conducted for 116 RCTs involving 12,415 patients in which the agents were administered during surgery (Additional file 1: Appendix 21). All of the results were the same as for the primary analysis, except that ondansetron plus dexamethasone was statistically superior to ramosetron. According to the SUCRA for this subgroup analysis, the most effective agents for PONV were granisetron plus dexamethasone (84 % probability) and ondansetron plus droperidol IV (79 % probability).

In order to account for the treatment effect modifier ‘risk of bias’, a sensitivity analysis was conducted in which 10 RCTs were removed because of high risk of incomplete outcome data bias (Additional file 1: Appendix 21) [5355, 58, 59, 6165]; the results were unchanged from the primary analysis, except that the risk of PONV was significantly higher with tropisetron than with granisetron. In another sensitivity analysis, two cohort studies [46, 49] and three non-randomized controlled trials [50, 51, 66] were added to the included studies, and the same results were observed (Additional file 1: Appendix 11).

Discussion

Administration of most 5-HT3 antagonists led to significantly fewer patients experiencing nausea, vomiting, and PONV relative to placebo. However, some of the corresponding PrIs were not statistically significant, suggesting that the statistically significant treatment effects might change should a new study become available. For all age groups and across all outcomes, the most effective agents were granisetron plus dexamethasone. For adults, the most effective agents were ondansetron plus droperidol IV; and for children, the most effective agents were ondansetron plus dexamethasone.

We also conducted a systematic review and network meta-analysis on the safety of these medications [13]. Our network meta-analysis results suggested that granisetron plus dexamethasone increases the risk of arrhythmia. However, a statistically significant increase in the risk of delirium was not observed in another network meta-analysis. In a meta-analysis including three studies for ondansetron versus placebo, no statistically significant results were observed for mortality. Only two studies reported prolongation of the QT interval; meta-analysis was not feasible because the studies compared different interventions.

Our network meta-analysis results for vomiting and PONV are similar to those of a previous network meta-analysis that examined only these two outcomes [67]. The only difference was that, unlike the current study, the earlier analysis showed that granisetron was significantly better than ondansetron and dolasetron for PONV. However, we included 378 studies involving a total of 68,167 patients that were not included in the earlier review. Although we are aware of other systematic reviews and meta-analyses of 5-HT3 receptor antagonists [14, 68], the previous researchers did not conduct a network meta-analysis, and therefore the results cannot be compared. Notably, because of our comprehensive literature search and broad eligibility criteria, we included 205 studies involving a total of 43,075 patients that were not included in those previous reviews (Additional file 1: Appendix 22).

The included studies were limited by having an unclear or high risk of bias on important components, including allocation concealment, selective outcome reporting bias, and potential for funding bias. Further, this systematic review process had some inherent limitations. Slight changes to the original protocol [12] were necessary, such as preparing a separate paper for patients undergoing chemotherapy, as well as one focused on the safety of these agents for patients with surgery [13]. Furthermore, it was assumed that the effects of the different doses and durations were identical across the treatments, and that they defined the same node they belong to. We are currently exploring these assumptions in another paper [69]. Although study designs above and beyond RCTs were included, the network meta-analysis was limited to the RCTs in order to increase the confidence of the results. We also were unable to present the results from the hundreds of meta-analyses conducted, as well as the raw data; these are available from the corresponding author upon request. Although the analyses were adjusted to account for the treatment effect modifiers (age, timing of administration, and risk of bias), the results might be influenced by effect modifiers that we were unaware of. However, the statistical evaluation of the transitivity assumption using the design-by-treatment interaction model suggested there was no evidence of inconsistency. Finally, 77 studies were excluded because they contained data known to be fraudulent or were retracted [14].

Conclusions

In conclusion, granisetron plus dexamethasone was often the most effective antiemetic across the effectiveness outcomes considered here, with the number needed to treat ranging from two to nine. A study that examines the administration of these agents at different dosages would provide further clarity to this important issue and our team is currently working on such an initiative [70].