Introduction

Achieving and maintaining glycemic control is the primary goal in the management of type 2 diabetes mellitus (T2DM). Patients are encouraged to adopt a healthy diet and increase their exercise; however, for patients who do not achieve glycemic control following lifestyle changes, oral antidiabetic agents are typically prescribed. Therapy is initiated with metformin, and other agents are added to the treatment regimen as necessary, to achieve the desired level of glycemic control [1]. Glucagon-like peptide-1 receptor agonists (GLP-1 RAs) are a class of glucose-lowering agents used in the treatment of T2DM. The addition of a GLP-1 RA is one of the recommended injectable approaches for patients who are inadequately controlled on metformin monotherapy [1].

Exenatide is a GLP-1 RA approved for the management of glycemic control among individuals with T2DM and is currently marketed in two formulations: a solution for twice-daily injection providing either 5 or 10 µg of exenatide [exenatide twice daily (BID)] and a prolonged-release once weekly (QW) injection that provides 2 mg of exenatide (exenatide QW). The long-acting formulation contains the active ingredient of the original exenatide BID formulation dispersed in microspheres of medical-grade poly-(d,l-lactide-co-glycolide) in an aqueous formulation [2]. Exenatide BID was the first GLP-1 RA approved in the twice-daily formulation, and exenatide QW was the first GLP-1 RA approved in the once weekly setting [3]. Other GLP-1 RAs administered QW that have since been developed include albiglutide and dulaglutide, whereas liraglutide and lixisenatide are administered once daily (QD). Overall, there are now several GLP-1 RAs on the market, with different dosing, and some evidence suggestive of differences in potencies [46]. Exenatide QW is expected to be associated with greater adherence and compliance than daily dosing, which could in turn translate into increased efficacy. As such, a study that estimates the relative efficacy and tolerability of exenatide QW compared to the existing GLP-1 RAs that have been developed is needed and would be valuable to investigate.

The efficacy and tolerability of exenatide in its QW formulation has been demonstrated in randomized controlled trials (RCT) within the DURATION clinical trial program [710] and real-world evidence studies [1113]. Apart from the direct comparison of exenatide QW to exenatide BID [DURATION 1 (ClinicalTrials.gov identifier, NCT00308139) and DURATION-5 (ClinicalTrials.gov identifier, NCT00877890) trials] [79], there was one other trial [DURATION-6 trial (ClinicalTrials.gov identifier, NCT01029886)] where exenatide QW was directly compared to another GLP-1 RA [10]. In the DURATION-6 trial, the efficacy and safety of exenatide QW was compared to liraglutide 1.8 mg QD in patients with T2DM treatment with lifestyle modification and one or more oral antihyperglycemic agents [10]. Newer GLP-1 RAs (albiglutide, dulaglutide and lixisenatide) have not been compared directly to exenatide QW.

In the absence of head-to-head evidence within a clinical trial setting, the efficacy and tolerability of exenatide 2 mg QW, relative to other GLP-1 RAs, can be estimated using both direct and indirect evidence within a network meta-analysis (NMA) [14, 15]. Where limited direct evidence does exist, NMAs allow pooling direct and indirect evidence together in a single analysis. The technique is based on the assumption that, on a suitable scale, one can add and subtract the within-trial estimates of relative treatment effects to obtain indirect evidence. In this case, the difference in effect between treatments A and B equals the difference in effects between treatments A and C and B and C [14, 16].

The objective of this study was to estimate the relative clinical efficacy and tolerability of exenatide QW, compared to GLP-1 RAs approved in the US and/or Europe, for the treatment of T2DM for patients who fail to achieve glycemic control on metformin monotherapy. Efficacy outcomes considered were mean change from baseline for glycated hemoglobin (HbA1c) (%), weight (kg) and systolic blood pressure (SBP) (mmHg) and proportion of patients achieving glycemic target (HbA1c <7% and ≤7%) at 24 ± 6 weeks. Tolerability was considered by estimating the relative odds of experiencing nausea or of treatment discontinuation due to any adverse event.

Methods

Identification of Trials

A systematic literature review (SLR) was conducted to identify RCTs to inform the NMA. EMBASE, MEDLINE, and the Cochrane Central Register of Controlled Trials were searched from database inception to October 2014. PubMed was searched from August to October 2014, to identify new studies not indexed in EMBASE/MEDLINE. Conference abstracts from the European Association of the Study of Diabetes (EASD) 49th Annual Meeting (2013), and the American Diabetes Association (ADA) 73rd (2013) and 74th (2014) Scientific Sessions were searched to identify studies ahead of full publication. Full search syntax is available in the supplementary material (Table S1). All searches were limited to English language publications.

RCTs suitable for inclusion in the review were selected using strict predefined inclusion/exclusion criteria based on the population, intervention, comparators, outcomes and study design (PICOS) approach recommended by the Cochrane Collaboration [17]. Key inclusion criteria were as follows: RCTs that examined the use of GLP-1 RAs at doses approved in the US and/or Europe for the treatment of T2DM (albiglutide 30 mg QW, albiglutide 50 mg QW, dulaglutide 0.75 mg QW, dulaglutide 1.5 mg QW, exenatide 5 µg BID, exenatide 10 µg BID, exenatide 2 mg QW, liraglutide 1.2 mg QD, liraglutide 1.8 mg QD, or lixisenatide 20 µg QD), each as add-on therapy to metformin monotherapy only, for the treatment of T2DM, in adults (aged over 18 years) who had inadequately controlled glycemia following treatment with metformin monotherapy. Outcomes were to be reported at 24 ± 6 weeks. This time period was deemed long enough to capture all outcomes of interest and allow for the inclusion of the largest body of evidence. Longer duration studies which reported outcomes at 24 ± 6 weeks were also eligible for inclusion in the review. Key exclusion criteria were as follows: RCTs that specifically examined patients with renal impairment, involved monotherapy treatment only, or where change in body weight was the primary outcome.

The GLP-1 RA clinical programs differed largely in terms of background medications allowed during the trial and patients previous use of antidiabetic agents. To minimize heterogeneity of the population included in the NMA, to account for different efficacies of various antihyperglycemic agents [1820], and to ensure similarity regarding the population’s clinical spectrum of T2DM, at least 80% of patients in each treatment arm within an eligible trial must have received background therapy consisting only of metformin monotherapy during the trial; and at least 80% of patients in all treatment arms within the trial must have metformin monotherapy or diet and exercise as the pre-trial diabetes management method.

Two reviewers independently determined whether articles met inclusion criteria. Abstracts of all identified articles were first assessed by each reviewer. Full text articles were obtained for any articles where inclusion could not be determined from the abstract alone. Discrepancies between reviewers were resolved by consensus—a third reviewer adjudicated unresolved disputes; the judgment of the third reviewer was considered final. Data extraction was performed by one analyst who extracted data elements from each included RCT, and line verification of the extraction fields was performed by a second reviewer. Discrepancies between reviewers were resolved as described above.

The Cochrane Collaboration’s tool for assessing risk of bias [21] was used to assess the quality of the included RCTs. Two reviewers independently assessed the quality of included studies. A third party resolved disagreement in the manner as described for study selection.

Statistical Methods

Mean values and associated measures of variability [variance, standard deviation, standard error (SE) or confidence interval (CI)] were extracted for continuous endpoints. Where measures of variability were not reported, the SE was imputed by borrowing information from other studies included in the review, using methods recommended by the Cochrane handbook for systematic reviews of interventions [17]. Counts or proportion of events was extracted for binary endpoints. Statistical analyses, other than the NMA were conducted using the statistical package Stata 12 (StataCorp, College Station, Texas, US).

The NMA was fitted by Markov Chain Monte Carlo techniques using the statistical package WinBUGS, version 1.4.3. (Cambridge, United Kingdom) [22] Code for the NMA was based on that recommended by the National Institute for Health and Clinical Excellence Decision Support Unit [23]. Non-informative (or “vague”) prior distributions were used to ensure estimates of treatment effects were informed by the trial data rather than by the choice of prior distribution.

For each outcome, random effects (RE) and fixed effect (FE) models were run. The a priori choice of model was an RE model, based on the assumption that there is not one true effect. A FE model was selected over an RE model by making considerations of the following aspects: (1) model fit among the FE and RE models, based on the deviance information criteria (DIC), which quantifies the trade-off between the goodness of fit of the data to the model and the complexity of the model, with a lower DIC value indicating a better fit; and (2) whether the posterior distribution of the between-studies variance was updated from the prior distribution with reasonable precision (i.e., credible interval (CrI) around between-studies variance was narrow enough that results from the RE model were still informative). If the between-studies variance was not appropriately updated, particularly because of having a sparse network of evidence (i.e., model was unable to estimate this parameter with the available evidence base), the FE model was selected over the RE model, when DIC values were comparable. However, in these cases, it is encouraged to look at results from both the FE and RE model and understand that the true value is likely between these findings.

Meta-regressions were conducted to explore heterogeneity in treatment effects due to differences in baseline values. These models were not deemed robust, as there were not enough data points informing the covariate effect (i.e., comparisons against the reference treatment informed by more than one study). The results from these models are hence not presented. Mean changes in HbA1c, weight, and SBP were analyzed using the mean difference scale; proportion of subjects achieving glycemic target, proportion of subjects experiencing nausea or who discontinued due to adverse events were analyzed on an odds ratio scale. Key elements of patient population and baseline risk were summarized in boxplots, to evaluate heterogeneity in the network [16, 24].

Each model was coded, analyzed, summarized using graphical and summary tables, validated, and reported. Model convergence was checked by examining caterpillar and density plots. The NMA presents results for GLP-1 RAs included in the network; antihyperglycemic agents that provided indirect evidence were not considered agents of interest and results for those agents are not presented.

For studies that presented endpoints in graph format only, values were derived by digitizing the graph, using the DigitizeIt program, version 1.5 (DigitizeIt, Braunschweig, Germany). Endpoints reported at week 24 were preferred, but data were acceptable if reported between 18 and 30 weeks post-randomization.

To generate estimates on the absolute scale, the mean reported absolute effect across the included placebo arms within each network was calculated and used as the anchor point (liraglutide 1.2 mg QD was used for the SBP network). These anchor values were then added to the relative treatment effects estimated by the NMA.

Compliance with Ethics Guidelines

This article is based on previously conducted studies, and does not involve any new studies of human or animal subjects performed by any of the authors.

Results

Supporting Evidence

The SLR identified 662 articles of which 14 RCTs met criteria for inclusion in the NMA [2538]. A PRISMA flow chart summarizing search results and study selection is provided in the supplementary material (Figure S1 in the supplementary material). Table 1 presents study and patient characteristics from the 14 RCTs included in the NMA.

Table 1 RCT characteristics

The mean age of participants ranged from 42.7 to 58.2 years, average duration of T2DM ranged from 4.4 to 8.2 years, and baseline HbA1c ranged from 7.8% to 8.9%. Boxplots for mean age, duration of T2DM, HbA1c and weight at baseline, by treatment comparators, are presented in the supplementary material (Figure S2).

The overall quality of the included trials in the network was good. The generation of the randomization sequence and concealment of blinding was adequate, when reported; however, this was not always described by authors. Eight studies were open-label, as this design is often used for drugs with injections, and for therapeutic monitoring. Most of the studies reported similar characteristics between treatment groups at baseline. Only two studies [25, 38] noted differences in gender proportions at baseline between treatment groups, but significance was not tested.

The endpoint data for the individual trials included in the NMA is presented in the supplementary material; Table S2 (HbA1c, weight and SBP), Table S3 (proportion of patients achieving glycemic target), and Table S4 (tolerability data).

NMA

Characteristics of model fit for all outcomes included in the NMA are presented in the supplementary material (Table S5).

All 14 trials provided evidence for the network of mean change in HbA1c from baseline (Figure S3 in the supplementary material). The relative effect sizes are presented in Table 2. All regimens performed significantly better than placebo at reducing HbA1c levels; point estimates varied from −0.42% for exenatide 5 µg BID to −1.09% for exenatide QW and dulaglutide 1.5 mg QW. For the comparisons of exenatide QW against other active agents for the HbA1c endpoint, exenatide QW was significantly better than lixisenatide 20 µg QD (mean −0.59%, 95% CrI −1.15, −0.03). The remaining comparisons of exenatide QW to the other GLP-1 RAs were not statistically different for HbA1c change.

Table 2 Relative effect sizes for mean change in HbA1c

Ten trials reported the proportion of subjects achieving glycemic target and definitions described as HbA1c <7% and ≤7% were considered for analysis (Figure S4 in the supplementary material). There were no statistically significant differences observed for exenatide QW versus the other GLP-1 RAs (Table 3).

Table 3 Relative effect sizes for odds of achieving glycemic target

All 14 included RCTs provided evidence for the network of mean change in weight from baseline (Figure S5 in the supplementary material). All GLP-1 RA treatment arms were associated with a reduction in weight from baseline (absolute change). Only exenatide 10 µg BID was significantly better than placebo in weight change from baseline (Table 4). There was no statistically significant difference in weight change between exenatide QW and the other GLP-1 RAs.

Table 4 Relative effect sizes for mean change in weight (kg)

A total of five trials provided evidence for mean change in SBP from baseline, for the following regimens: exenatide QW, dulaglutide 1.5 mg QW, and liraglutide 1.2 and 1.8 mg QD (Figure S6 in the supplementary material). The Get-Goal X trial [37], which evaluated exenatide 10 µg BID vs. lixisenatide 20 µg QD had to be excluded, as these agents were disconnected from the rest of the network (i.e., no direct comparisons were available between either exenatide 10 µg BID or lixisenatide 20 µg QD versus any of the other agents). None of the trials that reported change in SBP included a placebo arm, so liraglutide 1.2 mg QD was chosen as the reference treatment. Exenatide QW demonstrated a trend towards greater SBP reduction against all other GLP-1 RAs in the network, although these comparisons did not achieve statistically significant results (Table 5).

Table 5 Relative effect sizes from the fixed effect model for mean change in systolic blood pressure (mmHg)

A total of 11 RCTs reported nausea as an adverse event, but one trial by Gallwitz et al. [35] was excluded as information on this outcome was only provided in one treatment arm (Figure S7 in the supplementary material). Exenatide QW exhibited a trend towards a lower risk of nausea compared to all GLP-1 RAs except exenatide 5 µg BID, but results were not statistically significant (point estimates of the odds ratios ranged from 0.29 vs. dulaglutide 1.5 mg QW, to 0.98 vs. lixisenatide 20 µg QD; and 1.73 for the comparison against exenatide 5 µg BID) (Table 6).

Table 6 Relative effect sizes for risk of nausea

Thirteen RCTs reported the number of subjects discontinuing treatment due to adverse events (Figure S8 in the supplementary material). The endpoint estimate for risk of discontinuation due to adverse events indicated a beneficial effect for exenatide QW compared to dulaglutide 1.5 mg QW, and liraglutide 1.2 mg and 1.8 mg QD, but none of these differences were statistically significant (Table 7).

Table 7 Relative effect sizes for risk of treatment discontinuation due to adverse events

Estimates on the absolute scale are presented in the supplementary material (Table S6).

Discussion

The position statement of the ADA and the EASD recommends the addition of GLP-1 RAs as a therapeutic option for patients with T2DM who are inadequately controlled on metformin monotherapy, with consideration of individual patient-related factors [1]. This study presents the results of an NMA that investigated the relative efficacy and tolerability of GLP-1 RAs, and in particular examined how exenatide QW, the first once weekly GLP-1 RA, compared to other existing GLP-1 RAs. Overall, the results of this study suggest that exenatide QW has a similar effectiveness and tolerability profile compared to other members of its class, for the treatment of adults with T2DM inadequately controlled on metformin monotherapy, with some evidence of improved HbA1c control over lixisenatide, and, though not significant, trends toward improvement in SBP and reduced risk of nausea against all non-exenatide GLP-1 RAs.

Few differences in efficacy between the GLP-1 RAs, as add-on to metformin, were found. Exenatide QW was significantly better than lixisenatide 20 µg QD in reducing HbA1c, and comparable to other GLP-1 RAs for change in HbA1c. Exenatide QW demonstrated a trend towards favorable results in the proportion of patients achieving glycemic target compared to both exenatide BID regimens, liraglutide 1.2 mg QD and lixisenatide 20 µg QD, although these comparisons were not statistically significant. Change in weight from baseline was comparable for all GLP-1 RAs. The significant difference between exenatide QW and lixisenatide QD in HbA1c reduction, may be partially explained by the fact that the QW dosing of exenatide provides better glycemic control than lixisenatide, a short-acting agent, which is administered once daily and therefore, does not have an effect over 24 h.

Published NMAs on GLP-1 RAs [18, 39, 40] have reported few differences between individual therapies for the outcomes examined [18, 40]. Scott et al., found no statistically significant differences for glycemic control between exenatide QW and liraglutide (1.2 and 1.8 mg) [18]. A recent NMA, presented as a poster, did report that exenatide QW was associated with a greater reduction in HbA1c compared to dulaglutide 1.5 mg QW in the add-on to metformin network, but differences were not statistically significant [39]. In the current study, however, results indicated that exenatide QW was significantly better than lixisenatide 20 µg QD in reducing HbA1c, and comparable to other GLP-1 RAs for change in HbA1c. Another NMA [40] reported that exenatide QW, exenatide 10 mg BID, and liraglutide 1.8 mg QD ranked as the top three agents in terms of weight reduction compared to traditional hypoglycemic agents. In the present NMA, overall change in weight was comparable across all GLP-1 RAs. However, it is important to note that the current study relied upon a different evidence base than previous NMAs, with study duration restricted to 24 ± 6 weeks and the inclusion of any GLP-1 RA at licensed doses.

Based on direct head-to-head trials, researchers have reported significant differences between some GLP-1 RAs. In particular, in the DURATION-1 and DURATION-5 trials, exenatide QW was found to have superior efficacy in HbA1c reduction compared to exenatide BID [8, 9]. In the DURATION-6 trial, the reduction in HbA1c was significantly greater with liraglutide 1.8 mg QD treatment compared to exenatide QW (treatment difference for exenatide minus liraglutide was 0.21%, 95% CI 0.08–0.33) [10]. Yet, these trials were not included in the present NMA as patients in these trials were on a variety of oral antihyperglycemic agents prior to study entry and continued these throughout the trial, suggesting likely differences in disease severity compared to a population largely on metformin monotherapy.

Real-world evidence of the clinical effectiveness of GLP-1 RAs has been provided by retrospective studies [1113]. For change in HbA1c from baseline, the efficacy has been reported to be comparable between liraglutide QD and exenatide QW [13], as well as liraglutide QD and exenatide BID [11]. Results of the current NMA also found no differences in HbA1c change between liraglutide QD and exenatide QW. Although HbA1c change between liraglutide QD and exenatide BID was not directly compared, point estimates were similar, particularly for liraglutide 1.2 mg QD and exenatide 10 µg BID. One study has reported liraglutide to be more efficacious than exenatide BID in reducing HbA1c [12]. However, these studies rely upon the availability of data in medical databases and thus by their study design, may inherently be subject to confounding and bias that is absent from RCTs.

Previous trials have reported that exenatide QW was associated with less frequent mild-to-moderate nausea compared to exenatide BID [8, 9, 41]. In addition, exenatide QW has been previously reported to be associated with fewer treatment discontinuations due to adverse events compared to liraglutide 1.8 mg QD [10]. These findings are in line with the design of exenatide QW, which was developed to provide better tolerability given its gradual titration [42]. However, results in the current study indicated that exenatide QW was comparable to other GLP-1 RAs in the network for both nausea and treatment discontinuation due to AEs. The lack of any significant differences in these results was likely due to the limited evidence base and the low frequency of discontinuation due to AEs in the included trials. Yet, a recent pooled analysis of eight RCTs reported a similar safety and tolerability profile for exenatide QW compared to exenatide BID and liraglutide QD [43]. Results in the current NMA indicated a higher risk of nausea and treatment discontinuation due to AEs with exenatide QW compared to exenatide 5 µg BID, although these differences were not statistically significant.

Strengths and Limitations

A major strength of this study was the comparison of exenatide QW to GLP-1 RAs (albiglutide 30 mg QW, dulaglutide 1.5 mg QW, exenatide 5 µg and 10 µg BID, liraglutide 1.2 mg and 1.8 mg QD, and lixisenatide 20 µg QD) approved for use in the US and/or Europe that has not been investigated in RCTs. In addition, the current study included an examination of several efficacy and tolerability outcomes. Another key strength is the strict inclusion criteria developed for the study. Eligible RCTs were required to include at least 80% of patients who had failed metformin monotherapy prior to study entry, and received a GLP-1 RA as add-on to metformin during the trial. To minimize heterogeneity in the network, there were several trials that did not meet these criteria; in particular, eight exenatide QW trials [810, 4448] did not meet these criteria and were therefore excluded from the NMA. In fact, this stringent inclusion/exclusion criteria may have led to the exclusion of treatments that may have formed indirect comparisons to the treatments of interest which could have influenced the results. However, these higher order indirect comparisons would also have contributed less weight to the analysis [49].

There are several potential limitations of the analysis. First, baseline characteristics such as gender and ethnicity were not extracted, and there may be potential differences in the patient population based on these characteristics. However, it is unlikely that such characteristics would result in differences in clinical outcomes examined in this study. Second, physical activity and diet are often not reported in the publications and the impact of those components of care cannot be investigated. Third, there are several methodological differences among the trials including incorporation of double-blinded design and titration schedules. Fourth, the sparse network of evidence informing the analyses led to considerable uncertainty around estimates of between-study variance for all endpoints under the RE models (14 trials informing 16 comparisons). The limited evidence base was particularly evident for SBP. Finally, aggregate, study-level summaries provided estimates for inclusion in the analysis, and are subject to ecological bias. The alternative would be to use individual patient-level data, which were not available.

Conclusion

Overall, the GLP-1 RA class is effective in controlling hyperglycemia and well-tolerated in patients with T2DM. The results of this NMA demonstrated that, exenatide QW has similar effectiveness and tolerability compared to other GLP-1 RAs, for the treatment of T2DM in adults inadequately controlled on metformin alone. The scientific rigor of the conduct of the SLR and NMA utilized in this study provides confidence in the robustness of the results.