Background

Glaucoma is the leading cause of irreversible blindness worldwide. Because conventional trabeculectomy and glaucoma medicines result in low success rates [1, 2], glaucoma drainage implants (GDIs) have been used with increasing frequency in the management of refractory glaucoma. In 1969, Molteno [3] invented the first of many glaucoma implants. The Ahmed glaucoma valve (AGV) and Baerveldt glaucoma implant (BGI) are currently two of the most commonly used implants for aqueous drainage. Both of them reduce intraocular pressure (IOP) by draining aqueous humor through a tube to a subconjunctival end plate. The AGV contains a one-way valve, which opens in response to a pressure increase in the anterior chamber, and thus helps to reduce the risk of complications, such as hypotony [4, 5]. The BGI, which has no valves, is available in three models according to different surface areas of the end plate (500 mm2, 350 mm2, and 250 mm2). A review by Patel et al. [6] concluded that the AGV has similar success rates and IOP-lowering effects as the BGI. However, a study by Budenz et al. showed that BGI implants produce greater long-term reduction in IOP [7]. Therefore, in the present study, we aimed to determine the efficacy and safety of these two devices for treating patients with glaucoma.

Methods

The study was approved by the ethics committee at Xiangya Hospital, Central South University, and was conducted in accordance with the Protocol of Helsinki.

Search strategy and trial selection

We searched PubMed, EMBASE, and Cochrane Controlled Trials Register databases (up to February, 2015) using the following search terms: “glaucoma,” “ocular hypertension,” “intraocular pressure,” “Ahmed,” and “Baerveldt.” The publication dates and languages were not limited, and we identified references of retrieved articles and reviews (Additional file 1). Screening of the articles was performed independently by two reviewers. Studies meeting the following criteria were considered eligible for our meta-analyses: (1) a study design involving comparative clinical trials, including randomized controlled clinical trials (RCTs) and non-randomized controlled clinical trials (non-RCTs); (2) eyes diagnosed with glaucoma undergoing the AGV or BGI; and (3) at least one of the following reported outcomes: success rate, number of glaucoma medicines, mean IOP, and occurrence of adverse events. Exclusion criteria were as follows: (1) case reports, reviews, animal trials, and letters to the editor; (2) studies involving surgery combined with other glaucoma surgeries; (3) studies that implanted two or more GDIs; and (4) studies involving eyes undergoing GDI replacement surgery.

Data extraction and qualitative assessment

Article quality and extracted data were assessed by two independent readers. Any disagreements were resolved by discussion. The information collected included the first author, publication year, study design, participants (number, age, and sex), GDI model, follow-up time, and baseline IOP.

Quality assessment of the RCTs was performed using Cochrane Collaboration’s tool to assess risks of bias [8], including selection bias, performance bias, detection bias, attrition bias, reporting bias, and other biases. Every bias item was associated with a level of risk (high, low, or unclear). The quality of non-RCTs was evaluated according to an assessment system for non-randomized studies reported by the Chinese Cochrane Centre [9]. The checklist of the system consisted of six items: methods of grouping, methods of blinding, inclusion of all patients, baselines, standards of diagnosis, and control of confounding factors. Because bias of selective reporting was not included in this system, we added the item in assessment. Each item was worth 0–2 points, with a maximum total of 14 points. The overall quality of evidence was evaluated using the GRADE system (performed by GRADEpro3.6, http://cebgrade.mcmaster.ca/Introduction/index.html) [10].

Statistical analysis

Data analysis was performed using Review Manager 5 software (RevMan 5, The Cochrane Collaboration, Oxford, UK). For dichotomous outcomes, odds ratios (ORs) were calculated. For continuous outcomes, the mean and SD were used to calculate weighted mean differences (WMDs). The heterogeneity of effect size was evaluated by the chi-square test. I2 statistics and P value were calculated. P >0.1 was considered as no significant heterogeneity. Results were pooled using the random-effect model in a meta-analysis. To evaluate publication bias, we performed Begg’s test [11] and inspected funnel plots. P <0.05 was considered statistically significant. A sensitivity analysis was conducted to confirm the stability of the meta-analysis results. PRISMA checklist for this meta-analysis can be obtained in Additional file 2.

Results

The study identification process is illustrated in Fig. 1. A total of 54 articles were identified by search strategies after duplicates were removed. No study reporting other outcomes was found in comparing the two interventions. Ten articles that enrolled a total of 1048 eyes (486 in the AGV group and 562 in the BGI group) were included in our meta-analysis [7, 1220]. Two of them were RCTs and the remaining studies were retrospective comparative studies. Two of the included retrospective comparative studies (Tesser et al. [16] and Chung et al. [17]) concurrently performed lens extraction (phacoemulsification or extracapsular cataract removal) with intraocular lens (IOL) implantation or secondary IOL implantation. Although we did not limit the types of glaucoma, most patients undergoing implantation were diagnosed with refractory glaucoma. The mean ages ranged from 5 months to 80 years. The male to female sex ratio ranged from 0.57 to 1.67 in the AGV group, and 0.6 to 1.88 in the BGI group. The follow-up time ranged from 8 months to 5 years. Study characteristics are listed in Table 1.

Fig. 1
figure 1

Flow diagram for the selection of included trials

Table 1 Baseline characteristics of eligible clinical trials

Qualitative assessment of these studies is summarized in Tables 2 and 3. Chung et al’s study [17] was assessed with a low quality score (score 5). Tesser et al’s study [16] had an inadequate sample size. Both of these studies concurrently performed lens-related surgeries. To eliminate potential heterogeneity, we performed a sensitivity analysis after removal of data from these two articles.

Table 2 List of biases in RCTs
Table 3 Quality assessment of non-RCTs

For studies with results available at different time points, we analyzed short-term results and long-term results separately. For analysis of short-term results, we pooled data during the mean follow-up times between 6 months to 18 months. Data at 1-year time points in long-term studies were also included. Data at final follow-ups of studies with mean follow-up times >18 months were analyzed for long-term results. Subgroup analyses were performed based on patients’ age (children and adults subgroups) and the study design (RCT and non-RCT subgroups). The boundary of age between the children subgroup and adult subgroup was 18 years.

Success rate

The definition of success rate was consistent with the original studies with one exception. Christakis et al. [12] reported three sets of results according to different IOP criteria (≤14 mmHg, 18 mmHg, or 21 mmHg). We adopted results using IOP criteria less than 21 mmHg in this article. For the rest of the studies, the crude data was pooled directly based on their original definition of success rate. Five studies (714 eyes) were included in the short-term analyses, and seven studies (835 eyes) were included in the long-term analyses. In short-term follow-up, the success rate in the AGV group was 78.6 % and that in the BGI group was 79.7 %. No significant difference was observed between the two groups (OR: 0.97; 95 % confidence interval [CI]: 0.56, 1.66; P = 0.90) (Fig. 2). Sensitivity analyses (4 studies, 682 eyes) yielded a similar result (OR: 0.87; 95 % CI: 0.56, 1.35; P = 0.53). In long-term follow-up, the success rate in the AGV group was 59.2 % and that in the BGI group was 68.4 %. Pooled results (OR, 0.73; 95 % CI: 0.54, 0.99) showed a P-value of 0.04, slightly less than the 0.05 threshold (Fig. 3). Therefore, there are some evidence indicate that the success rate for the AGV group was significantly lower than BGI group in long-tern follow-up. Moreover, sensitivity analyses (6 studies, 826 eyes) showed no significant difference between the two groups (OR: 0.74; 95 % CI: 0.55, 1.00; P = 0.05). A pooled result from three studies (632 eyes) showed that the number of reoperations for glaucoma in the AGV group was significantly higher than that for the BGI group (OR: 2.70; 95 % CI: 1.54, 4.74; P = 0.0005).

Fig. 2
figure 2

Forest plot of meta-analysis: success rates in short-term follow-up

Fig. 3
figure 3

Forest plot of meta-analysis: success rates in long-term follow-up

A summary of subgroup and sensitivity analyses is shown in Table 4. Although the BGI group showed a higher success rate in total results for long-term follow-ups than the AGV group, subgroup and sensitivity analyses did not show a significant difference between the two groups. The pooled results of the RCT and non-RCT subgroups showed no evidence of statistically significant differences between the two groups for short- and long-term follow-ups. Data from two studies (40 eyes) that focused on children were pooled in long-term follow-up. We found no significant difference in success rate was been observed (OR: 0.96; 95 % CI: 0.04, 21.88, P = 0.98) and there was high heterogeneity (I2 = 64 %, P = 0.1). The large CI suggests that this result may not be reliable. The pooled results of the adult subgroup showed that there was no significant differences in two follow-up times (Table 4). The heterogeneity test showed a lack of significant heterogeneity for total and sensitivity analyses, and RCT, Non-RCT subgroup (I2 < 50 %, P >0.1).

Table 4 Comparison of the success rate

IOP

We pooled the mean IOPs for the two groups because all articles reported the absolute IOP after the operation. Detail data of total and subgroup analyses are shown in Table 5. In short-term follow-up, the difference in the pooled mean IOP from six studies (685 eyes) for the AGV group compared with the BGI group was 2.12 mmHg (95 % CI: 0.72, 3.52), which was statistically significant (P = 0.003, Fig. 4). Significant heterogeneity was observed (I2 = 49 %, P = 0.08). Sensitivity analyses showed that the overall WMD did not substantially change, and no evidence of significant heterogeneity was observed (I2 = 0 %, P = 0.6). In long-term follow-up, the difference in the pooled mean IOP from seven studies (659 eyes) for the AGV group compared with the BGI group was 1.85 mmHg (95 % CI: 0.43, 3.28), which was statistically significant (P = 0.01, Fig. 5). However, significant heterogeneity was observed (I2 = 44 %, P = 0.1). The result of Sensitivity analyses (excluded two studies) consisted with the total group (included all eligible studies), but heterogeneity was still significant (I2 = 53 %, P = 0.06).

Table 5 Comparison of postoperative IOP
Fig. 4
figure 4

Forest plot of meta-analysis: intraocular pressures in short-term follow-up

Fig. 5
figure 5

Forest plot of meta-analysis: intraocular pressures in long-term follow-up

The pooled results from the RCT group were similar to the total group in short-and long-term follow-ups, with no statistically significant heterogeneity. No difference in IOP was observed between the BGI and AGV groups in the non-RCT subgroup in short-term follow-up (4 studies, 208 eyes, WMD: 1.68; 95 % CI:-1.27, 4.63; P = 0.26) and long-term follow-up (5 studies, 319 eyes, WMD: 2.18, 95 % CI: −0.91, 5.27; P = 0.17). Significant heterogeneity was observed in the non-RCT group (P <0.1). The results of the children subgroup analysis consisted with the total group. For short-term follow-up, adult subgroup analysis included two RCTs and one non-RCT study (total 509 eyes). There was no significant difference in IOP between the two groups (WMD: 1.44, 95 % CI: −0.76, 3.65; p = 0.20) and significant heterogeneity was observed (I2 = 74 %, P = 0.02). Adult subgroup analysis included the same studies as the RCT group in long-term follow-up.

Use of glaucoma medications

The mean number of glaucoma medications was reported by three studies (558 eyes) for short-term follow-up and seven studies (659 eyes) for long-term follow-up. Pooled differences showed that BGI implantation lowered the number of medications by a significant value of 0.29 (95 % CI: 0.07, 0.50; P = 0.009) in short-term follow-up (Fig. 6) and 0.42 (95 % CI: 0.22, 0.62; P <0.05) in long-term follow-up (Fig. 7). Sensitivity analysis and RCT subgroup analysis showed a significant difference in the mean number of glaucoma medications between the BGI and AGV groups in long-and short-term follow-up (Table 6). The random-effect model was used for pooling. One retrospective study (81 eyes) reported that medication use was not significantly different between the BGI and AGV groups. For the long-term follow-up, the pooled results of the non-RCT subgroups were consistent with the total group. No significant difference in use of glaucoma medication between the BGI and AGV groups was observed in the children subgroup (3 studies, 85 eyes). The WMD was 0.20 (95 % CI: −0.40, 0.80, P = 0.51). Adult subgroup analysis included the same studies as the RCT subgroup. The heterogeneity test showed a lack of significant heterogeneity for the total, subgroup, and sensitivity analyses.

Fig. 6
figure 6

Forest plot of meta-analysis: medications in short-term follow-up

Fig. 7
figure 7

Forest plot of meta-analysis: medications in long-term follow-up

Table 6 Using of medication in comparing AGV with BGI

Postoperative complications

A total of 971 eyes (443 in the AGV group and 528 in the BGI group) were included in analysis of complications. Because Budenz et al. reported early (≤3 months) complications [21] and late (>3 months) complications [22], the latter category was used in the pooled calculations. The definition of severe complications was the same as that in the original studies, including severe complications and devastating complications. If the studies did not report numbers of severe or devastating complications, we included the following complications for pooling: suprachoroidal hemorrhage, severe choroidal effusion (requiring correctional surgery), retinal detachment, endophthalmitis, and vitreous hemorrhage. A total of 158 eyes in the AGV group and 199 eyes in the BGI group experienced complications. Eyes in the AGV group experienced a significantly lower overall occurrence of complications than those in the BGI group (OR, 0.67; 95 % CI: 0.50, 0.90; P = 0.007) and no heterogeneity was identified (I2 = 0 %, P = 0.88). The occurrence of severe complications in the AGV group was also lower than that in the BGI group (OR: 0.57; 95 % CI: 0.36, 0.91, P = 0.02). The AGV group was characterized by a lower incidence of hypotony, but this difference was not statistically significant (6 studies, 724 eyes; OR: 0.54; 95 % CI: 0.26, 1.11; P = 0.1). There were no significant differences in hyphema, choroidal effusion, and tube complications (including tube obstruction, malposition, and erosion) between the two groups. The results of sensitivity analysis were consistent with the total groups (included all eligible studies). The incidence of complications in both groups is listed in Table 7.

Table 7 Risk of complications in comparing AGV with BGI

Begg’s test and funnel plots were used to assess publication bias in pooled effect sizes that calculated using five or more studies. Publication bias assessment showed no significant bias in success rates, IOP, and glaucoma medications in long-term follow-up, overall and severe complications, hypotony, and choroidal effusion (all P ≥ 0.05).

We used GRADEpro 3.6 software to assess the quality of evidence for each outcome in the total groups (Table 8). Because data from RCTs and non-RCTs were included in the analysis, we used the standards of an observational study to assess overall outcomes. The pooled IOP and risk of tube complications were identified significant heterogeneity; therefore we graded it as “inconsistency”. We downgraded outcomes of tube complications, IOP in short-and long-term as “very low” quality. The rest of the outcomes were graded “low” quality.

Table 8 Summary of AGV compared to BGI for glaucoma

Discussion

A total of 10 studies were included in this meta-analysis. Two of these studies were RCTs and eight were non-RCTs. The pooled results showed no statistically significant difference in success rates between the AGV and BGI groups for short-term follow-up. The success rates for the AGV group were lower than for the BGI group for long-term follow-up, but sensitivity and subgroup analyses showed a lack of stability. Nonetheless, the BGI group had better efficacy in controlling IOP than the AGV group. The pooled results from the RCT subgroup support the point that better efficacy in the BGI group, but the non-RCT subgroup showed negative results with significant heterogeneity. The BGI group required fewer glaucoma medications than the AGV group. More reoperations for glaucoma were required in the AGV group than in the BGI group. With regard to safety, the AGV was associated with a significantly lower overall frequency of adverse events and incidence of severe complications than the BGI. In subgroup analysis based on age, all of the studies that were included in children subgroup analyses were retrospective studies and sample sizes were small. More well-designed studies with a larger sample size needed to be performed in children. Publication bias and heterogeneity testing indicated that the pooled results were valid.

Although both implantations shared a similar success rate, the BGI resulted in a lower level of postoperative IOP and use of glaucoma medications than the AGV. The major success criteria, upper limit of IOP, ranged from 21 to 24 mmHg. However, the Advanced Glaucoma Intervention Study showed that an IOP target of greater than 18 mmHg may be insufficient to prevent progression of visual field defects [23]. Therefore, when setting a strict IOP target, the BGI may be more advantageous than the AGV. A larger surface area of the end plate for the Baerveldt implant (350 mm2 or 250 mm2) compared with the Ahmed valve (184 mm2) would theoretically help aqueous humor reabsorption into the circulation. Previous studies compared the efficacy of IOP control in several GDIs with different surface areas. They showed that the double-plate Molteno implant (surface area = 268 mm2) was superior to the single-plate implant [24]. The 350-mm2 Baerveldt implant was more successful than the 500-mm2 implant for overall IOP control [25]. These studies suggested that IOP control may be nonlinear relative to the surface area of the end plate. Although the AGV is equipped with a valve to reduce the occurrence of postoperative complications, the resistance to aqueous humor outflow eventually becomes counterproductive [26].

The models of the Ahmed valve and Baerveldt implant in our study were not consistent. Old polypropylene models (S2 and S3) and new silicone models (FP7) of the Ahmed valve were tested. Whether differences in biomaterial and end plate rigidity added an additional contribution to long-term IOP results was still uncertain, but the silicone model was associated with a lower incidence of complications [2730]. Our study included the 350-mm2 model and the 250-mm2 model of BGIs, and both were made of silicone. Previous studies showed that these two models shared similar success rates and occurrence of complications [15, 31]. Despite the controversial effects of characteristics of the implant, potential heterogeneity from inconsistencies in the models could weaken the pooled results.

Begg’s test and funnel plots were used to assess publication bias. We found no significant bias. However, the results of the funnel plots may not be statistically meaningful because of the lack of power for the small sample size.

To minimize heterogeneity due to the inconsistencies of follow-up times, we pooled data for two time periods. Implantations concurrent with lens-related surgeries were enrolled in this meta-analysis. Phacoemulsification and extracapsular cataract removal can reduce IOP [32, 33], especially in patients with a shallow anterior chamber. However, the effects of these procedures combined with glaucoma implantation devices were uncertain. In addition, extra surgical procedures could lead to a higher risk of adverse events. Despite this heterogeneity, we included these two articles because they provided important clinical information. Furthermore, sensitivity analysis was performed to examine the heterogeneity.

There are some limitations to our study. First, only two RCTs were included in the studies. Most studies were retrospective comparative studies that had a potential selection bias. A small sample size and incomplete baseline data also weakened the validity of the tests. Second, surgical success and complication criteria were not standardized among the included studies. Therefore, standardized assessment criteria should be established in further studies. Third, our current statistical methodology assumed that the input samples were approximately symmetric and approximately followed a Gaussian distribution. However, the values of glaucoma medications are non-negative integers, mostly in the range of 1 to 4, which are more likely to have a skewed distribution. Skewed distributions tend to have larger SD than mean. This over-generalized assumption may result in biased conclusion. Fourth, we did not analyze visual outcomes as a result of inconsistent statistical methods used in the visual results. Furthermore, we did not perform subgroup analyses for types of glaucoma and race.

When choosing a device, other factors should also be considered, for example, the experience of the surgeon, compliance during follow-up, and the goals for therapy. Moreover, additional RCTs with a longer duration and a larger sample size are required to better determine the efficacy and safety of the AGV and BGI for the treatment of glaucoma.

Conclusions

This study showed no significant difference in the success rate between the BGI and AGV groups. The BGI performed better in the control of IOP and required fewer medications than the AGV. The AGV performed better than the BGI regarding safety.