Introduction

Acromioclavicular joint (ACJ) dislocations are common injuries, particularly in athletes involved in contact sports and victims of road traffic accidents [1, 2]. The mechanism is usually a direct fall onto the shoulder with the arm in adduction [1, 2]. The ACJ is stabilized by the acromioclavicular (AC) ligament and the coracoclavicular (CC) ligaments [3]. Disruption of the CC ligaments typically results in superior and horizontal displacement of the clavicle [4].

The amount of displacement has been classified by Rockwood [5]. Type I injuries represent a sprain of the AC ligament complex; type II injuries represent rupture of the ACL ligaments with no marked elevation of the clavicle [6]. These injuries should be treated non-operatively [2, 5, 6]. For types III–VI, both the AC ligament and CC ligaments are completely ruptured, and surgical treatment is generally recommended [2, 6].

Multiple surgical techniques have been described and include temporary coracoclavicular screw fixation, coracoacromial ligament transfer, CC ligament reconstruction, acromioclavicular stabilization with double-buttons, or hook plate fixation [2]. Arthroscopically assisted acromioclavicular joint stabilization using double-button suture devices via transclavicular-transcoracoid tunnels and hook plate fixation are contemporary techniques, and both can be considered standard techniques for acute injuries [6]. The suture button is a suspensory fixation technique that can be performed either open or arthroscopic assisted, augmenting the CC ligament complex with high-strength sutures [7]. Advantages of this single-stage technique are that associated glenohumeral pathologies can be treated concomitantly, and are reportedly well-accepted by patients. However, it is technically challenging and is associated with a 20–25% prevalence of implant irritation and iatrogenic clavicle and coracoid fractures [6]. The hook plate utilizes a special modified plate which is placed beneath the inferior surface of the acromion and fixed with screws on the superior aspect of the clavicle [8]. The advantages of this technique include early functional rehabilitation and low technical requirements, but it has been associated with acromial osteolysis, acromial fractures, and subacromial impingement pain [6].

Currently, there is no consensus as to which surgical technique is preferred. A recent meta-analysis has included only four studies which were published between 2014 and 2018, and concluded that both techniques provide good clinical outcomes but note double-button suture fixation resulted in less postoperative pain [9].

The purpose of this study was, therefore, to perform an updated systematic review and meta-analysis of both randomized controlled and observational studies comparing double-button suture fixation to hook plate fixation for types III–IV acromioclavicular joint dislocation.

Methods

The updated guidelines described in the Cochrane Handbook [10] and the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines [11] were used to conduct this study.

Eligibility criteria

A comprehensive literature search for all studies comparing suture button to hook plate fixation for types III–VI acromioclavicular dislocations was performed on studies that fulfilled the inclusion and exclusion criteria. All levels I–III evidence studies were considered if they were published between January 2000 and August 2022. Level III studies were included to increase the sample size and increase the generalizability of the pooled results [12]. Several authors have demonstrated that the inclusion of LOE III studies does not cause any differences in the risk estimate of treatment effects of an intervention derived from randomized controlled trials, observational studies, or a combination of LOE I–III studies [12,13,14]. If studies compared various surgical interventions, or included a non-operative study arm, the data of interest were extracted from these studies and included in the analysis. Other inclusion criteria were defined as follows: minimum follow-up of 12 months and inclusion of at least one functional validated outcome scores (such as Constant, ASES, DASH, VAS, OSS, UCLA, or SANE). Level IV case series, abstracts, conference proceedings, and articles from non-peer-reviewed journals or unrefereed pre-publications were excluded. Although omission of gray data sources could have resulted in publication bias, it was considered that these publications would have been ineligible for inclusion.

Literature search

All publications in English and German listed in the databases of Medline, Embase, Scopus, and Google Scholar were identified and checked against the eligibility criteria. The following terms and Boolean operators were utilized: “acromioclavicular” AND/OR “dislocation” AND/OR “injury” AND/OR “Rockwood”; AND/OR “suture button” AND/OR “TightRope” AND/OR “hook plate” AND/OR “operative treatment”; AND/OR “clavicle hook plate”; AND/OR “acromioclavicular joint.” For the Medline search, the following MeSH term “acromioclavicular joint” was used in addition to the above search strategy. In addition, a manual search of the included studies and published reviews was conducted for other potentially eligible studies. Two reviewers conducted independent title and abstract screening. Disagreements between reviewers were resolved by consensus, and if no consensus was reached, they were carried forward to the full-text review.

Data extraction and quality assessment

An electronic data extraction form [Excel spreadsheet] was used to obtain the following data from each article: level of evidence, country, age, gender, length of follow-up, sample size, clinical outcome scores, and complications. Risk of bias was assessed using the Cochrane Collaboration’s Risk of Bias Tool [10] or LOE I and II studies. For LOE III studies, the ROBINS-I tool was used [15]. The GRADE system was used by two reviewers to assess the quality of the body of evidence for each outcome measure [10]. The recommendations from the Cochrane Handbook were followed, and an initial level of certainty assigned. Studies were downgraded if there was a high risk of bias, inconsistency and imprecision of the results, and indirectness of evidence. Studies were upgraded if there were large treatment effects, a dose–response, or reasons to oppose plausible residual bias and confounding effects. In addition to GRADE, the Methodological Index for Non-Randomized Studies (MINORS) and modified Coleman Methodology Score were used as a second validated instrument to assess the quality of the included surgical studies [16]. The MINORS score was categorized as per Ekhtiari et al. [17] into the following categories: < 6 very low quality, < 10 poor quality, < 14 fair quality, and > 16 good quality. Any disagreement between reviewers was resolved by consensus and/or by arbitration between the two senior authors.

Statistical analysis

Interobserver differences for study eligibility and risk of bias were measured using Cohen’s kappa coefficient. Heterogeneity of the data was assessed using χ2 and I2 statistics. Outcomes were pooled using a random effects model if the I2 statistic was > 25%, and a fixed model was used if the statistic was < 25%. Pooling of data for clinical outcomes, stability measures, and functional testing was only performed if a minimum of three studies were available. The prevalence of osteoarthritis between groups was pooled as a binary yes/no variable and analyzed by calculating the odds ratios. If standard deviations were not reported, the standard deviation was calculated using the following formula: SD = max–min/4 [10, 18]. All tests of significance were two-tailed, and an α of less than 0.05 was considered significant. Publication bias was assessed using funnel plots and Egger’s test. Funnel and forest plots, and all statistical analyses, were performed using STATA SE (Version 13.0; StataCorp, College Station, Texas, USA) for Windows and the comprehensive meta-analysis software package (CMA), version 3 (Biostat Inc., Englewood, NJ, USA).

Results

Study selection and characteristics

The initial literature search identified 137 studies for consideration. Of those, 35 studies were excluded for duplication, and the titles of the remaining 102 publications were checked for eligibility. Another 20 studies were excluded, and the full-text manuscripts of 82 studies were examined. Fifteen studies met all of the eligibility criteria and were included in the analysis (Fig. 1) [19,20,21,22,23,24,25,26,27,28,29,30,31,32,33]. Four studies were level II evidence [19,20,21,22], and the remaining eleven studies were level III evidence [23,24,25,26,27,28,29,30,31,32,33]. All 15 studies were published in English between 2013 and 2022, with a cumulative total of 682 cases. A total of 360 cases were treated with a hook plate, and 322 cases were treated with double-button fixation. The study characteristics are summarized in Table 1. Overall agreement between the two reviewers for final eligibility was excellent (kappa value 0.94, 95% CI 0.91–0.97).

Fig. 1
figure 1

PRISMA flow diagram. From the initial 137 records, 15 studies were included in the quantitative synthesis

Table 1 Summary of all included studies

Risk of bias and quality assessment

The findings of the risk of bias assessment are summarized in Tables 2 and 3.

Table 2 Risk of bias Cochrane risk of bias assessment tool version 2
Table 3 Risk of bias ROBINS Cochrane risk of bias assessment tool version 2

Risk of bias Cochrane assessment tool version 2

Three of the LOE II studies [19,20,21] were assessed as having a high risk of bias. The high risk was caused by bias from randomization. Sokkar et al. [19] have described in their methods section that they randomly divided the patient cohort, but failed to provide any detailed information. Cai et al. [20] have apparently performed a prospective randomized trial, but failed to provide any specific information in their manuscript. Pongsamakthai et al. [21] used a sealed envelope and box size techniques, but also failed to provide any detailed information regarding how randomization was executed. Stein et al. [22] used a web-based randomizer but also failed to provide further details, and their randomization protocol was, therefore, assessed as having some risk.

Risk of bias ROBINS-I assessment tool

For the eleven observational studies, the ROBINS-I tool was used to assess the risk of bias [23,24,25,26,27,28,29,30,31,32,33]. The study by Taleb et al. [28] was assessed as having a critical risk of bias, after they performed a retrospective study where group allocation was unclear. This assessment has resulted in an overall critical risk of bias. Four studies [23, 26, 30, 32] were assessed as having a serious risk of bias, with bias in the classification of interventions [23, 26, 30]; the intervention groups were not clearly defined in these manuscripts. Two studies [30, 32] were assessed as having serious risk due to missing data. Fosser et al. [30] included 120 patients initially, but only 31 patients were considered in the final analysis. Yoo et al. [32] included 38 patients initially, but only 22 were considered in the final analysis.

Publication bias was not detected. The funnel plot was symmetric, and Egger’s regression intercept (intercept − 2.91, t-value 1.18, p-level 0.257) was not significant (Fig. 2).

Fig. 2
figure 2

Publication bias: Publication bias was not detected. The funnel plot was symmetric, and Egger’s regression intercept (intercept − 2.91, t-value 1.18, p-level 0.257) was not significant

Quality assessment

The GRADE quality assessment for all included outcomes is summarized in Table 4. All outcomes were downgraded to very low quality, related to inconsistency of results due to a considerable I2, high risk of bias, and missing 95% confidence intervals.

Table 4 Quality assessment using the Cochrane GRADE system

The MINORS score (Table 5) was above 16 indicating good study quality for three LOE II studies [20, 22] and for two LOE III studies [24,25,26,27]. Fair study quality was observed in eight LOE III studies [25, 26, 28,29,30,31,32,33]. One LOE II study [19] and one LOE III study [23] were considered low quality. The modified Coleman Score (Table 6) demonstrated good study quality for one study [25], fair study quality for eight studies [20,21,22, 24, 26, 29, 30, 33], and poor quality for six studies [19, 23, 27, 28, 31, 32].

Table 5 MINORS quality assessment
Table 6 Modified Coleman Methodology Score

Clinical outcomes

The clinical outcomes for all studies are summarized in Table 1. The pooled estimate for both LOE II and LOE III studies demonstrated significant differences between the hook plate and double-button fixation, in favor of double-button fixation (SMD − 0.662, 95% CI − 1.013 to − 3.692, p = 0.0001, I2 = 74%; Fig. 3). According to Cohen, the magnitude effect is medium, suggesting that more than 69% of the results with the hook plate are below the results when using double-button fixation [34]. The pooled estimate for LOE II studies was similar to the pooled results for LOE II and III studies but was not statistically significant (SMD − 0.682 95% CI 0.183 to − 1.595, p = 0.111, I2 = 96%; Fig. 4). According to Cohen, the magnitude effect is medium, suggesting that more than 69% of the results with the hook plate are inferior to the results when using double-button fixation [34]. The failure to reach significance can possibly be explained by study heterogeneity and the lack of a statistically significant between group differences in two studies [19, 20]. The pooled estimate for LOE III studies was similar to the pooled results for LOE II and III studies and was statistically significant in favor of double-button fixation (SMD − 0.650, 95% CI − 1.049 to − 3.187, p = 0.001, I2 = 56%; Fig. 5). According to Cohen, the magnitude effect is medium, suggesting that more than 69% of the results with the hook plate are inferior to the results when using double-button fixation [34].

Fig. 3
figure 3

Forest plot comparing clinical outcomes for all studies. The pooled estimate for all studies was statistically significant (p = 0.0001) in favor of button repair

Fig. 4
figure 4

Forest plot comparing clinical outcomes for LOE II studies. The pooled estimate for all studies could not statistically significant (p = 0.11)

Fig. 5
figure 5

Forest plot comparing clinical outcomes for LOE III. The pooled estimate for all studies was statistically significant (p = 0.001) in favor of button repair

Clinical outcomes for pain

Eleven studies [19, 20, 22, 25,26,27,28,29, 31,32,33] included pain assessment scales in their study. The pooled estimate for both LOE II and LOE III studies demonstrated significant differences between the hook plate and double-button fixation in favor of double-button fixation (SMD 0.333, 95% CI 0.139–3.369, p = 0.001, I2 = 85%; Fig. 6). According to Cohen, the magnitude effect is between medium and small, suggesting that between 58 and 69% of the results with the hook plate were inferior to the results when using double-button fixation [34].

Fig. 6
figure 6

Forest plot comparing outcomes for pain for all studies. The pooled estimate for all studies was statistically significant (p = 0.001) in favor of button repair

Three LOE II studies [19, 20, 22] included pain assessment scales in their study. The pooled estimate for LOE II studies demonstrated significant differences between the hook plate and double-button fixation in favor of double-button fixation (SMD 0.481, 95% CI 0.155–0.806, p = 0.004, I2 = 59%; Fig. 7). According to Cohen, the magnitude effect is small, suggesting that only 58% of the results with the hook plate were inferior to the results when using double-button fixation [34].

Fig. 7
figure 7

Forest plot comparing outcomes for pain for LOE II studies. The pooled estimate for all studies was statistically significant (p = 0.004) in favor of button repair

Eight LOE III studies [25,26,27,28,29, 31,32,33] included pain assessment scales in their study. The pooled estimate for LOE III studies demonstrated significant differences between the hook plate and double-button fixation in favor of double-button fixation (SMD 0.317, 95% CI 0.012–2.882, p = 0.004, I2 = 0%; Fig. 8). According to Cohen, the magnitude effect is between small and medium, suggesting that between 58 and 69% of the results with the hook plate were inferior to the results of when using double-button fixation [34].

Fig. 8
figure 8

Forest plot comparing outcomes for pain for LOE III studies. The pooled estimate for all studies was statistically significant (p = 0.004) in favor of button repair

Complications

All studies reported complication rates (Table 7). The pooled estimates demonstrated overall lower complication rates in the double-button group, but the between-group differences did not reach statistical significance (odds ratio 2.220, 95% confidence intervals 0.95–5.188, p = 0.065) (Fig. 9).

Table 7 Complications
Fig. 9
figure 9

Forest plot comparing complications for all III studies. The pooled estimate for all studies was not statistically significant (p = 0.065) but favored button repair. The odd ratio indicated that complications were twice as likely in the clavicle hook plate

Discussion

The results of this meta-analysis clearly demonstrated superiority of the double-button repair compared to the clavicle hook plate. For clinical outcomes, the pooled estimates for all studies demonstrated that more than 69% of all double-button repairs had better clinical outcomes when compared to the hook plate. This result was also observed when separately pooling LOE II and LOE III studies. Similarly, patients who were treated with double-button repair had significantly less pain, but the treatment effect was lower and 58–69% of the double-button group had substantially less pain than patients treated with the hook plate. Double-button repair was also associated with a 2.2 times lower overall risk for complications.

An earlier meta-analysis included four studies and concluded that both techniques resulted in good clinical outcomes, but reported that double-button fixation had an advantage with regard to postoperative pain [9]. Compared to Pan et al. [9], our meta-analysis has included a total of 15 studies, increasing the sample size by 400%. Pan et al. [9] have included two case–control, one cohort, and one randomized controlled trial, and pooled all studies. It could be argued that the inclusion of observational studies is typically associated with high risk of bias, heterogeneity, and could be unbalanced on confounding factors. Therefore, non-randomized studies should theoretically not be included into meta-analysis. The Cochrane Handbook outlines that high risk of bias for one or more key domains is sufficient to weaken the confidence in the interpretation of the results with pooling randomized controlled trials [10]. However, Anglemyer et al. [35] compared outcomes between observational and randomized trials. In their review, they have demonstrated that there is little evidence that there would be significant effect estimate differences between observational studies and RCTs, regardless of study design and heterogeneity [35]. Abraham et al. reported that the mean difference between RCTs and non-randomized comparative studies was between 4.0 and 5.6%, and concluded that the inclusion of non-randomized studies into meta-analysis is possibly as accurate as that of RCTs alone [36]. Bun et al. investigated the effect of pooling different study types and concluded that treatment effects did not differ between observational studies and RCTs as long as the same outcomes are pooled [14]. In this meta-analysis, studies were analyzed according to the level of evidence, but also pooled into one cumulative analysis. The effect estimate did not differ between the three analyses, indicating that the true treatment effect was not influenced by the inclusion of observational studies. However, only with the analysis of LOE II studies did the findings not reach statistical significance. This could be explained by a small sample size and bias by sampling error, resulting in low power and type II error [37]. The inclusion of observational comparative studies has increased the overall sample size reducing the risk of type II error, and as a consequence resulted in significant between-group differences in favor of double-button repair. The increase in sample size has also increased the external validity and generalizability of the findings.

Unfortunately, the risk of bias within and across studies was assessed as high, and the Cochrane Handbook specifies that high risk of bias can affect the interpretation of results [10]. In this meta-analysis, the risk of bias was high for 75% of the LOE II studies and serious or critical for 45% of the included observational studies. These conclusions should, therefore, be interpreted with caution. Study quality was assessed using the MINORS scoring system and modified Coleman Methodology Score for within study quality, and the GRADE system for across studies outcome variables. The study quality with GRADE was assessed as very low for each outcome across studies. Considerable heterogeneity and imprecision of results by omitting the 95% confidence intervals were the main reasons. According to Cochrane, this reduces the confidence in the effect estimate [10]. In addition, further high-quality studies are likely to have an impact on the estimate of the treatment effect and may actually change the estimate. Within study quality using two validated instruments revealed that the majority of studies were of fair and poor, quality further reducing confidence and validity.

Despite a large number of studies describing numerous techniques to treat ACJ injuries, a widely accepted standard does not exist. Certainly, there has been a focus on more precise restoration of normal anatomy using minimally invasive techniques, taking both horizontal and vertical instability into consideration [38]. More temporary techniques for acute injuries include arthroscopic-assisted or minimally invasive coracoclavicular reconstructions with button suture configurations, combined with rigid fixation with clavicle hook plates [2, 39].

Clinical outcomes are significantly better in the double-button repair group, and despite the concerns with risk of bias, study quality, and heterogeneity, one could argue that there is a strong argument in favor of suture-button repair when surgical repair is indicated for acute acromioclavicular joint injuries. Superiority of double-button fixation is also observed with postoperative pain, and patients in the double-button group had significantly lower pain scores, although the treatment effect was lower. The possible explanation for this finding could be less soft-tissue disruption during surgery with double-button fixation in the early postoperative stage. Significantly lower pain levels were also observed more than 12 months postoperative for double-button repair, and the reasons for these differences are not clear. Complication rates are an important factor when deciding which of the two surgical techniques should be considered. Clearly, the complication rates are significantly lower for double-button repair, and in the hook plate group, complications were twice as high compared to the button repair group. Loss of reduction, recurrence, and horizontal instability were the main complications in the button group, while wound complications, infection, and osteolysis/acromial erosions were the main complications in the hook plate group. Another potential disadvantage of the hook plate is the need for plate removal.

Despite the recent dramatic increase in the number of publications on acromioclavicular injuries, controversy still exists with regard to diagnosis, classification, and optimal treatment [38, 39]. Given the lack of high-level evidence, the decision to operate or not and which technique to use is ultimately a joint decision between patient and surgeon, and must be based on individual patient characteristics, professional activities and demands, their desire to return to sports, and their willingness to adhere to the postoperative rehabilitation protocol [2].

Limitations

This meta-analysis has several limitations. The included studies had small to moderately small sample sizes with variable follow-up intervals, introducing sampling bias and heterogeneity. Subgroup analysis considering the severity of dislocation (Rockwood III–VI) was not performed, as this would have resulted in low power. Another limitation is that this meta-analysis focused on two more contemporary techniques, and it is possible that this approach has caused selection bias. In addition, non-operative treatment was not included and may have been appropriate for Rockwood type III injuries with similar outcomes. It could be argued that the inclusion of both randomized controlled trials and comparative observational studies has resulted in unbalanced trial groups, with confounding factors introducing bias. Even though the search strategy included multiple databases, it is possible that studies were missed. This could also be because the search was limited to English and German publications only. Publication bias can, therefore, not entirely be excluded; however, funnel plot and Egger’s intercept do not suggest that publication bias was introduced.

Conclusions

The results of this meta-analysis demonstrated significantly better outcomes with double-button repair for acute ACJ dislocations when compared to using a clavicle hook plate. Double-button repair is also associated with a 2.2 times lower risk of complications; however, risk of bias is high, and study quality within and between studies was low. These results must, therefore, be viewed with caution.