Introduction

Treatment modalities for humeral shaft fractures have evolved over time. Non-operative treatment has been the preferred method for decades since the healing potential of the humerus was considered very good in terms of speed and fracture healing rates, restoration of anatomy is not a prerequisite for good functional outcome, and patients are not exposed to operative risks such as iatrogenic radial nerve palsy, postoperative infections, and implant failure. However, the very good results from functional bracing as published in landmark papers in the 70’s and 80’s by, e.g., Sarmiento, could not be reproduced by others [1]. Despite the possibility of early mobilization of the shoulder and elbow joints, impairment of range of motion (ROM) of especially the shoulder joint was often reported [2, 3]. The persisting clinical need led to the development of new and improved implants for surgical treatment.

Operative treatment for humeral shaft fractures is mostly performed using intramedullary nailing (IMN) or plate osteosynthesis. An IMN is placed in the medullary cavity of the humerus and is, thus, in line with its mechanical axis. If closed reduction can be achieved, periosteal blood supply and fracture biology can be preserved. Incisions are small and require less soft tissue stripping than open reduction and plate osteosynthesis [4]. However, shoulder-related complaints, such as pain and restriction of shoulder movement due to malrotation and impingement of the proximal nail tip or locking head screw, are frequently reported [5,6,7,8]. Open reduction and plate osteosynthesis (ORPO) offers the possibility of anatomic reduction and, depending on the fracture configuration, compression of fragments, as it requires extensive soft tissue exposure [9]. A potential disadvantage is a possible higher rate of (temporary) secondary radial nerve palsy [10]. Minimally invasive plate osteosynthesis (MIPO) has the advantage of limited soft tissue dissection and avoids the need to expose the radial nerve [10].

The development of anatomical and angular locked plate systems since approximately 2002 has led to a variety of reliable surgical techniques and a shift from non-operative management toward osteosynthesis, even when no absolute indication for surgery is present [11,12,13,14,15]. Some authors recommend to use IMN, whereas recently MIPO has been proposed as the preferred treatment [6, 8, 16,17,18,19,20,21,22]. The debate on the most optimal treatment strategy of humeral shaft fractures remained inconclusive after previous reviews, which only included 6–17 published randomized controlled trials and comparative prospective cohort studies in total [8, 16,17,18,19,20,21,22,23,24,25,26].

The primary aim of the current systematic review and pooled analysis was to compare fracture healing between non-operative and operative treatment of a humeral shaft fracture. The secondary aims were to compare complications and functional outcome.

Materials and methods

This systematic literature review and pooled analysis was conducted and reported according to the standards set out in Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) [27]. Methods used for the analysis, search strategy, and inclusion criteria were specified in advance.

Search strategy

Databases of Embase, Medline ALL, Web-of-Science Core Collection, and the Cochrane Central Register of Controlled Trials (CENTRAL) were searched. Search strings were made by an experienced librarian and are shown in Table 1. The final search was done on July 30, 2021.

Table 1 Search strategy

Eligibility criteria

Studies were included if they reported primary treatment of a humeral shaft fracture in patients aged 16 years or older with functional bracing, intramedullary nailing, or plate osteosynthesis. All study designs, except case reports, meta-analyses, and reviews, were included.

Studies were excluded if they met one or more of the following exclusion criteria: (1) recurrent, pathological, or periprosthetic fractures, (2) proximal or distal metaphyseal fracture extension, (3) grade III Gustilo Anderson open fractures, (4) treatment with external fixator, (5) experimental treatment, (6) outcome of less than five patients reported, (7) less than 6 months follow-up, (8) published before the year 2000 or (9) alternative operative methods for humeral shaft fractures (e.g., Ender nails, Marchetti nails, Rushs nails, Hackethal nailing, K wires, expandable, and flexible or elastic nails). Studies that reported on patients with concomitant injuries, such as vascular injury, compartment syndrome, or ipsilateral forearm fractures, were not excluded.

Study selection

First, four reviewers (KCM, SHVB, TVDT, and CAWN) independently screened the titles and abstracts of the studies to identify eligible studies. Inconsistencies were resolved by consensus. Second, the full-text articles of the remaining eligible publications were retrieved. The corresponding authors of studies with no available full-text version were contacted once by email. Third, the full-text articles were independently reviewed by the aforementioned reviewers. Any disagreement was resolved through consensus. Furthermore, the references of the included studies were reviewed for additional studies that may have been missed.

Data collection and data items

Data were extracted from the reports independently by three reviewers (KCM, SHVB, and PAJ) using a predefined data sheet. From each study, information was extracted on: study design, publication characteristics, demographics, treatment characteristics (including type of treatment, antegrade or retrograde IMN, ORPO, or MIPO), fracture classification according to the AO/OTA classification, complications, range of motion, and functional outcome scores, including patients-reported outcome measures (PROMs).

Fracture healing (time) was defined as (time to) radiologic or clinical fracture healing. Nonunion was defined as failure to heal at 6 months post-fracture with no progress toward healing seen on the most recent radiographs. Malunion was defined as fracture healing in an abnormal position. Primary radial nerve palsy was defined as radial nerve palsy as a result of initial trauma. Secondary radial nerve palsy was defined as radial nerve palsy as a result of reposition, during non-operative treatment or surgery. Implant failure was defined as the failure of the medical implant. Intraoperative complications included any deviation from the ideal intraoperative course occurring between skin incision and skin closure. Infection was defined as clinically diagnosed infection of (surgical) wounds as a consequence (of the treatment) of the humeral shaft fracture. Shoulder dysfunction was defined as experiencing pain or limited range of motion of the shoulder. Nail protrusion was defined as migration and subsequent protrusion of the intramedullary nail. Subacromial impingement was defined as irritation of the rotator cuff muscles in the subacromial space. (Sub)cutaneous problems included bursitis, cellulitis, granuloma’s, hypertrophic scarring of the wound, and skin irritation, macerations, or abrasions due to prolonged contact with the brace.

When measurements were done at different time points, the outcomes at the 12 months follow-up were used for calculation. The extracted data were compared, and disagreements were resolved by discussion between the three reviewers. Consensus was reached by discussion.

Risk of bias assessment

The Methodological Index for Non-Randomized Studies (MINORS) instrument was used to assess methodological quality of the included publications [28]. The MINORS scale yields a maximum score of 24 for comparative cohort studies and a maximum of 16 for non-comparative cohort studies, with a higher score indicating better quality. Studies were scored for the various items by three authors (KCM, SHVB, and PAJ) independently. Any disagreement was resolved by consensus. Funnel plots, for each outcome and per treatment type separately, were used to determine the risk of publication bias.

Statistical analysis

Data were analyzed using MedCalc Statistical Software (Version 18.2.1; MedCalc Software bvba, Ostend, Belgium; http://www.medcalc.org; 2018). Binary outcomes were transformed using a double arcsine transformation to ensure normal distribution [29]. The transformed rates and 95% confidence intervals were transformed back to prevalence estimates. Forest plots were constructed with 95% confidence intervals. Heterogeneity was quantified with Cochran’s Q test and I2 statistic. For the Cochran's Q test, a p value < 0.10 was considered statistically significant. A random effects model was used if the I2 statistic was > 40%. Otherwise, a fixed-effect model was used. Pooled percentages and means were calculated for binary and continuous variables, respectively, and are reported with their 95% confidence intervals (CI). Results are reported per treatment modality or per subgroup if differences between subgroups were deemed relevant.

Results

Study selection

The search strings identified 9625 publications (Fig. 1). Duplicates were removed, resulting in 6754 unique publications. Two additional records were identified through other sources (citation searching). The remaining 6756 publications were reviewed for inclusion and exclusion criteria. A total of 192 eligible publications were identified. For 39, studies the full-text manuscripts were not available online. Of these, 13 publications had no contact details available. The remaining corresponding authors were contacted. This revealed seven full-text publications. After full-text assessment, 173 publications were included in this review and meta-analysis (Supplemental Table S1).

Fig. 1
figure 1

Flow diagram of study selection

Study characteristics

Supplemental Table S1 shows the study characteristics of all included studies. Of the 173 included studies, 23 were randomized controlled trials, 55 were prospective cohort studies, and 95 were retrospective cohort studies. A total of 79 studies were comparative studies and 94 studies were non-comparative. The included studies report on a total of 11,868 patients. Of these, 2204 were treated non-operatively with a functional brace, 3545 were treated with intramedullary nailing, and 6119 by plate osteosynthesis. The pooled mean age of the patients was 44 years in the non-operative group, 45 in the IMN group, and 41 in the plating group. The pooled mean percentage of males was 57% in the non-operative group, 62% in the IMN group, and 64% in the plate group. The pooled percentage of patients with AO type A fractures was 67% in the non-operative group, 53% in the IMN group, and 46% in the plating group. The pooled percentage of patients with AO type B fractures was 23% in the non-operative group, 34% in the IMN group, and 36% in the plating group. The pooled percentage of patients with AO type C fractures was 9% in the non-operative group, 12% in the IMN group, and 15% in the plating group.

Risk of bias assessment

The outcome of the methodological quality assessment, according to the MINORS score, is shown in Supplemental Table S2. The average score of the quality assessment for comparative studies was 20/24 (range 11–23) and 12/16 points (range 9–15) for non-comparative studies.

Fracture healing—time to union

Time to fracture healing (radiologic or clinical) was reported in 37 studies (Table 2). The pooled estimate time to fracture healing was 16 weeks (95% CI 14–18 weeks) for the non-operative group, 14 weeks (95% CI 13–15 weeks) for the IMN group, and 15 weeks (95% CI 14–16 weeks) for the plate group. An antegrade IMN approach resulted in a pooled estimate time to fracture healing of 14 weeks (95% CI 12–15 weeks) versus 12 weeks (95% CI 9–16 weeks) after a retrograde approach. Furthermore, considering plate osteosynthesis, ORPO resulted in a pooled estimate time to fracture healing of 16 weeks (95% CI 15–17 weeks) versus 14 weeks (95% CI 12–16 weeks) after MIPO. Much heterogeneity of effects was seen across studies in all treatment groups, varying from 91% in the MIPO group to 98% in the (antegrade) IMN group.

Table 2 Fracture healing of a humeral shaft fracture per treatment group

Fracture healing rate

In 160/173 (92%) studies consisting of 10,206 patients the fracture healing rate was reported (Table 2). The pooled fracture healing rate for the non-operative group was 89% (95% CI 84–92%), 94% (95% CI 92–95%) for the IMN group, and 96% (95% CI 95–97%) for the plating group. The pooled fracture healing rate was the highest in the MIPO group (98%; 95% CI 97–98%). In the non-operative group, high heterogeneity across studies was found (I2 = 87%) and seen in the funnel plot (Supplemental Fig. S1). In the IMN and plate group, the funnel plots showed comparable asymmetry and the heterogeneity was moderate (I2 = 54% and I2 = 41%, respectively; Supplemental Figure S1).

Fracture healing—nonunion

The pooled nonunion rate showed variation between the treatment groups (Table 2). In the non-operative group, 182 nonunions were reported in 1959 patients, resulting in a pooled estimate of 11% (95% CI 7–15%). In the IMN group, 156 nonunions were reported in 2787 patients, resulting in a pooled estimate of 6% (95% CI 5–7%) and in the plating group, 163 nonunions were reported in 5098 patients, resulting in a pooled estimate of 3% (95% CI 3–4%). In the plating group, an open approach resulted in more nonunions than a minimally invasive approach [4% (95% CI 3–5%) and 2% (95% CI 2–3%), respectively].

Fracture healing—malunion

Pooled malunion rates were 6% (95% CI 2–12%) in the non-operatively treated group, 3% (95% CI 1–5%) in the IMN group, and 1% (95% CI 1–2%) in the plating group (Table 2). However, malunion was often poorly defined and is expected to be reported differently across studies.

Complications—radial nerve palsy

The pooled primary radial nerve palsy rate showed no variation between the treatment groups (Table 3). Secondary radial palsy was reported in 146 studies (Table 3). The pooled secondary radial nerve palsy rate was 1% (95% CI 0–2%, 18 studies, N = 1377, 10 patients) in the non-operatively treated group, 3% (95% CI 2–3%, 58 studies, N = 2576, 66 patients) in the IMN group, 4% (95% CI 3–5%, 42 studies, N = 1292, 43 patients) in the MIPO group, and 7% (95% CI 6–9%, 82 studies, N = 4232, 275 patients) in the ORPO group.

Table 3 Complication rates of (treatment of) a humeral shaft fracture per treatment group

Complications—intraoperative complications

The pooled rate of intraoperative complications was 5% (95% CI 3–8%) in patients treated with an IMN and 1% (95% CI 0–1%) in patients treated with plate osteosynthesis (Table 3). Heterogeneity across studies was especially low in the plate group (I2 = 0%).

Complications—implant-related complications

Implant failures were reported more frequently in the IMN group (51/1034, pooled estimate of 4%; 95% CI 3–6%) than in patients in the plate group [pooled estimate of 2% (95% CI 1–2%), 40/2839 patients; Table 3]. An antegrade IMN approach resulted in less implant failures than a retrograde approach [4% (95% CI 3–6%) and 7% (95% CI 3–12%), respectively]. Implant failure did not differ between the surgical approaches in the plating group [ORPO 2% (95% CI 1–3%) and MIPO 2% (95% CI 1–4%)].

Complications—infection

The infection rate was reported in 124 studies consisting of 7986 patients, and was low in all treatment groups, especially in the non-operative [1% (95% CI 0–2%), 3/462 patients] and MIPO group [1% (95% CI 1–2%), 8/1126 patients; Table 3]. The infection rate in the IMN and ORPO group was 2% (95% CI 1–2%) and 3% (95% CI 3–4%), respectively.

Complications—shoulder dysfunction

The pooled rate of shoulder dysfunction was the highest in patients treated with an IMN (11%; 95% CI 8–15%) and the lowest in patients treated with plate osteosynthesis (6% (95% CI 4–8%); Supplemental Table S3). An antegrade IMN resulted in more shoulder dysfunction than a retrograde IMN [13% (95% CI 10–16%) and 5% (95% CI 1–15%), respectively].

Complications—nail protrusion

The pooled rate of nail protrusion was 10% (95% CI 6–14%) in patients treated with an IMN (17 studies, 61/666 patients; Supplemental Table S3).

Complications—subacromial impingement

Subacromial impingement was seen more in the antegrade IMN group than in the plate osteosynthesis group [pooled rate of 13% (95% CI 9–18%) and 2% (95% CI 1–3%), respectively; Supplemental Table S3].

Complications—(sub)cutaneous problems

The pooled rate of (sub)cutaneous problems in patients treated non-operatively was 6% (95% CI 4–9%, nine studies, 20/347 patients; Supplemental Table S3).

Range of motion

In the plating group, the pooled estimates of shoulder abduction and anteflexion were 151° (95% CI 116–186°) and 148° (95% CI 137–160°), respectively (Table 4). Anteflexion was better after MIPO than after ORPO [167° (95% CI 164–171°) and 141° (95% CI 124–158°), respectively]. In the IMN group, consisting of only 2 studies with a total of 34 patients, the pooled estimates of shoulder abduction and anteflexion were 132° (95% CI 76–189°) and 120° (95% CI 33–207°), respectively. All treatment groups showed high heterogeneity across studies, varying from 87% in the MIPO group to 100% in all other operative treatment groups.

Table 4 Range of motion after treatment of a humeral shaft fracture per treatment group

Functional outcome—DASH

The DASH score after on average 1 year (ranging from 6 to 24 months) showed variation in mean scores between the treatment groups (Table 5). For the non-operative group, the pooled estimate score was 17/100 (95% CI 3–31); for the IMN group, it was 23/100 (95% CI 17–29); and for the plating group, it was 13/100 (95% CI 8–19; Table 4). The DASH score was the highest in the antegrade IMN group (23/100; 95% CI 17–29) and the lowest in the MIPO group (7/100; 95% CI 1–13).

Table 5 Functional outcome scores after treatment of a humeral shaft fracture per treatment group

Functional outcome—Constant–Murley

The pooled estimate of the Constant–Murley score was 90/100 (95% CI 85–95) in the IMN group and 93/100 (95% CI 92–95) in the plating group (Table 5). The Constant–Murley score did not differ between the surgical approaches in the treatment groups.

Functional outcome—UCLA

The pooled estimate of the UCLA shoulder score in the IMN group was 28/35 (95% CI 22–34) and 33/35 (95% CI 32–33) in the plating group (Table 5). The UCLA shoulder score did not differ between the surgical techniques in the treatment groups.

Functional outcome—other

Little to no differences were observed in the other functional outcome scores after IMN or plating osteosynthesis (Supplemental Table S4). Heterogeneity was high (I2 > 70%) in all subgroups for all functional outcomes, most likely due to the low number of studies with available data. For the non-operatively treated patients, little to no data of functional outcome scores were available for analyses.

The Broberg–Morrey, Gill, Hospital for Special Surgery, l’Insalata, Neer Shoulder, Oxford Shoulder Score, Quick-DASH, Rommens, Simple Shoulder Test, Short Musculoskeletal Functional Assessment, and Short Form-36, as well as the Hunter criteria did not have enough data reported for analyses. The nowadays seldom used Rodriguez–Merchan criteria were analyzed but not reported.

Discussion

This systematic review compared fracture healing, complications, and functional outcome of non-operative and operative treatment for humeral shaft fractures and results suggest that although all treatment modalities result in satisfactory outcomes, operative treatment, and specifically plate osteosynthesis, should be considered the preferred treatment as it results in the most favorable fracture healing rates, least complications, and best functional outcomes.

The current systematic review reveals that the risk to develop a nonunion after non-operative treatment is much higher (11%) than after any kind of surgical stabilization (6% and 3% in the IMN and plating group, respectively). This is in line with previous systematic reviews reporting higher absolute risks of nonunion after non-operative treatment (15% and 18%) and a risk ratio of 0.49 for nonunion in the operative group compared with in the non-operative group [8, 24, 25]. A first requirement for good functional recovery is fracture stability since it relieves pain in the upper limb. Stability can be achieved by fracture union, but also by relative or absolute surgical stabilization of a fresh fracture with IMN and plate osteosynthesis, respectively. A nonunion after non-operative treatment implicates that the patient has experienced pain and loss of function for months, whereas a patient who has been operated upon immediately after his injury has been able to recover functionally despite the development of the nonunion. In the balance of shared decision-making, such numbers call for a surgical and not a non-operative treatment.

The final goal of any type of treatment should be a good functional outcome. Overall, all treatment modalities result in satisfactory functional outcomes after 1 year, indicating that a good functional outcome can be achieved irrespective of treatment. However, a slight advantage of functional recovery can be found after operative treatment with plate osteosynthesis considering the Constant–Murley, DASH, and UCLA shoulder score. This is in line with a meta-analysis of RCTs describing better functional outcomes in patients treated with plate osteosynthesis than in patients treated with IMN [26]. Less complications and rotator cuff problems might enable these patients treated with plate osteosynthesis to regain function faster. These favorable results of functional recovery may tip the scale of the scientific debate toward plate osteosynthesis as the preferred treatment.

However, speed of functional recovery and a lower risk of nonunion after a humeral shaft fracture comes at a price. Both non-operative and operative treatment generate complications. The major complication is considered a radial nerve palsy. Primary nerve palsies are caused by the trauma itself, not by the therapy given to treat the injury. Secondary radial nerve palsy occurs from fracture reduction during non-operative treatment or manipulation during surgery. Not surprisingly, the rate of radial nerve palsy after non-operative treatment is much lower—albeit not absent—than after surgery in which the nerve is exposed. Within the operative group, the current systematic review showed a higher rate of secondary radial nerve palsy in the patients treated with (open) plating. However, the rate of persistent radial nerve palsy could not be defined due to the heterogeneity in reporting, and therefore questions about permanent disability after radial nerve palsy cannot be addressed. A meta-analysis of RCTs and observational studies, comparing non-operative and operative treatment, reported no difference in permanent (primary or secondary) radial nerve palsy rate between both groups suggesting that the risk of persistent radial nerve palsy should no longer be a deterrent for operative treatment [8]. Other complications inherent to operative treatment were more frequently reported in the IMN group than in the plating group. Results of other reviews are comparable, describing lower number of complications in the plating group than in the IMN group, suggesting plating is superior to IMN [18, 21, 26].

All previous meta-analyses only included randomized control trials and comparative prospective cohort studies of 6–17 published studies in total [8, 16,17,18,19,20,21,22,23,24,25,26]. A strength of the current study is that by including many study designs, it included all relevant recent comparative and non-comparative studies, resulting in 173 included studies reporting the results of 11,868 patients. In this way, this systematic review provides information on results of all relevant aspects of each treatment option, and therefore empowers both the patient and the doctor in their respective roles in the desired shared decision-making process.

However, some limitations of this study are the low methodological quality of the included studies as reflected by the MINORS scores. The studies meeting the inclusion criteria often had small sample sizes and lacked an adequate power calculation. Unfortunately, due to the lack of homogeneous reporting of, e.g., patient characteristics and treatment regimens of functional bracing, risk factor and subgroup analyses could not be performed. Furthermore, different outcome parameters and methods of reporting the results were used. Results were frequently reported without a standard deviation and thus could not be included in the pooled analysis. Therefore, the results of this study should be interpreted with care given the large statistical and clinical heterogeneity.

In the literature, a definitive answer on the optimal treatment strategy remains as high-quality data are lacking. This causes practice variation. Furthermore, uniform reporting of outcome of treatment is needed to compare the results of different studies. For instance, in the included studies, 18 different functional outcome scores were reported. The use of different instruments makes it hard to compare results. The DASH and Constant–Murley score have been validated and are recommended as preferred instruments for future studies [195].

Conclusion

This study suggests that even though all treatment modalities result in satisfactory outcomes, operative treatment is associated with the most favorable results. Disregarding secondary radial nerve palsy, specifically plate osteosynthesis seems to result in the highest fracture healing rates, least complications, and best functional outcomes compared with the other treatment modalities.