Introduction

Rib fractures are very common in patients with thoracic trauma and nowadays still associated with significant morbidity and mortality due to the underlying injuries to the lung and heart resulting in more pulmonary complications [1,2,3,4]. Compared to multiple rib fractures, flail chest is associated with a worse outcome due to a higher incidence of respiratory compromise and concomitant injuries [5, 6].

A combination of adequate pain control, respiratory assistance, and physiotherapy is considered the gold standard in management of rib fractures [3]. Over the past decades, there has been a growing interest in rib fixation for flail chest and for multiple rib fractures, however, there is no consensus regarding the indication and patient selection for rib fixation.

In the field of (orthopedic) trauma surgery, there is increasing scientific evidence that inclusion of observational studies could add value to meta-analyses without decreasing quality of the results [7,8,9,10]. Adding observational studies result in larger sample sizes and might enable the evaluation of small treatment effects, subgroups, and infrequent outcome measures while also providing information about the generalizability of the results [11].

The aim of this systematic review and meta-analysis was (1) to present current evidence on outcome after rib fixation compared to nonoperative treatment for both flail chest and multiple rib fractures and (2) to compare effect estimates obtained from RCTs and observational studies.

Methods

This review was performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) and the Meta-analysis of Observational Studies in Epidemiology (MOOSE) guidelines [12, 13]. A published protocol for this review does not exist. Ethical committee approval did not apply to this study.

Search strategy and eligibility criteria

A structured literature search was conducted in MEDLINE, Embase, CENTRAL and CINAHL on June 16th, 2017 for both randomized controlled trials (RCTs) and observational studies comparing operative to nonoperative treatment of traumatic rib fractures. The search was not restricted by publication date, language, or other limits. The full search syntax is provided in Appendix 1.

All obtained studies from the literature search were independently screened for eligibility based on title and abstract by two reviewers (RBB, JP). Exclusion criteria were animal studies, abstracts of conferences, case-reports, reviews, inclusion of patients younger than 18 years, and studies written in another language than English, French, Dutch or German. Disagreement regarding study selection was resolved by discussion with a third reviewer (RMH). References of included studies were manually screened and citation tracking was conducted using Web of Science to identify additional relevant studies.

Data extraction

Data were extracted by two independent reviewers (RBB, JP), using a data extraction file. Extracted data included first author, year of publication, study period, study design, country, fracture type, number of fractured ribs, number of included patients, number of patients with flail chest or multiple rib fractures (according to the definition used by the original study), age, gender, type of operative treatment, type of nonoperative treatment, duration of follow-up, loss to follow-up, Injury Severity Score (ISS), Abbreviated Injury Scale (AIS), Glasgow Coma Scale (GCS), hemothorax, pneumothorax, pulmonary contusion, type of implant in operative group, mortality during hospitalization, hospital length of stay (HLOS), intensive care unit length of stay (ILOS), duration of mechanical ventilation (DMV), incidence of pneumonia, need for tracheostomy, complications, revision surgery, and implant removal.

Outcome measures

The primary outcome measure was mortality during hospitalization. Secondary outcome measures were HLOS, ILOS, DMV, incidence of pneumonia, need for tracheostomy, complications, revision surgery, and implant removal.

Quality assessment

The Methodological Index for Non-Randomized Studies (MINORS) score was used to assess the included studies [14]. The MINORS is a critical appraisal instrument developed to assess the methodological quality of observational surgical studies. Other quality assessment tools focus on a specific study design while the MINORS is externally validated on RCTs and is therefore a suitable instrument for meta-analyses of different study designs. The MINORS score ranges from 0 to 24 and a higher score reflects better quality. Studies were independently assessed by two reviewers (RBB, JP) using the MINORS criteria and disagreement was resolved by discussion with a third reviewer (RMH). Additional details on the MINORS criteria and scoring system are set out in Appendix 2.

Statistical analysis

Statistical analyses were performed using Review Manager (RevMan, Version 5.3.5 Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2014). Data were converted to a mean with standard deviation (SD) using different methods as described in the Cochrane Handbook for Systematic Reviews of Interventions [15].

Different studies based on the same patient cohort were included only once in the analysis [16, 17]. Studies reporting on specific patient subgroups were split and included separately for meta-analysis, provided sufficient information was reported; Qiu et al. distinguished between the presence or absence of a flail chest and Voggenreiter et al. made subgroups based on the presence or absence of pulmonary contusion [18, 19]. Results from both RCTs and observational studies were pooled in the primary analysis.

Meta-analysis was performed if outcome measures of two or more studies were available. For continuous outcome measures, the inverse variance weighted random effects model was used to estimate the pooled difference in the outcome measure for fixation versus no fixation, with corresponding 95% confidence interval (CI). For dichotomous outcomes, we applied the Mantel–Haenszel method and pooled results are presented as risk ratios (RR) with 95% CI. Heterogeneity between studies was assessed by visual inspection of the forest plots and by estimating statistical measure for heterogeneity, i.e., the I2 statistic. Inspection of a funnel plot of the study-specific difference in the primary outcome measure against its standard error was done to detect potential publication bias. A two-sided p value < 0.05 was considered statistically significant.

Subgroup and sensitivity analyses

In subgroup analysis, we stratified by study design and pooled effects of RCTs were compared with pooled effects of observational studies. For the analysis of study quality only studies with an arbitrarily chosen MINORS score of 16 or higher were included, similar to previously published meta-analyses in orthopedic trauma surgery studying both study designs [8, 10, 20]. To assess the impact of improvement in intensive care management over time, we performed a sensitivity analysis including only studies published in the last 5 years. Different methods were used to include studies with zero events in one or both arms of the outcome measure. To assess the sensitivity of the analyses to the choice of the method of analysis, also the crude methods, DerSimonian–Laird method with correction, the inverse variance with and without correction for zero event data, and the Peto method were applied and results were compared for consistency [21].

Results

Search

The flowchart of the literature search is presented in Fig. 1. Ultimately, 33 studies were included [16,17,18,19, 22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50]. There were three RCTs, two prospective cohort studies, 14 retrospective cohort studies, and 14 case–control studies.

Fig. 1
figure 1

Flowchart of the literature search

Patient characteristics

The studies included for meta-analysis included 5874 patients; 1255 received rib fixation and 4619 received nonoperative treatment. In the majority of the studies (n = 20), patients were surgically treated with plates (Tables 1, 2). Other surgical methods were K-wires and Judet or Adkins struts. Nonoperative treatment consisted generally of ‘best medical treatment’ and included adequate pain management, lung physiotherapy and respiratory support. The weighted average age was 52.9 years and 73% of patients were male. The weighted average of the number of rib fractures was 6.9 in the rib fixation group and 6.0 in the nonoperative group with a weighted mean ISS of 21.2 and 22.4, respectively.

Table 1 Baseline characteristics of the included studies comparing rib fixation versus nonoperative treatment of traumatic rib fractures
Table 2 Treatment characteristics of the included studies comparing operative versus nonoperative management of traumatic rib fractures

Quality assessment

The average MINORS score of the included studies was 15.4 (SD 2.7; range 9–21). The MINORS score for RCTs was 20 (SD 1.0; range 19–21) and for observational studies 14.9 (SD 2.4; range 9–21). An overview of the study-specific MINORS score is provided in Appendix 3.

Mortality

Twenty-five studies (n = 4826) reported on mortality (Online Appendix 4) [18, 19, 22, 23, 25, 27, 28, 30, 32,33,34, 36,37,38,39,40,41,42,43,44,45,46,47,48,49,50]. Rib fixation resulted in a significant reduction of mortality compared to nonoperative treatment with a risk ratio (RR) of 0.41 (95% CI 0.27, 0.61, p < 0.001, I2 = 0%) (Fig. 2). Different methods of incorporating studies in the meta-analysis with zero-event data in one or both arms yielded similar results (Online Appendix 5). When stratified by study design, RCTs showed a RR 0.57 (95% CI 0.13, 2.52, p = 0.46, I2 = 0%) vs. RR 0.40 (95% CI 0.26, 0.60, p < 0.001, I2 = 0%) in observational studies (Table 3). Figure 3 shows a funnel plot of the odds ratio and standard error of the included studies using the mortality rate; there was no important asymmetry observed.

Fig. 2
figure 2

Mortality in a systematic review of rib fractures comparing operative to nonoperative treatment

Table 3 Subgroup and sensitivity analyses of studies included in a meta-analysis of rib fractures comparing rib fixation versus nonoperative treatment for patients with a flail chest
Fig. 3
figure 3

Funnel plot of studies included in a meta-analysis reporting mortality rates after operative or nonoperative treatment of rib fractures (RR risk ratio, SE standard error)

Hospital stay length of stay

Twenty-one studies (n = 4770) reported on length of hospital stay (Online Appendix 4) [16, 17, 23, 25, 26, 31,32,33,34,35, 37,38,39,40,41,42,43,44,45, 47, 50, 51]. Rib fixation did not result in a significant reduction of HLOS compared to nonoperative treatment with a mean difference of −1.46 days (95% CI −4.31, 1.39, p = 0.32, I2 = 96%) (Online Appendix 6). When stratified by study design, the pooled mean difference of RCTs (−8.33 days; 95% CI −14.6, −2.1; p < 0.001, I2 = 46%) was greater compared to observational studies (−0.77; 95% CI −3.72, 2.18; p = 0.61, I2 = 97%) (Table 3).

ICU length of stay

Twenty-six studies (n = 4520) reported on length of ICU stay (Online Appendix 4) [16,17,18, 22,23,24,25,26, 28, 30,31,32,33, 35,36,37,38,39,40,41,42,43,44, 47, 50, 51]. Rib fixation resulted in a significant reduction of ILOS compared to nonoperative treatment with a mean difference of −2.0 (95% CI −3.61, −0.38, p = 0.02, I2 = 85%) (Online Appendix 7). When stratified by study design, RCTs showed a greater difference compared to observational studies (Table 3).

Duration of mechanical ventilation

Twenty-seven studies (n = 2063) reported on duration of mechanical ventilation (Online Appendix 4) [16,17,18,19, 22,23,24,25,26,27,28, 30,31,32, 35,36,37,38,39,40,41,42, 45,46,47, 49,50,51]. Rib fixation resulted in a significant reduction of days on mechanical ventilation compared to nonoperative treatment with a mean difference of −4.01 (95% CI −5.58, −2.45, p < 0.001, I2 = 91%) (Online Appendix 8). When stratified by study design, RCTs showed a greater difference compared to observational studies (Table 3).

Pneumonia

Twenty-five studies (n = 4485) reported on the incidence of pneumonia (Online Appendix 4) [16,17,18,19, 22, 24,25,26, 28, 30,31,32,33, 37,38,39, 41,42,43,44, 47, 50, 51]. Rib fixation resulted in a significant reduction of pneumonia compared to nonoperative treatment with a risk ratio of 0.59 (95% CI 0.42, 0.83, p = 0.002, I2 = 79%) (Online Appendix 9). When stratified by study design both subgroups showed similar results (Table 3).

Tracheostomy

Fourteen studies (n = 1541) reported on the need of tracheostomy (Online Appendix 4) [16,17,18, 22, 25, 26, 28, 30, 32, 34, 36,37,38, 45, 50]. Rib fixation resulted in a significant reduction of tracheostomies compared to nonoperative treatment with a risk ratio of 0.59 (95% CI 0.36, 0.90, p = 0.01, I2 = 72%) (Online Appendix 10). When stratified by study design both subgroups showed similar results (Table 3).

Other outcome measures

Nine studies (n = 1174) reported on implant removal; five studies reported zero events and four studies reported implant removal ranging from 1.5 to 4.9% (Online Appendix 4) [17, 26, 28, 36,37,38, 40, 45, 48]. Eleven studies reported on wound infection; five studies reported zero events and six studies reported a wound infection rate ranging from 1.7 to 25% [18, 23, 24, 26,27,28,29,30, 46]. Other short and/or long-term complications were poorly reported and described mainly respiratory complications.

Sensitivity analyses

In sensitivity analysis for study quality, results did not change significantly except for HLOS which increased in favor of rib fixation in studies with higher quality with a mean difference of –3.53 (95% CI −7.27, −0.21, p = 0.06) (Table 3). Results from studies published after 2012 did not show a reduced HLOS, ILOS, incidence of pneumonia or need for tracheostomy after rib fixation (Table 3).

Discussion

In this systematic review and meta-analysis of RCTs and observational studies, rib fixation for patients with flail chest resulted in lower mortality, shorter ILOS and DMV, lower pneumonia rate, and lower need for tracheostomy. Pooled results from RCTs and observational studies were similar for all studied outcome measures although results from RCTs showed a larger treatment effect for HLOS, ILOS, and DMV. Results from recent studies showed lower mortality and shorter DMV after rib fixation, but there were no significant differences for the other outcome measures. The implant removal rate ranged from 1.5 to 4.9%. There were not enough studies of only patients with multiple rib fractures to perform meta-analyses on rib fixation for this patient population.

This meta-analysis included a large number of studies demonstrating the potential short-term benefit of rib fixation over nonoperative treatment for flail chest. Most often the indication for rib fixation was the presence of flail chest and to a lesser extent respiratory failure or intractable pain. Even though almost all studies included patients with flail chest, in many cases it was unclear whether it was a radiological or clinical flail chest making results harder to interpret. It is important to distinguish between these subgroups as respiratory compromise as well as injury severity is thought to mark important differences and influence outcome. The heterogeneous indication and patient populations reported on in the literature mask the exact indication and patient subgroup that would benefit most from rib fixation and consequently the adaptation of rib fixation in current practice.

Very few studies are available investigating patients with multiple rib fractures without flail chest. In a retrospective study, Qiu et al. performed separate analysis on patients with multiple rib fractures without flail segment and showed good short-term results and an earlier return to ‘normal activity’ after rib fixation [18]. Another notable study on multiple rib fractures was from Khandelwal et al. who described a prospective cohort of patients with multiple rib fractures where most patients had two or three rib fractures and only two (5.3%) had a flail chest [29]. They reported a significant reduction of pain and earlier return to work after rib fixation. No other studies have reported on rib fixation compared to nonoperative treatment focused on multiple rib fractures even though this is the largest subgroup of patients seen in daily practice.

In this review, we have included both RCTs and observational studies and show similar results for all outcome measures between both designs. Concato et al., Benson et al., and Ioannides et al. have provided an empirical basis for the comparison of RCTs and observational studies and showed results from these different designs can be remarkably similar, but can be rather different as well [52,53,54]. Although, treatment effects can be similar across studies regardless of design, genuine differences in treatment effects between different patient populations may be masked by biases in observational studies. Pooling results across different design could then lead to incorrect inferences. The judgement about validity of pooling results from different designs should be made on a case-by-case basis, since for instance the potential for confounding bias is context- and research-specific. Still, within the field of (orthopedic) trauma surgery there is growing evidence showing the potential of observational studies in meta-analyses leading to more robust conclusions without decreasing quality of the results [7,8,9].

Interestingly, RCTs in this study showed a larger treatment effect for some of the outcome measures as compared to observational studies. It is thought that observational studies tend to overestimate treatment effect which is possibly the result of the surgeon introducing a selection bias by choosing the optimal patient or publication bias [55, 56]. The three RCTs available on this subject all had very strict inclusion and exclusion criteria resulting in specific patient groups where treatment effects could be demonstrated yet with limited generalizability [22, 23, 50]. In observational studies, usually with less strict inclusion and exclusion criteria, an unclear indication together with other serious concomitant injuries can result in a selection of patients including patients who would benefit more from nonoperative treatment. A wrong patient selection can reduce measured treatment effects after rib fixation which could explain differences found between RCTs and observational studies in this specific topic. Additionally, differences in timing of the surgical procedure between studies might have introduced bias in comparability as early surgical stabilization is associated with favorable outcomes [57]. However, data regarding timing of surgery were not sufficiently reported in the included studies to further explore these effects. Finally, improvement of intensive care management over time could have attributed to differences in treatment effects as shown by our sensitivity analysis. In more recent studies only mortality and DMV improved after rib fixation, but there was no difference for the other outcome measures.

This study had some limitations. First, the results may be altered by missed studies in the literature search or by publication bias. However, we performed an extensive search using multiple databases with citation and reference checking of included studies. A funnel plot of the primary outcome measure did not suggest bias due to selective publication. Therefore, we are confident that we have a representative overview of the current literature. Second, we did not distinguish between studies with both flail chest and multiple rib fractures and studies including only flail chest patients. Very few patients with multiple rib fractures were included in these studies. Therefore, we think results from these studies translate to flail chest patients and should not be excluded from analyses. Still, cautious interpretation of study results is necessary as the variety of definitions used in the included studies might have resulted in a high in-between study variability of patient samples.

More research is needed to further identify the right indication and right patient for rib fixation. As previously mentioned, RCTs in this heterogenic population are very difficult to perform and for adequate subgroup analyses sufficiently large sample sizes are needed. In the rapidly developing area of surgery, RCTs can be expensive, time consuming, and often have limitations in terms of generalizability and small sample sizes due to strict inclusion and exclusion criteria [58, 59]. Observational studies show similar results as compared to RCTs and might be an achievable first step in gathering high-quality evidence. Currently a large prospective multicenter database is created in the Netherlands including both patients with flail chest and multiple rib fractures from multiple level-1 trauma centers, aiming to answer the above questions with the use of large sample sizes and long-term follow-up [60].

Conclusion

Rib fixation significantly improves short-term outcome for patients with flail chest, although the indication and patient subgroup who would benefit most from this treatment remain unclear. There is not enough data regarding patients with multiple rib fractures without flail segment. Observational studies show similar results as compared to RCTs and might be an achievable first step in gathering high-quality evidence. Larger prospective studies are required to investigate proper indications and relevant outcome after rib fixation.