Introduction

Periprosthetic fracture of the femur (PFF) is one of the major complications occurring during or after total hip arthroplasty (THA). When these occur in the post-operative period, PFF is the third principal cause of THA failure requiring surgical revision, after aseptic loosening and recurrent dislocation [1,2,3,4], ranging from 1 to 11% in primary implants [5]. The Vancouver classification is the most widely used to guide fracture diagnosis and management, dividing PFF into two main categories: intraoperative and postoperative. These categories are further classified in A, B or C according to fracture location (Fig. 1): A types are located at the proximal metaphysis without extending into the diaphysis, and these are further subdivided into those involving the greater (AG) and lesser (AL) trochanter; B types affect the femoral diaphysis around the stem, and quality and stability of stem implant and residual proximal femoral bone stock are used to divide B group into three subcategories: in type B1 fractures the stem is well fixed, in type B2 there is a loose stem with good bone stock and in type B3 the stem is loose and the surrounding bone stock is poor. C types are located below the tip of the stem of the implant [6].

Fig. 1
figure 1

Vancouver Classification of periprosthetic implant fractures at the hip. Type A: Fracture in the trochanteric region, involving Lesser (AL) or Greater (AG) trochanter. Type B: B1 Fracture around stem or just below, with well-fixed stem, B2 Fracture around stem or just below, with loose stem but good proximal bone, B3 Fracture around stem or just below, with loose stem and poor quality or severely comminuted proximal bone. Type C: Fracture below the stem of the implant, with stability of the implant

Vancouver B2 and B3 types are the most commonly encountered in clinical practice, being reported in 75–85% of all PFF [7]. These fractures are considered “unstable” subtypes because the fracture develops around the stem itself, making the treatment particularly challenging. Current recommendations for surgical treatment of Vancouver subtypes B2 and B3 is revision with a long femoral stem to bypass the fracture site, with or without associated internal fixation [8]; this technique is associated to a lower risk of failure [9]; however, some Authors reported poor results and a higher perioperative mortality rate when stem revision procedures are performed in this patients population, suggesting that a less invasive reduction and internal fixation, when possible, might be a suitable option. At present, internal fixation is reserved for more fragile patients [10, 11] because of the higher risk of fracture non-union in the presence of a loose implant, and for the requirement of a longer rehabilitation [12]; however, when the fracture heals properly, there is no requirement for later implant revision and the patients may benefit from a surgical procedure of reduced complexity, offering similar outcomes compared to revision arthroplasty [13]. This technique might result in several benefits including shorter surgical time and reduced blood loss, and it could be particularly helpful in elderly and fragile patients in which a more demanding revision arthroplasty surgery may adversely affect the overall prognosis and patients’ survival [14].

The aim of this study is therefore to compare in a systematic literature review and meta-analysis the outcomes of stem revision (SR) procedure vs. open reduction and internal fixation (ORIF) in patients affected by Vancouver B2 or B3 periprosthetic fractures.

Materials and methods

Literature search

On April 15th 2023, the Cochrane Database of Systematic Reviews and the medical databases PubMed, Google Scholar and MEDLINE were examined to find relevant publications on the different outcomes using stem revision versus fixation in patients with periprosthetic Vancouver B2/B3 hip fractures. A patient intervention compared outcome of interest (PICO) model was used. Studies published in English between June 1984 and April 2022 were included from the databases. The search was performed using the terms “femoral”, “fracture”, “hip” and “periprosthetic”. Then, to identify other works relevant to the subject, references of pertinent publications were examined. Articles describing the different outcomes using stem revision versus osteosynthesis fixation in periprosthetic hip fracture Vancouver B2/B3 were included. However, articles were excluded if published in a language other than English, or if describing outcomes of cohorts of less than fifty patients. Two authors (MB, EV) independently evaluated and screened the titles and abstracts of the identified papers according to our inclusion and exclusion criteria, the articles deemed to be of interest were then chosen for a full text analysis. In accordance with the Preferred Reporting Item for Systematic Reviews and Meta-Analyses (PRISMA) criteria, a flowchart was made to illustrate the steps involved in include the reviewed works (Fig. 2).

Qualitative assessment and risk of bias

Two authors (BB, LT) independently evaluated the strength of the evidence and the caliber of each study using the methodological index for non-randomized studies (MINORS). It evaluates 12 criteria, the first 8 of which were expressly created to assess the caliber of research in non-randomized studies. These include a clear statement of the study purpose, the inclusion of consecutive patients, prospective data collection, and the appropriateness of the endpoint to the goal of the study, which should be unbiased. Moreover, the study follow-up should be explicitly reported and being appropriate for the goal of the study, with a loss of less than 5% of patients at follow-up. The material and methods section of the study should also include a prospective computation of the study size. Each component receives a score between 0 and 2: 0 for non-reporting, 1 for poor reporting, and 2 for satisfactory reporting. For non-comparative studies, a figure of 16 is the best available score. To assess the risk of bias within the included studies, their methodological quality was assessed by using the Grading of Recommendations, Assessment, Development, and Evaluations (GRADE) cohort checklist for assessing the quality of non-randomized studies.

Data extraction and report

The selected studies were analyzed, and the information of interest was extracted onto a database created using Microsoft Excel for Microsoft Mac (Microsoft Corporation, Redmond, Washington, USA). The following data were recorded, if available: number and age of patients, type of fracture (B2/B3), BMI, time from THA to fracture, clinical outcomes by hip functional scores, surgical time, surgical approach, type of fixation including the use of bone cement, reoperation rate, postoperative complications, blood loss, average hospital stay and length of follow-up. The extracted data were reported by descriptive statistics. Continuous variables were reported as average and range (minimum–maximum). Categorical variables were reported as frequencies and percentages.

Outcome measures

Measured outcomes included the type of surgical procedure used in ORIF and SR. Data on surgical time, in hospital stay and perioperative blood loss were retrieved from selected manuscripts. The complication rate related to each surgical procedure was indagated, including surgical and not surgical complications, aseptic and septic loosening, intraoperative fractures, wound dehiscence, dislocation, hematoma, etc. Revision rate included all reasons that led to failure and required surgical stem implant revision, including aseptic loosening, dislocation, isolated fracture or periprosthetic fracture. Moreover, functional scores pre- and post-operatively were recorded and compared, analyzing the Parker Mobility Score (PMS) and the Harris Hip Score (HHS).

Statistical analysis

Primary grouping for analysis was based on the type of surgical indication in periprosthetic Vancouver B2/B3 hip fractures. The endpoints that were evaluated included reoperation, complication, transfusion, surgical time and hospital stay. To assess the effect of stem revision vs. internal fixation on dichotomous variables, weighted risk ratios (RR) were calculated to pool study and control groups in each publication for analysis. Random effects models were used regardless of the heterogeneity assessments. Forest plots of the SMD or RR were generated with all studies. The I2 statistic was used to help assess heterogeneity. Fixed effects models were used for sensitivity analyses. R Core Team (2023). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. <https://www.R-project.org/>. software was used to perform the analyses [15, 16].

Results

Search results

As shown on the PRISMA FLOWCHART (Fig. 2) 2076 manuscripts were found at the initial search. 2047 studies were eliminated from consideration based on their title and abstract, because these were deemed unrelated to the current investigation. To ensure that the remaining 29 manuscripts met the inclusion criteria, a thorough evaluation was conducted: 15 studies satisfied the inclusion criteria after full text screening, with a total of 1629 recruited patients, 564 managed by ORIF and 1065 by SR.

Fig. 2
figure 2

Prisma flowchart

Study quality assessment

All the selected studies were retrospective case series of medium to large size (N range 51–278 patients) reporting the outcomes of SR vs. ORIF in PFF classified as Vancouver B2 and B3. All retrieved manuscripts were published between 2006 and 2023. According to the MINORS evaluating score system, on a maximum of 16 points, 1 study achieved 8 points [17], 1 study achieved 10 points [18], 2 studies achieved 12 points [19, 20], 5 studies achieved 13 points [21,22,23,24,25], 4 studies achieved 14 points [26,27,28,29], 1 study achieved 15 points [30] and 1 study reached 16 points [31].

Cohort characteristics

The studies included 1629 patients with a mean age of 77 years (range 66.8–87); average follow-up was 44.4 months (range 6–78 months). The median BMI was 25.4. Considering the study population according to Vancouver B2 or B3 type, our work featured 1128 B2 and 333 B3 fractures.

Surgical technique

Revision surgery consisted of stem revision by longer stem implant with or without the use of plate and/or cable wires, regardless of the use of bone cement; internal fixation included the use of locking compression plate (LCP), in some cases combined with Locking Attachment Plate (LAP), and limited contact dynamic compression plate (LC-DCP), with or without the use of cable wires. Surgery was performed by direct lateral approach in all the studies that reported surgical approach [21, 22, 25, 26, 28].

Complications and mortality

10 of the 15 studies reported complications, distinguishing among ORIF and SR [17, 20, 21, 25, 26, 28,29,30,31,32], reporting data on 979 patients (437 ORIF and 542 SR). More specifically, 7 of the 15 studies [17, 20, 21, 26, 28, 29, 32] reported subgroups of complication in a total of 629 patients, 288 treated with ORIF and 341 treated with SR. 23 out of 288 (7.9%) ORIF patients developed medical complications and 30/288 (10.4%) developed implant-related complications. 32 out of 341 (9.3%) SR patients developed medical complications and 42/341 (12.3%) developed implant-related complications. The most frequently reported postoperative complication was infection (14/288, 4.8% in ORIF group vs. 18/341, 5.2% in SR group), followed by aseptic loosening (13/288, 4.5% in ORIF vs. 6/341, 1.7% in SR), dislocation (4/288, 1.4% in ORIF vs. 14/341, 4.1% in SR), malunion (7/288, 2.43% in ORIF vs. 3/341, 0.9% in SR), refracture (3/288, 0.9% in ORIF vs. 6/341, 1.7% in SR) and implant rupture (1/288, 0.3% in ORIF vs. 1/341, 0.3% in SR). As regards mortality in the first postoperative year, 17/288 (5.9%) cases were reported in ORIF group and 14/341 (4.1%) in the SR group.

Hip function scores

10 of the 15 studies reported function scores [17, 18, 20,21,22, 25, 26, 28,29,30], describing a general improvement from preoperative to postoperative condition. The Parker Mobility Score (PMS) [33] was the most commonly utilized [18, 21, 25, 28, 29], followed by the Harris Hip Score (HHS) [34]. In the studies where PMS was used, patients treated with ORIF had an average of 4.2 PMS score while SR group had a 6.3 PMS score (p = 0.022). The evaluation with the post-operative HHS showed better results of SR compared to ORIF (HHS 67.5 for O vs. 74.8 for R), although not statistically significant (P = 0.148).

Meta-analysis and risk of bias

Risk of bias was assessed for each individual study (Fig. 3). Fifteen studies were amenable for being included in the meta-analysis.

Fig. 3
figure 3

Risk of bias assessment for included studies (GRADE)

The pooled random EM estimated for the reoperation (Fig. 4) was 0.87 (95% CI, 0.39–1.96; I2 = 78%) for fixation surgery, showing a lower reoperation rate in favour of ORIF procedure. In fact, out of 323 patients treated with fixation only 41 needed reoperation (12.7%), while in the SR group 84 out of 453 patients (18.5%) required re-operation.

Fig. 4
figure 4

Reoperation

As regards the overall complications (Fig. 5) the random EM was 1.01 (95% CI, 0.45–2.27; I2 = 85%) for revision surgery; however, EM value in this case suggests similar outcomes in both groups. In the 10 studies reporting data from complications, 6 showed lower rates for ORIF [17, 22,23,24, 26, 27], and 4 had a lower rate of complications when SR was performed. 78/437 (17.8%) ORIF patients developed complications compared to 117/542 (21.5%) patients in SR group.

Fig. 5
figure 5

Complications

The random EM for transfusion (Fig. 6) was 0.72 (95% CI, 0.46–1.12; I2 = 62%) in favour for fixation surgery.

Fig. 6
figure 6

Trasfusions’ rate

The evaluation for in hospital stay (in days) (Fig. 7) and surgical time (in minutes) (Fig. 8) showed better outcomes for ORIF, with a reduced time for both parameters. The ORIF group registered an average hospital stay of 12.6 days, versus 18 of the stem revision group. Regarding surgical time, ORIF required an average 129.3 min, while SR required mean 161.2 min. It was not possible to perform an analysis on cemented vs. uncemented and collar use due to lack of clarity and data within the papers.

Fig. 7
figure 7

Stay in hospital

Fig. 8
figure 8

Surgical time

Discussion

The most important finding of the current study is that in Vancouver B2/B3 periprosthetic fractures of the hip, ORIF surgery obtained better results compared to SR in several evaluated outcomes, showing a lower re-operation rate, reduced surgical time and in hospital stay, with a lower requirement for blood transfusion. Overall complications showed similar results in both procedures. In PFF patients, SR procedures showed superior outcomes in terms of fracture union rates and long-term functional results.

Analyzing recent literature, only three meta-analysis are available on the current topic; two were released in 2021 and 2022 [35, 36], and these showed that PPF management by SR was associated to a reduced risk in reoperation and revision rate. In the meta-analysis by González‑Martín [37], unclear results about the superiority of ORIF treatment over SR were reported, and the Authors supported the use of ORIF in selected patients with low functional request and multiple comorbidities.

PFF patients are typically associated with poor functional outcomes; therefore, to better understand of the clinical conditions, several measures are used to evaluate overall function before and after treatments: these scores include HHS, PMS, OHS, PROMIS and VAS pain assessment, and these provide standardized ways to assess pain levels, functional ability, and overall patient satisfaction [38, 39]. In the studies in which PMS was evaluated [18, 21, 25, 28, 29], there were better postoperatively functional outcomes for the patients treated by SR compared to ORIF; similar findings were obtained observing the post-operative HHS in the two groups [17, 20, 22, 29], Moreover, the authors reported similar outcomes also for pain, stating that SR patients could better manage postoperative pain [18, 20, 29, 30]. Most probably, these results can be attributed to the a priori patient selection in retrospective the cohorts; in fact, ORIF was reserved for patients with comorbidities and compromised general health status [18, 25], while SR was performed in younger and more healthy patients.

Overall revision rate was higher in SR group. A total of 7 manuscripts included this parameter, three showing better reoperation rate with SR [20, 23, 31] and 4 supporting ORIF [21, 26,27,28]. Powell et al. [27] et al. in his study showed a very low rate (4/152) of reoperation in ORIF patients. Conversely, Lindhal et al. [23] in a cohort of 153 SR and 21 ORIF, showed better outcomes in the revision group. These contrasting results that favour SR or ORIF are most probably related to the dedicated expertise of the institution where the surgery was performed, whether it was a trauma or an arthroplasty facility.

In the current metanalysis, the overall complication rate was similar between the two groups of treatments. In literature complications are reported after surgery for PFF in up to 48% of patients [40]. The risk of complications in PPF of the hip is a significant concern; SR and ORIF showed similar complication rates, confirming data from literature [17, 18, 29]. Therefore, the choice of treatment most probably requires balance of the patient’s overall health status, and the available expertise at the treating centre [8].

Patients affected by hip PPF, are typically affected by several comorbidities [41] and show poor bone quality and low body muscle mass [42]. Seen the fragility of PPF patients, the estimated surgical time and blood loss are of critical importance to perform a safe surgery minimizing intra and post-operative risks [18]. The analysis of the included studies suggests that ORIF for PPF may require less transfusions compared to SR. Several studies reported lower transfusion rates in ORIF patients [21, 22, 24,25,26,27,28, 30, 31]. Common sense suggests that the finding is due to the less invasive nature of ORIF procedures, which generally involve smaller surgical exposures and limited bone and soft tissue dissection. The preservation of the native femoral stem in osteosynthesis may also contribute to the reduced blood loss as compared to the more extensive exposure required in SR surgery [20]. Furthermore, ORIF surgery is typically associated to a reduced surgical time [21, 25, 26, 28] compared to SR. Martinov et al. [24] in a study of 90 patients treated by either procedures found that surgical time in SR patients almost doubles ORIF (134 vs. 225 min).

The limited blood loss and shorter surgical time of PPF patients managed by ORIF are factors determining the reduced in-hospital stay for patients with PPF managed by osteosynthesis compared to implant revision [27, 29, 31]. Zheng et al. [29] confirmed this finding; in their study, they also investigated the time in Intensive Care Unit for ORIF and SR procedures, and found an average permanence of 0.2 days in PPF patients operated by osteosynthesis compared to an average 0.7 days for SR.

It is important, when discussing about periprosthetic fractures, to understand whether the implant is cemented or not, and whether modular collars are used in the implants. These factors can influence the survival of the implants.

In fact both present advantages, cemented stems provide stability through controlled subsidence within a cement mantle, which stabilizes upon cement polymerization. Some author recommend ORIF alone for cemented stems with periprosthetic fractures if the cement–bone interface remains intact and only the cement-stem interface is disrupted by the fracture [13]. Managing fractures around cemented implants with ORIF is often more technically challenging because both the bone-cement and cement-stem interfaces contribute to implant stability. Additionally, patients with cemented implants frequently have poor bone quality and may ultimately require revision arthroplasty if ORIF alone cannot restore stem stability after a periprosthetic fracture, further complicating the case [30].

Uncemented implants are easier to revise and do not carry a risk of cement extrusion into the fracture site or interference on fracture healing, leading to non-union. Uncemented, extensive coated prostheses have been shown to perform better than cemented stems for revision in type B (B1-B3) fractures. Disadvantages include limited weight-bearing in the immediate post-operative period, stress shielding and stem subsidence [43].

Currently, there are few studies in the literature on this topic, and their results are sparse and limited, but the data present reported that there was no significant difference in the reoperation rate and survival following use of the cemented stems and use of the uncemented stems [7, 44].

Our study, like many systematic reviews and meta-analyses, has several limitations. These include the presence of selection bias, as well as the restricted availability of complete information from the included reports. Additionally, there is a possibility of language selection bias since only studies published in English were included, potentially excluding relevant studies published in other languages. It is important to acknowledge these limitations because these may impact the chance to generalize and extend our findings to the general population. However, the decision to include only studies reporting at least 50 patients gives strength to our findings because only outcomes from high volume surgical centres were included. Finally, greater surgical expertise towards revision or ORIF is a factor that can certainly influence the results. This factor is not identifiable and is not described in the papers that can be selected in the literature.

In conclusion, ORIF is a reliable alternative to SR to manage instable periprosthetic fractures around hip stems; it is associated to a reduced rate of reoperations, transfusion, surgical time and in-hospital stay, making it more suitable for patients with reduced functional requests and worse general health status. SR is associated to better functional outcomes, and it should be mainly considered in PFF patients with less comorbidities.