Background

Total hip arthroplasty (THA) is one of the most common orthopedics surgeries. It is the preferred cost-effective treatment for osteoarthritis and other end-stage hip abnormalities. Patients experience a significant improvement in joint function as well as the quality of life following THA [1]. Studies suggest a rising trend in the number of performed THAs during the last decade [2]. From 2000 to 2014, the number of annual performed THAs increased by 105% in the USA. It is also projected that by 2030, this number will increase by 71.2%, reaching 635,000 procedures per year [3]. Total hip replacement also imposes a high economic burden on healthcare systems, with US hospitals bearing a staggering cost of $ 15 billion annually [4].

Patients scheduled for bilateral THA usually undergo two different timing sets of surgeries: simultaneous or staged. Simultaneous BTHA is performed in single hospital admission and under the same anesthesia. On the other hand, staged BTHA is executed at separate intervals in two hospitalizations and under two distinct anesthesia [5]. In 1971, Charnley et al. introduced simultaneous THA for bilateral hip pathologies, a noteworthy revolution in orthopedic science [5, 6]. Since then, there has always been controversy over which method could have better outcomes.

In 2016, Shao et al. conducted a systematic review comparing simBTHA and stgBTHA. It was revealed that surgery time, deep vein thrombosis (DVT), and major systemic complications were significantly lower in simBTHA compared to stgBTHA [7]. In 2019, another systematic investigation performed by Huang et al. also demonstrated lower rates of DVT, pulmonary embolism (PE), and respiratory complications in simBTHA [8].

There is still debate concerning this critical issue, and many original studies have been conducted since the last published systematic review. Previous reviews have focused on systemic and surgical complications, blood loss, operation time, and mortality as their primary outcomes. Essential factors such as readmission, revision, hip joint function, and cost have been considered less. Thus, a thorough review of the available data is required to identify the best way to perform bilateral THAs. The forthcoming systematic review aims to make a more comprehensive and accurate comparison between simultaneous and staged BTHA with a higher sample size and additional related outcomes.

Method

The protocol of this study was registered on PROSPERO (CRD42022310240). We followed the Cochrane guidelines for meta-analysis during the process [9]. Our study phases were based on the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines [10]. The PRISMA checklist is presented in Additional file 1.

Search strategy

We searched the electronic databases MEDLINE, Web of Science (WOS), Embase, and Scopus for relevant articles in any published language; the last updating search was performed on February 15, 2022. The keywords are exhibited in Additional file 2. In addition, we explored the reference part of the articles that fulfilled our eligibility criteria. We also used the “related articles” feature in PubMed to avoid probable missing.

Eligibility criteria

PICOS categories (population, intervention, comparator, outcomes, and study design) were applied to define our inclusion criteria. We included studies only if they were executed to compare mortality, complications, costs, or other possible outcomes between simBTHA and stgBTHA. Eligible study designs were randomized controlled trials (RCTs), non-randomized clinical trials, prospective and retrospective cohort studies, and case–control investigations. We did not impose any restrictions on the length of follow-up and year of publication. Exclusion criteria were reviews, research letters, conference abstracts, non-English articles, duplicate publications, irrelevant articles, non-human models, studies comparing simBTHA to unilateral THA, and resurfacing or revision surgery.

Systemic complications were defined as cardiovascular, pulmonary, gastrointestinal, urologic, and neurologic complications, hypotension, anemia, DVT, and PE. Notably, we did not include PE in the pulmonary complications in the meantime of analysis. Local complications in our study were defined as wound infection, decubitus ulcer, hematoma, dehiscence, neurapraxia, vascular injury, accidental laceration or puncture, chronic soft tissue pain, neuroma, wound drainage, superficial infection, and ectopic ossification.

Data extraction

We imported all the studies into Rayyan online tool [11] in order to screen conveniently. After resolving duplicates, two researchers (AR, AS) completed an initial independent review to determine if the studies met the inclusion criteria hinged upon the title and abstract. Then, the two prior reviewers (AR, AS) evaluated each in the full-text screening phase. In case of any discrepancy, a third reviewer (AG) became involved and resolved it.

We prepared an electronic spreadsheet according to the Cochrane's template for data extraction of intervention reviews. Two separate reviewers fulfilled the data extraction (AR, AG). We acquired the following data from the studies: first author's name, publication year, country, study design, the sample size, mean age, gender, mean body mass index (BMI), American Society of Anesthesiology (ASA) classification, the interval between stages, duration of follow-up, primary and secondary outcomes including mortality, DVT, PE, fracture, dislocation, deep infection, any other complications, revision, readmission, operation time, blood loss, blood transfusion, length of hospital stay (LOS), hospital cost, and functional measures. Raw data were reviewed by another researcher (AS) to settle any disagreement. We also tried to contact the corresponding authors of the included articles regarding raw data or missing information. Patients with an ASA score of 1 or 2 were categorized as ‘low risk,’ and patients with an ASA score of 3 or 4 were categorized as ‘high risk’ [12].

Methodology assessment

To assess the quality of each study, we employed the Newcastle–Ottawa Scale (NOS) for observational and non-randomized investigations. Briefly, the NOS evaluates a study according to three main characteristics: selection of groups, comparability, and outcome assessment [13]. We judged the quality of included studies according to the previous classification described in a meta-analysis by Simunovic et al. [14]. Studies with a score > 6 were categorized as high quality. Those with a score of 5 or 6 were classified as medium quality. Articles scored less than 5 were assigned as a low-quality study. Concerning randomized clinical trials (RCTs), we utilized the Cochrane Collaboration tool to assess the risk of bias. Two reviewers (AR, AS) independently assessed each study's quality. Disagreements were determined by consensus or involvement of the corresponding author (SHS).

Statistical analysis

We performed meta-analysis using the Comprehensive Meta-Analysis software (Biostat, Englewood, NJ, USA, Version 3.3) if three or more studies reported a particular outcome. For dichotomous variables, odds ratios (ORs) were calculated and pooled for all investigations. Meta-analysis of dichotomous variables was committed through the Mantel–Haenszel (MH) method, with 95% confidence intervals (CI). Meta-analysis of continuous data was performed by applying the mean and standard deviation of outcome measures with 95% confidence intervals (CI). For studies that reported only data ranges without standard deviations, we calculated SDs using the formula suggested by Walter & Yao [15]. A p value less than 0.05 was considered statistically significant. We analyzed heterogeneity among the studies using the I2 test [16]. I2 > 50% with a p value < 0.05 suggested high heterogeneity. A fixed-effects model was utilized if low statistical heterogeneity among the studies was discovered (I2 < 50%). A random-effects model was used if high heterogeneity became proven. We also detected potential publication bias by using Begg’s funnel plots and the Egger test [17].

Results

Search results

After deleting duplications, we identified 5324 potentially relevant titles from the mentioned databases. Based on the titles and abstracts, 5236 publications were excluded. Full texts of 88 remaining publications were screened. Finally, in this systematic review, 38 studies, including 104,151 patients (29,551 simBTHA and 74,600 stgBTHA), were entered into the quantitative analysis. A flowchart summarizing the selection process is provided in Fig. 1.

Fig. 1
figure 1

PRISMA flowchart showing identification, screening, and inclusion of studies for review

Study characteristics

Among the 38 included studies, 2 studies [18, 19], including 348 patients, were RCTs and 36 studies were non-RCTs [20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55]. The baseline characteristics of the articles are displayed in Table 1. Studies were in the English language and were published from 1978 to 2022. The duration of follow-up was at least 3 months. The sample size of included studies ranged from 15 to 42,238. The mean age of participants was 57.6 years for simBTHA and 63.2 years for stgBTHA. The male-to-female ratio was 1:1.29. Raw data for ASA classification were reported in 14 studies [18, 19, 24, 25, 33,34,35, 37, 41, 42, 45,46,47, 49]. Regarding ASA score, 13% and 18% of patients in simBTHA and stgBTHA were considered high risk (ASA 3 or 4), respectively (Table 1).

Table 1 Main baseline characteristics of the included studies

Quality assessment

Randomization methods, outcome assessment blinding, incomplete outcome data, and selective data reporting were low risk for both RCTs. Although the allocation method was not reported in one RCT, all other included studies were observational, comprising one prospective cohort, seven registries, nineteen retrospective cohorts, and nine retrospective case controls. The risk-of-bias assessment results for both randomized and observational studies are summarized in Table 2.

Table 2 Quality assessment of the eligible studies

Mortality and complications

Pooled analysis of 11 studies on DVT (OR = 0.639, p = 0.044, Fig. 2a), 12 studies on pulmonary complications (OR = 0.533, p < 0.001, Fig. 2c), 14 studies on systemic complications (OR = 0.803, p = 0.048, Fig. 3a), and 16 studies on local complications (OR = 0.736, p < 0.00, Fig. 3b) exhibited that these complications are lower in simBTHA. However, PE, reported in 12 studies (OR = 1.925, p < 0.001, Fig. 2b), and periprosthetic fracture, reported in 13 studies (OR = 1.306, p = 0.049, Fig. 4b), were higher in simBTHA. 90-day mortality, reported in eight studies (OR = 1.101, p = 0.815, Fig. 5), periprosthetic joint infection, reported in nine studies (OR = 1.112, p = 0.508, Fig. 4a), and dislocation, reported in 14 studies (OR = 0.760, p = 0.153, Fig. 4c), were similar between the two groups (Table 3).

Fig. 2
figure 2

Forest plot of a DVT, b PE, and c pulmonary complications. M-H, Mantel–Haenszel; OR, odds ratio; 95% CI, 95% confidence interval

Fig. 3
figure 3

Forest plot of a systemic complications and b local complications. M-H, Mantel–Haenszel; OR, odds ratio; 95% CI, 95% confidence interval

Fig. 4
figure 4

Forest plot of a periprosthetic joint infection, b periprosthetic fracture, and c dislocation. M-H, Mantel–Haenszel; OR, odds ratio; 95% CI, 95% confidence interval

Fig. 5
figure 5

Forest plot of 90-day mortality. M-H, Mantel–Haenszel; OR, odds ratio; 95% CI, 95% confidence interval

Table 3 Summary of postoperative mortality and complications reported in each included study

Perioperative and postoperative relevant outcomes

The overall effect of included studies demonstrated that simBTHA was lower in terms of length of stay (MD = −4.777, p < 0.001, Fig. 6) (26 studies), operation cost (USD) (MD = −2464, p < 0.001, Fig. 7c) (11 studies), and blood loss (MD = −254.785, p < 0.001, Fig. 7a) (12 studies). Pooled data of nine studies showed that the simBTHA group experiences a mean 1.37 point improvement over the stgBTHA group in postoperative Harris Hip Score (HHS) (MD = 1.370, p = 0.006, Fig. 8a). There was no significant difference in the revision rate (OR = 1.033, p = 0.572, Fig. 9a) (ten studies), readmission rate (OR = 0.997, p = 0.980, Fig. 9b) (six studies), blood transfusion rate (MD = 0.114, p = 0.286, Fig. 7b) (12 studies), and postoperative limb length discrepancy (LLD) (MD = −0.391, p = 0.312, Fig. 8b) (seven studies) (Tables 4 and 5).

Fig. 6
figure 6

Forest plot of LOS. MD, mean difference; 95% CI, 95% confidence interval

Fig. 7
figure 7

Forest plot of a total blood loss, b blood transfusion need, and c total cost. MD, mean difference; 95% CI, 95% confidence interval

Fig. 8
figure 8

Forest plot of a postoperative HHS and b postoperative LLD. MD, mean difference; 95% CI, 95% confidence interval

Fig. 9
figure 9

Forest plot of a revision and b readmission. MD, mean difference; 95% CI, 95% confidence interval

Table 4 In-hospital important outcomes reported in each included study
Table 5 Postoperative important outcomes reported in each included study

Systematic review of heterogeneous data

Based on 12 studies [18,19,20, 24, 25, 29, 34, 37, 38, 41, 42, 54], the mean operation time was 171.4 min for simBTHA and 191.4 min for stgBTHA. Cumulative operation time for both surgeries in stgBTHA was longer than simBTHA operation time in all studies except the study by Kim et al. [42]. Although postoperative Western Ontario and McMaster Universities Arthritis Index (WOMAC) scores were reported to be similar between the two groups [35], two studies reported significantly higher scores of Oxford Hip Scores [56] or EuroQoL-5D index [42] in simBTHA compared to stgBTHA. In contrast, another study by Kamath et al. [37] stated no statistical difference between the two groups in mentioned functional outcomes. Functional recovery was faster in simBTHA, as walking without support started earlier [36] and walking capacity was better postoperatively [21, 28]. Rates of home-discharged patients for stgBTHA were higher in all studies [25, 26, 40, 41, 43, 49, 54].

For 90-day mortality, systemic complications, operation cost, LOS, blood loss, blood transfusion rate, HHS, LLD, and high heterogeneity existed between studies (I2 ranged from 59.909 to 99.729%). Begg’s funnel plots are shown in Additional file 3.

Discussion

SimBTHA has continued to attract attention since Charnley first introduced this type of orthopedic surgery. Many studies comparing simBTHA and stgBTHA have been conducted since then but, due to small sample size or other undetermined possible reasons, failed to obtain a definite conclusion. We conducted a comprehensive systematic review and meta-analysis of 38 comparative studies enrolling 104,151 patients. Findings of this updated meta-analysis generally concur and further extend that of previous reviews on the topic, providing several relevant results that have not been previously addressed.

Mortality and complications

The combined 90-day mortality rate was 0.22% for simBTHA and 1.57% for stgBTHA. Nonetheless, the 90-day mortality analysis failed to show any significant difference between the two groups. Since most included articles were retrospective studies, we should interpret the present results with caution. Previous studies have also posed no significant difference in mortality rate between the two groups [7, 32, 33, 48, 57].

Periprosthetic joint infection (PJI), as an uncommon complication of THA [58], can incur costs for the patient and healthcare system [59]. PJI can also lead to secondary surgery and even death [60]. No significant difference was observed regarding the PJI rate between the two groups. However, our results contrast with the previous review [7], which indicated a significantly higher infection rate in one-stage versus two-stage. Shao et al. [7] computed the risk in the cumulative number of superficial and deep infection cases, so their effect on subsequent procedures on hospitalization might be diverse. The overall PJI rate was 0.91% in the simBTHA group and 0.87% in the stgBTHA group. The overall PJI rate for both groups was higher than in previous studies [39, 61].

We investigated periprosthetic fracture between the two groups, and contrary to previous studies [5, 7, 41, 51], the incidence of fracture in simBTHA was higher than in stgBTHA. The unanticipated increased fracture risk in simBTHA can be attributed to the cemented or cementless fixation [62] and operation time in a single surgery. As in the previous meta-analyses [5, 7, 63], no clinically significant difference was seen in the occurrence of dislocation between the two groups in our study.

We found a significantly lower risk of DVT in simBTHA compared to stgBTHA. This finding is consistent with previous studies [7, 8]. Lower activity levels in stgBTHA due to pain in the contralateral hip can justify the elevated risk of DVT in stgBTHA [64]. Despite simBTHA patients having an associated lower risk of DVT, we observed an increased risk of PE in simBTHA compared to stgBTHA. Still, other investigations revealed no difference [5, 7, 57] or an elevated risk of PE in StgBTHA [8] PE, consuming a huge part of medical resources [65], can yield in-hospital and post-discharge mortality [66]. A large-scale data registry study by partridge et al. [48] suggested that simBTHA is associated with a greater risk of developing PE. This study included more than half of our study population and maybe has shifted the results toward itself. However, the quality of this study was high and might not have imposed bias on the results. We should consider that pharmacological thromboprophylaxis can reduce thromboembolic events [67], and many risk factors affect PE incidence [68].

The stgBTHA was associated with a higher risk for postoperative pulmonary complications. Malcolm et al. also reported a 1.42% respiratory complication rate for THA, similar to the simBTHA group in our study [69]. In our study, the pulmonary complications rate in simBTHA and stgBTHA was 1.69% and 2.38%, respectively.

On the other hand, a higher risk of systemic and local complications in the stgBTHA was evidenced. Similar results were reported by Aghayev et al. [28]. Poultsides et al. [43] and Guo et al. [47] also presented that the rate of systemic complications in simBTHA was lower than in stgBTHA.

Other outcomes

Combining the results of 10 studies revealed no significant differences in revision rate between the simBTHA and stgBTHA. Our findings are compatible with the previous study [46] published on this topic. Another study by Garland et al. [33] indicated a slightly higher risk of revision for stgBTHA. There were no significant differences among simBTHA and stgBTHA concerning readmission rates in keeping with previous studies [41, 47, 48].

Our research shows that simBTHA is superior to stgBTHA in terms of cumulative operation time, hospital cost, and LOS. The simBTHA surgery is performed in one session, while the stgBTHA surgery is performed in two sessions. Undergoing two operations, which obviously has a longer cumulative operation time, means a more extended anesthesia period which is correlated with increased risk of infection [70], venous thromboembolism (VTE) [71], neurologic deficit [72], revision, intraoperative blood loss, transfusion, and other critical adverse events [73, 74]. Operation time is a potentially modifiable risk factor that engages surgeons and healthcare systems interested in quality improvement. Sodhi et al. [75] saw that operation time is significantly associated with LOS, and LOS has also been a major driver of cost in THA [76]. Mean LOS for simBTHA was 4.8 days less than stgBTHA, which can justify more costs and complications in stgBTHA. However, operation time is varied by various factors such as operating technique, surgery approach, general or epidural anesthesia, patient's demographics, and surgeon's experience. Although almost all studies demonstrated a lower cost, and LOS in simBTHA, researchers utilized various methods to calculate these data. Therefore, high heterogeneity was observed in the pooled data.

The aggregate results of our study indicated that simBTHA outperformed stgBTHA in reducing perioperative total blood loss. Previous studies also showed a higher cumulative blood loss in stgBTHA compared to simBTHA [5, 18, 24]. Interestingly, in this meta-analysis, despite a lower total blood loss in simBTHA, analysis of transfusion units did not show any significant difference between the two groups. It should be taken into account that indications for blood transfusion in different studies were not the same. Another reason for similar rates of blood transfusion could be the interval between two operations in stgBTHA that provides enough time for hematopoiesis. In a retrospective study [39], comparing infection rates after THA, blood transfusion has found to be a powerful risk factor for PJI, and patients who underwent simBTHA had a higher blood transfusion rate than stgBTHA. In contrast, another study by Parvizi et al. [25] revealed that the cumulative blood transfusion was lower in simBTHA compared with stgBTHA. As higher blood loss is accompanied by more need for blood transfusion in which itself is associated with a higher risk for infection [77], immunosuppression [78], and even death [79], blood loss stands as a significant concern in major orthopedic surgeries [80].

Although the pooled results of analysis favored simBTHA in terms of the postoperative HHS, but a 1.37 point improvement is not clinically significant based on the prior evidence [81]. Kim et al. [42] found that the mean postoperative HHS was significantly higher in simBTHA than in stgBTHA, and they mentioned that better functional outcomes in simBTHA could be because of the accuracy of surgery, earlier starting rehabilitation for both operated hips, and reduced time lost from work in a simultaneous procedure. The diversity of functional outcome measure types did not allow us concluding precisely regarding hip joint function. Using a comprehensive and unified tool that includes important items for hip joint function evaluation can help us decide more precisely which type of surgery is appropriate for specific situation.

Concomitant to our results, several studies have exhibited no difference in LLD between simBTHA and stgBTHA [36, 37, 40]. However, LLD can yield patient dissatisfaction after THA [82]. It also has been indicated that LLD can worsen functional outcomes such as Oxford Hip Score [83].

The strength points of this meta-analysis comprise peer-reviewed comparative studies and a rigorous assessment of the methodological quality of the currently available data. This study enhanced the power to compare the clinical outcomes of simBTHA and stgBTHA through more excellent details. With respect to the previous meta-analysis [8], we used explicit exclusion and inclusion criteria. We also utilized a robust search strategy spanned multiple databases, yielding 38 published studies on the topic, twice the number of included studies in the previous meta-analysis.

Our study has several potential limitations. First, due to the limited number of RCTs, we included non-RCTs, too. As we know, retrospective studies vary in terms of quality, making our study susceptible to bias and confounding. Second, we also excluded non-English studies, which may cause language bias in our research. Third, lacking a specific definition for some outcomes like operation time and variety of measurements may bias our findings. Fourth, most of the studies did not report outcomes according to surgical approach, method of anesthesia, use of antibiotics and thrombosis prophylaxis, primary diagnosis, and demographic data. Although our goal was not to compare these data, they could have influenced the accuracy of our results. Fifth, some studies did not contain raw data for pooled analyses. Although we tried to contact the authors, we could not get these data. Sixth, each study’s criteria for blood transfusion were different or not mentioned. Seventh, the number of participants varied considerably among the included studies, ranging from 15 to 42,238. Eighth, National registry data studies have some missing information about patients and these studies may also underestimate complications rates which could have influenced the final result. Ninth, follow-up periods were heterogeneous among studies. Tenth, HHS measurements were done at different times, which might have biased our results. At last, we combined different complications to obtain two categories: systemic and local. However, some studies avoided reporting complications separately, so they put together all of them without paying attention to the different severity, which limits the conclusion's reliability.

Conclusion

Taken together, this meta-analysis demonstrated that simultaneous and staged THA have similar 90-day mortality, dislocation, and PJI rates. A statically significant risk reduction was identified in DVT, pulmonary, systemic, and local complications in the simBTHA group. Interestingly, stgBTHA is more promising in terms of PE and fracture rate. The present study also revealed that simBTHA is associated with lower total blood loss, length of stay, and total surgery cost. Reduced length of hospital stay and total surgery cost as essential advantages of simBTHA compared to stgBTHA may attract healthcare providers' and policy-makers' attention. After all, simBTHA remains noninferior to the stgBTHA in most postoperative outcomes. Anyhow, we recommend that well-designed randomized controlled trials should be conducted to elucidate the advantages of each surgery in order to help surgeons choose the proper surgical method hinged on their point of view and patient's benefits.