The demand for primary total joint replacement (TJR) in developed countries has been growing rapidly. It is expected to surpass the 4 million mark in the United States by 2030 (3.48 million total knee replacements and 0.57 million total hip replacements).1 With the expected increase in TJR procedures, it is important to investigate parameters that could optimize medical outcomes as well as resource allocation and utilization. The use of neuraxial anesthesia (NA), which includes both spinal anesthesia and epidural anesthesia, may reduce hypercoagulability following surgery, increase regional blood flow, reduce the stress response, and provide good pain control.2,3 Previous studies reported that orthopedic procedures under NA may be associated with a lower 30-day overall complication rate, better 30-day survival, a shorter hospital stay, and fewer blood transfusions than general anesthesia (GA).4-8 Considering the short-term benefit, NA may be preferable for patients undergoing TJR. Little is known about the effect of anesthetic management on long-term outcomes.

In recent years, some studies reported an association between anesthetic management and cancer recurrence.9,10 These findings may therefore imply an association between anesthetic management and long-term outcome. Although NA may reduce the number of postoperative complications and improve short-term mortality in patients undergoing TJR, it remains unknown whether it may also influence long-term survival in these patients.11,12

The primary aim of this study was to evaluate the effect of anesthetic management on long-term survival rate for patients undergoing TJR. The short-term survival rate, length of stay (LOS), and hospital treatment costs were also evaluated. We hypothesized that NA may improve the long-term overall survival rate when compared with GA in TJR patients.

Methods

Database and study design

The institutional review board (IRB) of Kaohsiung Medical University Hospital issued a formal written waiver of the need for consent (KMUH-IRB-EXEMPT-20140055, October 2014). Data for the Longitudinal Health Insurance Database (LHID) were collected by systematically and randomly sampling the National Health Insurance Research Database (NHIRD), which includes data for one million individuals. The National Health Research Institute of Taiwan reports that the patients in the LHID and those in the original NHIRD do not significantly differ in sex distribution, age distribution, or average insured payroll-related amount. Data in the NHIRD are de-identified and aggregated for analysis. Written consent of the patients analyzed in this study was not required. The NHIRD provided an opportunity to study the effects of different types of anesthesia on the short- and long-term outcomes of patients who underwent total hip or knee replacement procedures.

The principal procedure codes 81.51 and 81.54 were used to identify total hip and knee replacements, respectively. For patients who had undergone repeated total hip or knee replacement during the study period, we selected only the first procedure as the index hospitalization. We excluded patients for whom the type of anesthesia was not known. Santaguida et al. found that age, sex, comorbidities, and surgical volumes were statistically associated with outcomes after TJR.13 The International Classification of Diseases—9th revision—Clinical Modification codes (including primary and secondary diagnoses) were used to identify comorbidities for the day prior to the TJR. The primary study outcome was all-cause mortality during the 14-year follow-up. The patient’s date of death was determined according to the National Register of Deaths of the National Health Insurance Administration (NHI). The analysis period began from the index date and ended at the death of the patient, withdrawal from NHI, or the end of 2010, whichever occurred first. This is a nationwide population-based study. A formal sample size estimate was not performed.

Potential confounders

Clinical significance guided the initial choice of covariates: age, sex, comorbidities, surgery type, and the hospital’s characteristics (high or low volume of procedures). The comorbidities included in the model were coronary artery disease, congestive heart failure, cerebrovascular disease, dementia, diabetes, pulmonary disease, renal disease, liver disease, peptic ulcer disease, rheumatological disease, hemiplegia or paraplegia, malignancy, and acquired immune deficiency syndrome. Annual hospital and surgeon volume for TJR were categorized into a dichotomous variable. We chose cutoff points to divide the study population into approximately quarters for each of the years under study (1997 through 2010).14-16 Hospitals/surgeons that fell in the highest quartile for the numbers of TJRs performed were considered “high-volume” hospitals/surgeons, and the others were “low-volume” hospitals/surgeons.

Statistical analysis

To reduce the confounding factors, we conducted a propensity-score-matching model as described by Rosenbaum and Rubin.17 Stratification by propensity score replaces covariates that may be present in an observational study with variables for these factors. A non-parsimonious logistic regression model was performed to estimate a propensity score for type of anesthesia.18 We used a structured iterative method to refine this logistic regression model to achieve balance of confounders within the matched pairs.16 Confounder balance was measured by the standardised difference. An absolute standardised difference above 10% represented meaningful imbalance.19 We used the approximate nearest neighbour 1:1 without replacement approach by modifying a published macro.17-19 The matching macro started to select pairs with an identical propensity score on the tenth decimal and decreased step by step until the first decimal. An additional criterion was being insured for the whole matching calendar year.

Regarding hospital treatment costs, we adjusted hospital treatment costs for different hospital levels based on differences in Bureau of National Health Insurance (BNHI) reimbursements. For each cost center, the cost-to-charge ratio was used to convert charge data into cost data at the hospital level. The total costs included radiology, physical therapy, hospital room, anesthetist, pharmacy, laboratory, special materials, surgeon(s), and others. To reflect changes in real dollar value, cost data were adjusted by the consumer price index for each year under study. Hospital treatment costs were then converted from Taiwan dollars to US dollars at an exchange rate of 30.5:1, which was the average rate during 1997-2010.

Continuous variables are expressed as means with standard deviation (SD) or as medians with the interquartile range (IQR). Categorical variables are presented as the frequency and percentage. Shapiro-Wilk test was used to assess for normal distribution. Normally distributed data were analysed by the paired t test. The Wilcoxon rank sum test was used to analyse non-normally distributed data. Differences in categorical variables were assessed using McNemar’s test. Overall survival was estimated using the Kaplan-Meier method, and differences were compared using the log-rank test.

Statistical analyses were performed with SPSS version 20.0 (SPSS Inc., Chicago, IL, USA) and SAS Version 9.1 (SAS Institute, Carey, NC, USA). All tests were two-sided. P < 0.05 was considered statistically significant.

Results

From the database, we identified 7,977 patients who had undergone a primary total hip (n = 2,318) or knee (n = 5,659) replacement between January 1, 1997 and December 31, 2010. Of the 7,977 patients, 4,987 were given NA, and the other 2,990 were given GA (Fig. 1). This approach generated two propensity-score-matched subgroups, each containing an equal number of patients who underwent TJR with either GA or NA (2,847 patients in each group) (Fig. 1).

Fig. 1
figure 1

Patient selection and study design

As shown in Table 1, there were significant differences in age, comorbidities, hospital volume, surgeon volume, and surgery type between the two groups before propensity-score matching. Using propensity-score matching, 5,694 patients were equally allocated to two matched anesthesia groups (Table 1). The matched groups were similar with regard to patient age, sex, comorbidities, surgeon volume, surgery type, and hospital volume. The short-term mortality rates for the two groups were similar after propensity-score matching (0.1% for both groups at postoperative month 1 and 0.4% vs 0.6%, respectively, at postoperative month 3). As the follow-up continued, NA was found to be associated with a better survival than was shown for GA, with the differences between the groups becoming statistically significant five years after surgery (Fig. 2). After a maximum of 14 years of follow-up, patients who had been given NA were more likely to be alive (58.2%; 95% confidence interval [CI] 50.4 to 66.0) compared with those who had been given GA (57.3%; 95% CI 51.4 to 63.2, P = 0.009).

Table 1 Comparison between general anesthesia (GA) and neuraxial anesthesia (NA) before and after propensity score matching (PSM)§
Fig. 2
figure 2

Five-year overall survival rate after TJR between GA and NA. TJR = total joint replacement; GA = general anesthesia; NA = neuraxial anesthesia

During the study period, the NA group had significantly lower median (IQR) treatment costs, shorter median (IQR) LOS, and better long-term overall survival rates than the GA group. The costs were $4,079 (3,806-4,444) vs $4,113 (3,812-4,568), respectively (P < 0.001). LOS were 8 (7-10) days vs 8 (6-10) days, respectively (P = 0.024) (Table 2).

Table 2 Medical utilization and long-term outcomes between general anesthesia (GA) and neuraxial anesthesia (NA) after propensity score matching (PSM)#

Discussion

This retrospective longitudinal study evaluated the effect of NA on the long-term survival rate in Asian patients undergoing TJR. Although the short-term survival rates were similar in the two groups, we found that patients in the NA group experienced shorter hospital stays, lower hospital costs, and improved long-term overall survival compared to those in the GA group.

In this longitudinal study in Taiwan, NA was given to 62.5% of patients undergoing TJR. In a US study by Memtsoudis et al.,7 only 11.1 % of nearly 400,000 patients undergoing TJR were given NA. This finding may imply that the preferred anesthetic technique chosen for patients undergoing TJR was different in Taiwan than in Western countries. This disparity may be attributed to cultural diversity, different reimbursement policies, preferences of physicians, or other, unidentified reasons.

Previous studies had identified a small 30-day mortality benefit of NA over GA after surgery.7,20 After studying a large nationwide sample in the United States, Memtsoudis et al. reported that the incidence of pulmonary complications (e.g., pulmonary embolism) is higher in patients undergoing GA than in those undergoing NA.7 Lee et al.21 had reported that the incidence of pulmonary embolism was lower in Asian patients than in those from Western countries. Whether the difference in race leads to the conflicting short-term mortality results is unknown. The relatively small sample sizes in our study and the fewer short-term deaths after TJR also could cause some bias.

Our study of Asian patients undergoing TJR, with a follow-up of 14 years, revealed that NA may be responsible for improving the long-term survival rate compared with that after GA. This finding is consistent with the results reported by Monk et al., who found that the mortality rate during the first postoperative year might be affected by anesthetic management.22 In an observational study of 1,064 patients, they found that the Charlson comorbidity index score, cumulative deep hypnotic time, and intraoperative hypotension (systolic blood pressure < 80 mmHg) were significant independent predictors of the one-year mortality rate. Brown et al. also found that the hypnotic depth might be associated with the one-year mortality rate in elderly patients during repair of hip fractures under spinal anesthesia.23 We suggest that the long-term benefit of NA may be related to the more stable hemodynamic profile24 and the shallower hypnosis associated with NA than in those seen with GA. Another possible reason is that orthopedic surgery under NA is associated with less blood loss and blood transfusion requirement than are found with GA.4,5,25,26 Blood transfusion were identified as a predictor of long-term mortality following primary TJR in elderly individuals in a previous study.27 Although not statistically significant at the 0.05 level, it is apparent that there was a slightly lower co-morbidity index amongst patients in the NA group after propensity-score matching. This finding suggests a lower overall risk for the NA group, leading to improved outcomes later in life.

A meta-analysis evaluating the relative efficacy of regional anesthesia and GA in patients who had total hip or knee replacements did not find any discernible difference in the LOS between the groups (standardized mean difference: −0.55; 95% CI, −2.47 to 1.37; P = 0.120).4 In 2013, Pugely et al. studied the American College of Surgeons National Surgical Quality Improvement Program database and showed that patients who had undergone primary total knee replacement with spinal anesthesia had shorter LOS than those who had been given GA (3.45 vs 3.77 days).8 Memtsoundis et al. reported similar findings after studying data from a large US database.7 In our study, patients in the NA group had shorter LOS than those in the GA group, confirming that the findings noted in the North American health systems were similar to those from Asia.

Studies comparing the total hospital treatment costs of orthopedic surgery performed under NA and GA have been limited. After conducting a small randomized controlled trial to investigate the costs associated with GA and spinal anesthesia for elective hip or knee replacement, Gonano et al. found that spinal anesthesia is more cost-effective than GA as it is associated with lower fixed and variable costs.28 We investigated the total hospital treatment costs for primary total hip or knee replacements performed in Taiwan from 1997 to 2010. As there was a significant difference in the costs of different anesthetic techniques we investigated the hospital treatment costs with and without anesthesia-related expenses. According to the reimbursement policy of the BNHI, the average cost of GA is two to three times higher than that of NA for the same surgical procedure. The results reported in Table 2 indicate that the NA group clearly had significantly lower hospital costs than the GA group, even when the anesthesia cost was omitted. For example, the NA group had significantly lower pharmacy costs during hospitalization after TJR. The likely explanation is that the NA group had less postoperative nausea/vomiting4 and better postoperative pain relief.29 Memtsoundis et al. concluded that 3% of patients undergoing TJR required critical care service (CCS), and CCS patients had higher mortality rates, longer hospital stays, and higher hospital costs.30 Risk factors for CCS are GA, advanced age, and postoperative cardiopulmonary complications. We were not able to identify a relation between anesthetic techniques and CCS because of limitations in the administrative data used.

Previous observational studies have obtained similar results regardless of whether confounders were adjusted by conventional logistic regression or by propensity-score matching.31,32 We used propensity-score matching to reduce the confounding factors between the two anesthetic methods by matching the patients’ age, sex, comorbidities, surgery type, and hospital and surgeon volumes. High surgeon or hospital volume is associated with better outcomes in terms of medical utilization and complications/mortality for TJR procedures.14-16 High surgical volumes increase the opportunity to improve surgical skills and implement a cost-effective model for treatment. After adjusting for hospital and surgeon volumes, our results became more reliable.

Our study had certain limitations that are inherent to studies analyzing large databases. First, the clinical perspective obtained from the claims data is likely not as accurate as that obtained from prospective clinical trial data because claims data can include errors in the coding of primary diagnoses and surgical modalities. Second, information was not available regarding the American Society of Anesthesiologists class, body mass index, preoperative laboratory values, intraoperative parameters, postoperative pain management, or other confounding factors that could potentially influence postoperative outcomes and long-term survival. Third, complications associated with anesthesia or surgery were not assessed, which may limit the validity of our predictions. Finally, we did not examine outcome data, such as patient-reported quality of life and indirect costs incurred after discharge.

Conclusions

Our study results showed that the long-term survival of patients who underwent primary total knee or hip replacements was better for patients given NA than for those given GA. The NA group also experienced shorter LOS and lower hospital treatment costs than the GA group. The association of NA with improved long-term survival remains unexplained. Further studies are needed to both confirm the association of longer survival with NA and identify its mechanism.