Introduction

TKA and THA are two of the most common and costly inpatient surgical procedures performed on Medicare beneficiaries [8]. However, the demand for joint arthroplasties is increasing among patients younger than 65 years because of younger patients seeking better mobility and less pain attributable to arthritis [19, 31].

Total joint arthroplasty (TJA) has been identified as a procedure with substantial variations in inpatient and postacute care costs [3, 10, 28, 33, 39]. Consequently, several private and public payers have been testing alternative payment models in an effort to reduce costs, improve value, and emphasize quality in patients’ transition from surgery to recovery [14, 16, 40]. For example, The Centers for Medicare and Medicaid Services (CMS) implemented its first mandatory bundled payment model for joint replacement care in April 2016 [8]. Under this type of payment model, participant hospitals receive a single payment from Medicare for any care related to a TJA episode that begins with an admission to a participant hospital and ends 90 days after discharge. There are potential cost savings for hospitals that successfully align incentives, but there is a simultaneous risk associated with costs exceeding the negotiated target if hospitals fail in that goal.

Policies designed for Medicare also may have important effects on prices negotiated between nongovernmental payers and hospitals. As such, various bundled payments for TJA are increasing among private payers, such as insurance providers and employers [14, 16, 40]. Consequently, hospitals considering participating in such initiatives and involved in the TJA care cycle have sought to evaluate their specific service-component cost differences across each TJA episode of care and factors associated with episode payments [5, 10, 17].

Given the high variability in inpatient and postdischarge payments among TJA episodes of care and the rising incidence of TJA in the younger non-Medicare population, we sought (1) to assess the mean total payments for a 90-day primary TJA episode, including the proportion attributable to postdischarge care, and (2) to evaluate the role of procedure, patient, and hospital-level factors associated with 90-day episode-of-care payments in a non-Medicare patient population younger than 65 years.

Methods

We obtained a population-based cohort from Blue Cross Blue Shield of Texas enrollment and claims data for 2008 to 2013. Our dataset included payments for all claims for medical care and ancillary services (facility and professional) filed with Blue Cross Blue Shield of Texas and a member enrollment file for Blue Cross Blue Shield of Texas members from 2008 to 2013. Indicators of race and poverty levels of enrollee residential zip codes were obtained through the American Community Survey (ACS) data in lieu of patient-level indicators [36]. The classification of urban and rural zip codes was based on the United States Department of Agriculture Economic Research Service Nonurban-Urban Continuum Codes [38].

We received approval for performing this study from our institutional review board. Our analysis cohort consists of all Blue Cross Blue Shield of Texas members younger than 65 years enrolled in preferred hospital organizations (PPO and PPO+) or point of service plans (POS) who underwent an elective, primary total TJA (this includes unicompartmental, bicompartmental, and tricompartmental knee procedures and total hip replacement procedures as defined by ICD-9 Procedure Codes 81.51 and 81.54) between July 1, 2008 and October 1, 2013. Each member had a minimum enrollment in a PPO or POS plan for 9 months. We used patient diagnoses codes over the 6-month preindex admission period for a TJA (knee or hip) to calculate patient-morbidity burden using patient diagnoses codes. Patient-morbidity burden (categories ranging from low morbidity to very high morbidity) was assigned using The Johns Hopkins Adjusted Clinical Groups® (ACG®) Risk Adjustment System [18]. The morbidity-burden calculation assigns a higher value to conditions that are expected to have higher consumption of healthcare resources (based on condition severity, duration, need for specialty care, diagnostic certainty, and type of etiology) and thus represents a patient’s resource-use burden. The “index” hospital stay for analysis was the date of hospital admission for a THA or TKA through the date of discharge. Thus, we excluded joint surgeries performed in outpatient settings as indicated by the service-type code (inpatient versus outpatient) in our dataset (Fig. 1). The 90-day episode of care refers to the index stay and a 90-day postdischarge followup period. To ensure that the unit of analysis is a TJA episode and the observation period for each TJA episode was a total of 9 months, we excluded patients with more than one TJA during a 9-month interval. We only included patients younger than 65 years at the end of the postdischarge 90-day followup to exclude those who might qualify for Medicare (Fig. 1). We only included patients with diagnosis-related group (DRG) codes for major joint replacement or reattachment of lower extremity (469 or 470), because these codes have been found to be most conducive to bundling and are the only ones included in the Comprehensive Care for Joint Replacement program guidelines published by Medicare [8]. Our sample had less than 1% of those with a DRG of 469 (major joint replacement or reattachment of lower extremity with Major Complication/Comorbidity). As per Medicare guidelines [8], we included all facility and professional services during the 90-day followup period including all unplanned readmissions to an inpatient, acute care facility but excluded any planned readmissions (eg, staged bilateral procedures).

Fig. 1
figure 1

The inclusion and exclusion criteria for calculation of 90-day episode-of-care costs are shown. DRG = diagnosis related group.

We identified 20,527 TJAs between 2008 and 2013. After applying our inclusion and exclusion criteria (Fig. 1), our study cohort consisted of 11,653 TJAs from 242 different Texas hospital hospitals. Of the 11,653 episodes that met the inclusion criteria, 453 had missing service hospital information including zip codes and 69 had missing information for the patient economic status indicator. Since these constituted only 0.04% of our total observations, we removed them from our analyses. Thus, our final sample had 11,131 TJAs from 242 hospitals. Of the 11,131 total episodes, approximately 5% (n = 512) were primary TJAs repeated on the same patients.

The primary outcome of interest for our study was the total dollar amount associated with a 90-day TJA episode. We calculated the total amount by aggregating payments associated with the index hospital stay and included professional or facility claims that were triggered in the 90-day postdischarge period. All payments are amounts rendered to providers by enrolled patients and Blue Cross Blue Shield of Texas based on contractual agreements. All payments were adjusted to 2013 USD using the medical component of the Consumer Price Index. Independent variables used in our model included procedure-characterizing factors such as inpatient length of stay, type of joint involvement (hip or knee), the use of computer-assisted surgery, discharge status (home health or routine), and whether there was any subsequent, unplanned readmission during the 90-day postdischarge period. Other factors included patient characteristics such as age at the end of 90-day episode, sex, and patient-morbidity burden.

Economic status and race were measured at the patients’ residential zip codes, which included the percent of families below the poverty line as a proxy for socioeconomic status and the percent of the population which is nonwhite [36]. Such area-based indicators have been considered useful in risk-adjustment studies and for monitoring disparities in healthcare systems [2, 9, 15]. Both social-context variables were split in four categories using quartiles. We included the zip code distance between patients’ residential zip code and hospital location as an indicator of geographic access to care.

For hospital characteristics, we constructed a binary variable to indicate whether the hospital was in an urban area, and calculated the number of total discharges during one calendar year for each inpatient hospital as a proxy for hospital size. We calculated the hospital case mix, which is the average relative DRG weight of a hospital, to adjust for differences in the severity of cases across annual hospital discharges. Thus, the hospital size and case-mix variables were calculated using Blue Cross Blue Shield of Texas claims data.

Statistical Analysis

Analyses were conducted using STATA®, Version 13.0 (Stata Corp, College Station, TX, USA). Factors of interest were summarized using percentages and frequencies for categorical variables and means and SDs for continuous variables. Comparisons of patient and hospital characteristics between THAs and TKAs were made with chi-square tests for categorical variables and Mann-Whitney U tests for continuous variables.

Our dataset consisted of TJA episodes that were naturally nested in patients, with patients nested in hospitals. We used the generalized linear mixed model with the gamma distribution, logarithmic link function, and robust standard errors to model the right-skewed payments in our data [11, 13, 20, 21]. We estimated a series of five models (Supplemental Table 1. Supplemental material is available with the online version of CORR ®.) to understand how each set of procedure, patient, and hospital characteristics separately contributed to the payment variation among patients and variation among hospitals in our data. The first was an unconditional/null model with no patient, provider, or hospital factors. The second model included procedure-characterizing factors only, the third model included patient-level factors only, the fourth included hospital-level factors only, while the fifth model was the all-factor model with procedure, patient, and provider factors included simultaneously. The five models allowed us to calculate the proportional changes in the variance components for each model using the null model as the reference [29]. The all-factor model allowed us to estimate how each factor was associated with the mean payment for 90-day care episodes.

Table 1 Descriptive statistics for all TJAs

Using gamma regression analysis, we calculated the rate ratio (RR), which is the rate increase in the mean payment per episode for a discrete change in each procedure, patient, and hospital factor in our model. Corresponding marginal effects were calculated to determine the estimated changes in dollar amounts per discrete change in patient, procedure, and hospital factors based on the fitted model.

We ran our analysis after removing outlier payments (the highest 1%) and found no meaningful change in our conclusions. Thus, we did not exclude any payments based on outlier values in our analysis. Since there was evidence of baseline differences (p ≤ 0.05) between knee and hip procedures among several factors in our data (Table 1), we tested for interaction effects between type of joint procedure and other factors in our model such as computer-assisted surgery and discharge status. We found no meaningful interaction effects on payments for type of joint involvement and other factors in our sample.

Results

Mean Payments for 90-day Care Episode

After adjusting for the Consumer Price Index, the mean cost for a 90-day episode was USD 49,222 ± USD 23,152 (mean ± SD) for THA and USD 46,988 ± USD 20,177 for TKA (Table 2). Only 11.5% and 14.8% of total episode payments were attributable to postdischarge care for THA and TKA, respectively.

Table 2 Payments for 90-day episodes for TJA including THA and TKA

Factors Associated With Increased Payments

The set of procedure-characterizing factors alone explained 44% variation in payments among patients but 0% variation among hospitals (Model 2, Table 3). The length of stay, computer-assisted surgery, any unplanned readmission, and an initial home health discharge were procedure factors associated with increased estimated payments (Table 4), but the type of joint involved (hip versus knee) was not associated with payments in our all-factor model. A RR of 1.19 for length of stay greater than 4 days (Table 4) indicates that the mean per-episode payment was 1.19 times higher than the length of stay of 1 to 2 days (RR, 1.19; 95% CI, 1.15–1.23; p ≤ 0.001). This corresponds to an estimated difference of USD 7943 (95% CI, USD 6089-USD 9797; p ≤ 0.001) independent of other factors in the model (Table 4). Similarly, any unplanned readmission during the 90-day followup added approximately USD 27,000 (95% CI, USD 24,129-USD 30,347; p ≤ 0.001), and computer-assisted surgery added an additional estimated payment of USD 1200 (95% CI, USD 152-USD 2580; p ≤ 0.05) to the mean episode payment.

Table 3 Variance components (random-effects) of 90-day episode costs by multilevel gamma regression analysis
Table 4 Observed rate ratios and marginal effects for the all-factor model

Patient-characterizing factors alone explained only 5% of patient-level variation and 1% of hospital-level variation (Model 3, Table 3). Except for patient-morbidity burden, which was strongly associated with increased mean payments, area-based race and economic status indicator and distance to hospital were patient factors that were not associated with payments in our all-factor model (Table 4). A very-high morbidity burden was associated with 1.11 times higher mean payments (RR, 1.11; 95% CI, 1.062–1.150; p ≤ 0.001) than low patient-morbidity burden. This corresponded to a difference of USD 4529 (95% CI, USD 2664-USD 6394; p ≤ 0.001).

Hospital-characterizing factors alone (Model 4, Table 3) explained 10% of the hospital-level variation but only 3% of patient-level variation. Hospital location in an urban setting and larger hospital size were hospital factors associated with increased estimated payments while hospital case mix was associated with lower marginal payments (Table 4). Any hospital located in an urban area was associated with 1.29 times higher mean payments (RR, 1.29; 95% CI, 1.17–1.42; p ≤ 0.001) than a rural area hospital. Conversely, hospitals with higher case mix were associated with 0.94 times lower mean payments than low case-mix hospitals (RR, 0.94; 95% CI, 0.89–0.95; p ≤ 0.05) for 90-day episodes.

Overall, all factors (Model 5, Table 3) together explained only 11% of variation among hospitals and 49% of variation among patients, with procedure factors contributing most to payment variation among patients and hospital factors contributing most to payment variation among hospitals. The estimated mean payments for hospitals (Fig. 2) based on the fitted model showed mean payments ranging from USD 25,000 to USD 120,000.

Fig. 2
figure 2

The estimated means for 90-day episode payments (k = thousands) among hospitals are shown.

Discussion

The number of joint arthroplasties performed annually has been increasing in Medicare and non-Medicare populations and this trend is expected to continue as more people with osteoarthritis demand better quality of life and mobility [8, 19, 31]. With the CMS-mandated 90-day bundled-payment model going into effect recently in more than 800 hospitals, many payers are considering using similar payment structures for joint replacement procedures. Our study aggregated inpatient payments and followup payments for 90 days for patients with an initial home discharge after TJA in a younger, commercially insured population. We found that the inpatient component of payments contributed a considerably greater proportion of total payments for TJA compared with the postdischarge-care component. Additionally, several procedure-, patient-, and hospital-characterizing factors were associated with higher total 90-day-episode payments and high variation in payments for TJA. These were factors such as the inpatient length of stay, computer-assisted surgery, any unplanned readmission, an initial home discharge with home-health component, patient-morbidity burden, hospital location in an urban setting, larger hospital size, and lower hospital case mix.

Most limitations in our study arise owing to the use of administrative data based on insurance claims. We modeled joint arthroplasty payments with an assumption that reimbursed payments were reflective of actual costs incurred by hospitals. Thus, the gap between actual costs and total reimbursements is not captured by our study. Claims data also are subject to coding errors or incomplete capture of patient and clinical data. We addressed this limitation by using patient-morbidity burden, which is an indicator of patient severity that we calculated using diagnoses codes during a 6-month period for a more-comprehensive capture of patient comorbidities. Moreover, we used patient residential zip code-based indicators of race and economic status to supplement our dataset with additional demographic and socioeconomic information. We were unable to obtain additional hospital characteristics owing to payer-provider contractual limitations, and approximately 89% of hospital-level variation remained unexplained in our analysis. Therefore, we cannot make a definitive conclusion whether the variability not explained by hospital characteristics was outside the scope of hospital provider influence. We also were unable to account for how insurance reimbursement structures influenced payments to hospitals.

However, because episode-based payments were not introduced among commercial carriers in Texas during the study period examined, the factors affecting total payments and the care-component differences are still relevant in the context of commercial payers that are not participating in episode-based bundled payments in joint procedures. Furthermore, our data represent approximately 25% to 30% of the commercially insured population in Texas and should be generalizable to other commercial payers serving younger Texans. Although the generalizability of our results can be influenced by regional differences in payment structures, hospital contractual factors, and patient demographics, we believe that the overall conclusions regarding inpatient and postdischarge care-component differences among younger patients and high variation in payments among hospitals are generalizable to other commercial payers serving similar populations in other parts of the country. Nevertheless, replication of this study among other non-Medicare populations will allow for conclusions that are more robust and generalizable.

The mean payments associated with an inpatient stay (USD 41,100) and a 90-day episode of care (USD 47,700) for TJA in our population are consistent with payments reported in other studies during the same period that have reported TJA payments using commercial claims data [3, 25]. Although a previous estimate of postdischarge-care payments for Medicare showed that it can constitute up to 45% of total episode-of-care payments [22], we saw only 14% of episode payments attributable to postdischarge services. This finding can be attributed to our relatively younger and healthier population who were initially discharged home (with or without home health services) after surgery. Our findings are consistent with those of previous studies on episode-of-care payments, which suggests that whereas the index inpatient admission contributes a relatively smaller proportion of total episode-of-care cost component in the Medicare population, it contributes a much-larger proportion of total payments in commercially insured populations [28, 39]. Thus, the inpatient component of episode-of-care payments remains an important driver of variation in TJA in such populations and will continue to be an essential target for cost containment and the best delivery-improvement strategies.

Procedure-level and Patient-level Factors

Procedure-characterizing factors were a major source of variation in payments among patients in our sample. Procedure and patient factors associated with higher 90-day mean payments were the index length of stay, higher patient-morbidity burden, any 90-day unplanned readmission, and having a home health versus routine discharge. Previous studies on episode-of-care payments for primary TJA have shown factors such as length of stay, patient comorbidities, type of discharge destination, and readmission rates as drivers of TJA payments [5, 25, 27], and our study results underscore these findings. Clearly, too many 90-day readmissions to an inpatient facility during the postdischarge period will chip away at hospital margins, and future studies that look at factors and interventions that alleviate 90-day readmissions will be useful for hospitals practicing in bundled environments.

Notably, we obtained information regarding the prevalence of computer-assisted surgery in our population and its association with higher payments. Many payers have classified computer-assisted surgery as medically unnecessary or investigational [4, 24, 34] (eg, Blue Cross Blue Shield of Texas requires prior authorization for the use of computer-assisted surgery for billing purposes). The final verdict regarding the effectiveness of computer-assisted surgery is uncertain. Some studies have found computer-assisted surgery to be cost-effective in TKA [26] and beneficial in THA [32, 35], whereas others have found no discernible short-term, clinical benefit in TKA or THA [6, 7, 12, 23]. In the cost-reductive environment of the current healthcare system, in the absence of clinical and economic benefits to support the effectiveness of computer-assisted surgery in TJA, the entire additional cost burden of computer-assisted surgery might eventually shift toward the hospital and patients. However, the rapidly changing landscape of technology might yet find computer-assisted surgery to be useful in high-complexity joint surgery.

Hospital-level Factors

We confirmed that there was high variation in TJA episode payments owing to the nested structure of our data. While procedure characteristics were the major source of variation in payments among patients, hospital characteristics were the major source of variation for differences in payments among hospitals. Expanding on a previous study [10], we found that larger hospital size and urban location were associated with higher estimated payments among TJAs while higher case mix was associated with lower payments. The difference in estimated payments between urban and rural hospitals was driven by the inpatient component in our sample, and most likely reflects regional variation in pricing (eg, implant costs) [1, 41]. Hospital size and case-mix differences possibly reflect changes in negotiating leverage between the insurer and hospitals resulting from changes in hospital market power, which has been shown to be positively associated with size and negatively associated with higher hospital case mix [1, 30, 41]. Future research can add measures of competition, such as the Herfindahl-Hirschman Index [37], to measure the effect of market forces on hospital payments.

Overall, the results of our study can inform policymakers, hospitals, and payers considering a shift toward bundled payments in TJA by providing valuable insights in factors associated with higher payments and higher variation in payments for 90-day episodes of care in patients younger than 65 years. Large variation in payments suggests system inefficiencies that may be suitable for the implementation of episode-based bundling. Improved efficiency may be achieved through improved care coordination and standardization of care pathways [16] or through the examination of physician, vendor, and payer relationships for negotiating prices [41]. Future studies also should examine the role of other hospital characteristics such as organizational structure, staffing, physician arrangements, and type of services (such as remote monitoring of discharged patients) when assessing episode payments.