Introduction

Fusion of a symptomatic lumbar spinal motion segment is still considered the gold standard for operative treatment of patients with degenerative disc disease (DDD) not responding to conservative care [1,2,3]. However, spinal fusion is associated with side effects such as cranial facet joint violations, decrease in sagittal motion, pseudarthrosis and symptomatic adjacent-level disease [4,5,6,7].

Total lumbar disc replacement (TDR) has been introduced to avoid those fusion-related side effects based on the hypothesis that chronic low back pain (CLBP) originates from DDD. However, TDR has also been associated with drawbacks, such as subsidence, luxation or malposition of the implant, increasing axial rotational instability and excessive loads to the facet joints [8,9,10]. In a meta-analysis, an average reoperation rate of 7.8% at 2- to 5-year follow-up was found [9]. Several studies with mid- to long-term results reported that 6–14% of the patients had revision fusion surgery after TDR [11,12,13,14,15,16,17].

Data from randomized controlled trials with a follow-up of 5 years have shown that TDR is not inferior to spinal fusion [1,2,3]. Nonetheless, much debate remains on the use and effectiveness of TDR, in particular concerning high rates of late loosening and revisions [18]. Mid- to long-term studies on this subject are scarce [2, 3, 11,12,13,14,15,16,17, 19]. Only six studies have a mean follow-up of 10 years or more [11,12,13,14, 16, 17].

The purpose of this study was to evaluate the long-term clinical and functional outcome in terms of patient satisfaction and complication and revision rate after TDR. Additionally, an assessment was made to identify patient- or surgery-related risk factors for revision or worse clinical outcome.

Materials and methods

Patient selection

The current study was approved by the Medical Ethics Committee METC Z (16-N-22) and registered at the Dutch Trial Registry (NTR5710). The medical records of all patients who had undergone a TDR by a single surgeon using a SB Charité III between 1989 and 2000 at the Zuyderland Medical Centre, Sittard-Geleen, the Netherlands, were reviewed. Altogether, 405 consecutive patients were identified.

TDR had been performed as treatment for patients with predominantly axial low back pain with failure of appropriate conservative measures and the presence of lumbar DDD as determined by plain radiographs and/or MRI. Preoperatively, all patients had undergone fluoroscopically guided provocation discography to rule out non-discogenic pain sources. No facet joint injections were performed. Radiculopathy, spondylolisthesis or spondylolysis were considered a contraindication for TDR. Patients with a previous discectomy or small decompression were not excluded in this study (Table 1). Complications were recorded when a reoperation needed to be performed. Revision surgery by spinal fusion of the TDR was defined as a failure.

Table 1 Summary of subgroup patient demographic and surgical data

Clinical and subjective outcome evaluation

Patient-reported outcome measures (PROMs) were obtained in all TDR patients who had not been revised at latest follow-up (n = 237). Back- and leg-pain intensity was recorded with a visual analogue scale (VAS, 0–10, 10 being ‘worst pain’). General well-being was evaluated using the Short Form-36 survey (SF-36) and Oswestry Disability Index (ODI). In both, a score of 0 is equivalent to maximum disability and a score of 100 is equivalent to no disability. Quality of life was assed using the EuroQol-5D (EQ-5D, 0–1, 1 indicating the best health state).

The patient’s subjective outcome evaluation was assessed by three questions. The first question was whether they were satisfied with the TDR operation. The second question was whether their current situation was better, the same or worse in comparison with the first 5 years after TDR. The third question was whether they would choose the same surgery again.

Data analysis and statistics

Baseline patient characteristics were described using mean and standard deviation (SD) and absolute number and percentage. The independent samples t test was used to test for differences in the means of the baseline patient characteristics between patients with or without revision surgery by spinal fusion. Differences in categorical variables in the same groups were tested using the χ2 test.

Kaplan–Meier curves were constructed to assess the cumulative incidence of fusion surgery after TDR over time. Multivariable Cox proportional hazards regression was used to estimate associations between patient characteristics and survival of the TDR. Corresponding hazard ratios (HRs) and 95% confidence intervals (CIs) were obtained. A multivariable logistic regression model was utilized to identify independent risk factors associated with worse clinical outcome defined as a VAS-score for combined leg and back pain ≥ 5.0- or an ODI-score > 40 points at latest follow-up. A cut-off p value of ≤ 0.05 was considered statistically significant. All analyses were performed using IBM SPSS (Version 23.0).

Results

Study population

Altogether, 405 consecutive patients with a TDR were identified. At follow-up, 34 patients had deceased (8.4%), 14 patients (3.4%) refused to participate and 61 patients (15.1%) could not be traced. These 109 patients (26.9%) were excluded from further analysis. Informed consent was acquired from the remaining 296 patients (73.1%) with a mean follow-up of 19.4 years (median 19.3, range 0.2–25.6 years). A summary of the reasons for exclusion is listed in Fig. 1.

Fig. 1
figure 1

Reasons for exclusion

In 59 patients (19.9%), a revision by spinal fusion with or without removal of the TDR had been performed at a mean of 7.1 years (median 6.7, range 0.2–21.6 years). The mean follow-up after implantation for the 237 patients (80.1%) without a revision was 19.5 years (median 19.1, range 13.7–25.6 years). Patient characteristics are listed in Table 1. Patients with a TDR at level L2–L3 or multilevel TDR had significantly more risk of needing to undergo revision fusion surgery (p = 0.042 and p = 0.011, respectively).

Survival analysis

A Kaplan–Meier survival curve is depicted in Fig. 2. The vast majority of the fusion procedures (n = 48, 81.4%) occurred in the first 10 years after TDR.

Fig. 2
figure 2

Kaplan–Meier (survival of the TDR with fusion as endpoint n = 59)

Reoperations

An overview of all complications requiring a reoperation is provided in Table 2. A total of 108 reoperations were performed in 92 patients, so the overall reoperation rate was 31.1%. Again, the majority occurred in the first 10 years after TDR (n = 81, 88.2%). It was 29.5% for mono- (n = 69) and 39.0% for the bi-segmental (n = 23) TDRs (p = 0.109). The mean time until reoperation was 4.0 years (range 0.0–21.6 years). These were divided into three subgroups: reoperations for immediate device- or technique-related complications (n = 20, 6.8%), reoperations that were related to the surgery in general (n = 7, 2.4%) and reoperations for the treatment of persisting symptoms (n = 81, 27.4%).

Table 2 Different indications for a reoperation

In 34 patients (57.6%), a posterolateral instrumented fusion was performed. In 25 patients (42.4%) in addition to spinal fusion, the TDR was removed and the intervertebral defect was filled with a femoral head bone strut graft (Fig. 3). In all patients with complaints attributed to ASD, spinal fusion of the index and affected adjacent segment was performed. The remaining 49 reoperations consisted of eight procedures to resolve general complications such as a rectus haematoma or an incisional hernia of the abdominal wall. In 11 patients, the TDR was repositioned after anterior luxation or malposition of the TDR. The remaining 30 procedures were related to radiculopathy or relative spinal stenosis as described by the surgeon. In these patients, a decompression of the hypertrophied ligamentum flavum was performed, often in combination with a laminotomy and undercutting of the facet joints.

Fig. 3
figure 3

Example case before and after revision of the TDR by posterolateral instrumented spinal fusion and after second stage removal of the TDR, because of persisting complaints

To investigate whether learning curve had an impact on reoperation rate, we looked at complications indicative for the latter. In the first 4 years (100 cases), anterior luxation of the TDR occurred in six out the 13 cases (46.1%). For malposition, this was two out of six cases (33.3%).

Risk factors for TDR survival and worse clinical outcome

An overview of all potential risk factors for revision surgery is provided in Table 3. A BMI > 30 was associated with a significantly lower probability of revision surgery (HR 0.27, p = 0.026). Although not statistically significant, previous spinal surgery (discectomy or small decompression) before TDR increased the probability of revision spinal fusion (HR = 1.66, p = 0.086). In contrast to the univariable analysis (Table 1), no significant differences were seen in the multivariable (adjusted) analysis for multilevel TDR or the level of placement. Therefore, they are not independent risk factors for failure. We tested for associations between all potential risk factors in relation to the different complications requiring a reoperation. A significant association was found for previous spinal surgery and ASD (occurrence of 4% vs. 14.3%, p = 0.004).

Table 3 Potential risk factors for revision surgery by spinal fusion

An overview of all potential risk factors for a poor clinical outcome is provided in Table 4. Rheumatoid arthritis (RA) and age at the time of surgery < 45 years were both associated with a higher probability of a VAS ≥ 5.0 (OR 5.08 and 1.95, respectively). Previous spinal surgery, osteoporosis (defined as taking medication for this condition, no data on bone mineral density were available), BMI > 30 and RA were all associated with a higher probability of an ODI-score ≥ 40 (OR 2.19, 4.03, 2.40 and 7.15, respectively). The potential risk factors of a TDR at L2–L3 or L3–L4 were not included in this multivariable analysis because of too few events to estimate the OR.

Table 4 Overview of potential risk factors for a poor clinical outcome

When we define failure as a VAS ≥ 5.0 and/or a revision operation (n = 144, 48.6%), both RA (OR 5.01, p = 0.014) and age at the time of surgery < 45 years (OR 3.10, p = 0.001) were associated with a higher probability of failure.

Clinical and subjective outcome evaluation

Analysis of the clinical parameters, at latest follow-up, showed mean scores for: SF-36 physical of 39.9 (n = 212, 12.1 SD), SF-36 mental of 50.9 (n = 212, 11.2 SD), VAS-leg of 2.1 (n = 235, 2.8 SD), VAS-back of 3.4 (n = 235, 2.9 SD), ODI 26.7 (n = 26.7, 18.7 SD) and for EQ 5D of 0.737 (n = 225, 0.232 SD).

In total, 79.6% of all patients (n = 235) were satisfied with the outcome of their TDR at latest follow-up: 173 patients (73.0%) would be willing to undergo the same surgery again, 29 patients (12.2%) were not sure, and 33 patients (13.9%) would not be willing. When asked to compare their current situation with the first 5 years after surgery, 60 patients (25.3%) reported a deteriorated clinical outcome.

Discussion

This study reports the long-term clinical follow-up of the Charité III lumbar TDR implanted by a single surgeon for the treatment of symptomatic DDD. In our study, the overall reoperation rate was 31.1%. In 59 patients (19.9%), revision spinal fusion at the index level was performed. RA, osteoporosis, age at the time of surgery < 45, BMI > 30 and previous spinal surgery were all associated with a poor clinical outcome.

There is still much debate concerning the use of TDR in terms of fear of deteriorating effect and high rates of late revision operations [15, 18]. To assess this issue, the accumulation and analysis of long-term data are paramount. There are few studies with a minimal follow-up of 10 years [11,12,13,14, 16, 17]. They report a reoperation rate between 5 and 33%. Most of these studies have a mean follow-up around 11 years. In our study, with a mean follow-up of 19.4 years, the vast majority of reoperations, including revision spinal fusions, occurred in the first 10 years after TDR (88.2% and 81.4%, respectively). It appears that there is no need for fear of late revision operations according to our data.

Our overall reoperation rate of 31.1% was similar to the rate of 33% in the study of Laugesen et al. [16] but higher in comparison with other long-term follow-up studies [11,12,13,14, 16, 17] reporting incidences between 5 and 9%. The mean follow-up in our study is higher in comparison with those studies [11,12,13,14, 16, 17], which may account for a relatively higher incidence of reoperations. Furthermore, this difference may have been caused by sub-optimal patient selection for the index TDR surgery. In later years, improvements have been made in terms of surgical technique, imaging of the spine and more appropriate patient selection based on social profile [11, 15]. This might explain the tendency of earlier studies, or those with longer follow-up, to report less favourable outcomes than those of more recent studies [16]. Learning curve does not seem to be a factor, since complications indicative for the latter such as anterior luxation or malposition of the TDR are evenly distributed over the years.

TDR is associated with progression of facet joint arthropathy (FJA) at the index level [10]. In our study, in 14 out of 59 patients (23.7%), FJA was reported as the reason for revision spinal fusion. However, it is likely that in patients classified as suffering from ASD, spinal stenosis or subsidence, FJA has been a factor as well. This assumption is supported by the observation that in all patients (n = 15) with revision spinal surgery for ASD, the index level was also fused.

A prospective study of Siepe et al. [15] with 181 patients and a mean follow-up of 7.4 years reported a reoperation rate of 16%. The incidence of either general surgery- or device-related complications in that study amounted to 7.2% and is quite similar to the 9.2% in the current study. It must be noted that in that study, as in the RCTs comparing TDR with fusion [1,2,3], numerous exclusion criteria such as previous spinal surgery, obesity and chronic steroid use were applied. As that was not the case in the current study, our population might be a better representation of patients with DDD in daily clinical practice.

In addition, the reoperation rates should similarly be compared with the rates that have been published on lumbar fusions. A large retrospective cohort study in adults who underwent lumbar fusion for degenerative spine disorders between 1990 and 1993 (n = 24.882) showed a cumulative incidence of reoperations of 21.5% after 11-year follow-up [20]. This is similar to the revision fusion rate in our study, but our overall reoperation rate is higher. However, when we look at our reoperation rate after 11-year follow-up (27.7%, n = 82), the difference in reoperation rate is less pronounced.

Risk factors for survival and worse clinical outcome

A BMI > 30 was associated with a significantly lower probability of revision surgery by spinal fusion (HR 0.27). It is possible that the surgeons were less inclined to perform revision surgery because of associated higher complication rates in the obese patients [21]. Moreover, it is possible that these obese patients had a lower activity resulting in less wear of the TDR, as has been published for obese patients with hip and total knee replacements [22]. The fact that in our population, BMI > 30 was associated with an ODI-score > 40 might be indicative of the latter. However, not statistically significant (p = 0.086), previous spinal surgery before TDR tended to increase the probability of revision spinal fusion (HR = 1.66) in our population.

In contrast to the study of Siepe et al. [15], multilevel TDR did not significantly increase the probability of revision surgery or poor clinical outcome in our multivariable analysis. These findings are consistent with several other studies [16, 19, 23]. The level of TDR was not associated with an increase in revision surgery or a decrease in clinical outcome as well. This again is consistent with the literature [24, 25].

Rheumatoid arthritis, osteoporosis, age at the time of surgery < 45 and previous spinal surgery were all associated with a VAS-score ≥ 5.0 or an ODI-score ≥ 40. Tropiano et al. [19] showed similar results for age and previous spinal surgery.

Clinical and subjective outcome evaluation

PROMs were obtained at latest follow-up in all patients without a previous revision by spinal fusion. This means that these outcome measurements are not a reflection of our total population. Furthermore, no preoperative questionnaires were available for comparison. However, it is possible to compare our outcome measurement with those reported in the other mid- to long-term follow-up studies on TDR.

Siepe et al. [15], Lu et al. [13] and Park et al. [17] reported at latest follow-up an ODI-score of 20.3, 13.2 and 22.4 and a VAS-score of 3.3, 1.5 and 3.4, respectively. These numbers are comparable to those in our study (VAS-score of 3.4, ODI-score of 26.7) although it must be noted that in these three studies, patients with a revision by spinal fusion were included (11%, 6.3% and 9.3%, respectively). Laugesen et al. [16] reported a VAS-score of 2.4 and a SF-36 physical of 31.9 in their group without a revision, both similar to our population (SF-36 physical of 39.9).

In our study, 79.6% of the patients were satisfied with their TDR and 73.0% would be willing to undergo the same surgery again for similar complaints. When assuming that all patients with a fusion were not satisfied and not willing, these percentages would drop to 63.6% and 58.16%, respectively. Previously mentioned studies reported percentages between 86.3–64.9% and 79.3–52.6%, respectively [13, 15,16,17]. To our opinion, the clinical status of patients after TDR at a follow-up of almost 20 years is not substantially different from those at 8–12 years follow-up [13, 15,16,17].

Limitations and strengths

The current study’s main limitation is its retrospective nature. PROMs were obtained at latest follow-up, and no preoperative questionnaires were available for comparison. Unfortunately, most of the other long-term follow-up studies had a retrospective design [11, 12, 14, 17]. A lack of control group (either non-operative or index spinal fusion) can be attributed to all these studies.

The number of patients included in any study has a vital influence on the outcome and whether a study is representative or not. We included a total of 296 patients. Our mean follow-up of 19.4 years is the longest available in the literature. Despite our long follow-up, only 18.5% of our patients were lost to follow-up or refused to participate. Consequently, there is a high chance of generalizability of our study.

The results presented in this study demonstrate a revision spinal fusion rate of 19.9% after a mean follow-up of 19.4 years. Fear of excessive late complications or reoperations following the primary TDR cannot be substantiated since the vast majority of all reoperations occurred in the first 10 years after TDR. Proper patient selection considering the identified risk factors for worse outcome may help to improve the clinical and functional outcome in these patients.