Introduction

Cervical disc arthroplasty was developed to mitigate problems attributed to fusion, that is, maintaining a greater range of motion and preventing adjacent segment degeneration [1,2,3,4,5,6,7,8,9,10]. However, cervical disc replacements (CDRs) have introduced new complications to the spine, previously observed at the larger, synovial joints [11,12,13,14]. While the materials and surface treatments used for CDRs and larger joint replacements are similar, physiological loading conditions, and kinematics differ. Nonetheless, complications such as periprosthetic bone loss have recently been reported for CDRs [15,16,17,18,19,20,21].

While bone loss has been reported as a radiographic or surgical observation in CDRs, its role in clinical performance is not well understood. This problem is compounded by a lack of uniform terminology which blurs the distinction between non-inflammatory and inflammatory bone loss (osteolysis) [19]. As a result, these distinct terms are often used interchangeably, as evidenced by a recent systematic review, which combined clinical outcomes from patients with non-specific bone loss and bone loss due to osteolysis to draw conclusions on treatments [22]. Osteolysis is characterized by lesions resulting from an inflammatory response (Fig. 1a&b) [23], whereas, non-inflammatory bone loss is a process of adaptation to mechanical changes (Fig. 1c&d) [20]. Consequently, the mechanism of bone loss is important to identify. A schematic image was created to depict two representative progressions of osteolysis versus non-inflammatory bone loss that are often observed. Despite the stark contrast between the two phenomenon, incorrect identification is prevalent. This points to the damaging nature of the terminology and accentuates the need for remedy.

Fig. 1
figure 1

Schematic depicting the difference between a mild, moderate, and severe inflammatory bone loss due to osteolysis, b mild, moderate, and severe non-inflammatory bone loss

This distinction is pivotal in understanding true incidence of the complications and providing useful information as to the symptoms and timeline associated with either complication. While resorption may be a natural mechanical change with little long-term significance, osteolysis may require a complete redesign or reevaluation of materials. By understanding the clinical relevance of these complications, engineers and implant designers will be better equipped to reassess resulting issues with the device and its design.

Here, we present a systematic review of the literature on CDRs to assess radiographic bone changes and document-reported causations of bone changes. By surveying the reported causes, we can identify common threads among the literature and use this knowledge to establish a lexicon for non-inflammatory bone loss versus osteolysis in the field. The information provided in this study may aid surgeons in assessment of radiographic changes in patients, and in the advancement of implant design and materials.

Methods & materials

Search methodology

Two of the authors (J.M.W. and S.N.S.) systematically searched electronic databases following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [24]. Specifically, a comprehensive search of the PubMed, Google Scholar, and ScienceDirect databases was conducted for studies reporting outcome of CDR including evaluation and assessment of periprosthetic bone loss or osteolysis. The keyword search terms used were “cervical disc replacement/arthroplasty,” “osteolysis,” “bone loss,” “radiograph,” and “complications.” To be included, studies must include a radiographic review, incidence of bone changes, and be written in English. Studies were categorized as prospective, randomized, or retrospective reports.

Data extraction

The following categories of data were extracted for further analysis from each article: (1) general information such as type of device, patient age, and any patient reported clinical outcomes (2) data pertaining to specifics of bone loss or osteolysis, such as time of observation, implant materials, subsequent surgery, and explant analysis results, (3) patient-reported clinical outcomes, and (4) incidence of bone loss or osteolysis. Additionally, any specific information pertaining to bone loss grading criteria, such as that of Kieser et al., was included [25]. In prospective studies in which the number of patients lost to follow-up was not provided, 100% follow-up was assumed.

In a number of studies, the terms “bone loss” and “osteolysis” were used interchangeably, exemplifying ambiguity that the present study aims to address and clarify. In these cases, the present study’s authors analyzed each articles descriptions, provided images, and clues to assess whether the outcome being reported could be deemed as bone loss or osteolysis beyond a reasonable doubt.

Results

Literature review

The searches yielded 1061 articles. Abstracts were screened and reviewed and, after removing studies that did not qualify or were duplicates, 49 articles were further evaluated (Fig. 2). Exclusion criteria included studies lacking bone loss or osteolysis data, focusing on the lumbar spine or anterior cervical discectomy and fusion data only, or lack of radiographic data. Inclusion criteria included fell-text studies assessing patients with a CDR, radiographic evidence of bone loss or osteolysis, and cervical spine data. Nine clinical articles were further excluded, leaving 40 full-text articles to be reviewed. Another 17 articles did not assess bone changes radiographically and were excluded, leaving 23 articles for the present review (Table 1).

Fig. 2
figure 2

PRISMA flow chart depicting the identification, screening, eligibility, and inclusion process for the included studies in this review

Table 1 Studies that met the inclusion criteria

Study characteristics

From the 23 included articles, the following information was collected: author(s), number of patients, follow-up, study type, mean patient age, country of origin, and overall reoperation rates (Table 1). Studies that included multiple devices were then separated according to implant, by implant name, implant materials, number of patients, follow-up time, and reported term(s) of bone change. The majority of cases reporting osteolysis distinguished between non-inflammatory bone loss and inflammatory osteolysis, whereas articles focused on bone loss were more inconsistent in their terminology and definitions. Overall, nine articles used bone loss and osteolysis interchangeably to describe the respective bone remodeling seen (Tables 2 and 3). Additionally, when reported, progression, removal, and implantation duration were noted. Results for histology or explant analyses were recorded when presented (Tables 4 and 5).

Table 2 Implants investigated and type of bone loss reported in osteolysis cases
Table 3 Implants investigated and type of bone loss reported in bone loss cases
Table 4 Time of onset, progression, and reported cause of osteolysis
Table 5 Time of onset, progression, and reported cause of bone loss

Reported osteolysis

Overall, fourteen articles reported the resorption of periprosthetic bone due to osteolysis in six implant designs (Table 2). Duration of implantation prior to reoperation ranged from 15–96 months. Radiographic analysis was performed at multiple time points, with progressive osteolysis observed in eight articles [26,27,28,29,30,31,32]. In twelve articles, the implants were removed. In one article, a case report for a 2-level disc replacement, a ventral mass was removed, the devices were retained, and a 3-level posterior fusion was performed [33].

Early reoperation rates were reported for the devices composed of stainless steel or CoCr components, previously reported to be associated with metal sensitivity. Specifically, the Prestige-ST, composed of steel plates and steel core, was reported to have high incidence of osteolysis between 7 and 30 months [28, 34]. Osteolysis was noted later for devices with polymeric components, ranging from 24–120 months following implantation (Table 4).

In eight articles, histological analysis was reported to confirm the radiographic and/or intra-operative diagnosis of osteolysis [29,30,31,32,33, 35, 36]. Two articles reported positive cultures for Propionibacterium acnes[29, 30] and two for Staphylococcus epidermidis.[35, 36] Some of the observations included abundant giant cells, granulomatous formation, macrophages, metal particles, and flakes of polymeric debris, which are features of a potentially osteolysis-inducing foreign body response [23].

Neck pain was most frequently reported for patients with osteolysis [26, 27, 30, 33,34,35,36,37,38]. Radicular symptoms were reported in four studies[28, 33, 35, 38] and paresthesia was reported in three case reports [29, 30, 33]. Other symptoms accompanying osteolysis were: dysphagia, respiratory compromise, myelopathy, shoulder pain, distal upper extremity numbness, and decreased fine motor dexterity. Clinical outcome scores were reported in two case studies with VAS and NDI scores both reporting severe pain [36, 38].

Reported bone loss

Nine articles reported non-inflammatory bone loss in seven implant designs (Table 3). In the majority of studies, the location of bone loss was specified as the anterior region of the vertebral bodies [39,40,41,42], occurring 3–6 months post-operative. Only one article reported failure of an implant with severe bone loss; however, other adverse events were reported in that case, so it was unclear what the main cause of revision was [43]. In contrast, the remaining eight articles reported no progression or correlation of bone loss and device failure (Table 5). For these studies, mean length of follow-up ranged from 24–120 months; therefore, some studies may not have sufficient follow-up time to assess long-term ramifications of early bone loss.

Five articles reported range of motion (ROM) data for the patient cohorts [39,40,41,42, 44]. Of these, only one study compared ROM of patients with or without bone loss, reporting mean pre- operative ROM of 8.99° and final follow-up of 8.25° in patients with bone loss, and pre-operative ROM of 9.1° and post-operative of 7.09° in those with no-bone loss, but this difference was not statistically significant [44]. The remaining articles reported ROM for the total patient population, but did not distinguish between patients with or without bone loss [39,40,41,42]. Combining the results of these nine articles, the ROM at final follow-up was comparable to the pre-operative ROM (P = 0.2).

Among included articles, 515 of 949 (54.2%) patients experienced bone loss of some severity. One article was excluded from this calculation due to duplicity of the patient cohort from another included article. Five articles used grading criteria to rank bone loss, with mild bone loss most commonly reported [25, 39,40,41, 44]. Bone loss differed in both severity and incidence among devices, but because data was not reported consistently, no definitive conclusions could be drawn.

Clinical outcome scores for NDI, VAS arm and neck were measured for patients with bone loss in three articles [39, 40, 44]. However, a combined average could not be calculated due to differences in reporting methodologies. For example, one study reported VAS arm and neck scores using numerical values while another reported scores qualitatively, in terms of minor and major changes [39, 40]. Nevertheless, each concluded that bone loss did not have an effect on clinical outcome scores.

Pain symptoms were reported in four articles; however, three improved over the course of follow-up [39, 41, 43, 45]. Two studies reported early neck pain with long-term improvement [39, 45]. One study reported radicular pain with long-term resolution [41]. The final study reported recurrent neck and arm pain with no resolution until revision. This was the only study reporting bone loss in a failed device [43].

Discussion

Radiographic bone changes in periprosthetic bone are commonly reported after arthroplasties in large joints, and their causes, clinical impact, and management have been the subject of extensive research over the last several decades [23, 46,47,48]. Many factors can stimulate radiographic bone changes including differences in mechanical load distribution, reaction to wear debris, implant micromotion, and infection [20, 46, 47, 49]. While it may seem intuitive that the cervical spine might be vulnerable to these same factors, particularly wear debris and infection, the loading conditions, kinematics, and biological environment differ from large joints [21, 50]. Specifically, the a functional spine unit, or two vertebra and the replaced disc, does not have the “effective joint space” seen in the hip, created by the hip joint’s synovial membrane which can trap wear particles [51]. Consequently, it is unclear whether wear debris particles generated by a CDR would have the potential to generate a host response comparable to that at other joints.

In hip and knee arthroplasties, the terminology used to describe non-inflammatory bone loss and osteolysis has evolved as more information has been gained regarding their etiology and presentation. As a result, clear distinctions between the two processes can be made. Bone loss has been established as a response to mechanical changes, whereas the term ‘osteolysis’ has become synonymous with a progressive bone resorption induced by particulate debris or due to infection, hypersensitivity or fluid pressure [19, 23, 49, 52]. However, this terminology has not translated to spine literature, creating confusion and threatening the ability to correctly assess the impact of a given change on clinical outcome. In the present review, we highlighted the importance of defining type of bone changes to clarify the etiologies behind both non-inflammatory bone loss and osteolysis in the cervical spine [22].

The results of the present review confirmed that non-inflammatory bone loss and osteolysis presented as distinct clinical observations in CDR patients with authors offering several explanations for each, discussed below. Additionally, this review emphasizes the importance of clarifying the type of complication, especially between these terms, so future designs can accurately utilize reported data to address serious failures and potential material concerns, as well as prognostication of outcome. Just as clinical observations show clear distinctions between osteolysis and non-inflammatory bone loss, the subject’s vernacular should follow suit.

Presentation of osteolysis vs. bone loss

The present study emphasizes the stark differences between osteolysis and non-inflammatory bone loss. Inflammatory osteolysis may present around the implant and appear as large lesions being scooped out of the bone (Fig. 1a&1b). By contrast, non-inflammatory bone loss appears as an erosive remodeling of the vertebral body (Fig. 1c&1d). Osteolysis may be present surrounding the implant, whereas bone loss is more common around the peripheral or anterior side of the vertebral body. These clear visual distinctions demand new terminology that accurately describes these differences.

Wear debris

In larger, synovial joints, wear debris has long been recognized as a potential cause of osteolysis. However, in several studies in the present review, the early onset of bone resorption led authors to conclude that wear debris was an unlikely primary cause [39, 41, 44, 45, 53]. In late onset cases, progressive osteolysis was attributed to mechanical failure of the M6-C in five studies, despite findings consistent with device failure and histopathology confirming the presence of particulate polymeric debris and giant cells [29, 30, 33, 35, 36]. This device was noted to include a sheath intended to contain and decrease debris migration, and rupture of the sheath contributed to particle-induced osteolysis [35, 36]. These findings were consistent with those reported for another annulus-nucleus analog device with a sheath, the Bryan [54].

The use of histopathology in confirming the presence of wear debris is pivotal in understanding the etiology behind osteolysis. As with most joint arthroplasties, material selection was crucial for minimizing debris and failures. Although cervical disc replacements utilize similar materials, little is known about how the body responds to these materials in the spine. Unlike most large joints, the spine is not a synovial joint so we cannot use these examples as a precedent. This suggests that material reactions may differ slightly from those observed in other joints and may affect the clinical outcome. Therefore, cases with suspected osteolysis should prioritize histopathology tests to confirm the presence or lack of wear debris in the surrounding tissues.

Stress shielding

In the articles reviewed, stress shielding was frequently cited as a cause of non-inflammatory bone loss. Referencing Wolff’s Law, which states that bones will adapt to applied stresses [55], some articles hypothesized that the implants created higher stresses posteriorly, resulting in remodeling to adapt to the modified stresses [45, 53, 56]. Some articles suggest that resection of the anterior longitudinal ligament (ALL) contributes to reduced stresses in the anterior vertebra after surgery [25]. The common “Smith-Robinson” surgical approach for CDR includes this resection which may increase lordosis, decreasing stresses anteriorly [57]. The ALL increases stability and provides a natural constraint for motion [58]. Radiographic observations of anterior bone loss could result from a change in natural motion and stress distribution after ALL resection. Wang et al. reported that patients with bone loss experienced more lordosis following surgery, possibly due to the resection [42]. They hypothesized that this change in stress patterns left a hypo-pressure area on the anterior portion of the vertebra, leading to resorption. Similarly, Kieser et al.[25, 40] reported the resection of the ALL caused bone loss due to decreased anterior stresses. Although these authors believe the timing is inconsistent with this theory, spinal vertebrae are predominantly composed of trabecular bone, which may remodel much faster than cortical bone surrounding hips or knees.

As most bone loss cases have not been associated with pain symptoms or device failure, this etiology would be consistent with remodeling in the cervical spine in the short post-operative time period while the bone adapts to change in stress and is non-progressive. In order to confirm this, longer follow-up with bone loss patients, as well as biomechanical studies to analyze the force distribution of a healthy and implanted cervical spine, are necessary.

Micromotion

Micromotion is another mechanism that has been attributed to bone loss[49], largely resulting in bone loss immediately anterior or posterior [44]. One study suggested that early micromotion during the first 3–6 months lead to bone loss, but after 6 months, no further change was seen, presumably due to a reduction in micromotion through enhanced fixation [41]. Another study reported that patients with hybrid surgeries who were immobile during the first three months had reduced micromotion and a lower incidence of bone loss [44]. The authors suggested that bone loss may have occurred during the process of osseointegration but stopped following successful integration. It has been widely accepted that cyclic motions under 40–150 microns will allow for bone formation [59]. However, as there are no studies that have quantified micromotion to assess the early fixation of cervical disc replacements, it is unknown if the same threshold can be applied to spine devices.

Infection & metal hypersensitivity

Several authors speculated that infection was the cause for observed osteolysis; yet the majority presented incomplete evidence to confirm infection. Most articles did not provide laboratory results to confirm the infection diagnosis[60] and those that did had conflicting results. Cultures tested positive for either for Propionibacterium acnes or Staphylococcus epidermidis; however, there were reports of some negative cultures or normal inflammatory markers in these cases. Due to this inconsistency, most articles could not conclude that infection was the primary cause [26, 29, 36].

Propionibacterium acnes is well documented to be of low virulence and slow growing, which renders it difficult to culture and diagnose as problematic. Its slow growth requires lengthy periods of incubation, which can lead to contamination and false negatives, further adding to the confusion [61]. Propionibacterium acnes has been reported to show poor sensitivity to standard lab markers such as white blood cell counts and erythrocyte sedimentation rates, making it difficult to detect [62]. Additionally, some positive cultures presented with normal inflammatory markers, suggesting inflammatory markers may not even be sensitive enough in the cervical spine to confirm infection. Therefore, conflicting evidence of an infection associated with osteolysis is expected, making it difficult to confirm an association.

In the absence of a confirmed diagnosis of infection, an immune-mediated response due to metal or polyethylene hypersensitivity was considered a possible cause of the osteolysis [26, 27]. Although hypersensitivity has long been observed in orthopedic implants, no validated test is available to confirm this [63, 64]. Studies that could not confirm infection concluded that wear debris was the main cause of osteolysis, despite no explant images or histological quantification of wear debris being provided. Evidence like this further questions reliability of causation analysis and demands a change.

Incidence & Time of Onset

Osteolysis in large joint replacements may take years to manifest as a clinically significant phenomenon [48, 65]. In contrast, in the spine, some authors reported onset less than a year post-operative, while others reported it after more than five years [26, 28, 29, 35, 36, 60]. The time disparities between onset and failure could be due to the natural history and severity of different causes.

Among the 14 articles reporting osteolysis, 48 patients were reported to have osteolysis and all required reoperation. Because most of these studies were case reports, the incidence of osteolysis could not be determined. Consequently, the prevalence of osteolysis in the general CDR patient population is still unknown. Case reports have highlighted the severity of osteolysis once diagnosed, but to recognize if this is a serious problem with disc replacements, larger patient cohorts and longer follow-up periods are crucial. By contrast, the incidence of bone loss in the nine articles reviewed was 515 out of 949 patients (54.2%), proving bone loss to be highly prevalent post-operatively.

Limitations

This review had several limitations, partly due to gaps in the literature for radiographic bone changes. The reviewed articles included many single case reports or small patient cohorts, limiting confidence in the combined averages tabulated. However, nearly all reported cases of osteolysis that could be attributed to reaction from wear debris were in single case reports. This finding alone emphasizes the importance of including all studies in the present systematic review, and not only randomized clinical trials. Specifically, some previous review articles that have addressed this issue have limited their study to randomized trials and may have severely under-reported the incidence and importance of osteolysis.

Studies were not uniform with their grading system or terminology. In some studies, a large number of cases were reported without clearly specifying how bone loss was assessed, quantified, or reported. Future studies would benefit from a universal grading scale of bone changes and radiographic reviews of larger patient cohorts with longer follow-up.

Conclusions

Short-term, non-inflammatory bone loss is a common observation following CDR, while osteolysis is less frequently reported.  Further, while non-inflammatory bone loss rarely leads to revision, osteolysis often progresses and can lead to device removal. Without histology and explant analysis, it is difficult to determine the specific mechanism responsible for bone loss. Although our analysis shows clear differences between these bone changes, these terms are often confused. The practice of interchanging these terms ultimately puts the patient at risk as recommended clinical treatments may not align with the actual prognosis. Therefore, this review highlights the need for uniform terminology to differentiate between non-inflammatory and inflammatory bone changes to identify the nature and severity of the reported bone loss and the need to assess these complications in preclinical trials.