Introduction

Osteoarthritis (OA) is currently defined as a ‘progressive disease of synovial joints, representing failed repair of joint damage that results from stresses initiated by an abnormality in any of the synovial joint tissues… this ultimately results in the breakdown of cartilage and bone, leading to symptoms of pain, stiffness and functional disability’ [1]. Such a concept of OA as being only a degenerative disease or ‘wear and tear’ is now considered outdated [2, 3]. OA is a complex disease involving all the tissues of the joint as well as inflammation. Emerging research suggests that OA is a collection of heterogeneous pathologies that result in a common outcome (pain and joint destruction), rather than being one homogeneous disease. This heterogeneity includes different subtypes and structures affected, where OA processes begin, and risk factors, all depending on the joints affected.

In addition to cartilage degeneration, specific phenotypes/subtypes of OA proposed in the literature include a bone-specific subtype [4], predominantly bone marrow lesions in the subchondral bone [5], traumatic (acute or repetitive) which may be initiated by lesions at the anterior cruciate ligament following mechanical disruption [68], or as a result of knee meniscectomy, metabolic (including systemic inflammatory), ageing, genetic and pain [9], as well as atrophic (few osteophytes with severe joint space narrowing (JSN)), hypertrophic (large osteophytes, little JSN) [10, 11]) and traumatic (acute or repetitive). These also vary in the main joint site(s) affected. Hip and knee OA can result from the majority of the subtypes, and post-traumatic stresses are proposed to result in less common types of OA including the thumb, shoulder, elbow and ankle OA [9].

The majority of studies assessing measurement of OA progression have been done in the knees, first using radiographs, and more recently employing other tools including MRI and ultrasound, but the latter to a much lesser extent and targeting specific and fewer joint structures than MRI.

Radiographs

Radiographic definitions of osteoarthritis

Historically, radiography has been the most common imaging modality to assess OA. The most commonly used radiographic definitions of OA are the Kellgren and Lawrence (KL) classification [12], which grades the joint as a whole (grade 0, no OA; to grade 4, severe), based on marginal osteophytes, JSN, subchondral bone sclerosis and altered shape of the bone.

The more recent OARSI system [13] grades individual features of OA (JSN, subchondral lucency, marginal osteophytes, subchondral sclerosis) on a 0–3 scale (where 0, absent and 3, severe), with no grading for the joint overall.

Although assessment of joint structural features on radiographs remains widely used for diagnosis and for staging OA, this technology is two dimensional, subject to measurement error due to joint positioning issues, typically used in a semi-quantitative manner and does not allow visualisation of non-calcified tissues [14], as well as being a poor predictor of clinical outcomes [15] and only weakly associated with symptoms [1619].

Assessing OA using radiographs: radiographic grade and joint space width

Radiographic assessment of knee joints has historically been used as a proxy measure of cartilage volume, due to favourable associations between joint space width (JSW) and cartilage volume cross-sectionally. However, JSN is not sensitive for cartilage loss since cartilage loss frequently occurs without a corresponding progression of JSN. Correlations between changes in JSW and cartilage volume in OA cohorts were observed in some studies, but were weak over long periods of observation [2022], while others reported no correlation over 2 years [23, 24]. In addition, this method also shows unexplained variability, which could reflect change in meniscal position or structure alteration as well as cartilage morphology [22, 2528]. Therefore, change in JSW is a poor proxy for change in cartilage volume longitudinally. Moreover, although this method can be assessed in a quantitative manner (mJSW), the assessment of JSN in both the KL and OARSI classifications is only semi-quantitative (range 0–3 or 0–4, respectively).

Overall, although change in grade of radiographic assessment of OA knee joints remains the “gold standard” for assessing disease modification in clinical trials [29] and is mandated by the USA Food and Drug Administration and European Medications Agency as a proxy endpoint to determine efficacy of disease modifying OA drugs (DMOAD), this method is very insensitive to cartilage change.

Measuring progression of OA using radiographs

Measurement of knee JSW obtained from radiographs in persons with knee OA is moderately responsive to change (standardised response mean (SRM) of 0.33, increasing to 0.57 in studies longer than 2 years duration); however, the inter-reader coefficient is high (ICC of 0.97) [30]. These estimates are inferior to comparable MRI measures of cartilage [21, 24, 31, 32]. Radiographic grade (as assessed by KL grade) does predict risk of knee replacements over 3–8-year periods [33, 34], which is unsurprising, given that decisions for joint replacement are based on radiographs. However, this association is not independent of cartilage loss in some cohorts [35], and is only one of the predictors, for example for pain [33, 36].

In hand OA, site-specific scoring systems have been developed for the assessment of radiographic progression, such as the Verbruggen–Veys anatomical phase score [37], and the quantitative Ghent University score system (GUSS) used in erosive hand OA [38]. Several reviews have investigated the reproducibility and sensitivity to change of the numerous hand scoring methods using radiographic progression [3941], and data showed that the estimated SRM values for four of them were modest (0.24 to 0.29) [40] and all semi-quantitative methods were comparable [39, 40]. However, with radiographic methods, the time required to demonstrate a change in hand OA is less than for knee OA, as it can detect differences in treatment in 6 months or less [42]. Quantitative JSW of the hand is a valid method to measure joint space and showed a good relation with clinical features in a case–control study [43], but this remains to be demonstrated in longitudinal studies.

Magnetic resonance imaging

Assessment of osteoarthritis

MRI allows assessment of the whole joint; it enables visualisation of cartilage and a large range of other structures, including those associated with pain. These can be assessed over a continuum from normal to diseased joint (OARSI grade 1 medial JSN, where more than 10 % of cartilage is already lost [14]), through to end-stage OA (OARSI grade 3, KL grade 4).

Diagnosis of OA using MRI

Based on a systematic review of 25 MRI studies [44], tibiofemoral OA is currently defined as requiring a combination of features: either an osteophyte and full thickness cartilage loss or an osteophyte or full thickness cartilage loss, plus two or more of the following features: bone marrow lesion (BML) or cyst not associated with meniscal or ligamentous attachments, meniscal pathology, partial thickness cartilage loss (in the absence of full thickness cartilage loss), or bone attrition. Patellofemoral OA requires an osteophyte and partial or full thickness cartilage loss, all in the patella and/or anterior femur regions [44]. MRI-based scoring systems have also been proposed in hand OA [45], a refinement of the Oslo hand OA scoring system, including osteophytes, JSN, bone erosions/attrition, cysts, malalignment, synovitis and BML in the distal and proximal interphalangeal (PIP) joints.

The MRI-based definitions of OA include cartilage, but also other important structures that would be considered part of putative OA phenotypes, such as BML, meniscal alterations, synovitis (synovial membrane thickness, effusion) and ligament pathology (Fig. 1) especially if they lead to functional disabilities [1]. All these structures could be the causal pathways of pathological processes of OA and are therefore potentially suitable as targets for therapy or as proxy endpoints for assessing disease progression.

Fig. 1
figure 1

Progression from a normal joint to a joint affected by end-stage osteoarthritis. Note: reference: [14].

Measuring disease progression

MRI assessment of cartilage morphology is now recommended for the evaluation of disease progression as an endpoint for clinical trials [29, 46]. Tissues other than cartilage are also increasingly being used and include BMLs, meniscal alterations, synovitis and effusion. Associations between structural features of knee OA as seen on MRI and knee OA outcomes are summarised in Table 1.

Table 1 Associations between MRI structural features of OA knee and endpoints

Cartilage assessment: cartilage volume/thickness

Cartilage volume as assessed by MRI is a direct measure, and its level of loss has been shown to be better responsive to change than JSW [21, 24, 30, 32, 47]. Unsurprisingly, quantitative measures also have superior sensitivity to cartilage changes compared to semi-quantitative scoring [31, 48•].

Rate of cartilage loss is a good measure of progression as it predicts risk of knee replacements [35, 46, 49]. Every 1 % increase in the rate of tibial cartilage loss over 2 years confers a 20 % increased risk of knee replacement at 4 years [35]. Data from clinical trials in knee OA have shown that reduction in cartilage loss by potential DMOADs can reduce the cumulative incidence of total knee replacement over time [49, 50]. Cartilage thickness has also been used to assess rates of disease progression, and data demonstrate that both the measurement of cartilage thickness and that of volume provide the same level of sensitivity to estimate cartilage loss in a clinical trial [51]. Loss of cartilage thickness was also found associated with increased risk of knee replacement [52].

It is now possible to accurately and feasibly measure change in cartilage volume over a 6-month period [53, 54], although time frames of 24 months and sample sizes of ∼200 are usually used for clinical trials with cartilage endpoints; this is much shorter and involves fewer patients than required for knee radiographic endpoints.

Cartilage assessment: other measures

Cartilage defects

Defects in the cartilage are common in people with knee OA and have poor capacity for repair; although there is some evidence that reduction in weight is associated with reduction in defect severity, at least in a midlife cohort [55]. Such defects are associated with reduced cartilage volume and patellar cartilage volume cross-sectionally in people with symptomatic knee OA [56]. Longitudinally, change in cartilage defects was associated with tibiofemoral cartilage loss over 10 years in a midlife cohort [55], and greater number of cartilage defects predicted joint replacement over 4 years, which was not confounded by age, gender, BMI, symptoms, annual percentage cartilage loss or radiographic severity [56]. SRMs for cartilage defects were similar in magnitude to those obtained for cartilage volume [32].

A recent trend in OA research is the investigation of changes in cartilage using various MR imaging modalities. These should enable assessment of progression of cartilage loss in earlier stages of OA and even before loss of this tissue. The methods most tested are the T1 cartilage signal intensity, delayed gadolinium-enhanced MRI of cartilage (dGEMRIC), and T2 relaxation time mapping [57], all of which require specialised MRI sequences. Reduction in mean signal intensity of cartilage on T1-weighted MR images is associated with reduced cartilage thickness over 2.9 years, suggesting that low cartilage signal intensity on T1 MRI may reflect early OA changes [58]. dGEMRIC is a measure of cartilage composition and is used to assess the relative distribution of glycosaminoglycans in cartilage, on the T1 sequences of MR images [59]. A low dGEMRIC index indicates normal cartilage, and a high dGEMRIC index indicates degraded cartilage. Reduced dGEMRIC values are associated with increased cartilage thickness in the medial tibiofemoral compartment [60], which may represent swelling of cartilage in the early stages of degeneration. Weight loss has been demonstrated to improve quality (increased proteoglycan content) and quantity (reduced cartilage thickness loss) of medial knee articular cartilage using dGEMRIC over 12 months [61], as has physical activity in young women over 3 months, as demonstrated by an improvement in T1(Gd) compared with the control group [62]. However, this technique uses a contrast agent, gadopentate dimeglumine (Gd-DTPA2−) and the USA Food and Drug Administration issued a report concerning the potentially serious side effects related to such contrast agent, particularly in older patients who constitute the great majority of OA patients, precluding its routine use or use in large clinical trials. The T2 relaxation time of articular cartilage is a function of the water content and collagen structure of the tissue. Measurement of the spatial distribution of the T2 relaxation time is associated with areas of increased or decreased water content, correlating with cartilage damage [57]. Higher baseline T2 is associated with greater cartilage thickness, higher cartilage volume [63] and fewer cartilage defects [64] cross-sectionally, and cartilage loss longitudinally [64]. However, longitudinal change in T2 relaxation time was inversely associated with grade of cartilage defects at baseline and follow-up, suggesting that once morphological cartilage defects occur, T2 values may be of limited use for evaluating further cartilage degradation [64].

Bone marrow lesions

BMLs predict site-specific progression of knee JSN in those with symptomatic knee OA (ORs 8.9 medially and 5.9 laterally, reducing to 2.8 and 5.6, respectively, following adjustment for alignment and other covariates) [65]. Moreover, a combination of cartilage damage and BMLs was associated with progression of JSN [66]. BMLs are strong and independent predictors of knee cartilage loss, especially site-specific cartilage loss [67, 68, 69••], and progression of knee cartilage lesions [24, 28, 67, 68, 7072]. BMLs also predict total knee replacement over periods of up to 4 years [49, 50, 7375]. In some studies, BMLs predicted knee replacements more strongly than other predictors assessed in the same cohort [49, 74]. Although these studies were of small size, this suggests that BMLs are a more sensitive predictor of development or progression of knee OA than other factors. An advantage of using BMLs as a measurement of disease progression or response to therapy is that response to treatment can be observed in a shorter period of time. In trials using BMLs as an outcome, differences in BML size were seen over periods as short as 6 weeks when looking at a brace for knee patellofemoral OA [76•] and 12 months when looking at the discrimination between OA treatment [53].

In vivo evidence for a subchondral bone phenotype of OA includes data on persons treated with bone remodelling drugs including bisphosphonates and strontium ranelate. Laslett et al. reported reduction in pain and reduction in size of BMLs in people with knee pain and clinical knee OA over 6 months using zoledronic acid in only 59 study participants [77••]. Zoledronic acid also reduced pain over 1 month in a similar pathology (Modic changes) in 40 people with low back pain, but there was no data regarding the effect of this drug on other structural changes in this cohort [78]. In a subset of the observational Osteoarthritis Initiative cohort, data showed improvements in pain for up to 3 years, and a trend toward reduction in JSN by year 4 in participants using any type of bisphosphonate [79]. In a phase III knee OA trial, strontium ranelate reduced both cartilage volume loss and BML and demonstrated an association between the decrease in BML score and the reduction in cartilage volume loss over 36 months [69••]. Other clinical trials evaluating drugs not directly related to bone remodelling revealed a strong association for the compounds tested between BML score change and cartilage volume loss in the same or related topographical sites [80] as well as the need for total knee replacement [49, 50, 53].

There is much less data for other anatomical sites. However, in a study on the hip, large BMLs were found to be longitudinal predictors of increased likelihood of high cartilage signal [81]. In the hand, BMLs were also shown to predict radiographic progression [82].

Therefore, targeting subchondral bone has allowed assessment of response to therapy for preventing disease progression using surrogate endpoints in relatively small numbers of participants over short periods of time.

Effusion and synovitis

Effusion and/or synovitis are the hallmarks of joint inflammation. Such features can be measured using MRI without contrast agent quantitatively for the synovial fluid [83] and semi-quantitatively for the synovial membrane [84].

Data from the Osteoarthritis Initiative demonstrate that presence of either effusion or Hoffa synovitis predicts incident radiographic knee OA over 2 years [85]. Longitudinal observational data demonstrates that knee effusion/synovitis in the suprapatellar pouch predicts cartilage defects, BMLs and increased cartilage loss over 2.6 years, in a dose–response manner [86]. Interestingly, associations between synovial inflammation/effusion and cartilage alterations could be the ‘chicken or the egg’ dilemma in that, although synovial inflammation could lead to cartilage damage [87], the reverse is also possible [87, 88]. The presence of MRI-detected effusion or synovitis predicts knee replacement [75]. Data also showed that chondroitin sulphate, a symptomatic slow acting drug for osteoarthritis (SYSADOA) with DMOAD activity, at 6 months in patients concomitantly taking NSAIDs, significantly reduced the synovial membrane thickness (as assessed by MRI without contrast agent [84]) compared to the placebo group, in addition to decreasing cartilage volume loss [53].

MRI-detected synovitis has also been used as a measure of disease progression in hand OA, with the presence of moderate/severe synovitis predicting radiographic progression (JSN) over 5 years, along with BMLs, bone damage and osteophytes [82]. Trials are starting to use effusion/synovitis as a treatment target. In a trial using low-dose oral prednisolone for painful hand OA, MRI-assessed effusion–synovitis was associated with pain cross-sectionally, but not hand pain as assessed by VAS; however, effusions did not change over time with treatment or predict response to treatment [89].

Meniscal alterations

Meniscal alterations are associated with increased risk of developing radiographic knee OA in middle-aged and elderly individuals, with meniscal tears increasing the odds by 5.7 in one study [90] and meniscal damage increasing the hazard ratio of radiographic OA in 2 years by 1.81 in another [85]. Meniscal extrusion independently predicts knee cartilage volume loss at about 2 years [54, 91], as do meniscal tears [92]. Both meniscal extrusion [75] and tears [49] predict knee replacement surgery [49]. Meniscal tears also demonstrate effect modification through changes in weight, with increase in weight associated with increased cartilage loss over 2 years in those with meniscal tears; this effect was absent in those without meniscal tears [93•]. Importantly, co-localised knee meniscal extrusion/damage and BML further increased the risk of cartilage loss [94•, 95].

Although the presence of meniscal extrusion has been shown to be associated with greater cartilage volume loss and to be a strong marker of OA progression [27, 28], to our knowledge, there has been no clinical trial using MRI investigating the effect of meniscal extrusion as a primary outcome on response to drug treatment. However, post hoc analyses of studies exploring the effect of potential DMOADs argue in favour of an MRI-based diagnosis of meniscal extrusion in clinical practice, which would help physicians identify knee OA patients who are more likely to benefit from such treatment, namely those with mild-moderate OA with meniscal extrusion or those with more symptomatic disease but without meniscal extrusion [54, 94•, 96].

Ultrasound

Ultrasound is not yet widely used for assessing OA disease progression. Ultrasound is operator dependent, which increases measurement error if more than one operator images joints in the same research study. However, ultrasound is less expensive than radiography and MRI and could be used in clinical practice as both a diagnostic and explanatory tool for clinical symptoms. Ultrasound is more appropriate for superficial joints such as the hand rather than joints located deeper within the body including the hip.

Ultrasound-detected inflammation

In a study detecting inflammation using ultrasound, inflammation was associated with radiographic features, with the presence of power Doppler signals (but not presence of grey scale (GS) synovitis or effusions) and associated with new radiographic central erosions in the hand over 4 years [97]. In another study, grade of ultrasound-detected inflammation (GS synovitis, power Doppler signals) showed a positive dose–response association with progression of KL grade in hand OA over 5 years [98].

Effusion

Ultrasound-detected knee effusion predicts knee joint replacement [33]. In terms of treatment, the presence of ultrasound-detected knee effusions has been used to select people who might respond well to oral methotrexate, which demonstrated efficacy in an open-label trial, albeit without a control group [99]. Overall, although there is some evidence of use of ultrasound indices to assess disease progression in OA; mostly for effusion, there are very few studies for associations between abnormal ultrasound findings and other OA features or disease progression.

Conclusions

Progression of OA has historically been measured using radiographs; however, this method has major limitations including its inability to assess other important joint structures such as synovium, menisci and BMLs, in addition to being insensitive to change. MRI is increasingly used to assess OA disease progression and as a marker for diagnosis of the disease. This method has increased and will continue to increase our knowledge of the predictors of progression, which will in turn be important for optimal treatment allocation and clinical investigations of DMOAD therapies. Not all patients will progress symptomatically, radiologically, or to joint replacement, and treatments need to be focused on those most in need. Methodologies in regard to quantitative measures of MRI structures are now well used and provide a better understanding of factors associated with disease markers and progression. In turn, the joint structures as evaluated by MRI could help us to understand the extent of response to treatment as well as being used as targets in clinical investigations of new therapies. In this context, fully automated quantitative MRI systems for cartilage volume [100•], BML [101], osteophytes [102] and synovial fluid [83] were recently developed. A primary advantage of such fully automated systems for knee tissue structure is the possibility of intensive and autonomous computation, enabling images from large-scale studies to be analysed in a shorter time and, very importantly, compared to manual or semi-automated methods, increased stability of the reading, and hence increased accuracy of results.

Ultrasound requires more studies on joint features other than effusion/synovitis in order to be a choice of method for OA patient diagnosis and disease progression.