Introduction

Hip osteoarthritis is an enormous health burden estimated to affect one in four individuals in the USA during their lifetime [1]. Frustratingly there are no disease-modifying therapies currently approved for the treatment of any form of osteoarthritis, with substantial breakthroughs in therapy development hampered by an ongoing need for reliable biomarkers [2, 3].

Clinical and epidemiological studies continue to define disease with radiographic Kellgren and Lawrence (K&L) grading [4], but this system has been shown to suffer from variable interpretation [5]. The current recommended definition of disease progression for clinical trials is change in radiographic minimum joint space width (JSW) [6]. However, relevant progression needs to be defined within each study, which can complicate comparisons between different populations, while techniques of JSW measurement have been reported to have poor reproducibility [7, 8]. Radiographic assessment is also limited by 2D interpretation of a 3D structure.

Magnetic resonance imaging (MRI) stands at the fore of imaging in osteoarthritis research, but despite promise from a variety of structural and biochemical techniques, quantitative MRI has not yet delivered a substantiated osteoarthritis biomarker. However, clinical computed tomography (CT) has gained momentum in imaging of osteoarthritis in recent years [810]. This is because bone has become increasingly apparent as a key player alongside cartilage and synovium. It is also apt as a modality given the proposed structural mechanisms of femoro-acetabular impingement [11]. Although not yet approved for clinical use in humans, there have been encouraging reports on the efficacy of a bone-related agent, strontium ranelate, from one clinical trial [12], providing motivation to monitor mineralised tissues. Quantitative CT of bone is further supported by recent opinion that careful alignment of therapeutic action, tissue response and imaging should yield the greatest chance of detecting longitudinal change with the required sensitivity [13].

In line with these strategies, our primary aim was to assess the relationship between cortical bone thickness and radiological hip osteoarthritis with a view to developing a new method for disease quantification. We propose this through the application of a recognised 3D image analysis technique called cortical bone mapping (CBM) [1416]. Our hypothesis was that CBM would be able to identify regions of cortical bone thickening around the proximal femur associated with worse radiological osteoarthritis, predicting this would be in the distribution of recognised peri-articular bone-related phenomena such as osteophytes and subchondral sclerosis.

Material and methods

Participants

Participants for this observational study were taken from a cohort of 204 female control volunteers involved in existing Cambridge trials investigating hip fracture risk (57 from FEMCO: LREC 07-H0305-61; 38 from MRC-Hip fx and 100 from MRC-Ageing: LREC 06/Q0108/180; 9 from MRC-Stroke: LREC 01/245). The local research ethics committee approved each of these studies, with each participant giving informed consent for the analysis of their CT data for the investigation of hip disease. All participants were free of hip fracture, osteomyelitis, bone malignancy, unilateral metabolic bone disease, terminal illness and prior treatment with teriparatide or strontium ranelate. All examinations were reported by a consultant radiologist as part of the routine clinical care of patients. One of the 204 examinations was unavailable for analysis on account of data corruption and so imaging was used from 203 individuals. The mean [± standard deviation (SD)] age of this final group was 65 ± 18 years; mean weight was 70.1 ± 14.4 kg; mean height was 1.62 ± 0.07 m; mean body mass index (BMI) was 26.9 ± 5.4 kg/m2. No height data was available for one individual and so she was given the sample mean value.

Imaging acquisition

Imaging of the hips was acquired helically in the supine position on five different clinical whole-body multi-detector CT machines (Siemens SOMATOM Sensation 16, Sensation 64, Definition Flash, Definition AS+, and a GE Medical Systems Discovery 690). The acquisition protocol involved imaging inclusively from the superior margin of the acetabuli to the lesser trochanters. All acquisitions were processed with a standard smooth-edge body kernel and reconstructed with axial slice separation ranging from 0.75 to 1.5 mm, equivalent to slice thickness in all cases. In-plane pixel spacing varied from 0.59 to 0.78 mm, with field of view consistently 512 by 512 pixels. Peak kV was routinely 120 kV. When available from the fully anonymised metadata, recorded exposure ranged from 67-274 mAs, varying due to routine use of dose limiting.

Radiological measures of hip osteoarthritis

Since there were no clinical measures of disease available in this cohort, we used radiological scores to grade disease severity. Digitally reconstructed radiographs (DRRs) were created using an anatomically standardised coronal mean intensity projection with a window level of 200 Hounsfield units (HU) and a width of 700 HU. DRRs were reviewed by a consultant radiologist (AF) for K&L grading (0 to 4) [4] and minimum JSW, recorded to the nearest 0.1 mm with electronic callipers. Multi-planar reformats were reviewed by another consultant radiologist (TT) for osteophyte load, a reviewer-generated cumulative score of osteophytes graded from 0 to 3 in 42 separate sectors from around the proximal femur [10]. The maximum score in this cohort was 47, and so this was the upper limit used in our analysis. Full details of DRR and scoring methodologies (including reliability) have been previously published [8, 10].

We investigated the effect of these radiological measures on cortical thickness in the proximal femur, with the assumption of a linear relationship with disease for each. Frequency histograms were created for each radiological score using MATLAB 2013a (© 1984-2013, The MathsWork, Inc.) to demonstrate the spread of disease severity (Fig. 1). The assumption of linearity was validated by scatter plots and calculation of Kendall’s tau correlation coefficient using StatPlus®:mac 2009 (AnalystSoft, StatPlus:mac - statistical analysis program for Mac OS. Version 2009; http://www.analystsoft.com/en/). Each covariate showed a linear relationship and significant correlation with the other (Electronic Supplementary Material 1). When both femurs were available, the side with the worse disease score was selected for analysis in order to minimise the risk of including only those with no or minimal disease. If disease scores were equal, then a side was chosen randomly for inclusion. Repeating analyses with different randomisation did not affect the pattern or significance of our results.

Fig. 1
figure 1

Frequency histograms for each of the radiological scores. a Kellgren and Lawrence (K&L) grade; b osteophyte load; and c minimum JSW at 0.1-mm gradations

Cortical thickness measurement

Cortical thickness measurement was performed using CBM, implemented by a freely available in-house program called Stradwin (http://mi.eng.cam.ac.uk/~rwp/stradwin/). The main strength of this technique is its ability to accurately estimate cortical bone thickness below the pixel spacing of the imaging system, irrespective of partial volume effect. It has been validated for clinical CT against high resolution quantitative CT data acquired at 82 microns per pixel, and applied in clinical CT data sets up to a slice separation of 3 mm [14, 17, 18]. Following semi-automatic segmentation and proximal femur mesh generation, cortical thickness was measured at each of ~6,000-mesh vertices at a normal to the underlying cortical surface. Creation of a single 3D thickness map takes, on average, 15 minutes; the majority of this time is spent on semi-automatic segmentation, which varies according to noise in the data, while automatic thickness analysis usually takes under 1 minute on a standard desktop computer. Measurement error (± SD) at any given point has been previously reported as 0.12 ± 0.39 mm, well below the pixel spacing size in this study and otherwise in daily clinical practice [18]. Precision has also been assessed in repeat analysis of 19 individuals re-imaged with CT and re-analysed after an average of 61 days (SD 22 days, range 34 to 107 days) giving a measurement precision of 5 % of the mean at individual points, dropping to 1 % of the mean when measurements were averaged over a defined region of interest (ROI) [19].

In order to account for differences in hip morphology, each proximal femur was spatially re-aligned with a canonical right femur using B-spline free-form deformation as calculated by an iterative closest-point registration algorithm [20]. This was performed using freely available in-house software called WxRegSurf (http://mi.eng.cam.ac.uk/~ahg/wxRegSurf/). Spatially normalised cortical thickness maps were then smoothed with a 10-mm full width at half maximum (FWHM) filter, required for statistical parametric mapping (SPM). After registration of each bone to the canonical femur, SPM was performed to determine regions of significantly different cortical thickness for changes in radiological disease score.

Statistical parametric mapping (SPM)

SPM was initially developed for applications in neuroimaging to allow regionally located statistical inferences to be made from the comparison of data at multiple points in space. It uses a general linear model to account for variability in measurement data (here, cortical thickness) in terms of experimental and confounding effects. Effects in our general linear model were radiological disease score (experimental), age and weight (confounding, as revealed by preliminary analysis). Standard two-tail F testing across the cohort gave p-values at each measurement location that were uncorrected for multiple comparisons. As required for SPM, random field theory delivered p-values corrected for these multiple comparisons to control for false positive results (type I errors) [21]. SPM was performed using the SurfStat package [22] in MATLAB 2013a (© 1984-2013, The MathsWork, Inc.). A mean thickness map was also produced for the whole cohort (Fig. 2), allowing change in cortical thickness to be represented as a percentage of the mean. Therefore, three sets of results were calculated for each disease score:

Fig. 2
figure 2

Mean cortical thickness of the proximal femur. Mean cortical thickness map of the proximal femur from the 203 individuals in the study cohort, presented as a colour wash on a canonical model of the right femur viewed from three perspectives: (left to right) anterior, superior and posterior

  1. (a)

    Absolute cortical thickening (in mm) for each unit worsening in radiological measure;

  2. (b)

    Relative cortical thickening (as a percentage of the mean value) for each unit worsening in radiological measure; and

  3. (c)

    Subsequent regions of significant differences in cortical bone thickness across the cohort.

Significant regions of cortical bone thickness difference are “clusters” that meet statistical significance when tested after random field theory correction. For the purposes of visualising results effectively, all non-significant regions were indicated by washed out colour over the relative thickness maps (Figs. 3, 4, and 5). Absolute cortical thickness maps are included in the Electronic Supplementary Material (2 to 4) for reference against these relative thickness maps. An overview of the CBM pathway from data acquisition to results is presented as a flowchart in Fig. 6.

Fig. 3
figure 3

Cortical bone mapping (CBM) results for Kellgren and Lawrence (K&L) grade. CBM results for K&L grade presented as a colour wash on the canonical right proximal femur. Significant regions of cortical bone thickness difference for each unit increase in K&L grade (0 to 4) are represented by the colour map (p < 0.05), with non-significant regions indicated by faded colours (p > 0.05)

Fig. 4
figure 4

Cortical bone mapping (CBM) results for osteophyte load. CBM results for osteophyte load presented as a colour wash on the canonical right proximal femur. Significant regions of cortical bone thickness difference for each unit increase in osteophyte load score are represented by the colour map (p < 0.05), with non-significant regions indicated by faded colours (p > 0.05)

Fig. 5
figure 5

Cortical bone mapping (CBM) results for minimum joint space width (JSW). CBM results for minimum JSW presented as a colour wash on the canonical right proximal femur. All colours were faded for this variable, meaning that there were no regions of significance (p > 0.05)

Fig. 6
figure 6

Cortical bone mapping (CBM) flowchart. Each stage of the CBM pathway is represented to summarise the study methodology. SPM = statistical parametric mapping

Results

Cortical bone mapping results for three radiological measures of disease (K&L grade, osteophyte load and minimum JSW) are shown as a colour map on a canonical model in Figs. 3, 4, and 5, with non-significant regions according to SPM indicated by faded colours. Therefore, significant regions of cortical bone thickness difference with worsening disease score (p < 0.05) are represented in full colour in contrast to faded regions of non-significance.

Kellgren and Lawrence (K&L) grade

There was up to 25 % significantly thicker cortical bone for each unit increase in K&L grade at the superolateral and anterior femoral head-neck junction, with extension into the superior subchondral bone plate (Fig. 3). The map for absolute differences in cortical thickness is presented in Electronic Supplementary Material 2, showing that 25 % was equivalent to a 0.1 mm increase per each unit increase in grade (0 to 4).

Osteophyte load

There was up to 7 % significantly thicker cortical bone for each unit increase in osteophyte load score circumferentially around the head–neck junction, extending medially across the superior subchondral bone plate to the fovea, and laterally into the posterior and medial femoral neck (Fig. 4). The map for absolute differences in cortical thickness is presented in Electronic Supplementary Material 3, showing that 7 % was equivalent to a 0.02 mm increase per each unit increase in osteophyte load (0 to 47 in this cohort).

Minimum joint space width (JSW)

There was no significant difference in cortical bone thickness for each mm decrease in minimum JSW (Fig. 5), although cortical thickness at the superolateral head-neck junction still increased by up to 20 % for each mm decrease in minimum JSW. The map for absolute differences in cortical thickness for minimum JSW is presented in Electronic Supplementary Material 4, showing that 20 % was equivalent to a 0.08 mm increase per each mm decrease in minimum JSW.

Discussion

We have applied a quantitative 3D image analysis technique to clinical CT imaging data to identify significant cortical bone thickening in association with worse radiological disease. More specifically, this effect was localized to a defined region at the femoral head–neck junction. Participants had statistically significant (10–25 %) thickening of bone at the superior and anterior aspects of this junction for each unit increase in K&L grade, with significant effects more widely distributed for osteophyte load. Although not significant for minimum JSW, a relative increase in cortical bone thickness at the same site across all scores suggests that this region may be involved in a particular manifestation of disease (as demonstrated by the red regions consistent in Figs. 3, 4, and 5). Alternatively, minimum JSW may be related to an independent factor that we did not include in our model. Three distinct bone phases have been recognised in osteoarthritis: (i) initial osteoclastic subchondral bone resorption; (ii) osteoblastic bone formation progressing to sclerosis; then (iii) osteophyte formation at the articular margin [23]. Our results suggest that CBM has picked up the latter two of these effects.

In order to understand the implications of these results, it is important to consider what CBM is registering as cortical bone thickening. Our distribution of cortical involvement fits with current biomechanical hypotheses of impingement at the hip, covering regions of predicted increased contact pressures at the superior and superolateral femoral head [24]. Another study of individuals with cam impingement showed that maximal shear stresses on squatting were located in the supero-anterior acetabulum, immediately opposite the site where we have demonstrated thickened bone (Figs. 3 and 4) [25]. The same study also localised stress maxima within subchondral bone, supporting the notion that superolateral impingement is influencing structural change here. It is, therefore, possible that we are detecting a subclinical response in cortical bone to mechanical effects of impingement at the superolateral head–neck junction. CBM may also be detecting a cortical manifestation of osteophyte growth. Figure 7 shows that this may not necessarily be an osteophyte component outside the cortex, since our algorithm selects the innermost of two layers as the true cortex for analysis. Importantly, peri-articular cortical bone and subchondral bone plate can appear thickened as separate from osteophytes, or even in their absence, as review of the underlying imaging data reveals (Fig. 7).

Fig. 7
figure 7

Cortical thickness at the superolateral femoral head-neck junction. Axial CT images of the right hip in two individuals from the cohort viewed in Stradwin, the software used to perform CBM. Dotted green lines represent the approximate contour for the femoral isosurface. Cyan lines represent the path of thickness measurement. Red dots represent the estimated limits of cortical bone, neither of which need be coincident with the original green contour. Individual (a) had a Kellgren and Lawrence (K&L) grade of 1, an osteophyte load of 4 (out of a maximum of 126), and a minimum JSW of 2.0 mm. Note that the measurement line is at a site with apparently greater cortical thickness than surrounding bone yet in the absence of an osteophyte. Individual (b) had a K&L grade of 3, an osteophyte load of 44 (out of 126), and a minimum JSW of 1.2 mm. Note that the measurement line in (b) has picked up two layers of bone, the underlying cortex and the osteophyte outside it. Note that there is also apparent increased cortical bone thickness in the anterior femoral head anteromedially and separate to the osteophyte (arrowhead)

Before considering the next phase of validation for assessment of osteoarthritis with CBM, it is important to evaluate the implications of using clinical CT for this purpose. CT is important for imaging joints and bone because it excels at depicting mineralised structures in contrast to surrounding soft tissue. In addition to providing a framework for finite element studies [2426], CT has also shown strength in representing bone shape and early bone-related changes of disease at the hip [27, 28]. The promotion of CT for imaging large joint osteoarthritis should, therefore, be balanced against its weaknesses and the inherent values of other modalities such as MRI and radiography. This is of particular importance given the exposure to ionising radiation that it entails along with the proximity of radiation-sensitive tissues in the pelvis (which is, therefore, relatively less dose at more distal joints such as the knee). However the continual development of dose reduction strategies should encourage consideration for CT in clinical trials [29, 30].

We recognise that this study has limitations. Our assessment of hip osteoarthritis severity was based on imaging features alone, which do not necessarily correlate with clinical osteoarthritis. Thus, the next step would be to test the hypothesis that increased peri-articular cortical bone thickness can predict worse disease according to clinical outcome measures. We also recognise that individuals from this cohort were pre-selected on the basis of volunteering for studies in which the aim was to establish hip fracture risk, and so there will be according selection bias. Finally, subchondral bone sclerosis is a recognised radiological feature used in assessment with K&L grading, meaning that there will be some dependence of the output of our cortical thickness analysis on K&L grade, since DRRs were created from the same CT data. Nonetheless, we believe that CBM should now be tested to show whether quantitative 3D measurement of cortical bone with clinical CT can predict clinical disease with the necessary levels of sensitivity and precision [31]. The role of CBM in the prediction of osteoarthritis at other large joints, such as the knee, should also be evaluated.

Conclusion

CBM applied to the proximal femur in clinical CT imaging data identified significant structural changes in peri-articular cortical bone thickness that were associated with worse radiological hip osteoarthritis, particularly at the superolateral head–neck junction. These findings are best explained by the manifestation of osteophytes and subchondral sclerosis in this region, which may be either in response to local biomechanical stresses or the biochemical environment as the joint fails. CBM could, therefore, have an impact on both osteoarthritis research and subsequent clinical practice, especially with potential for application in disease monitoring and the development of structurally modifying therapies.