Introduction

The growing population suffering from knee osteoarthritis (OA) and the lack of early biomarkers and therapeutics prompt the need for efficient imaging methods [1]. Magnetic resonance imaging (MRI) allows assessment of the whole knee joint, making it ideally suited for imaging in knee OA, which is a multi-tissue disease [2, 3]. Several potential MRI-based biomarkers have been proposed in this context [4]. In particular, the role of quantitative MRI (qMRI) techniques is emerging. qMRI techniques, such as T2 mapping, have the ability to non-invasively detect subtle changes in biochemical composition of tissues such as cartilage and menisci. Increased T2 relaxation times have been shown to be associated with cartilage and meniscus degeneration, potentially enabling early-stage detection of knee OA and similar conditions [5,6,7,8]. T2 mapping does not require a contrast injection or special MRI imaging hardware and numerous techniques for post-processing of T2 images are available [5, 7, 9, 10].

Besides quantitative MR imaging, structural evaluation of the knee is fundamental in the assessment of knee OA, given its multi-tissue nature [2, 3]. The semi-quantitative MRI Osteoarthritis Knee Score (MOAKS) [11] is a widely used and well-validated instrument for evaluating knee OA and has been applied in large-scale epidemiological OA studies such as the Osteoarthritis Initiative (OAI) [11,12,13,14].

T2 mapping and MOAKS are potential biomarkers to non-invasively assess joint health; however, acquiring them efficiently is a challenge. In general, multiple sequences are used in knee OA imaging, often resulting in time-consuming MRI protocols that take 30–45 min or longer [6, 15]. In particular, in the context of large-scale clinical trials and repeated measurements, MRI acquisition can create a significant burden for patients, hospitals, and research budgets. In the context of quantitative MRI, multiple sequences also bring up the need for registration between sequences. Hence, creating more streamlined MRI protocols and reducing acquisition time are of great interest.

In the present study, we evaluated a novel MRI technique to reduce scan time in the context of knee OA: the quantitative double-echo steady-state (qDESS) sequence. qDESS generates two echoes: one echo with T1/T2 weighting (resembling proton-density contrast) and one echo with T2 weighting. It has the potential to provide diagnostic images as well as quantitative measurements (i.e., T2 maps) of the knee in a single sequence with an acquisition time less than 5 min [16, 17].

Proof-of-concept of qDESS for T2 mapping of cartilage and meniscus and structural knee assessment (using MOAKS) has been provided by Chaudhari et al [16]. Focusing on healthy subjects, they validated qDESS against routine methods for T2 measurements and MOAKS and reported high accuracy in most tissues. Also, a pilot study in 10 patients with knee OA, performed in the same work, provided promising qDESS-based T2 mapping and MOAKS outcomes, suggesting that accurate knee OA measurements are possible with qDESS [16]. Building upon this work, we further assessed the discriminative power of quantitative and structural qDESS-based biomarkers, in a larger OA cohort against radiography, widely accepted as the gold standard for knee OA imaging [18, 19]. We evaluated structural MOAKS scores and T2 measurements of the knee cartilage and meniscus in a clinical OA population. In contrast to the approach of Chaudhari and colleagues, which comprised a global assessment of cartilage and menisci, in the present study, we evaluated predefined subregions of cartilage and menisci. Regional assessment is relevant as knee OA is a focal disease with a heterogenous disease pattern [6, 20, 21].

The aim of the present study was to assess the discriminative power of a single 5-min qDESS MRI sequence for simultaneous T2 measurements of cartilage and meniscus, and structural knee OA assessment, in a clinical OA population, using radiographic knee OA as reference standard.

Methods

Study population

This study was performed with approval from our Institutional Review Board and in compliance with Health Insurance Portability and Accountability Act (HIPAA) regulations. Written informed consent was obtained from all participants after receiving full explanation about the study. Consecutive patients who were referred by the Department of Orthopedic Surgery for knee MRI at Stanford Medical Center between December 2016 and July 2017 were screened for eligibility. The eligibility criteria for this study are shown in Table 1. Based on radiographic (Kellgren and Lawrence grade (KLG) [22]) and clinical (American College of Rheumatology (ACR) criteria [23]) degree of knee OA, three subject groups were selected: subjects with no OA (KLG0 and ACR negative), subjects with mild OA (KLG2 and ACR positive), and subjects with moderate OA (KLG3 and ACR positive).

Table 1 Eligibility criteria

Scoring of radiographic knee OA

The assessment of radiographic knee OA was performed according to the KLG criteria [22], by a researcher with a medical degree and 4 years of experience in musculoskeletal imaging research (SE) who was blinded to any patient data. Standardized, weight-bearing AP radiographs were used. A second reader, a musculoskeletal radiologist with 15 years of experience (EO), also performed the KL grading in a random selection of 20 subjects from the study population to assess inter-observer reliability. To assess intra-observer reliability of the primary observer (SE), 20 randomly selected subjects from the study population were re-evaluated 14 days after initial grading.

MRI data acquisition

MR imaging was performed on one of two identical 3-Tesla MR scanners (Discovery MR750, GE Healthcare), using a 5-min 3D sagittal qDESS scan with an 8-channel transmit-receive knee coil (InVivo). qDESS generates two echoes per repetition time: S+ (with T1/T2 contrast; echo time (TE) 5.7 ms; Fig. 1a) and S− (with T2 weighting; TE 30.1 ms; Fig. 1b) [16]. The sagittal qDESS images were used to generate axial and coronal reformats (Fig. 1d–f). Sequence parameters of qDESS are described in Table 2.

Fig. 1
figure 1

Representative example of first (a) and second (b) sagittal qDESS echo in a 37-year-old female without OA, lateral compartment. In a, femoral cartilage ROI is indicated by red dots, tibial cartilage ROI is indicated by blue dots, anterior meniscal horn is indicated by orange dots, and posterior meniscal horn is indicated by green dots. c Corresponding T2 colormaps of femoral cartilage and the anterior and posterior horns of the lateral meniscus (color bar on the right shows the range of T2 values). Sagittal qDESS images are used to generate reformatted reconstructions in the axial (d, e) and coronal (f) plane. Sag = sagittal; Ax = axial; Cor = coronal

Table 2 qDESS MRI sequence parameters

Quantitative MRI analysis (T2 mapping)

The two echoes of qDESS were used to compute T2 relaxation time parameter maps, by inverting the qDESS signal model [24]. qDESS T2 measurements have shown to have high concordance with multi-echo spin echo T2 measurements [25] and limited sensitivity to T1 and signal-to-noise ratio variations in cartilage and meniscus [26]. The first echo (S+) of sagittal qDESS was used for manual segmentation of cartilage and menisci for the calculation of T2 relaxation times (Fig. 1c). Segmentation was performed on single slices, by the same researcher (SE) blinded for the patient’s clinical data. For femoral and tibial cartilage segmentation, the centermost slice through the medial and lateral femoral condyle (defined as the slice midway between the slice on which the femoral condyle was first visible and the slice on which the femoral condyle was last visible) was identified. Four cartilage regions of interest (ROIs) were defined per patient: medial and lateral femoral cartilage and medial and lateral tibial cartilage. Trochlear cartilage was not included in quantitative analysis because of the potential influence of the magic angle effect on T2 relaxation times [27].

For meniscus segmentation, the sagittal slice depicting the maximum dimension of the anterior horn and posterior horn as individual triangles was used. Four meniscus ROIs were defined per patient: the anterior and posterior horn of the medial and lateral menisci. To avoid partial volume effects of joint fluid in case of a meniscal tear, the torn area was not included in segmentation. All segmentations and subsequent T2 analyses were performed using custom in-house software created in MATLAB (version R2011b; The Math-Works).

Structural analysis of knee OA (MOAKS)

Structural, semi-quantitative assessment of cartilage and meniscus was performed using MOAKS [11] by the same researcher (SE). Both qDESS echoes with multiplanar reformatting were used. Criteria for MOAKS grading for cartilage (MOAKScartilage) and meniscus (MOAKSmeniscus), used in this study, are described in Supplementary Materials 1 and 2, respectively. We performed no second reading because high intra- and inter-observer reproducibility for MOAKS scoring using qDESS with separated echoes, especially for cartilage and meniscus, was reported in a previous study [16].

Statistical analysis

We assessed the intra- and inter-observer reproducibility for KLG scoring by calculating weighted Cohen’s kappa. Tests for normality of baseline characteristics and outcomes were performed using Shapiro-Wilk tests. Between-group differences in overall (i.e., pooled across all ROIs) T2 values and MOAKS scores were evaluated using ANOVA (for parametric data) or Kruskal-Wallis tests (for non-parametric data). In case of statistically significant differences in mean age and/or sex among the three subject groups, a multivariate model with linear regression was used to assess the potential influence of these differences on T2 values and MOAKS scores. Associations between radiographic OA and T2 values and between radiographic OA and MOAKS were assessed in predefined cartilage and meniscus ROIs, and for overall scores using correlation analysis for ordinal variables (Spearman’s correlation). Differences were considered statistically significant at p < 0.05. All statistical analyses were performed using SPSS (version 24.0.0.0, 2018).

Results

Characteristics of study population

Out of the 196 potentially eligible patients, 53 subjects were included in this study: 20 subjects with no knee OA, 18 subjects with mild knee OA, and 15 subjects with moderate knee OA. A flowchart of the selection of the study population is presented in Fig. 2. Characteristics of study participants, stratified by degree of OA, are summarized in Table 3. There was a slight overall male predominance of 60%, yet no statistically significant differences in sex distribution were found across the three subject groups. The mean age of patients with mild and moderate OA was statistically significantly higher (p < 0.001) compared with subjects with no OA. No statistically significant association between age and T2 values or MOAKS scores was found (data not shown).

Fig. 2
figure 2

Flowchart showing the selection process of the study population. In the rectangles on the right, the number and nature of exclusions are described. MR = magnetic resonance; Dec = December; OA = osteoarthritis; KL = Kellgren and Lawrence grade; ACR = American College of Rheumatology; ACL = anterior cruciate ligament

Table 3 Characteristics of the study population, stratified by degree of OA

Reproducibility of KLG scoring

Inter-observer reproducibility for scoring the degree of radiographic knee OA according to KLG was good (weighted kappa, 0.78), while intra-observer reproducibility was excellent (weighted kappa, 0.85).

qDESS T2 mapping and MOAKS in cartilage

Overall qDESS cartilage (i.e., pooled across all ROIs) T2 values were 36.1 ± SD 4.3, 40.6 ± 5.9, and 47.1 ± 4.3 ms for no, mild, and moderate OA, respectively. The delta value (difference) in T2 was 4.6 ms between no OA and mild OA and 6.5 ms between mild OA and moderate OA. Overall qDESS cartilage T2 values were similar to T2 values in previous literature (33.8–38.8, 34.9–41.8, and 40.5–46.9 ms for no, mild, and moderate OA, respectively [7, 16, 28]). Differences in qDESS T2 values were statistically significant between the three subject groups (p < 0.01; Fig. 3a). Likewise, overall MOAKScartilage scores were consistently higher with increasing stages of OA with statistically significant differences found between the three subject groups (p < 0.001; Fig. 3b). The delta value (difference) in MOAKScartilage was 4 between no OA and mild OA and 6.8 between mild OA and moderate OA. A representative example of qDESS MOAKScartilage findings in a subject with moderate OA, compared with a corresponding fat-suppressed T2-weighted image, is provided in Fig. 4. Osteophytes were not included in the analyses of the present study, but they were identified on qDESS images. Subchondral cysts and surrounding bone marrow lesions (BMLs) were not included in the analyses of this study but identified as well (see Fig. 4). Overall qDESS T2 and MOAKS scores for cartilage, stratified by degree of OA, are summarized in Table 4.

Fig. 3
figure 3

Discriminative power of quantitative and structural qDESS-based measurements in cartilage. Statistical significantly differences in (a) cartilage T2 and (b) MOAKScartilage scores were found among subject groups. Data is shown as overall mean values (pooled across all ROIs); vertical bars represent standard deviation. Horizontal bars represent statistically significance between two subject groups; **p < 0.01, ***p < 0.001, ****p < 0.0001. ms = milliseconds; OA = osteoarthritis; ROI = region of interest

Fig. 4
figure 4

Example of MOAKScartilage assessment in a 71-year-old male with moderate OA on qDESS images (a, b), compared with corresponding fat-suppressed T2-weighted image (c) (TE 54 ms; flip angle 142°; FOV 14 cm; matrix 384 × 192). Sagittal images of first (a) and second (b) qDESS echo show thinning of medial femoral cartilage (dotted arrow). Subchondral cysts and surrounding BML (dashed arrow) and osteophytes (triangles) were not included in the analysis of the present study, but they were identified on qDESS images. Note the underestimation of BML size on qDESS images compared with T2-weighted image. OA = osteoarthritis; BML = bone marrow lesion

Table 4 Cartilage T2 values and MOAKScartilage scores per ROI and overall scores, and their correlation with radiographic degree of OA

T2 mapping and MOAKS in menisci

In menisci, overall (i.e., pooled across all ROIs) qDESS T2 values were 15 ± SD 3.6, 17.5 ± 3.8, and 20.6 ± 4.7 ms for no, mild, and moderate OA, respectively. The delta value (difference) in T2 was 2.5 ms between no OA and mild OA and 3.1 ms between mild OA and moderate OA. Overall qDESS meniscus T2 values were similar to T2 values in previous studies (11.4–21.3, 13.5–22.4, and 16.8–24.2 ms for no, mild, and moderate OA, respectively [7, 16, 29]). Differences in qDESS T2 values were statistically significant between the three subject groups (p < 0.01; Fig. 5a). Differences in qDESS MOAKSmeniscus scores were statistically significant between the three subject groups (p < 0.001; Fig. 5b), except for the difference in MOAKSmeniscus scores between subjects with mild and moderate OA. The delta value (difference) in MOAKSmeniscus was 2.2 between no OA and mild OA and 1.5 between mild OA and moderate OA. An example of qDESS MOAKSmeniscus assessment in a subject with mild OA, compared with a corresponding proton-density-weighted image, is provided in Figure S1. Overall qDESS T2 values and MOAKS scores for menisci, stratified by degree of OA, are summarized in Table 5. With regard to meniscus extrusion, the presence of meniscus extrusion was consistent with the degree of OA. We found a medial extrusion of 0.3 ± SD 0.1, 0.9 ± 0.3, and 1.1 ± 0.3 in non-OA subjects, subjects with mild OA, and subjects with moderate OA, respectively. A lateral extrusion of 0.0 ± SD 0.0, 0.4 ± 0.2, and 0.7 ± 0.3 was found in non-OA subjects, subjects with mild OA, and subjects with moderate OA, respectively. Statistically significant differences in medial and lateral extrusion grade were found among the three subject groups (p = 0.04 and p = 0.03 for medial and lateral extrusion, respectively).

Fig. 5
figure 5

Discriminative power of quantitative and structural qDESS-based measurements in menisci. Statistical significantly differences in meniscus T2 (a) and MOAKSmeniscus (b) scores were found among subject groups. Data is shown as overall mean values (pooled across all ROIs); vertical bars represent standard deviation. Horizontal bars represent statistically significance between two subject groups; **p < 0.01, ***p < 0.001, ****p < 0.0001. ms = millisecond; OA = osteoarthritis; ROI = region of interest

Table 5 Meniscus T2 values and MOAKSmeniscus scores per ROI and overall scores, and their correlation with radiographic degree of OA

qDESS T2 mapping and MOAKS in cartilage and meniscus ROIs

qDESS T2 values and MOAKS scores for each cartilage and meniscus ROI, stratified by degree of OA, are summarized in Tables 4 and 5, respectively. In all cartilage and meniscus ROIs, statistically significant correlations were found between qDESS T2 values and radiographic OA and between MOAKS scores and radiographic OA. The strongest correlation (r = 0.71) between MRI findings and radiographic OA was found in the medial femoral cartilage; the weakest correlation (r = 0.29) was found in the anterior horn of the medial meniscus.

Discussion

In the present study, we demonstrated that quantitative and structural measurements in cartilage and meniscus, obtained with a single 5-min qDESS sequence, can differentiate between OA stages. T2 values in cartilage and menisci were similar to T2 values reported in previous studies [5,6,7,8].

The disease distribution of OA within the knee joint is often compartmental, with high variability regarding compartmental involvement [6, 20, 21]. Therefore, we assessed the validity of qDESS-based biomarkers in various cartilage and meniscus ROIs. The discriminative power to distinguish degree of OA was the greatest in the medial femoral cartilage, and the least in the anterior horn of the medial meniscus. These findings were most likely caused by the uneven distribution of OA features; the anterior horn of the medial meniscus showed relatively low T2 values and MOAKS scores in subjects with mild or moderate OA while the medial femoral cartilage showed relatively high T2 values and MOAKS scores in those subjects. Despite the differences in discriminative power, T2 values and MOAKS outcomes in all ROIs were found to be statistical significantly correlated with radiographic knee OA.

The qDESS sequence in the present study was optimized to simultaneously generate high-resolution images and quantitative measurements, by combining high spatial resolution with high SNR, in one single, rapid scan. While twice as fast, the resolution and voxel volume of this qDESS sequence (0.18 μL) was over 10x better than the resolution of established quantitative T2 sequences [7, 30]. In a previous study, qDESS has shown high T2 accuracy compared with multi-echo spin echo sequences, as well as high accuracy for MOAKS measurements compared with conventional spin echo–based sequences, with high intra- and inter-observer reproducibility [16, 25]. qDESS has been thought to underestimate the size of bone marrow lesions (BMLs), which seems to be the case in our study as well (see Fig. 4, not studied), likely due to T2* susceptibility effects [15]. A separation of the two qDESS echoes may enhance accuracy of BML detection compared with previous qDESS studies [31]. Although outside the scope of this study, further work is needed to test and optimize BML detection with qDESS.

Building upon the work of Chaudhari et al [16], the present study assesses the discriminative power of a 5-min qDESS sequence to obtain T2 values and MOAKS in a clinical knee OA population. We validated T2 measurements and MOAKS against radiographic OA, which remains the gold imaging standard for diagnosing and monitoring knee OA [18, 19]. In OA research, KLG2 is considered the cut-off point for the presence of radiographic knee OA [4, 18, 19, 32]. Although potentially a relevant group in the context of early OA imaging, we did not include patients with KLG1, indicating doubtful radiographic OA. The reproducibility of scoring KLG1 (i.e., doubtful narrowing of joint space and possible osteophytic lipping) is relatively poor, most likely due to differences in the interpretation of radiographic findings, especially concerning osteophytic lipping [18]. Also, patients with severe radiographic OA (i.e., KLG4) were not included in the present study, as bony deformity and bone-to-bone contact precludes accurate segmentation of cartilage.

OA is among the top ten burdensome diseases, with the knee being the most affected joint [1]. In the light of increased numbers associated MR imaging studies [2, 33], reducing MR imaging acquisition time is highly relevant. Reducing scan time saves costs and increases patient comfort and may reduce motion artifacts in longer acquisitions [16]. Because qDESS rapidly provides rich structural and quantitative information, there is a great promise for using this technique in large clinical OA studies. Recent advances in deep learning and simultaneously imaging both knees with qDESS may further reduce scan time, without loss of image quality or quantitative accuracy [34,35,36].

This study has some limitations that must be acknowledged. First, segmentation of quantitative analysis and MOAKS scoring was performed by a single, experienced researcher. As evidence of high intra- and inter-observer reproducibility for cartilage and meniscus segmentation and MOAKS assessment with qDESS images has been reported previously [16], analyses performed by a single researcher were considered sufficient. Second, our validation study was cross-sectional. The lack of a longitudinal aspect may limit interpretation regarding the potential use of qDESS in clinical trials. Therefore, future studies on the sensitivity of qDESS-based biomarkers for longitudinal changes in the knee are required. Third, KLG was used as reference standard, which is considered the gold standard for imaging-based knee OA classification [4]. Radiographically detected joint space narrowing (JSN) is currently the only structural endpoint accepted by the European and US regulatory bodies (European Medicines Agency and FDA) to assess knee OA progression [37] and is commonly used in qMRI validation studies [6, 7]. We opted for this method because we aimed to explicitly use qDESS in a clinically relevant matter. However, an important drawback of the KLG method is the low reproducibility of JSN measures reported in literature, in particular in longitudinal assessment of knee OA [4, 38]. Given the cross-sectional design of our study without longitudinal measures, challenges concerning longitudinal KLG measures are unlikely. To optimize reproducibility, we used standardized radiographs (weight-bearing AP). To assess reproducibility, both inter- and intra-observer reproducibility of KLG were carefully evaluated in the present study (weighted kappa of 0.78 and 0.85 for inter- and intra-observer reproducibility, respectively). Finally, although osteophytes and BMLs are important OA features, they were not studied. The primary objective of this study was to assess the validity of qDESS for cartilage and menisci in OA subjects. We focused on those tissues as they have conclusively been shown to be strong indicators for OA and because of their possibilities in both quantitative (T2) and semi-quantitative (MOAKS) [4, 7, 8, 11, 39]. To assess the external validity of our study results, further studies evaluating other relevant OA features will be essential, in particular regarding BML detection. In addition, future validation studies on qDESS T2 values in OA patients against histological degree of degeneration (the gold standard for tissue changes) are desirable.

In conclusion, quantitative T2 and structural assessment of cartilage and meniscus with a single 5-min qDESS scan can distinguish between different grades of OA and show significant correlations with the reference standard. These results demonstrate the potential of qDESS as an efficient and accurate imaging tool for OA research.