Introduction

Magnetic resonance imaging (MRI) of the ankle is frequently used to investigate pain of unknown origin and characterization of painful conditions [1,2,3,4]. The complexity of the ankle anatomy with oblique orientations of numerous cartilaginous, ligamentous and tendinous structures [5,6,7] often requires multiple planes when using two-dimensional (2D) TSE techniques. In contrast, three-dimensional (3D) TSE techniques can acquire isotropic data sets that are capable of axial, sagittal, coronal, oblique and curved planar reformations from single datasets to optimize the display of ankle structures [7,8,9,10].

Despite these favorable attributes, 3D TSE techniques thus far have required trade-offs between long acquisition times, edge blurring, limited anatomical coverage and suboptimal image contrast [8, 11,12,13,14]. The implementation of a CAIPIRINHA (Controlled Aliasing In Parallel Imaging Results IN Higher Acceleration) sampling pattern with a shifted, bi-directional, two-fold accelerated, parallel imaging scheme into 3D SPACE (Sampling Perfection with Application optimized Contrast) TSE remedies many of the shortcomings by affording higher accelerations factors, image contrast and quality similar or better to 2D TSE images, minimized aliasing artifacts and reduced image noise [15, 16].

The purpose of our study was to test the hypothesis that MRI of the ankle with a 10-min 3D CAIPIRINHA SPACE TSE protocol is at least equivalent for the detection of painful conditions when compared to a 20-min 2D TSE standard of reference protocol.

Material and methods

Study design

Our institutional review board approved our study. Written informed consent was obtained from all patients for prospective data collection and use of investigational pulse sequences. None of the patients included in this study have been previously reported.

We recruited patients from our orthopedic practice who presented with painful ankle conditions that met the criteria for ankle MRI [17, 18]. Inclusion criteria were an agreement to participate in our study and adult age. Exclusion criteria were a degree of ankle swelling preventing positioning into an ankle coil, inability for supine positioning, severe claustrophobia and general contraindications for MRI (Fig. 1). All subjects underwent MRI of the ankle in one session using both, our standard 2D TSE MRI pulse sequence protocol and investigational 3D TSE prototype protocol (Table 1).

Fig. 1
figure 1

Study design

Table 1 MRI study protocol

Subjects

Between January 2016 and June 2016, 70 subjects (46 men, 24 women; mean age, 42 years, range: 18–72 years; mean body mass index, 27 kg/m2, range: 18–35 kg/m2) were prospectively included for unilateral MRI of the ankle.

MR imaging technique

All MRI exams were performed on a commercially available, clinical wide-bore 3 Tesla MR imaging system (MAGNETOM Skyra, Siemens Healthcare, Erlangen, Germany) with a commercially available boot-shaped foot and ankle surface coil with one transmit channel and 16 receiver channels (Siemens Healthcare). All subjects underwent the MRI study protocol with a total acquisition time of 30 min, which consisted of a 3D TSE pulse sequence protocol with an acquisition time of 10 min and a 2D TSE pulse sequence protocol with an acquisition time of 20 min (Table 1). Either the 2D or 3D protocol portion was acquired first with equal proportions throughout the study. The investigational 3D protocol was based on a 2 x 2-accelerated 3D SPACE TSE sequence prototype that utilized a shifted, bi-directional parallel imaging and CAIPIRINHA sampling pattern [16]. One non-fat-suppressed intermediate-weighted (IW) and one fat-suppressed T2-weighted (T2FS) 3D CAIPIRINHA SPACE TSE dataset with isotropic voxel resolution were acquired. Our institutional 2D TSE protocol for MRI of the ankle served as the standard of reference, which is characterized by fully sampled data sets without undersampling or acceleration techniques to ensure the highest image quality. All sagittal 2D and 3D TSE pulse sequences were aligned to the anteroposterior axis of the talus and rotated 45° in the sagittal plane, which resulted in an anterosuperior to the posteroinferior in-plane phase-encoding direction across the ankle that obviated the use of phase oversampling. The final images were digitally rotated back on the MRI console immediately after completing the pulse sequence. To avoid phase wrap artefacts in the slice partition direction, the contralateral ankle was either elevated above the surface coil or pulled back to reside proximal to the surface coil.

Image evaluation

The 70 pairs of 2D and 3D TSE datasets were separated, anonymized and randomized, resulting in 140 individual studies. Each of those 140 studies was evaluated by two fellowship-trained, full-time musculoskeletal radiologists, with 5 years and 10 years of experience in interpreting musculoskeletal MRI examinations, respectively. Each reader evaluated the studies independently. Before the evaluation, both readers underwent a joint training session with 15 complete studies that were not included in our study. Readings were performed in a standardized condition [19]. All data sets were displayed with a 2 x 3 viewport layout. For 2D TSE, individual image stacks were displayed, whereas 3D data sets were displayed using the interactive multi-planar reformation mode of the viewer software with the original isotropic voxel size (MERLIN Diagnostic Workcenter, Phönix-PACS GmbH).

Outcome variables

Variables included the degree of motion, overall image quality defined as the reader’s subjective impression of the suitability of the study for accomplishing an MRI interpretation of the ankle, frequency of structural abnormalities, inter-reader reliability defined as the degree of agreement between the two readers for assessments made, inter-method concordance defined as the degree of agreement of readers’ diagnoses between corresponding 2D and 3D studies, diagnostic definitiveness defined as the degree of readers’ certainty for diagnoses, and interchangeability defined as common pair exact match proportions between readers of different combinations of 2D and 3D studies [20].

For the assessments of the degree of motion and overall image quality, equidistant 5-point Likert scales were used (Table 2).

Table 2 Qualitative assessment

For evaluation of the frequency of structural abnormalities, a total of nine bones, six joints, 12 ligaments and ten tendons were assessed. Bones were assessed for fractures, bone marrow edema or both, whereas tendons and ligaments were assessed for tears, using an equidistant 5-point Likert-scale as certainly not, probably not, equivocal, probably yes and certainly yes. A fracture was defined as discontinuity of cortical bone or a sharp hypointense osseous line on IW and T2FS images with surrounding bone marrow edema. Bone marrow edema was defined as ill-defined osseous signal hyperintensity on T2FS images. Tears were defined as greater than 50% short-axis fibre disruption and long-axis splits. Each joint was evaluated separately for the absence (modified Noyes grade 0 and 1) or presence (grade 2 and 3) of cartilage defects [21, 22]. In addition, each reader rated their level of confidence as less than 50%, 50–90% and > 90% certainty.

Statistical analysis

Statistical analyses were performed using R version 3.3 with the lme4 and epiR packages (http://cran.r-project.org). Motion and diagnostic quality assessments are given as the median value with minimum, first–third quartile and maximum, and the chi-squared test was used to test for reader differences. For the analysis of frequencies and proportions, reader responses were dichotomized into ‘present’ (for certainly yes and probably yes) or ‘absent’ (equivocal, probably no and certainly no). All assessments were performed on a location-by-location basis; for example, for cartilage defects the same joint had to have a defect to be considered as having an agreement. Differences in the frequencies of structural abnormalities on 2D and 3D MRI were tested with McNemar’s test. Inter-reader reliability of measurements was assessed with the exact agreement and Kendall’s coefficient of concordance, W using dichotomized data. Kendall’s coefficient of concordance, W was qualified into poor (0–0.20), fair (0.21–0.40), moderate (0.41–0.60), good (0.61–0.80) and very good (0.81–1.00), paralleling the criteria that were defined for Cohen’s kappa [23]. The degree of concordance between 2D and 3D TSE assessments was assessed with dichotomized average reader values due to a high degree of inter-reader reliability using exact agreement and Kendall’s coefficient of concordance, W. The diagnostic definitiveness was used to describe the proportion of times that a common diagnosis was given more strongly after evaluation of 2D TSE or 3D TSE images, whereas cases with a discrepant finding between 2D and 3D images were excluded for this part of the analysis. For comparison of diagnostic definitiveness of each MRI technique, average reader values were used to describe the proportions of certainty levels of 2D and 3D MRI in the case of an agreement on the presence or absence. Interchangeability of 2D and 3D techniques for diagnoses of structural abnormalities was assessed with likelihood ratio testing, using proportions of exact matches when both readers were reading 2D studies, one reader was reading 2D TSE and one 3D studies, and both readers were reading 3D studies and final testing for significant differences. P-values of less than 0.05 were considered statistically significant.

Results

There was no substantial motion degradation of either 2D TSE (5 [2, 5]) or 3D TSE (5 [4, 5]) studies (p = 0.368), with very good inter-reader agreement (Kendall W, 0.889).

3D TSE studies had non-significantly (p = 0.441) higher image quality ratings (5 [2, 4, 5]) than 2D TSE studies (4 [3, 5]), with very good inter-reader agreement (Kendall W, 0.816). The average ratings for the overall image quality for 3D TSE exceeded the rating of 2D TSE in 51/70 (73%) of cases, whereas the overall image quality ratings for 2D TSE exceeded the rating of 3D TSE in 5/70 cases (7%).

The frequencies for the detection of structural abnormalities and inter-reader reliabilities are given in Table 3, which are based on the total number of 420 cartilage segments, 840 ligaments, 630 tendons and 630 bones (Figs. 2, 3, 4, 5 and 6). Reader 1 diagnosed 6% fewer and reader 2 diagnosed 9% more cartilage defects with 2D MRI when compared to 3D MRI. In-between readers, the difference of frequencies of cartilage lesions was 19% for 2D MRI and 4% for 3D MRI (p < 0.001). The overall difference between 2D and 3D MRI for the diagnosis of cartilage lesions was 1%, which includes normal cartilage diagnoses (p = 0.440). Both readers detected 3% more ligament tears with 3D MRI than 2D MRI (p < 0.001), while the detection rate differences of tendon tears (p = 0.125) and osseous bone abnormalities (p = 0.729) were 0–1 %. Overall, there was very good inter-reader agreement for 2D MRI (Kendall W, 0.925; range, 0.657–0.948) and 3D MRI (Kendall W, 0.936; range, 0.778–0.958).

Table 3 Frequency of structural abnormalities and inter-reader reliability
Fig. 2
figure 2

A 57-year-old man with chronic right heel pain. Sagittal intermediate-weighted 2D (a) and 3D (b) MR images and sagittal fat-suppressed T2-weighted 2D (c) and 3D (d) MR images show an insertional partial-thickness tear of the Achilles tendon (white arrow), fluid in the retrocalcaneal bursa (asterisk) and intra-ligamentous heterotopic ossification (grey arrow)

Fig. 3
figure 3

A 34-year-old man with acute left ankle inversion-type injury. Axial intermediate-weighted 2D (a) and 3D (b) MR images and axial fat-suppressed T2-weighted 2D (c) and 3D (d) MR images show a complete tear of the anterior talofibular ligament (arrow)

Fig. 4
figure 4

A 42-year-old woman with chronic plantar pain. Sagittal fat-suppressed T2-weighted 2D (a) and 3D (b) MR images show a non-insertional tear of the central cord of the plantar fascia (arrow) on a background of focal fibrosis

Fig. 5
figure 5

A 55-year-old woman with chronic medial ankle pain. Axial fat-suppressed T2-weighted 2D (a) and 3D (b) MR images show a split tear of the posterior tibial tendon (arrow)

Fig. 6
figure 6

A 27-year-old man with an acute ankle injury. Sagittal intermediate-weighted 2D (a) and 3D (b) MR images and coronal fat-suppressed T2-weighted 2D (c) and 3D (d) MR images show a non-displaced, subchondral fracture (arrow) of the anterior tibial plafond

There was overall very good inter-method concordance (Kendall W, 0.919; range, 0.732–0.947) between 2D and 3D MRI, with the highest concordance for osseous abnormalities and lowest concordance for ligamentous injuries (Table 4).

Table 4 Inter-method concordance between 2D and 3D MRI

The diagnostic definitiveness of readers for cartilage assessments was higher for 3D MRI than 2D MRI 27% of the time, while the reverse was true 11% of the time. Table 5 presents the diagnostic definitiveness for ligaments, tendons and bones as the proportion of times the confidence of 3D TSE reader responses exceeded the 2D TSE reader responses and vice versa for the cases where both readers agreed on the presence or absence of findings. For the assessments of ligaments, tendons and bones, the confidence of 3D reader responses exceeded the 2D reader response in 10% of cases, whereas the confidence of 2D reader responses exceeded the 3D reader response in 7% of cases (p < 0.01). The proportions of excluded cases on which both readers disagreed on the diagnosis on 2D and 3D images were 5.5% for ligaments, 1.4% for tendons, 5.5% for bones and 4.2%.

Table 5 Diagnostic definitiveness

The interchangeability analysis (Table 6) shows that the common pair exact match proportions were similar between 2D and 3D studies when readers diagnosed bone abnormalities, which indicates bi-directional interchangeability of 2D and 3D technique. The common pair exact match proportions were significantly higher (p < 0.05) for the 3D technique when readers diagnosed abnormalities of cartilage, ligaments and tendons, indicating higher internal reliability of the 3D technique and unidirectional interchangeability, meaning that only the 3D technique can replace the 2D technique.

Table 6 Interchangeability analysis

Discussion

We compared a novel 10-min 3D CAIPIRINHA SPACE TSE protocol to a 2D TSE standard of reference protocol for the diagnosis of painful ankle conditions. Readers detected more ligament tears with 3D than 2D MRI, while the overall differences for cartilage, tendons and osseous abnormalities were small. The inter-method concordance and inter-reader reliability were overall high, whereas the diagnostic definitiveness and internal reliability were higher for 3D CAIPIRINHA SPACE TSE than 2D TSE images, indicating that the 3D CAIPIRINHA SPACE TSE protocol is at least equivalent and can replace the 2D technique for the detection of painful ankle conditions.

Owing to rising economic constraints and the demand of MRI for non-invasive diagnosis of painful ankle conditions, there is a need for faster MRI acquisition without comprising diagnostic accuracy. 10-minute MRI of the ankle offers short table times for patients and can contribute to a high institutional throughput and potentially compensate for decreasing reimbursements. The 10-min acquisition time of our 3D TSE protocol was mainly based on a 2 x 2 acceleration with a shifted, bi-directional CAIPIRINHA undersampling in phase and partition encoding directions [24, 25]. This approach optimized the use of coil sensitivities, which helped to minimize aliasing artefacts and reduce image noise [16]. The isotropic 3D TSE voxel size allowed for the interactive reformation of the standard axial, sagittal and coronal as well as oblique planes.

When compared to 2D TSE, the higher spatial resolution that is achievable with 3D TSE pulse sequences relates to signal gains from volume, rather than slice selective excitations, and the number of additional phase-encoding steps of the slice partitions. However, the number of slice partitions results in longer acquisition times, which can range between 20 and 30 min for fully sampled data sets. Applying different acceleration strategies and techniques, previous 3D TSE studies investigating high-resolution MRI of the ankle reported acquisition times of 5–8 min for a single 3D TSE dataset [8, 9, 13, 14, 26, 27]. Applied strategies included long echo train lengths of up to 100 [13, 14, 26], anisotropic voxel size acquisitions [14, 27], in-directional parallel imaging acceleration of up to three [8, 13] and partial Fourier undersampling [8, 9, 13, 26], which are all inversely proportional to the acquisition times. While those techniques in combination can mathematically achieve net accelerations of three to six, they result in various but often meaningful degradation of image quality including decreased signal-to-noise ratios, increased image blurring and aliasing artefacts. Additional strategies to reduce the acquisition time included limited field-of-view dimensions and a reduced number of slice-encoding partitions that cover specific structures selectively, rather than the entire ankle [8, 13, 26].

The bi-directional acceleration scheme of the 3D CAIPIRINHA SPACE pulse sequence prototype affords a fourfold acceleration through parallel imaging without degrading noise levels or aliasing artefacts [15]. The afforded reduction of acquisition time allowed us to limit the echo train length to 42 and 54 to minimize image blurring, acquire truly isotropic voxels to prevent degrading extrapolation effects and avoid partial Fourier undersampling in both phase-encoding directions, which degrade image quality to various degrees. Additionally, we could realize whole ankle coverage with a sagittal plane field-of-view of 16 x 16 cm, which extended from the muscle-tendon junction of the Achilles tendon to the midfoot, and mediolateral skin-to-skin coverage of 9 cm, providing identical coverage than our 2D TSE protocol. Lastly, driven equilibrium pulse technique to restore the longitudinal magnetization, low repetition times of 1,000 and 1,100 ms, and the use of elliptical k-space sampling resulted in additional time savings [16].

Previous studies have suggested equal or better visualization capabilities of 3D TSE MRI of the ankle for cartilage lesions as well as tendon and ligament tears when compared to 2D TSE ankle MRI [8, 9, 13, 14, 16, 26, 27]. Our study showed a high inter-method concordance for the detection of abnormalities, whereas the inter-reader reliability was higher for 3D TSE images. For evaluation of cartilage lesions, however, there was a notable discrepancy between readers. Reader 1 detected fewer and reader 2 detected more cartilage lesions with 2D than 3D MRI, whereas the inter-reader reliability was notably higher for 3D images. On the other hand, both readers detected more ligament tears with 3D than 2D MRI with overall low inter-reader reliability, but higher for 3D MRI. While the lower 2D inter-reader reliability for diagnosing cartilage defects may relate to different sensitivity levels of the readers, the thin articular cartilage layers of the ankle and a lack of robustly designed cartilage grading systems with previously reported low inter-reader reliabilities were likely contributing factors [28]. The high inter-reader reliability and diagnostic definitiveness of 3D TSE for both cartilage lesions and ligament tears suggest an advantage of 3D TSE. The 3D CAIPIRINHA SPACE pulse sequence is Conformité Européenne marked and received US Food and Drug Administration 510(k) clearance. Following a transition period of acquiring both 2D and 3D techniques, we have since replaced our conventional 2D protocol with the new 3D protocol for MRI of the ankle at the Johns Hopkins Medical Institutions.

Similar to our results, improved detectability of syndesmotic and lateral collateral ligaments tears on 3D TSE images has been suggested before [9, 14, 27], which likely relates to limited partial volume effects and the ability for interactively align image planes to obliquely oriented ligaments. As multi-dimensional image plane rotation is currently a standard interactive feature of professional image viewers, the additional time required to perform these tasks is small. Therefore, we believe that replacing an existing 2D TSE protocol with the more time-efficient 3D CAIPIRINHA SPACE TSE protocol is feasible in a clinical setting and can be recommended for the purpose of reducing scan time without degrading image quality and detection of structural abnormalities.

Our study has limitations. First, there was no standard of reference to confirm the detected anatomical abnormalities. Surgical exploration is often used as a standard of reference when assessing injuries in other joints such as the knee and the hip. In the ankle joint, however, only a few abnormalities, such as osteochondral defects of the talus and complete tendon ruptures, are amenable for a definitive and accurate intra-operative diagnosis, while surgical exploration frequently does not provide a diagnosis of certainty for most of the ligamentous and chronic tendinous ankle injuries. As such, we used statistical metrics to compare both techniques. While this approach can determine equivalence, the superiority of one technique over the other can only be suggested. Second, while both readers were blinded to corresponding 2D and 3D data sets, the differences in the number of slices and spatial resolutions allowed readers to recognize the technique. Therefore, personal preferences may have influenced the study results. Both readers did not, however, use 3D TSE techniques in their daily clinical practice. Third, the novel 3D TSE protocol was compared to our institutional standard 2D TSE protocol for MRI of the ankle. However, MRI ankle protocols vary in part substantially between institutions, and therefore sequences (for example by the inclusion of a T1-weighted pulse sequence), acquisition times and image quality of our study may be different from other established protocols.

In conclusion, a 10-min 3D CAIPIRINHA SPACE MRI protocol produces equal or better image quality and the performance for the detection of painful ankle conditions is similar when compared to a 20-min 2D TSE MRI reference standard. Significantly higher inter-reader reliability and reader definitiveness indicate improved visibility of abnormalities on 3D TSE images, which may have increased accuracy for the detection of ligament tears.