Besides surgery, neoadjuvant treatment with chemo- and/or radiotherapy has become a crucial part of the treatment of rectal cancer. The choice of the type of neoadjuvant treatment and surgery is based on preoperative imaging. During the past decade magnetic resonance imaging (MRI) has been proven to be the most accurate staging modality for primary rectal cancer [1]. With preoperative imaging the T-stage, involved circumferential resection margin (CRM), nodal involvement, and location of the tumor are evaluated, which determine the type of (pre)operative treatment. So far, MRI has not been able to accurately predict T-stage unless an endorectal coil is used [2, 3]. However, an endoluminal technique leads to less visibility of surrounding structures in the mesorectum because of the limited field of view. Furthermore, it is difficult to apply this technique in high and stenosing tumors. Therefore, endorectal MRI has been less adopted for staging small rectal tumors than endorectal ultrasound (EUS) [4, 5]. EUS has equally good accuracy as endorectal MRI for identification of T1 tumors, is less cumbersome and is therefore the exam of choice for staging superficial rectal cancer [6]. It is, however, less accurate for staging of the more advanced tumors, in specific those with threatened CRM because EUS cannot identify the mesorectal fascia. Therefore, MRI with phased-array coils is widely adopted as the most accurate technique for staging of these bulky T3 and T4 tumors with sensitivities ranging from 74% to 94% and specificities from 74% to 76% [7]. It is the borderline tumor, the T2 or small T3 tumor, that remains a diagnostic challenge. The distinction between T2 and borderline T3 tumors has important implications for clinical decision making. When a tumor is confined to the bowel wall (T1-2) the patient will undergo surgery immediately without neoadjuvant treatment, but when the tumor penetrates the bowel wall (T3) the current trend is to administer preoperative neoadjuvant therapy (radiotherapy with or without chemotherapy) prior to surgery. Furthermore, when a tumor is confined to the bowel wall local excision—leading to less morbidity and mortality than with standard surgery—could be considered [8], whereas in tumors outgrowing the bowel wall this is not possible. A large proportion of the pT2 tumors is overstaged, because desmoplastic benign reactions are mistaken for tumoral extensions associated with a borderline pT3 tumor [9]. Overstaging would thus lead to overtreatment of patients with T1-2 tumors, with increased risk for treatment related morbidity and mortality.

Most studies evaluating the accuracy of phased-array MRI for rectal cancer staging have focused on 1.5 Tesla (1.5T) MR units. MR imaging at higher field strength (3 Tesla) could improve the diagnostic performance of phased-array MRI for T-stage prediction. Three Tesla (3T) MRI has a theoretical two-fold increase of signal-to-noise ratio compared to 1.5T MRI [10]. This could allow for better resolution and enhanced image quality, resulting in a more accurate distinction between T2 and borderline T3 tumors. Only few studies exist on 3T MRI for T-stage prediction in rectal cancer. To the best of our knowledge, no report exists on the intra-patient comparison of the diagnostic performance of 3T MRI with that of 1.5T MRI [9, 1116].

Therefore, the aim of this study is to determine, with an intra-patient comparison, whether 3T MRI compared with 1.5T MRI leads to improvement of accuracy for the differentiation between tumors confined to the bowel wall (T1-2) and tumors penetrating the bowel wall (T3) and to evaluate reproducibility.

Materials and methods

Patients

Between November 2007 and July 2008, 13 patients with primary non-locally advanced rectal cancer who underwent MR imaging of the pelvis at 1.5 and 3T (with a mean interval of 19.8 ± 10.8 days) were included. The 3T MRI scans were performed as part of another MRI study protocol, which was approved by the local institutional review board. Inclusion criteria were: (1) patients had to have undergone pelvic MRI at 1.5 and 3T, (2) had non-locally advanced rectal cancer and underwent 5 × 5 Gray followed by immediate surgery, which has been shown to have no downstaging effect on the tumor [17]. Non-locally advanced rectal cancer was defined as tumors with a T-stage up to borderline T3 and uninvolved CRM, based on histology after surgery, which was the reference standard.

MRI techniques

Patients were placed in feet first supine position. No antiperistaltic agent or endorectal filling was administered prior to imaging. Phased-array multichannel coils were used for signal reception. The 1.5T MR unit was either an Intera unit (n = 5, Philips Medical Systems, Best, The Netherlands) or an Avanto unit (n = 8, Siemens Medical Solutions, Erlangen, Germany). The MR sequences followed the standard rectal MR protocol as described in the literature and consisted of 2D T2-weighted fast spin echo (FSE) sequences (TR/TE 3500-4780/94-150 ms, with in plane resolution ranging from 1.14 × 0.78 to 1.09 × 1.56 mm2, slice thickness 3–5 mm) in two planes: sagittal and axial. Acquisition time per sequence ranged from 3:17 to 5:08 min. All patients underwent a second scan at a 3T MRI unit (Intera Achieva (software release 1.5.4.0); Philips Medical Systems, Best, The Netherlands) with standard 2D T2-weighted FSE sequences (TR/TE 3107/150 ms, in plane resolution 0.65 × 1.06 mm2, slice thickness 4 mm) in two planes: sagittal and axial. Acquisition time per sequence was 2:52 min. This sequence was based partially on the sequence as published in the literature by Kim et al. [14].

Image analysis

Three readers with different experience in reading rectal cancer MRI retrospectively evaluated the images, independently and blinded for each other’s and histological results. Image evaluation was performed according to earlier published criteria [18, 19]. Reader 1 has 7 years of specific experience in reading rectal MR in a referral hospital. Reader 2 is an experienced MR rectum reader who has 5 years of experience in a referral hospital. Reader 3 is a general radiologist who works in a general hospital, without specific MR rectum experience.

All readers evaluated the MR images for the T-stage as follows: the likelihood for tumor confined to the bowel wall (T1-2) was scored using a confidence level system ranging from: 0 = definitely outgrowing wall, 1 = probably outgrowing wall, 2 = possibly confined to the wall, 3 = probably confined to wall, and 4 = definitely confined to wall. All readers evaluated the images at the same Picture Archiving and Communication System.

All readers evaluated the 1.5T scans first. After a 30 min interval, the 3T images were evaluated at random patient order, besides blinding for each others’ and histology results, the readers were also blinded for the 1.5T results. These evaluations were used for analysis and comparison of the accuracy between 1.5 and 3T MRI for T-stage prediction.

Furthermore, for reader 2 an additional scoring procedure of both 1.5 and 3T scans was organized to determine intra-observer agreement (with an interval of 2 weeks between scoring procedures).

Histopathologic examination

Results for T-stage prediction by all readers at both field strengths were compared with histopathological staging of the surgical specimen, which was the reference standard. Surgery was performed directly subsequent to the short course of radiation. The surgical specimens were handled according to standard clinical practice as advocated by Quirke et al. [20] and were pathologically examined in accordance with the Tumor Node Metastasis staging system.

Statistical analysis

For statistical analyses Statistical Package for the Social Sciences (SPSS, version 15, Inc., Chicago, IL) was used. Receiver operator characteristics (ROC) curve analyses were performed and areas under the curve (AUC) with 95% confidence intervals (CI) were calculated. AUCs were compared according to the method as described by DeLong [21] using Stata software (Stata, release 9.2; StataCorp, College Station, Texas) to determine whether the difference in diagnostic performance between both field strengths was statistically significant. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) with 95%CI were calculated for all readers on both field strengths. For these analyses, it was decided at the start of the study to use a cut-off level between confidence level 2 (possibly confined to the bowel wall) and 3 (probably confined to the bowel wall), with tumor confined to the bowel wall as the positive outcome measure. Mean overstaging rate of tumors limited to the bowel wall and understaging rate of tumors outgrowing the bowel wall were calculated for all the three readers. The paired samples t test was used to test whether the difference in mean over- and understaging rates between 1.5T MRI and 3T MRI was statistically significant. Intra- and inter-observer agreements were analyzed by using weighted kappa values with quadratic weighting. Degree of agreement was stated as follows: κ value 0.00–0.20 poor agreement; κ value 0.21–0.40 fair agreement; κ value 0.41–0.60 moderate agreement; κ value 0.61–0.80 good agreement, and κ value 0.81–1.00 excellent agreement. P values smaller than 0.05 were considered statistically significant.

Results

Patient characteristics

Median age of the 13 included patients was 67 years (range 53–78), 10 were male and 3 were female. Of the 13 patients, 3 had pT1 tumors, 4 had pT2 tumors, and 6 had pT3 tumors, thus 7 patients had tumor confined to the bowel wall (pT1-2) and 6 had tumor outgrowing the bowel wall (pT3). Total mesorectal excision was performed in all 13 patients. Median interval between final radiation dose and surgery was 1 day (range 1–5).

Diagnostic performance

Sensitivity, specificity, PPV, NPV, AUC, overstaging, and understaging rates for all readers at both field strengths are shown in Table 1. ROC curves for all readers at both field strengths are shown in Figure 1. There was no difference in AUC for 1.5 and 3T between the readers except between reader 1 (0.821) and reader 3 (0.524) at 3T MRI (P = 0.04).

Table 1 Overview of results for all readers at 1.5 and 3T
Fig. 1
figure 1

ROC curves for T-stage prediction for all readers at 3 and 1.5T.

Mean rates of overstaging and understaging for all readers were comparable at both field strengths (Fig. 2). Mean overstaging was 43% at 1.5T vs. 57% at 3T (P = 0.23). Mean understaging was 11% at 1.5T vs. 17% at 3.0T (P = 0.42).

Fig. 2
figure 2

Mean over- and understaging rates for 1.5 and 3T MRI.

Agreement

At 1.5T, inter-observer agreement was moderate to good (κ 0.502–0.708). Inter-observer agreement was lower at 3T MRI (κ 0.153–0.681). Intra-observer agreement (reader 2) was good: at 1.5T weighted κ was 0.710 (95% CI: 0.544–0.875) and at 3T weighted κ was 0.762 (95% CI: 0.662–0.862). Results for agreement are presented in Table 2.

Table 2 Inter-observer and intra-observer agreement

Discussion

This intra-patient comparison study between 3 and 1.5T MRI does not show an improved diagnostic performance for T-staging with 3T MRI regardless of the experience of the reader, with sensitivities from 29% to 57% and specificities from 50% to 83%. For the reader from a general hospital, prediction at 3T MRI was least accurate. Overstaging was higher at 3T MRI and highest for the general reader: 86% at 3T compared to 57% at 1.5T. Inter-observer agreement was moderate to good at 1.5T and poor to good at 3T MRI. Intra-observer agreement was measured for reader 2 and was good.

Our finding that 3T is not better than 1.5T MRI for T2 vs. T3 differentiation might be surprising; nevertheless, it could be explained given the fact that with MR the differentiation between T1-2 and T3 tumors is associated with interpretation difficulties in the distinction between desmoplastic benign reactions in a T2 tumor and malignant tumor stranding in a T3 tumor (Fig. 3). This results in substantial overstaging errors in this group of borderline tumors, confirming the findings of other authors [18, 22]. Overstaging rates at 1.5T MRI are reported to range from 25% to 46% [9]. In our study at 1.5T, overstaging occurred in 29%–57%, slightly higher figures than in the literature because of our study population, with mainly patients with borderline T1-2 and T3 tumors. At 3T, it was higher ranging from 43% to 86%. This confirms the hypothesis that despite the inherent higher resolution and the more detailed depiction of the rectal wall, 3T does not help us to obtain a better performance because the interpretation difficulties of desmoplastic reactions with or without tumor cells remain.

Fig. 3
figure 3

Stranding (arrows) around a pT2 tumor which cannot be distinguished from tumor extension in a T3 tumor (left 1.5T and right 3T).

Our results are in contrast to other publications. Only small studies on 3T MRI have been reported and these showed controversial findings with overstaging rates ranging from 15% to 30% [11]. In the largest study to date by Kim et al. [13] in 109 patients who were evaluated at 3T MRI, only equal overstaging rates were found as earlier reported with 1.5T MRI. Their findings suggest that 3T MRI is, unlike our results, not worse than 1.5T MRI.

The inter-observer agreement was lower at 3T MRI compared to 1.5T MRI, which indicates higher degrees of uncertainty between readers. This may have been caused by the more detailed depiction of the tumor and of the normal anatomy. Improved visibility at 3T MRI of small vessels penetrating the rectal wall may cause more interpretation difficulties (Fig. 4A, B). Increased variation between readers just by chance would be less likely because of the relatively high intra-observer agreement at both field strengths, although tested in only one of the readers (reader 2). We believe that the solid intra-observer agreement at both field strengths can be explained by the type of case mix that reader 2 is exposed to in the referral hospital, which had resulted in a stable and consistent learning curve, making him more flexible to handle different and new techniques.

Fig. 4
figure 4

A, B A small vessel penetrating the bowel wall is depicted in more detail on the 3T image (A) than on the 1.5T image (B). This could lead to more interpretation difficulties and thus overstaging of T2 tumors.

The highest overstaging rate was found in the reader from a non referral center (86% at 3T). The fact that this reader reads rectal MRI on a less frequent basis than the others might explain the inferior results compared to the others. This trend may be even aggravated by the more detailed depiction at 3T MRI which can lead to more interpretation difficulties in the less experienced reader in particular. The PPV for staging of T2 tumors with MRI is known to be high [23], because the existence of a hypo-intense bowel wall on T2W FSE sequences (Fig. 5) is a reliable predictive criterion for a tumor limited to the bowel wall. Yet this sign is subject to some learning curve. Therefore, readers from a referral center are more confident to identify a tumor that is restricted to the bowel wall based on this bowel wall sign.

Fig. 5
figure 5

An intact hypo-intense line (arrows) is seen surrounding the tumor, which indicates that the muscularis propria is intact (1.5T image). Therefore, the tumor is confined to the bowel wall (pT1-2).

There are some limitations to this study. First, the sample size is small. Nevertheless, this is the first study performing an intra-patient comparison between 1.5 and 3T. Thus, the results are important and suggest that it is not so obvious that 3T MRI would be better than 1.5T MRI. Second, it can be argued that other published studies had a higher resolution at 3T than our study [11, 1315]. Furthermore, it would have been ideal to have a similar acquisition time at 3T as at 1.5T. We acknowledge that our 3T protocol could be improved; however, due to the retrospective nature of the study this was not possible. Nevertheless, compared to our 1.5T standard rectal MR protocol, our 3T protocol was still superior in resolution and we could not achieve better results with 3T MRI. Third, we performed the 1.5T scans on two different MR units and the 3T scans on one unit. If this would have influenced our findings, we would expect an advantage for the diagnostic performance of 3T MRI (because of a uniform image acquisition at 3T for all patients), which we did not find in this study.

Clinical relevance

Accurate staging of tumors limited to the bowel wall vs. tumors outgrowing the rectal wall is important because it determines treatment stratification. When tumors are confined to the bowel wall (T1-2), patients are referred for immediate TME. When the tumor penetrates the bowel wall (T3-4), the patient will be stratified for a more intensive treatment arm with a long course of chemoradiation treatment followed by resection. In the Netherlands and Northern Europe, patients with T3 tumors without CRM involvement are given preoperative 5 × 5 Gray radiation treatment (instead of a long course of chemoradiation) followed by immediate TME. Overstaging of T2 tumors by MRI would lead to a substantial overtreatment of patients with a T1-2 tumor, who will receive unnecessary preoperative (chemo)radiotherapy. Furthermore, the recent trend toward organ sparing treatment propagates local excision for tumors that are limited to the bowel wall [8]. When local excision becomes standard treatment for T1-2 tumors, the high overstaging rate leads to overtreatment of these patients as well, with not only unnecessary neoadjuvant treatment, but also an unnecessary TME.

Conclusions

In this first intra-patient comparison between 3T MRI with 1.5T MRI for the clinically relevant distinction between T2 and borderline T3 tumors, no improvement was found with 3T MRI, regardless of the expertise of the readers. Even though there is a better visibility of the rectal wall at 3T, it does not aid in the distinction between T2 tumors with desmoplasia and borderline T3 tumors with tumor stranding in the mesorectal fat, a distinction which is crucial for the differentiation and known to be difficult on MRI. The lower inter-observer agreement at 3T MRI reflects less certainty about the prediction of the T-stage, most prominent in the reader from a general hospital. Future studies have to validate these findings in a larger sample size by comparing local staging of rectal cancer at 3T with 1.5T MRI in one patient group.