INTRODUCTION

Multiple sclerosis (MS) is an inflammatory autoimmune disorder of the central nervous system that is frequently associated with the development of significant disability. There are approximately 400,000 cases of MS in the USA, or 0.1% of the general population. The age of onset is in young adulthood (ages 20–40), and women are affected more commonly than men [1]. The female:male ratio of 2:1 is usually cited, but newer studies suggest an increasing female preponderance, ranging from 4:1 to 7:1 [2, 3]. The reason for this greater skewing by sex is not clear, although several intriguing theories have been proposed [4, 5]. Acute attacks of neurological impairment (relapses) followed by variable degrees of improvement (remission) characterizes relapsing remitting MS (RRMS), representing 85% of newly diagnosed cases. The majority of RRMS patients will eventually transition to a disease stage in which relapses are less common or even absent, and a slowly progressive disability ensues, classified as secondary progressive MS (SPMS). Another form is primary progressive MS, which is associated with a slowly progressive myelopathy, an older age at onset, and equal distribution among men and women; this form accounts for 10% of new MS diagnoses. Progressive-relapsing MS in which relapses occur after an initially progressive form accounts for the remaining 5% of cases [6]. In addition to these classical MS subtypes, there are several MS variants, including neuromyelitis optica and acute disseminated encephalomyelitis [7].

DIAGNOSIS

The specific diagnostic criteria for MS have undergone considerable modification in the past few decades with recent changes incorporating findings from magnetic resonance imaging (MRI) to a greater extent. However, it is important to recognize that the principles that underscore MS diagnosis have not changed; namely, the need to establish dissemination in time and space after ruling out other MS mimics [8] and other disorders that can have associated MRI abnormalities [9]. In the past, clinical events and findings on examination were required to establish dissemination in both time and space [10]. Under the new McDonald criteria, findings on MR images can fulfill both requirements [11]. Classic examples of first demyelinating events include optic neuritis or transverse myelitis. When a patient presents initially with one of these clinically isolated syndromes (CIS) or other suggestive symptoms, findings on MR images are critical in establishing the likelihood that this represents the first clinical episode of MS. Standard MRI sequences, such as double echo proton density/T2-weighted or, especially, fluid-attenuated inversion recovery (FLAIR) imaging, are used to detect areas of high signal in the white matter that are the hallmark of inflammatory demyelination. New white matter lesions will frequently show enhancement with the uptake of gadolinium (Gd)-based contrast agents. The presence of Gd + lesions is more sensitive and specific for MS and is a useful measure of acute inflammation, with lesions typically showing enhancement for 6–8 weeks. The appearance of 2 or more white matter lesions at the time of a clinically isolated event is associated with a high risk of developing clinically definite MS [12]. Key characteristics of white matter lesions that are associated with MS were originally identified by Barkhof et al. [13] and subsequently modified [14]; these are helpful in differentiating likely demyelinating lesions from non-specific changes in white matter. See Table 1 for the Barkhof MRI criteria.

Table 1 Barkhof Magnetic Resonance Imaging Criteria

MRI criteria for diagnosis: A moving target

The original McDonald diagnostic criteria for MS incorporated the modified Barkhof criteria to establish dissemination in space and a somewhat complicated algorithm to establish dissemination in time using findings on serial MR images [11]. Modifications in 2005 reflected the important role that spinal cord imaging plays in the diagnosis of MS and simplified the establishment of dissemination in time [15]. Most recently, the MAGNIMS group has proposed new guidelines that would allow a diagnosis of MS in a CIS patient with a single MRI scan that demonstrated typical white matter lesions meeting Barkhof criteria (dissemination in space) and consisting of a mixture of both enhancing and non-enhancing lesions (dissemination in time) [16].

Individuals with characteristic MRI findings in the absence of clinical symptoms and with an entirely normal neurological examination have been described as having a “radiologically isolated syndrome” or preclinical MS. Recent longitudinal cohort studies suggest that about one-third of such patients will convert to CIS over the course of 2 years [17, 18] and that the presence of spinal cord lesions is associated with a faster clinical conversion [19].

MRI as a surrogate marker

Acute inflammatory activity.

Initial studies of MS using serial MRI scans revealed the presence of new inflammatory activity occurring at a rate seven- to tenfold greater than the clinical events [20, 21]. The sensitivity of MRI to detect ongoing disease activity in the absence of less frequent and sometimes hard to define clinical worsening has made it an attractive readout for assessing treatment effects in clinical trials [20]. Subsequent studies have convincingly demonstrated that the presence of active inflammation, as detected with Gd + lesions, is associated with a higher likelihood of relapse activity [22]. As a readout of acute inflammation and short-term clinical activity, Gd + enhancing lesion numbers and volumes are among the most robust MRI biomarkers of MS [23, 24] and short-term disability progression [25].

Cumulative disease burden.

While Gd + lesions reflect transient inflammatory events, changes in T2/FLAIR lesion numbers and volumes are measures of cumulative lesion formation and can be used in lieu of Gd + lesion counts [26]. The relationship to future disability is only modest [27], and other measures that reflect global and gray matter changes have been shown to be better predictors of long-term outcomes [28, 29]. Despite this evidence, MRI outcomes for exploratory Phase II studies of MS therapies continue to use Gd + lesions and T2 volumes. While clinical metrics (decrease in relapse rate or slowing of disability progression) are the primary outcome measures in definitive Phase III clinical trials, changes in lesion burden detected by MRI remain important secondary outcome measures.

TREATMENT EFFECTS

All currently available disease-modifying therapies for MS have been shown to have significant effects on MRI measures of white matter inflammation. The original studies with interferon-β showed a dramatic reduction in Gd + activity [30] and subsequently a more modest decrease in relapse rate [31]. Treatment with the first-generation injectable therapies, such as interferon beta-1b (Betaseron; Bayer Healthcare Pharmaceuticals, Wayne, JF), and interferon beta-1a [Avonex (Biogen Idec, Weston, MA); Rebif (Merck Serono, Geneva, Switzerland)] and glatiramer acetatate (Copaxone; Teva Pharmaceuticals, Petah Tikva, Israel), leads to variable decreases in Gd + activity and the number and volume of T2 lesions [3137] and, on average, a 30% reduction in relapses [7, 31, 36, 38, 39]. However, a dose/response relationship is evident with high doses of interferon-β associated with more pronounced reductions in MR disease activity and relapses [36, 4042].

Newer therapies have also shown pronounced effects on inflammatory activity and significant decreases in relapse rates. Natalizumab (Tysabri; Biogen Idec and Élan, Roscommon, Ireland) was tested first in combination with interferon beta-1a [43], which may have increased the risk for developing progressive multifocal leukoencephalopathy. Subsequent Phase III trials showed what appeared to be more prominent treatment benefits of natalizumab versus the first-generation injectable therapies. In both the SENTINAL [44] and AFFIRM [45] studies, MRI activity and relapse rates were suppressed significantly. However, the patient cohorts tested were diagnosed using McDonald criteria, were younger, and had overall lower disease activity which may have led to an overestimation of the treatment response [46], especially if the relative risk reduction is reported instead of absolute risk reduction [47].

The limitations of relying solely on markers of inflammation came into focus when these therapies were used to treat patients with progressive forms of MS. The European trial of interferon beta-1b in SPMS found benefits in markers of inflammation and disease progression [48, 49]. However, a North American trial failed to find a similar effect [50]. Subsequent analysis of the two patient cohorts revealed that the benefits of interferon therapy on disease progression was limited to the group with higher levels of Gd + activity and continued relapses [51]. The results from a series of studies with alemtuzamb (Campath; Genzyme, Cambridge, MA) provided more evidence for the complex role that inflammation plays in causing future disability. When used in progressive patients, Campath eliminates all traces of inflammatory lesion activity, but it has no impact on disability progression. However, treatment in early RRMS appears to stabilize the disease, suggesting that early inflammatory events lead to future neurodegenerative changes and worsening neurological function [52, 53]. Results from these therapeutic trials provide further evidence for the early treatment of MS with potent therapies.

NEURODEGENERATION

TI black holes.

T2/FLAIR lesion volumes are only modestly correlated with concomitant disability measures due to several factors, including limitations in commonly used disability measures, the lack of pathological specificity of MR signal changes, and functional compensation that limits the clinical expression of accumulated disease burden [54]. Several other measures have been explored to overcome these limitations. T1 “black holes” are areas of hypointensity detected on T1-weighted scans; they represent a subset of T2 lesions with greater degrees of tissue destruction [55] and are more highly correlated with disability measures [56]. Serial studies demonstrate that there is considerable evolution of these initially hypointense lesions over many months and that their development is related to the intensity and persistence of initial inflammatory activity [57, 58] (FIG. 1). The ratio of T1 black hole volume to overall T2 volumes has been incorporated in composite MR scales as a measure of disease severity that predicts future disability progression [59]. The ability to limit the development of T1 black holes has been suggested as a surrogate marker for reparative or neuroprotective therapeutic effects [60]. Modest effects have been reported in short-term studies of Copaxone [61], although the methodology used generated some controversy [62]. Trials of interferon beta-1b in SPMS showed a reduction in T1 hole formation, although this may have been due to an overall decrease in new lesion activity [63]. In RRMS, interferon beta-1b decreased the accumulation of T1 black holes but did not decrease their persistence [64]. A similar benefit was reported with interferon beta-1a SQ [65]. The BECOME study provided a head-to-head comparison of treatment effects on T1 black hole accumulation over 2 years and found a trend towards a lower conversion rate of acute black holes (ABH) to chronic black holes (CBH) with interferon beta-1b compared to copolymer-1 [66]. Notably, while the majority of the newly enhancing lesions were associated with ABH (62.7% overall), only a small proportion of Gd + lesions (12%) became CBH when assessed over at least a 1-year period. These relatively low conversion rates make designing treatment trials powered to detect effects on the conversion of ABH to CBH challenging [67].

FIG. 1.
figure 1

Serial cerebral magnetic resonance (MR) images demonstrate lesion evolution in relapsing remitting multiple sclerosis (MS). Conventional MR imaging is used to capture inflammatory disease activity in MS, including gadolinium (Gd)-enhanced T1 scans (A), T1-weighted scans without contrast (B), and fluid-attenuated inversion recovery (FLAIR) scans ©). A newly enhancing lesion (white arrow) is seen at baseline (A, left column) which leads to the development of a chronic black hole at 2 years (A, B, right column, white arrow) and increased area of FLAIR abnormality ©, right column, arrow). Note the increase in overall FLAIR white matter lesion volumes as well as the ventricular enlargement occurring over a 2-year period in relapsing remitting MS (RRMS) with minimal disability progression, as measured by the Expanded Disability Status Scale (EDSS; 1.5 to >2.0). (High resolution version of this image is available in the electronic supplementary material.)

Atrophy.

MRI-derived measures of atrophy have emerged as the most frequently used metric to capture the neurodegenerative process in MS, while Gd + activity and T2 lesion volume accumulation are measures of the “inflammatory” component of the disease. In practice, the distinction is not so neat, as inflammatory activity and brain volume measures are intertwined, making assessing treatment effects nuanced. Initial studies using global brain volume measures convincingly demonstrated that atrophy could be detected in RRMS [68] and could be measured accurately and reproducibly using automated techniques, such as the brain parenchymal fraction. In turn, these automated measures were used to monitor treatment effects on brain atrophy rates [69]. However, it has became clear that global brain volume measures are sensitive to changes in brain water content, which in turn fluctuate with disease activity and fluid status. For example, therapies with potent anti-inflammatory effects produce a more rapid, but transient rate of brain volume loss—the so-called pseudoatrophy effect—but may have differing effects in slowing brain volume loss with longer term treatment [70].

A recent study found that the rate of global brain atrophy varies across patients with MS; however, when baseline atrophy is accounted for, the rate is similar across the disease spectrum [71]. Long-term studies have validated the relationship between global brain atrophy and disability, as measured by the Expanded Disability Status Scale (EDSS) and the MS functional composite score (MSFC) [72], and also indicated that brain atrophy rate during the RR phase was a significant predictor of subsequent disability [53], suggesting that atrophy is clinically relevant.

Interferon preparations have been shown to slow the rate of brain atrophy in the second year of treatment in RRMS [7375] but not in SPMS [76]. An initial analysis of a study using a less sensitive central brain slice methodology failed to find an effect of copolymer-1 in slowing the rate of brain atrophy [77], but a subsequent re-analysis using the automated SEINA approach did demonstrate a benefit of treatment [78].

Measures of specific tissue types reveal that there are distinct atrophy rates in gray matter versus white matter regions in the brain, and that while white matter regions show a steady decrease, gray matter atrophy accelerates over time and is a more sensitive predictor of future disability [28, 29]. There is emerging evidence that the appearance of white matter lesions is linked to subsequent cortical gray matter changes, although the specific location of the lesions may influence the degree of gray matter change detected [79]. This fits with the treatment effects on whole brain atrophy reported above, which found a slowing of brain atrophy after several months of treatment, thus reflecting the delayed impact of suppressing white matter inflammatory changes.

Only a few studies have specifically assessed treatment effects on gray matter atrophy, although measuring neuroprotective and remyelination effects is of great importance for future MS therapeutics [60]. One study suggests a benefit of interferon beta-1a (Avonex) in slowing the rate of gray matter atrophy, detectable at 2 and 3 years after initiating treatment [80]. Another recent study found that gray matter loss was detected in patients treated with interferon-β but not in untreated or Copaxone-treated patients assessed over a 2-year period using a voxel-based morphometry approach [81]. However, interpretation of the latter study is limited by the small sample size and the lack of randomization.

Regional gray matter changes.

There is increasing evidence that gray matter regions are also impacted in MS and that some gray matter regions are especially susceptible. For example, newer techniques at higher fields as well as detailed pathological analyses have revealed widespread demyelination in the cerebral cortex [8285]. The impact of disease modifying agents (DMAs) on cortical plaques has not been studied in detail to date. and such studies will be complicated by the lack of sensitivity of current imaging techniques in detecting the majority of cortical plaques [86]. In addition to the cortical rim, deep gray matter structures are also impacted in MS, with the thalamus showing early and significant atrophy associated with axonal and neuronal loss [87, 88]. Thalamic atrophy has been linked with fatigue and cognitive impairment [89, 90], but the effect of DMAs on thalamic volume or function has not been assessed. The hippocampus is an archicortical gray matter structure that is impacted by MS and related to cognitive impairment [91]. Pathological studies show demyelination and neuronal loss throughout the hippocampal formation [92, 93]. Recent evidence suggests that there may be subregional susceptibility to distinct MS disease processes linked to depression and cognitive impairment in the hippocampus [94, 95]. Although changes in these distinct gray matter areas are well documented, few current studies are targeting MR outcome measures to assess specific treatment effects on these key brain areas.

MEASURING THERAPEUTIC EFFICACY

Although it is an important outcome measure for the assessment of new therapies, the use of MRI to monitor treatment response lacks consensus among practitioners [59, 96]. It is difficult to define “breakthrough” disease, although most experienced clinicians seem to “know it when they see it”. However, there is some evidence that in the setting of clinical events, the presence of new inflammatory activity detected on MR images should prompt a consideration to change therapy [97]. The search for relevant biomarkers continues, with a recent finding that baseline cytokine levels can predict the response to interferon treatment a promising advance [98], although other approaches, such as monitoring biological response markers, seem to be unhelpful in identifying non-responders [99].

NEW APPROACHES

Future clinical trials will incorporate newer techniques and use high-field-strength MRI scanners to better capture the diffuse disease effects discussed above that are currently not captured by inflammatory lesion counts. (FIG. 2). For example, MRI performed at high field (≥3T) and specific sequences, such as double inversion recovery imaging, capture larger number of cortical lesions [100, 101]. Magnetization transfer imaging, which can be used to quantify intact myelin, has been proposed as a biomarker for remyelination [102]. Spectroscopy can be used to detect metabolites that reflect axonal/neuronal integrity and has been used as a measure of neuroprotective effects [103] and excitotoxicity [104]. Diffusion tensor imaging can be used to monitor tract-specific changes that may be more closely linked to clinical measures [105, 106] and may be particularly powerful when combined with functional measures, such as functional MRI [107]. T1/T2 relaxation is another quantitative approach that may provide a better metric for diffuse brain changes occurring beyond traditional white matter regions [108] and to quantify myelin content [109]. Quantification of brain iron levels is an area attracting intense interest [110]. Other more technically challenging approaches include molecular imaging [111] and sodium imaging [112]. In general, these latter two approaches have not been applied to multicenter treatment trials because of technical complexities and a lack of standardizations across sites and between different MR equipment manufacturers [113]. However, these limitations are not insurmountable, but they do require a consortium approach, as recent publications demonstrate [114].

FIG. 2.
figure 2

Diffusion tensor imaging assessed with tract-based spatial statistics (TBSS) detects widespread white matter abnormalities in RRMS. Statistical maps of a group comparison between healthy controls (n = 19) and RRMS patients (n = 29) based on fractional anisotropy (fA) reveal widespread decreases in fA in RRMS patients with a low disability (EDSS <3). Shown are sagittal (A), axial (B), and coronal ©) views. The TBSS skeleton is overlaid onto a group atlas and color-coded to reveal regions with statistically significantly decreased fA RRMS compared to healthy controls. White matter disruption involves extensive regions of the corpus callosum, optic radiations and temporal lobe white matter. Future and current therapies should be assessed using this and other non-conventional techniques that can detect tissue disruptions beyond traditional white matter lesions. (High resolution version of this image is available in the electronic supplementary material.)

CONCLUSIONS

Findings on conventional MRI scans have contributed significantly to the clinical care of persons with MS by allowing for early diagnosis and treatment. In addition, key pathophysiological insights have been gleaned from MRI data obtained in practice and in clinical trials. New challenges are to develop sensitive and more specific markers of disease effects beyond detectable white matter lesion areas. Newer therapies developed to slow neurodegeneration or promote remyelination will require the use of advanced multimodal MRI techniques that capture structural and functional changes. The adoption of these cutting edge techniques into multicenter trials will be technically challenging, but feasible in the near future.