Introduction

Relapsing–remitting multiple sclerosis (RRMS) often originates from a single demyelinating event, known as clinically isolated syndrome (CIS) [1]. CIS events are isolated events of symptomatic neurological disturbance lasting more than 24 h, which indicate the first clinical demyelination of the central nervous system [2]. Clinical syndromes are usually mono-focal but occasionally multi-focal in nature. Based on the previous diagnostic classifications of multiple sclerosis (MS) [3, 4], people with CIS could be categorised as high risk of developing MS but without being formally diagnosed with MS. With the revisions in the magnetic resonance imaging (MRI) component of the 2010 McDonald criteria [5], it is now possible to be diagnosed with MS after CIS if repeated MRI imaging shows multiple demyelinating lesions of the central nervous system developing over time, which has lead to an increased rate of MS diagnosis in patients with CIS [6].

Disease-modifying therapies (DMTs) are commonly prescribed for people diagnosed as having MS. Most randomised controlled trials (RCTs) conducted in RRMS have sought to include patients with at least two acute clinical exacerbations in the prior years. First-generation DMTs, such as beta-interferon (IFN-β) and glatiramer acetate (GA), have also been evaluated in people with CIS with the aim of delaying a second clinical attack leading to the diagnosis of clinically definite MS (CDMS).

In 2008, a systematic review assessed these two types of DMTs in this specific population [7] but newer evidence has become available since then. Moreover, this systematic review assessed the effectiveness of DMTs over a short follow-up duration (2–3 years), which is insufficient to evaluate the long-term clinical effectiveness of DMTs over placebo. This is of particular importance when accounting for the frequent occurrence of adverse events with first-generation DMTs. For example, flu-like symptoms occur in 40–75% of patients with IFN-β, and injection-site reactions and pain are reported by at least one-third of patients with IFN-β or GA [8]. These adverse events can affect the quality of life of patients who may otherwise have only limited disability at this very early disease stage. While an RCT comparing an early to a pre-planned delayed initiation of DMTs would be the best study design to investigate the optimal treatment sequence after CIS, the long-term benefit of early DMTs can be partially assessed from the open-label extension phases of studies which originally compared DMTs to placebo. Thus, in this systematic review and network meta-analysis (NMA), we examined the short- and long-term clinical effectiveness of first-generation DMTs in people with CIS.

Methods

Search method

The current work follows a systematic review (PROSPERO identifier: CRD42016043278) that was undertaken as part of a larger project assessing the clinical and cost effectiveness of IFN-β and GA acetate for MS within the UK [8]. The following electronic databases were searched for RCTs in CIS in February 2016: Cochrane Multiple Sclerosis and Rare Diseases of the CNS group specialised register; MEDLINE (Ovid); MEDLINE In-Process and Other Non-Indexed Citations (Ovid); Embase (Ovid); Cochrane Library (Wiley), including Cochrane Database of Systematic Reviews, Cochrane Central Register of Controlled Trials (CENTRAL), DARE, NHS EED, and HTA databases); Science Citation Index and Conference Proceedings—Science (Web of Science); UKCRN Portfolio Database; WHO ICTRP; Current Controlled Trials; ClinicalTrials.gov. No date restriction was applied. The included and excluded study lists from previous relevant Cochrane systematic reviews were also checked [9,10,11]. Additionally, any CIS RCTs identified during the broader MS systematic review were also considered for inclusion. These additional searches are described in the report of this larger project [8] (Online resource 1). During our original work, we also identified long-term follow-up studies of immediate versus delayed treatment at title/abstract level, but excluded them at full-text review. For the purpose of the current review, we retrieved these records from our initial list of excluded references and undertook targeted searches using more than one source to find other potential subsequent reports from the original CIS studies.

Selection criteria

Our original systematic review included RCTs conducted in people with RRMS, SPMS, or CIS, evaluating all forms of beta-interferon (IFN β-1a; pegylated IFN β-1a; IFN β-1b) and GA compared against each other or placebo/best supportive care, and where clinical outcomes were reported such as relapse rates, progression to multiple sclerosis, or disability progression as measured by the Expanded Disability Status Scale (EDSS). We consider here a subset of studies reporting RCTs conducted in people with a single clinical attack diagnosed as having CIS and where the interventions were used at the authorised dose regimen (IFN β-1b 250 µg subcutaneously (SC) every other day for Betaferon® (Bayer, Leverkusen, Germany) and Extavia® (Novartis, Basel, Switzerland); IFN β-1a 30 µg intramuscularly (IM) once weekly for Avonex® (Biogen Idec Ltd, Cambridge, MA, USA); GA 20 mg SC daily for Copaxone® (Teva Pharmaceutical Industries, Petah Tikva, Israel); IFN β-1a 44 µg SC three times weekly for Rebif® (Merck, Darmstadt, Germany)).

Our main outcomes of interest were: (1) time to CDMS using Poser criteria [12] and involving a second relapse or neurological deterioration, measured using hazard ratios (HR); we also undertook a sensitivity analysis examining time to ‘McDonald MS’ instead of time to CDMS, in which MRI findings could be used with clinical findings to arrive at a diagnosis; (2) discontinuation due to AE, measured using risk ratios (RR). Although these were not included in our original systematic review, we also included long-term open-label extension studies following the double-blind period of RCTs that had enrolled patients with CIS. Open-label extension studies were defined as studies that compared all those randomised to active treatment who continued on treatment in extension phase (early DMT) versus all those randomised initially to placebo who then crossed over to active treatment in extension phase (delayed DMT). We used these studies to explore the impact of DMTs on disability progression, which was measured using the Expanded Disability Status Scale (EDSS). We also examined long-term annualised relapse rates (ARR) from open-label studies.

Study selection process

In our initial work, we first examined relevant past systematic reviews from the Cochrane MS group [7, 10, 11] for studies meeting the inclusion criteria. For updated and new searches, we collected all retrieved records in a specialised database and duplicate records were identified and removed. Subsequently, two reviewers applied the inclusion/exclusion criteria and screened all identified bibliographic records for title/abstract and then for full text. Any disagreements were resolved through discussion consensus or by recourse to a third-party reviewer.

Data extraction, quality assessment, and synthesis

Data extraction was performed by two reviewers and was cross-checked. Any disagreement was resolved through discussion or recourse to a third reviewer. We undertook the quality assessment of included studies using the Cochrane risk of Bias tool for randomised-controlled trials [13]. The quality assessment was cross-checked within the reviewer team. Any disagreements were resolved as in earlier stages. The included papers were organised in text and tables. The summary tables tabulated characteristics and results for included primary studies. Tables for primary studies present summary data on participants, interventions, and outcomes.

For RCTs reported up to the end of double-blind periods, we performed a conventional pairwise meta-analysis using a method-of-moment random effects model (inverse-variance weighted) to directly compare each pair of interventions for which direct evidence exists, and to compare included DMTs as a class against placebo. We examined these pairwise meta-analyses for heterogeneity, measured as Cochran’s Q and I2. We then estimated random effects network meta-analyses in the frequentist framework, using package network in Stata v.14. For each outcome of interest, a network diagram of pairwise direct comparisons between multiple treatments was created. Tests for inconsistency were inapplicable because the network was star shaped. The summary estimates (with 95% CIs) for all pair-wise comparisons were provided in a league table [14]. We aimed to determine the rankings of all treatments (e.g. based on the relative probability of an intervention being superior) with respect to effectiveness and safety outcomes in reference to placebo/standard care and we used the surface under the cumulative ranking curve (SUCRA) [14].

For open-label extension studies, we calculated a pooled estimate of the effect of early DMT, including all DMTs as a class, compared to delayed DMT on time to CDMS (effect again measured as HR) through a method-of-moments random effects meta-analysis. When HR were reported at different follow-up durations, we used the most mature data, i.e. those with the longest follow-up. In these studies, the outcomes were only reported for patients originally randomised to active treatment or placebo, who entered extension phase. Our exploratory work measuring the impact of early DMT on disability progression and examining long-term ARR was summarised narratively.

Results

Study identification and characteristics

Our original searches identified 6420 records. Of these, 6146 were excluded based on title and/or abstract, leaving 274 to be examined at full-text (Fig. 1).

Fig. 1
figure 1

PRISMA flowchart

Among these, 14 records [15,16,17,18,19,20,21,22,23,24,25,26,27,28] (including two additional ones from our later targeted search) met our inclusion criteria and corresponded to five trials: BENEFIT comparing IFN β-1b 250 µg SC every other day to placebo [15]; CHAMPS [16] and Pakdaman et al. [17] both evaluating IFN β-1a 30 µg IM once a week against placebo; PRECISE comparing the effect of GA 20 mg SC once daily to placebo [18]; and REFLEX comparing IFN β-1a 44 µg SC three times a week against placebo [19]. In these trials, all included patients had a single clinical event and also had evidence of clinically silent lesions based on MRI. The double-blind period in these trials was 2 years after randomisation in BENEFIT and REFLEX, and 3 years in PRECISE, Pakdaman et al. [17], and CHAMPS (Table 1).

Table 1 Baseline characteristics of patients in included studies

Nine of the 14 selected records were open-label extension studies of four of the five above-listed trials (Online resource 2): BENEFIT with long-term follow-up at 3, 5, 8, and 11 years [20,21,22,23]; CHAMPS with results at 5 and 10 years (CHAMPIONS study) [24, 25]; PRECISE with a prolonged follow-up at 5 years [27]; and REFLEX with a 5-year follow-up (REFLEXION study) [26, 28]. We found no subsequent long-term follow-up for the Pakdaman et al. [17] RCT. In open-label extension studies, patients initially allocated to placebo were offered active treatment from the time of a second clinical attack or after the end of the double-blind period (delayed DMT group). Conversely, people initially allocated to active treatment were offered to continue on treatment during an extension study (early DMT group). Overall, the proportion of patients entering open-label extension studies ranged from 53 to 86% after completion of the double-blind phase with no significant differences between the two groups (early or delayed DMTs) within each study (Online resource 2).

We judged there was a substantial proportion of studies with insufficient information to rate the risk of selection bias (60% of studies rated as unclear risk on random sequence generation and 40% of studies rated as unclear risk on allocation concealment) (Online resource 3 and 4). Although studies were placebo controlled, we found that four of the five (80%) studies were at high risk of performance bias (due to loss of blinding of participant and personnel). This was because of the much increased rates of side effects such as injection-site reactions in patients allocated to active treatment which led to high risk of participant unblinding.

Overall, we found no major issue of detection (blinding of outcome assessment) or reporting bias across studies. Only one study [17] was rated as at high risk at attrition bias (incomplete outcome data). Understandably, the open-label extension studies were at much higher risk of attrition bias with a range of complete follow-up rates after entering the extension phase of 44–96%. Finally, all studies funded by drug manufacturers were designated as high risk of bias under the ‘other’ category.

Short- and long-term clinical outcomes

Short-term clinical outcomes were informed by the results from the double-blind period of RCTs.

Time to CDMS: short-term outcomes

Direct evidence from comparisons showed all drugs reduced the time to CDMS compared to placebo (Fig. 2). There was little heterogeneity (I2 = 0%, p = 0.72) for the IFN β-1a 30 μg IM once a week vs. placebo comparison, which was the only one that included more than one study. Accounting for all DMT and considering a class effect for DMTs used in CIS, the pooled HR for time to CDMS was 0.51 (95% CI 0.44, 0.61) with low heterogeneity (I2 = 0%, p = 0.98).

Fig. 2
figure 2

Pairwise meta-analyses, time to clinically definite MS (active drug vs placebo)

The set of studies reporting hazard ratios for time to CDMS formed a connected, star-shaped, network (Fig. 3). Our random-effect NMA showed similar results compared to pairwise comparisons for active vs placebo comparisons (Table 2). There was no evidence from indirect comparisons suggesting superiority of any one active drug over another. Rankings from the NMA suggested that IFN β-1a 44 μg SC thrice weekly was ranked best, followed by IFN β-1b 250 μg SC every other day, IFN β-1a 30 μg IM once a week and GA 20 mg SC once daily while placebo was ranked last.

Fig. 3
figure 3

Network of studies, time to clinically definite MS. ifn1a30: IFN β-1a 30 µg IM once a week; ifn1a44: IFN β-1a 44 µg SC three times weekly; ifn1b250: IFN β-1b 250 µg SC every other day; ga20: GA 20 mg SC once daily; plac: placebo

Table 2 Network meta-analysis: time to CDMS

As a result of our sensitivity analysis, we found that effectiveness estimates were robust to the use of time to McDonald MS instead of time to CDMS [pooled HR for time to ‘McDonald MS’ 0.52 (95% CI 0.46, 0.60); I2 = 0%; p = 0.93].

Discontinuation due to AEs: short-term outcomes

All but Pakdaman et al. [17] reported discontinuation due to AEs. However, the time point to assess this outcome was not consistent across the four studies (36 months in PRECISE and CHAMPS; 24 months in REFLEX and BENEFIT). Therefore, we chose not to undertake a NMA for discontinuations due to AE. Estimates from published papers are presented in Table 3 and do not show clear trends across studies.

Table 3 Discontinuation due to AEs in CIS studies

Time to CDMS: long-term outcomes

Open-label extension studies informed the long-term clinical outcomes. Results presented below include data reported over both original and extension phases (Online resource 2).

At 5 years (3 years of open-label extension in REFLEXION [26] and CHAMPIONS [25]), early IFN β-1a 30 µg IM once weekly led to a delayed time to CDMS compared to the late use of these agents. This was also observed at 10 years (open-label extension in CHAMPIONS [24]) for early IFN β-1a 30 µg IM once weekly. These findings were robust to the use of time to McDonald MS instead of time to CDMS. In CHAMPIONS [24, 25], ARR in patients with delayed DMT was doubled over the 5- and 10-year periods compared to early DMT (0.31 vs 0.14, respectively; p = 0.03). At 10 years, 81% of patients had EDSS score ≤ 2.5 and only 6% with EDSS ≥ 6.0, with no significant difference between early and delayed IFN β-1a 30 µg IM once weekly.

In BENEFIT, results from open-label extension follow-up showed a reduced risk of developing CDMS with early IFN β-1b 250 µg SC every other day compared to delayed use of the same DMT for up to 9 years of additional follow-up [20,21,22,23]. At 5 years (i.e. including 3 years of open-label extension), risk of EDSS progression in early DMT vs delayed DMT was 25 and 29%, respectively, annualised relapse rates were lower with early IFN β-1b 250 µg SC every other day (0.21 vs 0.27, respectively; p = 0.014) while EDSS scores were stable in both groups (− 0.03 compared to baseline with early DMT and + 0.07 compared to baseline with delayed DMT) [23]. At 8 years (6 years of open-label extension), overall ARR was reduced by 22.9% and relapse risk was reduced by 23.4% (HR 0.77, 95% CI 0.589–0.988; p = 0.048) with early DMT [20]. EDSS scores at the end of the 8-year period were on average 1.87 with early DMT (+ 0.38 change from baseline) and 1.56 with delayed DMT (+ 0.07 compared to baseline). At 11 years (9 years of open-label extension), overall ARR was lowered with early IFN β-1b vs delayed IFN β-1b (0.21 vs 0.26, respectively, p = 0.0018) while EDSS scores had minimal changes remaining low in both groups (mean of 2.04 with early IFN β-1b, + 0.55 compared to baseline; mean of 2.22 with delayed IFN β-1b, + 0.72) [21].

With early GA, the time to CDMS was delayed by 41% (HR 0.59; 95% CI 0.44–0.80) compared to delayed GA at 5 years [27]. ARRs were 0.11 and 0.16, respectively (p = 0.0241). EDSS score changes over the 5-year period were minimal.

Accounting for all DMTs, the pooled HR for time to CDMS was 0.64 (95% CI 0.55, 0.74) with low heterogeneity (I2 = 0%, p = 0.89) (Fig. 4) demonstrating the benefit of early vs delayed DMT.

Fig. 4
figure 4

Pooled estimate of time to CDMS (early vs delayed DMT)

Discussion

Meta-analyses confirmed that IFN-β and GA delayed the time to CDMS after CIS compared to placebo. No active drug showed superiority over another. Similar findings were observed in our sensitivity analysis using time to McDonald MS instead of time to CDMS. We were not able to compare drugs in relation to the risk of discontinuation due to AE.

Findings from open-label extension studies suggested a persistent long-term benefit in terms of time to CDMS for earlier treatment, despite the relatively short delay in treatment initiation (2–3 years). However, these results should be treated with caution because of a high-risk of selection bias which may make delayed DMT appear worse than early DMT. The fact that patients initially allocated to placebo were offered active treatment from the time of a second clinical attack, and that the risk of developing CMDS was reduced with DMT compared to placebo over the double-blind period means that those in placebo arms who had a second attack were more likely to get into extension phase. This would result in higher risk of MS in the delayed DMT group than in the early DMT group. Moreover, most of these studies had large losses to follow-up (14–47%), which gives the potential for attrition bias. This makes it difficult to be certain of the long-term benefit in delaying the time to CDMS.

The evidence showing the benefit of IFN-β and GA in delaying time to CDMS after CIS could underline the importance of initiating specific therapies in a timely manner following CIS. However, the decision about whether to start DMTs is more complex. Some patients with CIS will never have a second attack even off treatment. This is suggested by cohort studies reporting high proportions of CIS patients having no second attack despite not receiving DMTs, 55% after a mean follow-up of 5.8 years according to Brownlee et al. [29], or 27% at 10 years in the paper by Kerbrat et al. [30]. Also, some patients with CIS may only have a few minor non-disabling attacks. These patients with benign disease would not gain any/much benefit from treatment but would be exposed to unnecessary risk and/or experience frequent local or systemic adverse events which may reduce their quality of life.

For those who are at high risk of developing more severe MS, there may be some merit in delaying a second attack, but the key aim (from both a clinical and cost-effective perspective) would be to delay irreversible, progressive disability and there is no evidence at present that early treatment after CIS does this. An analogous situation might be whether to use anticonvulsants after a single seizure, given that some patients will never have another seizure. This is generally not recommended, at least in the UK, because the evidence shows early treatment does not improve long-term outcome [31].

The main strength of our work is that we present an up-to-date review on the clinical outcomes of first-generation DMTs, namely IFN-β and GA, used in people with CIS. For that purpose, we used a rigorous and comprehensive search to identify all existing primary studies. This is also the first review to include open-label extension studies, which provide a signal (albeit biased as previously discussed) of the longer term clinical effectiveness of DMTs in this population.

There were several limitations. We were interested in people with CIS and we only included studies where patients had a single clinical attack. It is possible that some of these patients would now meet diagnostic criteria for MS based on new MRI criteria [5], which means that some of these would be considered eligible to begin DMTs. However, given that our main interest was to assess the evidence related to the clinical effectiveness of DMTs in people with a single clinical event we feel this is a minor limitation. To the best of our knowledge, there is no RCT which evaluates the effectiveness of DMTs in people with CIS using the new diagnostic criteria. A second limitation is the inconsistent reporting of safety outcomes, which led to an inability to compare DMTs on discontinuation due to AE. However, we believe that the safety profile of DMTs in CIS can be understood through the results observed in people with RRMS. Indeed, there is no a priori reason why the safety profile of IFN-β and GA would differ in people with CIS compared to those with RRMS because the drugs, strength and dose regimens are similar across both indications. In our work conducted in RRMS based on follow-up at 24 months, we found no evidence that one drug was more likely to lead to discontinuation than other, although confidence intervals for estimates from mixed treatment comparisons were wide. Therefore, we are fairly confident that these findings also apply in CIS patients.

Although we originally reviewed studies reporting long-term open-label extensions of RCTs conducted in people with CIS, this work was conducted post hoc after the original work for our initial systematic review. However, we conducted additional searches for this analysis and are, therefore, reasonably confident that we retrieved all extension studies published as original articles and that no bias was introduced in this subsequent analysis. Finally, a potential limitation of our work, inherent to all systematic reviews, is publication bias since, for instance, some studies finding no impact of DMTs in people with CIS might have remained unpublished. There were too few studies to formally test for this.

In this review, we have focused on first-generation DMTs (IFN-β and GA) since these were all evaluated in CIS-specific RCTs. Oral teriflunomide, a newer generation DMT, has also been tested for patients with CIS in a RCT [32] that reported a benefit over placebo in delaying time to CDMS, yet with the same uncertainty on long-term clinical benefit.

At the European level, we are aware of the 2015 guidelines for prescribing disease-modifying treatments in MS [33]. Our findings agree with these guidelines, stating the benefits of use of the only two DMT types with a licensed indication in CIS in delaying CDMS, namely IFN-β and GA. The Association of British Neurologists (ABN) guidelines also emphasise the uncertainty around the effectiveness of these drugs over a long-term perspective.

In summary, IFN-β and GA reduce the short-term (up to 2–3 years) risk of a second clinical attack after CIS and so delay the diagnosis of CDMS. Open-label extension studies suggest a possible long-term effect of IFN-β and GA in delaying CDMS but there is less certainty because of the risk of bias of these studies. There is currently no evidence supporting the effectiveness of DMTs following CIS in delaying irreversible or progressive disability. Further research with RCTs is needed to specifically assess early vs delayed DMT after CIS to establish the long-term benefit/risk balance of DMTs and whether high-risk patients who might benefit more from early treatment can be identified.