Review

Introduction

The mortality, morbidity and economic burden of Chronic Obstructive Pulmonary Disease (COPD) exacerbations is well documented [14]. Reduction in the frequency of exacerbations is a major therapeutic aim in COPD and treatment with inhaled corticosteroids (ICS) has been associated with a 25% reduction in exacerbations [5]. In a Cochrane review by Yang, ICS were found both to reduce the frequency of COPD exacerbations by 0.26 per patient per year (weighted mean difference: 95% CI -0.37 to -0.14) and to slow the rate of decline in health related quality of life as determined by the St George's Respiratory Questionnaire (weighted mean difference: -1.22 units/year, 95% CI -1.83 to -0.60) [6]. Despite this, Yang's review failed to demonstrate any effect of ICS on the decline in lung function or on mortality, although a more recent paper identifies a small clinically insignificant effect on lung function [7]. It is now recognised that ICS in stable COPD are associated with an increased risk of pneumonia (relative risk 1.34 95% CI, 1.03-1.75) [8], and the long-term use of high-dose ICS has been associated with adverse effects including cataracts [9], glaucoma [10] and osteoporosis [11]. UK National Institute of Health and Clinical Excellence (NICE) guidance discourages the use of ICS as monotherapy, but does encourage their use with bronchodilators if patients have moderate or severe COPD and are still symptomatic, or are experiencing two or more exacerbations requiring treatment per year [3].

In light of the debate around the role of long term ICS in COPD [8], consideration of the potential effects of withdrawing ICS becomes important, but there are no reviews of withdrawing treatment. The aims of this systematic review were to determine the effects of withdrawal of ICS, to use the review process to explore the reasons for different results between trials, and to make recommendations for improving methodology in future trials where medication is withdrawn.

Method

Search strategy and selection criteria

The search strategy involved text words for COPD and the names of inhaled steroids used in COPD and relevant Medical Subject Headings (MeSH). Boolean logic was used to truncate and combine terms and the search strategy was modified several times before being finally performed (full search available from the authors). The Cochrane Library and the Database of Abstracts and Reviews were searched for existing systematic reviews on this topic. None were found so we searched for published original articles in PubMed, EMBASE and CINAHL from inception to February 2007. Criteria for the inclusion and exclusion of studies were based on the four facets: Population, Intervention, Comparison and Outcome. Studies were included if they were randomised controlled trials, comparing patients withdrawn from ICS with those not withdrawn, in which the diagnostic criteria for COPD were consistent with GOLD, NICE or ATS [1, 3, 12]. Studies were included if they assessed any of the following outcome measures: lung function, exacerbations, health related quality of life, exercise tolerance and the use of healthcare facilities because of respiratory symptoms. Due to time and resource constraints it was not possible to include non-English language papers.

Procedures

Two reviewers independently reviewed papers identified by the electronic search for potential inclusion on the basis of their titles and abstracts and independently scrutinised full texts of papers considered potentially includable to confirm inclusion/exclusion. Any discrepancies were resolved through discussion between the two reviewers. To identify further studies, we searched citations of included reports and contacted experts in the field. Data from each included trial were extracted independently by two reviewers. Again, disagreements were resolved through discussion. We extracted data on outcomes, the design and conduct of the trial, the way in which exacerbations were defined, the protocol for managing severe exacerbations, and the methodological quality of the trials.

Trial quality

To assess quality we used the Cochrane collaboration classification scheme for assessing bias [13]. We used the following criteria: method of randomisation, sequence generation, and allocation concealment for selection bias; blinding for performance bias; whether the loss of patients were accounted for and whether the analysis was by intention to treat for attrition bias; whether the outcome assessor was blinded for detection bias, and whether there was evidence of selective reporting of outcome measures for reporting bias.

Analysis

We only conducted a meta-analysis on outcome data from trials which we rated as acceptable in terms of bias on at least three of the five bias criteria listed above, and only when relevant data could be obtained from at least three trials. For other outcomes we report results from individual trials. We meta-analysed outcome data using the random effects method of DerSimonian & Laird [14], with the estimate of heterogeneity being taken from the Mantel-Haenszel model and assessed for statistical significance using a Cochran Q-test which follows a chi-squared distribution. All statistical analyses were performed in Stata, version 10.1. We identified features of the included trials that we felt were important in interpreting results. Based on an examination of the treatment of these features in the included papers, we make recommendations for the future design of these types of trial. Statistical significance was reported at the 5% level in all trials and we report all results at that level.

Results

The electronic search identified 107 publications of which 103 were excluded on the basis of their titles and abstracts (Figure 1). The full texts of four papers were retrieved. One was excluded because it did not compare patients withdrawn from ICS with those not withdrawn. A further trial published during the course of this research was included; one of the authors of the present paper is an author on this trial. Searching through references of these trials or contacting experts did not identify any further studies.

Figure 1
figure 1

Summary of the research process.

Characteristics of the included trials are shown in Table 1 and characteristics of the patients included in Table 2. The number of patients in these trials ranged from 24-615. Three of the trials were parallel group trials (COPE, COSMIC, WISP) [1517] ranging in duration from 10-12 months with a run-in period of between 2 weeks and 4 months. The reason for a run-in is usually to screen patients so that those who are unsuitable for the trial can be identified and excluded before randomisation. In two of the trials, however, the run-in period involved all patients being administered the same inhaled corticosteroid treatment, and in the COSMIC trial authors explain the length of their run-in, which involved all patients being put on the same treatment as 'to be on the safe side of the GOLD guidelines which recommend a trial period of 6 weeks to 3 months to get a plateau in the response to FEV1 to ICS'. The fourth trial was a cross-over trial with a total duration of 12 weeks and no run-in period [18]. The parallel trials used fluticasone as the inhaled steroid but the cross-over trial used beclomethasone. During the run-in period, WISP patients received their usual medication, COSMIC and COPE patients were given the inhaled steroid which was to be used for the remainder of the trial.

Table 1 Description of studies
Table 2 Patient characteristics

The three parallel trials all had an acceptable level of bias for three or more bias criteria (Table 3). The fourth trial appeared to have a much greater potential for bias, was substantially different in size and design, and had a different active treatment. We did not consider data from this trial for our meta-analyses. Thus we could only meta-analyse data from a maximum of three trials. For most of the outcomes we considered, data were either not reported or not reported clearly for at least one trial. As a result we only conducted a meta-analysis for one outcome: whether or not a patient had an exacerbation during the study period.

Table 3 The methodological quality of the trials

Meta-analysis indicated that patients who had been withdrawn from ICS were slightly more likely to experience an exacerbation in the study period (odds ratio 1.11, 95% CI 0.84 to 1.46), but the effect was not statistically significant (Figure 2). Statistical heterogeneity was not present (P-value = 0.369), but we felt that clinical heterogeneity was. We outline this clinical heterogeneity, and reasons that we had to be cautious about conducting this meta-analysis, in the discussion. Results relating to exacerbations were reported in more than one way in all three parallel trials. In these trials, patients who were withdrawn had more exacerbations than those who were not withdrawn. The difference was significant in COPE [16] but not in WISP [15] or COSMIC [17] (Table 4). In WISP, patients who were receiving placebo experienced an exacerbation on average 19 days earlier than those receiving inhaled steroid (median = 44 days (CI 29-59) v. 63 (CI 53-74) days, P = 0.05) [15]. The mean difference in the COPE trial was 34.6 days (95% CI 15.4-53.8) [16]. Change in lung function from baseline was worse in the placebo group in all four trials but only in COSMIC was the difference statistically significant (Table 4).

Figure 2
figure 2

Meta-analysis of the odds ratio of patients experiencing at least one exacerbation.

Table 4 Effect size (withdrawn compared with active treatment) 95% confidence intervals and denominators for study outcomes

All three parallel trials [1517] assessed health related quality of life by means of the St George's Respiratory Questionnaire (SGRQ) [19]. In COSMIC and WISP [15, 17] there was no significant difference in the total SGRQ scores, whereas in COPE [16], patients in the placebo group experienced significantly poorer health status in terms of overall score (Table 4). Only COPE and O'Brien measured reported exercise tolerance and there was no evidence that this was affected by withdrawing treatment. In COPE, the use of health care facilities was greater in the placebo group.

Discussion

There is no conclusive evidence from the trials included in our review that withdrawal of ICS has an effect on the frequency or number of exacerbations. Two trials showed a statistically significant decrease in the time to first exacerbation. Effects on other outcomes are inconclusive. These results are not inconsistent with a recent review in which the authors suggest that the benefits of ICS in preventing COPD exacerbations may be overstated [20].

Strengths and limitations

We conducted a robust systematic review. Nevertheless, we did not assess publications in languages other than English and, given the small number of studies identified, could not assess publication bias. Limited data within the trial reports meant that we were only able to conduct one meta-analysis.

Meta-analysis

There are reasons for being cautious about the one meta-analysis we conducted and, in general, meta-analysing results from these types of trial. First, there are differences in patient characteristics between trials, in this case duration of use of ICS and exacerbation rate prior to entry to the trial which may be related to a patient's dependency on inhaled steroids, and differences in outcome definition, in this case definition of exacerbations (Table 5). The two trials conducted most recently also showed less effect of withdrawal which may reflect the increased efficacy of more recent medications which might have been taken alongside the inhaled steroids, although the use of other medication is not reported in detail in the trial reports. An additional reason for being cautious about meta-analysis in trials where medication is withdrawn relates to patient management during the trial, specifically in these trials, the management of exacerbations severe enough to make it unethical for patients to continue on the randomised treatment (Table 4). The WISP study had the highest number of patients discontinuing randomised treatment; 46% in the placebo group and 26% in the fluticasone group. In this trial, a patient's own doctor managed any exacerbation according to usual guidelines. The decision to stop the study inhaler and return to the usual (pre-randomisation) inhaler was made by the patient and doctor without a clearly defined protocol. The procedure was similar in the COSMIC study. By contrast, in the COPE study patients only returned to their usual medication following an examination and decision by one of the trial physicians. Thus, patients in WISP and COSMIC may have had a lower threshold for accepting changes in their normal level of symptoms than in the COPE study and may have left the randomised treatment earlier, especially in the placebo group, resulting in a smaller observed difference in outcome between this group and the fluticasone group in these trials. It is not possible to be prescriptive about a protocol for managing severe exacerbations; different approaches reflect the point of the trial on the pragmatic/explanatory spectrum, and trials at all points are useful in this context. For example, COPE suggests that withdrawing ICS results in an increase in the likelihood of an exacerbation but from WISP there is no indication that doing so in a routine clinical setting with patients participating in the decision to discontinue randomised treatment is likely to result in a change in proportions exacerbating.

Table 5 Summary of the way in which trials define and manage exacerbations

Reporting of individual trials

We recommend that authors make it clear in these trials what other medication patients were likely to have been taking during the trial period and whether there were differences between intervention and control, particularly in situations where the drug being withdrawn is generally seen as an add-on to other medication, as in the case of inhaled corticosteroids. In the COSMIC trial, all patients were prescribed a long acting bronchodilator as part of the trial. In the COPE trial, data on the use of rescue β2 agonists during the study were collected but not presented in the paper. In WISP it is reported that those in the placebo group used a reliever inhaler more frequently during the trial than those in the treatment group. No information is given in the O'Brien trial regarding other medications. Another factor that needs to be clearly reported is the length of time patients have been on inhaled corticosteroids prior to the start of the trial as this may influence the effect of withdrawal; two trials did not give any details of this.

The differences between the trials in the treatment of those experiencing severe exacerbations highlight the need for clear reporting in this area. Currently there is no agreed classification of exacerbations [12]. Providing a clear classification is useful to compare the severity of exacerbations between the groups in a trial, but there needs to be some flexibility in defining events likely to warrant a withdrawal from randomised treatment in these trials. Nevertheless, it is important for authors to report the method of managing exacerbations so that those reading reports are able to interpret results.

A further issue that we became aware of in reviewing these trials is the description of those who discontinue randomised treatment. In the COSMIC trial these individuals were described as drop-outs, although they appeared to provide outcome measures. We suggest that these individuals would be better described as 'discontinuers' and those who do not provide outcome data described as drop outs from the trial. Using this terminology, there were no drop outs in the COSMIC or WISP studies, and only two (both deaths) in the COPE study. There were 134 discontinuers (51%) in the WISP study (56 ICS, 78 placebo), 80 (21%) in COSMIC (34 ICS, 46 placebo) and 32 (13%) in COPE (6 ICS, 26 placebo). Furthermore, taking a pragmatic approach, which is necessary in these trials on ethical grounds, differences in discontinuation between groups are not of paramount importance in assessing the effect of withdrawal.

Analysis and interpretation of individual trials

How should the results of these trials be analysed and how should these analyses be interpreted? Intention-to-treat (ITT) is the recommended analysis for pragmatic superiority randomised controlled trials [21]. Per protocol analyses were originally suggested as the preferred way to analyse explanatory trials [22], but are now generally regarded as at best an adjunct to ITT analyses because of the potential for selection bias [23]. Here we exemplify this argument for trials where medication is withdrawn.

In such trials, the ethical imperative to allow patients to discontinue randomised treatment and the fact that more in the placebo group usually do so, means that in a per protocol analysis there are likely to be more severely ill individuals in the group receiving ICSs. Thus a per protocol analysis will often underestimate the effect of the withdrawal of treatment on the target population as a whole, and its usefulness in trials at the explanatory end of the spectrum is diminished because an underestimate of effect might lead to treatment being withdrawn when in fact this withdrawal may be harmful. Intention-to-treat analyses make more sense in these trials, but should be interpreted in the context in which the trial is being undertaken. Given the potential differences between the management of patients in different trials, as illustrated in this paper, this interpretation may be unique to the context, and comparisons between different trial results should be made with caution.

Whether there are more suitable analyses for trials in which treatment is withdrawn is an important question. For trials with binary outcomes, Graham Dunn developed a method of allowing for non-compliance while effectively keeping all individuals in an analysis thus overcoming one of the criticisms of per protocol analyses that the balance achieved by randomisation is broken [24]. To our knowledge, there are no publications extending this method to survival analysis, particularly when outcomes are repeated, or to trials in which medication is withdrawn. However, a more important question than the overall effect of withdrawal of treatment on the population may be the identification of patients for whom withdrawal likely to be non-detrimental either in the short or long term.

A further issue in relation to interpretation is the risk of bias introduced by unblinding of patients who have had an exacerbation severe enough to warrant a discontinuation of randomised treatment. It seems likely that there will be more of these patients in the placebo group and this may introduce performance bias.

Clinical implications

At the moment there are no national or international guidelines advocating the withdrawal of inhaled corticosteroids in those with COPD. Our review does not indicate that patients withdrawn from inhaled corticosteroids will, in general, have substantially worse outcomes than those not withdrawn. Nevertheless, we cannot advocate the withdrawal of inhaled corticosteroids in patients with COPD until further studies confirm which subsets of COPD patients benefit from this treatment and which subsets are at most risk of developing side effects.

Conclusion

The withdrawal of ICS in patients with COPD may cause exacerbations sooner with a smaller effect on the number of exacerbations. Effects differ according to the definition of outcome and setting and management of patients within the trial. Clarity is needed in the reporting of these trials, particularly in defining exacerbations, reporting the management of exacerbations severe enough to warrant discontinuation of randomised treatment, the description of individuals who discontinue, and the collection and analysis of their outcomes. ITT analyses are appropriate for these trials, but other methods of analysis could be explored including analyses which attempt to identify those for whom discontinuation results in little in the way of detrimental effects.