Background

An Interrupted Time Series (ITS) study is a powerful quasi-experimental design for evaluating effects of interventions when random assignment is not feasible [1]. In an ITS study, a series of observations on the same outcome before and after the introduction of an intervention are used to test immediate and gradual effects of the intervention. A major strength of this design is its ability to distinguish the effect of the intervention from secular change, that is, change that would have happened even in the absence of the intervention. Estimating the intervention effect is done by comparing the trend in the outcome after the intervention to the existing trend in the pre-intervention period, and is achieved through modifications to the standard regression analysis. In a basic segmented regression analysis [24], the time period is divided into pre- and post-intervention segments, and separate intercepts and slopes are estimated in each segment. Statistical tests of changes in intercepts and slopes pre- to post-intervention are carried out. By making a few simple changes to the data set-up and model specification, segmented regression analysis can easily be implemented in standard statistical software packages. An additional adjustment is usually required to account for serial autocorrelation, which arises because observations taken over time are usually correlated. Technical details for data preparation, model specification, and adjustment for autocorrelation are presented elsewhere [2, 4]. Several examples of the use of segmented regression analyses in studies of quality improvement interventions have been published [57].

Illustration

To illustrate the segmented regression analysis approach, we analysed data from a previously published study [8] that used an ITS design to evaluate the effectiveness of a collaborative intervention to improve quality in pre-hospital ambulance care for acute myocardial infarction (AMI) and stroke at 11 publicly funded ambulance organizations in England. A series of weekly measurements (the percentage of patients with a pre-hospital diagnosis of AMI and stroke who received a defined care bundle) between January 2010 and February 2012 was used to measure the impact of the collaborative intervention. The six-month pre-intervention period was defined as January to June 2010. The authors used logistic regression analysis of the data at each site, with the outcome being delivery of the care bundle and the predictor being time, modeled in two ways: first as a continuous variable across the entire study period, and then as a dichotomous indicator representing pre- and post-intervention periods. The estimated odds ratios (ORs) from the models that specified time as a continuous variable were combined across sites using fixed effects meta-analysis. The authors concluded that, over all sites, the collaborative intervention led to statistically significant improvements in ambulance care for AMI (OR 1.04 per month, 95% Confidence Interval [CI] 1.04, 1.04) and stroke (OR 1.06, 95% CI 1.05, 1.07).

We used the plots of weekly data aggregated across sites provided in the additional file [9] to re-analyse the data using segmented regression analysis. The details of our results are presented in Table 1 and displayed graphically in Figures 1 and 2. Before the intervention, the increase in AMI performance was OR = 1.02 per month; after the intervention, there was an additional increase of OR = 1.04 (95% CI 0.98 to 1.10) per month, which was not statistically significantly different from the pre-intervention trend (p = 0.20). Over the entire 86-week intervention period, the estimated increase in AMI performance is given by OR = 3.2 (a 220% relative increase); in the absence of the intervention, we would have expected a 45% increase (OR = 1.45). After accounting for the secular trend, the additional improvement associated with the intervention would, if it were real, likely be clinically important. For stroke, the pre-intervention increase in performance was OR = 1.05 per month; after the intervention, there was an additional effect of OR = 1.02 (95% CI 0.97 to 1.07) per month, which was not statistically significantly different from the pre-intervention trend (p = 0.52). Over the entire 86-week intervention period, the estimated increase in stroke performance is given by OR = 4 (that is, a 300% increase in odds). But even if the intervention had not been introduced, we would have expected a relative increase of 190% (OR = 2.9).

Table 1 Segmented logistic regression analysis of care bundles for AMI and stroke: all sites combined
Figure 1
figure 1

Segmented logistic regression analysis of care bundle for AMI: all sites combined.

Figure 2
figure 2

Segmented logistic regression analysis of care bundle for stroke: all sites combined.

Discussion

A challenge in the use of the ITS design to evaluate complex quality improvement interventions is that the intervention may consist of several different components introduced at different times. For example, in this study, the authors’ preliminary investigations suggested that the two most effective interventions may be provider prompts and individualized feedback, while education and passive information dissemination did not appear to bring about change. One way to estimate the effects of different intervention components is to add multiple 'interruptions’ to the time series, but this requires a sufficient number of time points between interventions to allow their independent effects to be estimated [4]. In some studies, the intervention may need to be phased in or introduced gradually over a period of time, before being fully implemented. Thus, there may be a time lag from the initial introduction of an intervention to when its full effects can be observed. One way to allow for this in the segmented regression analysis is to fit a model with three segments, corresponding to the pre-implementation, implementation, and post-implementation periods. An alternative is to fit the model excluding the time points corresponding to the phase-in period [4, 10]. Although the authors did not specify a phase-in period in their original analyses, we explored the effect of allowing for a phased introduction by censoring the first three months of observation after the start of the intervention. The results from this analysis are presented in Table 2, and our conclusions about the effect of the collaborative intervention remain unchanged.

Table 2 Segmented logistic regression analysis of care bundles for AMI and stroke: all sites combined, allowing for a ramp-up period of 12 weeks after introduction of the intervention

In ITS studies, it is not uncommon to have different participating sites contributing data. In our re-analyses of these data, we used a single time series of data aggregated across all sites. An analysis of aggregated data is likely to have less power than a multilevel logistic regression analysis of the time series from the individual sites. Gebski et al. [3] describe how to conduct segmented regression analysis when there are multiple sites and different intervention start times. One approach is to conduct separate segmented regression analyses at each site, and then estimate the overall effect by pooling the estimates of intervention effect across sites using inverse variance weights in a meta-analytical model [11]. Another approach is to fit a single model to the data from all sites and account for heterogeneity across sites by incorporating random effects for the sites.

Sensitivity analyses may be conducted to consider the effect of outliers on the results. Outliers may be censored from the analyses or modeled explicitly via dummy variables [2]. Alternatively, the effect of outliers may be reduced by using moving averages. Yet another approach is to combine data points prior to analysis, by using, for example, bi-weekly instead of weekly measurements. Although we did not explore these issues in our re-analyses, variability in outcomes over time can substantially affect power in an ITS study.

Conclusion

In conclusion, our results demonstrate the importance of using segmented regression analysis in an ITS study. When a standard regression analysis is used with time modeled as a single continuous variable, an estimate is obtained for the slope over time, but it is impossible to distinguish the effect of the intervention from the underlying secular trend and to make causal claims about the effects of the intervention. Based on our re-analyses of these data, we conclude that the quality improvement collaborative resulted in no statistically significant improvements in the quality of AMI and stroke care, but that potential clinically important effects for AMI cannot be ruled out.