Four hundred and thirty-three publications were identified electronically, of which 10 met our inclusion criteria. Requests to authors for unpublished data provided one further manuscript [24]. In total, 11 studies were added to our original dataset. We identified one publication [25] describing studies included in our original review as a conference abstract and unpublished data (Clark 2005 and Clark 2006 in Banwell et al. [11], confirmed through personal communication); these original data are therefore excluded from the current analysis. We identified one publication from the original review where median data were reported (an exclusion criterion in the current review), and this study is also excluded from the current analysis [26]. Our updated dataset includes 25 studies in total (Supp Fig. 1); study characteristics are shown in Table 1.
Table 1 Study characteristics
The range of evidence met 11 of a possible 13 STAIR criteria assessed. Criteria newly met in the current study are evidence from animals with co-morbidities which include aged, aged corpulent (a model of metabolic syndrome) and acute infection with pneumonia or LPS, and outcome assessed at 4 weeks post-ischaemia. The dataset now includes experiments where IL-1 RA is administered up to 6 days after induction of ischaemia, with 12 comparisons where administration is more than 3 h post-stroke. The number of neurobehavioural outcomes reported increased from 1 to 33 comparisons. All studies published post-2009 administered IL-1 RA peripherally including intravenous, intraperitoneal and subcutaneous routes. In our original review, over half of studies used central administration via intracerebroventricular or intracerebral stereotactic routes. Relevant biomarker endpoints including MRI assessment of injury have been reported. Although IL-1 RA has been studied in animals also treated with tissue plasminogen activator (tPA), no in vivo interaction studies with medications commonly used by stroke patients such as statins, blood pressure-lowering medication and aspirin were identified. Evidence is still lacking in female animals and in species other than rodents.
Overall, the number of study quality items met is greater in studies published post-2009 (median 11.5/15 interquartile range [IQR] 9.75–12) than pre-2009 (median 6/15 IQR 5–7; Table 2).
Table 2 Change over time: A comparison of the data prior to the publication of our 2009 review, afterwards and with all data pooled
In particular, the proportion of studies reporting randomisation, blinded induction of ischaemia, blinded assessment of outcome, prespecified exclusion criteria and animal exclusions increased substantially. Clear differences are also evident in the proportion of studies using co-morbid animal models and those reporting a sample size calculation (Supp Table 1). Pre-2009, no studies reported a statement regarding possible conflicts of interest. Post-2009, seven out of eight studies included a statement; of these seven, one reported no conflict while six made disclosures. Use of laser Doppler or perfusion imaging to confirm ischaemic injury was assessed as a quality item; however, alternative methods of confirmation were reported in some studies: through behavioural observation in one study and visually (microscopically) in two studies.
Infarct volume was reported in 76 comparisons from 1283 animals, neurobehavioural score in 98 (33 nested) comparisons from 473 animals and mortality in 10 comparisons from 227 animals. These data met our prespecified criterion for a minimum 30 % increase in the number of independent comparisons required to justify an updated meta-analysis (original dataset 44 infarct volume, 1 neurobehavioral and 2 mortality comparisons).
Overall, IL-1 RA reduced our primary outcome, infarct volume, by 36.2 % (95 % confidence interval [CI] 31.6–40.7). Administration of IL-1 RA in protein form resulted in a 35.5 % (30.3–40.7) reduction, administration of IL-1 RA transgenic bone marrow (BM) cells a 34.7 % (15.9–53.6) reduction and vector transfection a 44.7 % (29.9–59.6) reduction. One comparison involved transgenic mice overexpressing IL-1 RA resulting in a 43.2 % (19.1–67.3) reduction in infarct size (Fig. 1a).
For study quality, we observed greater reduction in infarct volume in studies that did not report that investigators were blinded to treatment allocation during the induction of ischaemia (p = 0.045, tau2 = 181.6, I
2 = 82.4 %, adj R
2 = 3.2 %, Fig. 2). Other potential sources of bias that did not account for a significant proportion of heterogeneity were reporting of randomisation to group, blinded assessment of outcome, prespecified exclusion criteria, reasons for excluding animals, sample size calculation and statement of potential conflict of interest.
Due to uncertainty around the timing and effective dose achieved in transgenic and transfection studies, analyses of study design characteristics are restricted to 19 sources describing 65 experiments where IL-1 RA was administered in protein form. Mode of IL-1 RA delivery (peripheral or central delivery) is not a significant source of heterogeneity (p = 0.412), and therefore, data were analysed together.
We observed substantial heterogeneity in this dataset (tau2 = 231.9, I
2 = 83.7 %) that was explained, in part, by two of the variables investigated with univariate meta-regression. Firstly, for route of delivery, studies using intracerebroventricular administration reported the greatest magnitude of effect (p = 0.0003, tau2 = 121.3, I
2 = 77.54 %, adj R
2 = 43.3 %). Large effects were also observed with the more clinically relevant peripheral routes of delivery, intravenous and subcutaneous (Fig. 3a). Secondly, dose-response relationships for central (intracerebroventricular, stereotactic) and peripheral (intravenous, intraperitoneal, subcutaneous) administration were analysed separately. Dose is a significant source of heterogeneity in experiments where IL-1 RA was administered centrally (p = 0.005, tau2 = 194.8, I
2 = 89.6 %, adj R
2 = 45.2 %) with larger reductions in infarct volume evident at higher doses (Fig. 3b).
Effect sizes for data stratified by publication date (pre- or post-2009) were similar with less statistical heterogeneity observed in more recent studies: we observed a reduction in infarct volume pre-2009 of 36.1 % (27.9–44.2), tau2 = 279.4 and I
2 = 88.1 % versus 35.0 % (28.2–41.7), tau2 = 159.4 and I
2 = 74.2 % post-2009, p = 0.97.
Variables that do not contribute significantly to heterogeneity include the following: species and sex of animals, time of IL-1 RA administration, whether single, multiple or continuous administration was used, whether infarct volume calculation involved a correction for oedema, method of infarct quantification, presence of co-treatments, co-morbidity studied, method of induction of ischaemia, type of ischaemia, anaesthetic used during model induction and whether mechanical ventilation was used, and time of outcome assessment relative to model induction.
Funnel plot asymmetry is detected with Egger’s test (p < 0.001), suggesting the presence of publication bias (Fig. 4a). Trim and fill analysis imputed the presence of 30 “missing” experiments, with an adjusted reduction in infarct volume of 21.9 % (17.3–26.4, Fig. 4b), 14.3 % lower than before adjustment.
Overall, IL-1 RA improves neurobehavioural measures by 35.9 % (28.2–43.5; n = 33). No improvement was observed in experiments where IL-1 RA transgenic BM cells were administered (n = 3, p = 0.200) (Fig. 1b). Neurobehavioural measures were categorised as tests of motor/sensory behaviours, social interaction/anxiety/depressive behaviours or thermal nociception. Using this categorisation, the type of neurobehavioural test is not a significant source of heterogeneity (post hoc analysis; p = 0.4480). Most experiments (28/33) tested motor/sensory outcomes, and further analyses are restricted to these data due to the divergent biology underlying the behaviours tested in the remaining outcomes. Only experiments where IL-1 RA was administered in protein form were investigated for sources of heterogeneity (27 comparisons); in all of these experiments, IL-1 RA was administered peripherally.
For motor/sensory behaviours, there is an improvement in outcome of 35.7 % (27.5–43.9; tau2 = 219.1, I
2 = 63.8 %, n = 27). Route of delivery accounted significantly for this heterogeneity (p = 0.0008, tau2 = 71.8, I
2 = 32.6 %, adj R
2 = 67.2 %) with the greatest improvement seen with subcutaneous administration (Fig. 5a). Greater effects are also observed in experiments that administered multiple rather than single doses of IL-1 RA (p = 0.018, tau2 = 133.1, I
2 = 49.9 %, adj R
2 =39.2 %; Fig. 5b). Sex of the animals is a significant source of heterogeneity (p = 0.040, tau2 = 186.6, I
2 = 61.3 %, adj R
2 = 14.8 %), with no effect seen in experiments where the sex of the animal was not reported (Fig. 5c). The anaesthetic used during induction of ischaemia also contributes to heterogeneity (p = 0.0023, tau2 = 62.6, I
2 = 31.7 %, adj R
2 = 71.4 %). The greatest effect is seen in studies using isoflurane while there is no effect in those using ketamine, tribromoethanol or halothane (Fig. 5d). In addition to the effects of anaesthesia, post-operative analgesia can affect stroke outcome in rodents. Only two of the included studies reported using an analgesic (buprenorphine); therefore, we were unable to assess the impact of this variable on the recorded outcomes.
We further subdivided motor/sensory behavioural measures into the more specific categories: gross neurological score (n = 26), skilled movement task (n = 4) or sensorimotor asymmetry test (n = 7). Post hoc analysis revealed that type of motor/sensory measure was not a significant source of heterogeneity (p = 0.8182). Other variables not contributing significantly to heterogeneity are species of animals, dose and time of IL-1 RA administration, co-morbidity studied, method of induction of ischaemia, type of ischaemia and time of outcome assessment relative to model induction. Only one comparison involved a co-treatment (tPA), and for all comparisons, it was unknown whether mechanical ventilation was used; therefore, these variables were not analysed.
Egger’s test suggests significant funnel plot asymmetry (p = 0.028) (Fig. 6a). Trim and fill analysis imputed the presence of 18 “missing” experiments, with improvement in neurobehavioural outcome adjusted from 41.4 % (34.9–47.9) down to 38.6 % (31.9–45.3) (Fig. 6b). These values differ from the estimate of efficacy calculated using meta-regression due to use of a moment-based rather than REML estimate of between-study variance.
Mortality was unaffected by administration of IL-1 RA with an odds ratio of 1.03 (0.45–2.38), n = 10, 227 animals, Q = 2.87 and p = 0.97. Mortality was not analysed further due to the limited data.
Sensitivity Analyses
We performed stratified meta-analysis, rather than meta-regression, as a sensitivity analysis to assess the impact of study quality and design on infarct volume using an alternative statistical method. For study quality, studies reporting a formal calculation of study size reported larger treatment effects than those that did not (Q = 13.1, df = 1, p = 0.0003) (Supp Fig. 2A). For study design, we similarly observed larger effects with intracerebroventricular delivery of IL-1 RA (Q = 125.6, df = 4, p < 10−25) (Supp Fig. 2B) and a dose response for central drug administration (Q = 68.8, df = 2, p < 10−14) (Supp Fig. 2C). In addition, we observed other variables to account for significant sources of heterogeneity. We observed the largest effects in studies using ketamine anaesthesia (Q = 53.2, df = 4, p < 10−10) (Supp Fig. 2D), in thrombotic models of ischaemia (Q = 13.0, df = 2, p = 0.002) (Supp Fig. 2E) and where ischaemia was induced via thrombin injection (Q = 24.5, df = 3, p < 10−4) (Supp Fig. 2F) and in studies which reported correcting for oedema in infarct quantification (Q = 53.9, df = 2, p < 10−11) (Supp Fig. 2G). We observed an inverse dose response for peripheral drug delivery (Q = 60.8, df = 3, p < 10−12) (Supp Fig. 2H) and a significant but unclear relation between the time of drug administration and effect (Q = 72.3, df = 4, p < 10−14) (Supp Fig. 2I).