Background

Glyceryl trinitrate (GTN) has several effects that may be beneficial in acute stroke. High blood pressure (BP) is common in the acute phase of stroke and associated with poor outcome [1]. In-hospital BP lowering is recommended for patients with intracerebral haemorrhage [2], and the application of a glyceryl trinitrate (GTN) skin patch is a simple and efficient approach. GTN has other effects which may be beneficial in stroke such as topping up nitrate-depleted endothelium.

Stroke can be difficult to diagnose in the prehospital setting as there is no perfect or readily available diagnostic test. Conditions such as seizures, migraine and functional disorders can present with symptoms suggestive of a stroke, hence use of the term ‘stroke mimics’ [3]. Mimics are estimated to account for 31% of presentations at hospital with suspected stroke [4]. An incorrect working diagnosis of stroke may delay appropriate treatment for patients and expose them to unnecessary risk as some may receive stroke treatments such as thrombolysis before the correct diagnosis is confirmed. Equally, patients with a stroke may be deprived of life-changing treatment if their initial diagnosis is thought to be a mimic. Numerous factors have been reported to be associated with a greater probability of an event being a mimic rather than stroke, including younger age, female sex, fewer vascular risk factors, history of seizures and less severe presenting symptoms including a lower likelihood of facial or limb weakness, speech difficulty or acute hypertension [5, 6]. The Face-Arms-Speech-Time (FAST) tool is widely used by ambulance paramedics to diagnose suspected stroke, and has a sensitivity of 79% [7]. However, as it is limited to examining the patient for facial palsy, altered motor functioning of the arm and abnormal speech, FAST is less likely to identify mild or severe strokes and those affecting only the posterior circulation [8, 9].

The second Rapid Intervention with Glyceryl trinitrate in Hypertensive stroke Trial-2 (RIGHT-2) investigated the effects of ultra-acute administration of transdermal GTN versus sham patch by paramedics in 1149 patients with suspected stroke in the UK [10]. The primary outcome was the 7-level modified Rankin Scale (mRS) score at 90 days. The RIGHT-2 trial tested the null hypothesis that GTN will not shift the mRS in participants with ultra-acute stroke. The alternative hypothesis, that GTN will shift the mRS between those stroke participants randomised to GTN versus sham, was 2-sided. Overall, no difference in the mRS was observed between the groups in participants with a final diagnosis of stroke or transient ischaemic attack. However, among the 297 (26%) participants with a final hospital diagnosis of a non-stroke condition, mRS scores were better in those randomised to GTN compared to sham [10]. The aim of this pre-specified subgroup analysis was to characterise stroke mimic cases in a FAST positive population and to examine in detail the RIGHT-2 primary outcome findings among participants with a stroke mimic as their final diagnosis.

Methods

Study design and population

RIGHT-2 was a multicentre, prospective, randomised, sham-controlled, participant-and-outcome-blinded, randomised-controlled trial in adults with ultra-acute presumed stroke in the UK. Paramedics from eight UK ambulance services (East Midlands, East of England, London, South Central, South West, Wales, West Midlands and Yorkshire) delivered the trial within the pre-hospital ambulance environment [10, 11]. Briefly, adult patients were eligible for inclusion if they accessed care through an emergency ambulance telephone call for presumed stroke and were assessed within 4 h of onset of their symptoms by a trial-trained paramedic from a participating ambulance service and could be transported to a participating hospital. Patients had to have at least two positive signs in the FAST test assessment (the number of positive signs was scored as a value out of a maximum of 3) and a systolic blood pressure (SBP) ≥ 120 mmHg. Patients from a nursing home, or with reduced consciousness (Glasgow coma scale [GCS] < 8/15), hypoglycaemia (capillary glucose < 2.5 mmol/l) or a witnessed seizure, were excluded. A sample size calculation determined that a total sample size of 850 participants (425 in each group) was required to detect a shift in mRS with a common odds ratio [OR] of 0·70 assuming an overall significance level of 5%, 90% power, 3% loss to follow-up, mimic and transient ischaemic attack rate of 20%, and reduction for baseline covariate adjustment of 20%. However, during the trial, the non-stroke diagnosis rate exceeded 30% and so the overall sample size was increased to 1050 to maintain the overall effect size and statistical power [10]. Detailed inclusion and exclusion criteria and additional information on the methods are given in the published protocol paper and in the Supplement to the main trial publication [11, 12].

The final diagnosis was made by the principal investigator based on clinical and neuroimaging findings, and categorised as intracerebral haemorrhage, ischaemic stroke, transient ischaemic attack (TIA) or stroke/TIA mimic. Brain scans were reviewed centrally by an expert panel of neuroradiologists who were aware of the time to scan and the side of symptoms but blinded to all other information to confirm diagnosis of ischaemic, haemorrhagic stroke or mimic with structural lesion. Diagnostic adjudication was completed without knowledge of the primary outcome. Diagnosis of stroke mimic was made from the final diagnosis recorded in the patient notes. Patients who had a stroke mimic received the same follow-up at Day 365 as confirmed stroke and TIA cases.

Treatment

Participants were randomly assigned to receive transdermal GTN (nitroglycerin; 5 mg as Transiderm-Nitro® 5, Novartis, Frimley UK) or a similarly-appearing sham treatment not known to exert any pharmacological effect (DuoDERM® hydrocolloid dressing, Convatec, Flintshire UK) in a 1:1 ratio. Randomisation was stratified by ambulance station with blocks of four packs (two active, two control) in a random permuted order that was generated by the trial programmer at the Nottingham Stroke Trials Unit. Ambulances carried only one pack at a time and paramedics signed-out the treatment pack with the lowest randomisation number from their ambulance station at the start of their shift and returned it if unused at the end of their shift [10]. The first treatment (GTN or sham) was administered by the paramedic immediately after randomisation in the ambulance; further treatments were given daily for up to three days in hospital but were stopped earlier if a non-stroke diagnosis was made. Each treatment pack was sealed to maintain blinding of paramedics. Participants were effectively masked since the patches and dressings themselves were unlabelled, and a gauze dressing was taped over the top of the patch or dressing to provide additional masking.

Outcomes

The primary outcome was functional status assessed across the 7-levels of the mRS (0 = no dependency to 6 = death) [13], measured at 90 days after randomisation. A trained assessor, masked to treatment allocation and using a structured questionnaire, obtained outcomes during a telephone interview with the participant. In cases where the participant was incapable of providing this information, the relative or carer was interviewed. If telephone contact could not be made after multiple attempts, a questionnaire was sent by post.

Participants were seen at Day 4 (or at hospital discharge, if earlier) to determine adherence to treatment and assess neurological deterioration. Duration of stay and discharge destination (to another hospital, institution or home) were also recorded. Pre-specified secondary clinical outcomes at Day 90 included activities of daily living (Barthel Index [BI]); cognition (modified telephone mini-mental state examination [MMSE]; telephone interview for cognition scale-modified [TICS-M]; and categorical verbal fluency using animal naming); health-related quality of life on the European quality of life-5 dimensions-3 level [EQ-5D-3L], from which a health status utility value [HSUV] was calculated with an EQ-visual analogue scale; and mood (abbreviated Zung depression score [ZDS]), all as used in the preceding Efficacy of Nitric Oxide in Stroke (ENOS) trial of GTN in hospital [10, 14]. Home-time was calculated as the number of days between discharge and Day 90. As a secondary assessment time, clinical outcomes were re-obtained by telephone at one year.

Statistical analysis

Analyses followed the RIGHT-2 statistical analysis plan [15]. The primary outcome (shift on 7-level mRS) was analysed using ordinal logistic regression with adjustment for age, sex, pre-morbid mRS, baseline FAST score and SBP, and time from the onset of symptoms to randomisation. The assumption of proportional odds was tested using the likelihood ratio test. We also performed unadjusted, per-protocol and imputed (missing mRS data estimated using multiple regression-based imputation) sensitivity analyses for completeness. Heterogeneity of the treatment effect on the primary outcome was assessed for the purpose of hypothesis-generation in pre-specified subgroups by adding an interaction term to an adjusted ordinal logistic regression model. Death was analysed using adjusted Cox proportional hazards regression models. Other outcomes were assessed using adjusted binary logistic regression, Cox regression, ordinal logistic regression, multiple linear regression and analysis of covariance (BP). A pre-specified global outcome (comprising ordered categorical or continuous data for mRS, BI, ZDS, TICS-M and EQ-5D-HSUV) was analysed using the Wei-Lachin test [16]. Data are shown as number (%), median [interquartile range, IQR], mean (standard deviation, SD), difference in mean and odds ratio, with 95% confidence intervals (CI).

Results

Demographics

From October 2015 to May 2018, 516 trial-trained paramedics enrolled 1149 participants into RIGHT-2 with follow-up continuing to 365 days. Among these 1149 patients, 297 (26%) were subsequently diagnosed with a stroke mimic (Fig. 1). Compared to stroke cases and prior to randomisation, the mimic group were on average significantly younger, had a longer interval from symptom onset to randomisation, a lower proportion of atrial fibrillation/flutter, lower SBP, and fewer positive signs in their FAST assessment (Additional Table A).

Fig. 1
figure 1

Trial profile for mimic group

Among the 297 patients with a stroke mimic, the mean age was 67 years (SD 18), 53% of participants were female, and 13% were non-white (Table 1). The GCS score was less than 14 in one quarter of mimic cases, and less than half were positive on all three FAST indicators (face/arm/speech). In the mimic group, the most common pre-existing medical conditions were hypertension (51%), previous stroke (31%), diabetes mellitus (21%) and heart disease (21%). Twenty-eight percent of participants in the study had a pre-existing dependence of a moderate or greater severity (mRS > 2). The median time from symptom onset to randomisation was 75 min [IQR 47, 126].

Table 1 Baseline ambulance and hospital admission characteristics of Mimic patients enrolled in the RIGHT-2 trial. Data are number (%), median [IQR], or mean (standard deviation)

Within the mimic group, 134 (45%) participants had been randomised to GTN and 163 (55%) to sham. Demographic and baseline clinical characteristics were similar between the GTN and sham groups except that the mean SBP was lower by 7.0 mmHg [95% CI  12.9– 1.1; p = 0.021] in the group randomised to GTN.

Mimic diagnoses

The three most common mimic diagnoses of epileptic seizure (17%), migraine or primary headache disorder (17%) and functional neurological disorder (14%), together accounted for 47% of the mimic group (Additional Table B). Other neurological (16%) and cardiovascular (7%) events represented a further 25% of presentations. A final diagnosis was unavailable in 9% of mimic cases with discharge records reporting exclusion of a stroke or TIA event but no clear diagnosis. The remaining 29% of mimic events represented a wide range of diagnoses. There was no significant difference in the proportions of final diagnoses between the treatment and sham group. In addition, for 36 participants, their qualifying event was diagnosed as an infection during at least one follow-up (Additional Table C).

Randomised treatment

Data on adherence to the trial protocol are available for 281 (95%) cases (Additional Table D). Adherence to the first randomised treatment was near complete in both GTN and sham groups (99.3% vs 100%) but overall, only 20% of participants with a stroke mimic received the first two patches. This decreased to 9% for application of all four patches. Adherence for treatment over 2 and 4 days were much lower than for stroke/TIA participants [12]. The most common reason for non-adherence in the stroke mimic group was discontinuation following an early diagnosis of non-stroke (66%). There was no difference in adherence to protocol between the GTN and sham groups. However, patients with a final diagnosis of mimic received less treatment than those with a stroke diagnosis (Additional Table E).

There were 25 protocol violations in the ambulance; these were related to the inclusion of patients beyond 4 h, low FAST score (< 2), low SBP (< 120 mmHg), resident in a nursing home, and failure to notify the hospital (Additional Table F). There were three protocol violations in hospital; two involved not administering the treatment on Day 2 and one was failure to obtain secondary consent.

Primary clinical outcome

The primary outcome (mRS score) was measured at 90 days in 274 (92%) participants in the mimic group. A minority of participants refused or were lost to follow-up. Participants randomised to sham treatment had a median mRS of 3 [1, 4] at 90 days (Table 2). Among participants with a mimic, GTN was associated with reduced likelihood of poor 90-day mRS score, compared to sham treatment: odds ratio 0.54 (95% CI 0.34–0.85; p = 0.008) (Fig. 2). In a post hoc analysis, this finding was also observed when death was excluded, i.e. mRS 0–5 (OR: 0.55 (0.34, 0.91), p = 0.019, N = 247). When considering the primary outcome, no differences were found between GTN versus sham in any subgroup of participants with a stroke mimic (Fig. 3). In a further post hoc analysis, the positive effect of GTN was not localised to any particular type of mimic (Fig. 3) or other post hoc subgroups (Additional Table B).

Table 2 Primary and main secondary outcomes at days 4 and 90 in participants diagnosed with a stroke mimic. Data are number (%), median [IQR], or mean (standard deviation)
Fig. 2
figure 2

Distribution of mRS scores at day 90 for GTN versus sham among 274 stroke mimic participants. Comparison of GTN versus sham, adjusted common odds ratio 0.54 (0.34, 0.85), p = 0.008, by ordinal logistic regression, with adjustment for age, sex, pre-morbid modified Rankin Scale, face-arms-speech-time test, pre-treatment SBP, final diagnosis (stroke mimic) and time to randomisation. The effect of treatment for GTN versus sham is shown as adjusted common odds ratio (acOR)

Fig. 3
figure 3

Forest plot showing modified Rankin Scale, analysed as adjusted ordinal outcome, in subgroup of participants with stroke mimics, with p-value for interaction. Heterogeneity of the treatment effect on the primary outcome was assessed in by adding an interaction term to an ordinal logistic regression model with adjustment for age, sex, pre-morbid modified Rankin Scale (mRS), face-arm-speech time test, pre-treatment systolic blood pressure (SBP), final diagnosis (stroke mimic) and time to randomisation

At the Day 365 follow-up, mRS scores were measured for 279 (94%) stroke mimic patients. Those randomised to the GTN group continued to have a significantly better functional outcome than those in the sham group: OR 0.53 (95% CI 0.33–0.84; p = 0.007) (Additional Table G).

Secondary outcomes

Overall, the median length of stay was 4 days [IQR 2, 8] with no significant difference between the GTN and sham groups (Table 2). The course of BP over 4 days of treatment did not reveal any sustained difference between the treatment groups (Additional Table A). There was no difference for in-hospital interventions (Additional Table H) or neuroimaging results (Additional Table I). The only significant difference between the two groups at 90 days was in the EQ-5D health utility scores (Table 2) with the group randomised to GTN scoring higher than those who received the sham treatment [aMD 0.1; 95% CI 0.0–0.2; p = 0.031]. However, this difference was not sustained at Day 365 (Additional Table G).

Safety

There was no difference in the proportion of patients who died in hospital or were discharged to a residential care (Table 2, Fig. 4). The causes of death did not differ between GTN and sham (Additional Table J). Similarly, there was no difference in serious adverse events (Additional Table K).

Fig. 4
figure 4

Kaplan-Meier curve for time to death in participants with a stroke mimic, by assigned treatment group. Comparison of GTN versus sham, adjusted hazard ratio 0.49 (95% confidence intervals 0.20, 1.19), p = 0.11, by Cox proportional hazards regression with adjustment for age, sex, pre-morbid modified Rankin Scale, face-arms-speech time test, pre-treatment SBP, final diagnosis (stroke mimic) and time to randomisation

Discussion

Summary of results and comparison with other studies

In our further analysis of the RIGHT-2 study, we found that 26% of the 1149 cases suspected by paramedics to be a stroke had a non-stroke final diagnosis. Patients with a stroke mimic were younger, had less atrial fibrillation, lower BP, fewer FAST positive signs, and a longer onset-to-randomisation compared to those with a confirmed stroke. The most common stroke mimics were neurological conditions; epileptic seizures, migraines and primary headache disorder, and functional neurological illness accounted for almost half of all the cases. The only significant difference between groups at baseline was a lower SBP in the GTN group. At 90 days, patients with stroke mimics had better mRS scores than those with stroke and this finding was maintained in sensitivity analyses and at 365 days. The lack of difference in the rate of serious adverse events between the two groups supports the safety of the GTN intervention among patients with stroke mimic conditions.

These findings add to prior work on prehospital stroke recognition. The rate of 26% stroke mimics is consistent with pooled results for 6870 patients in physician and paramedic-led EMS systems and larger reviews that included pre and in-hospital settings [6, 17]. Our results corroborated previous reports that stroke mimic patients are younger, less likely to display atrial arrhythmias, have a lower BP, and milder stroke signs at presentation compared to stroke patients [5, 18,19,20,21]. In contrast to earlier findings, we did not observe that mimic patients are more often female, have more vascular risk factors or a history of previous stroke [5, 18,19,20, 22]. The common stroke mimic conditions were similar to those seen in other studies [5, 18,19,20, 22].

The key but unexpected finding was that 90-day and one-year functional outcomes were better with GTN than the sham. This was despite the absence of any significant demographic or clinical differences between the two treatment groups at baseline (other than SBP), or during their in-hospital care. In addition, the 90-day quality of life score was higher in the GTN group. We suggest several possible explanations.

First, although there were no imbalances in measured prognostic factors between the groups at baseline, there may have been imbalances in unmeasured factors. Second, it is possible that cardiovascular and cerebrovascular events were missed, possibly due to atypical presentation, and included among the mimics. The subgroups with the highest odds of a better outcome with GTN were aged over 80, female, AF, hypertension, previous stroke, normal GCS, and high score on FAST. Third, it is possible that bias in the assessment of outcomes favoured GTN. However, this is unlikely due to the trial design which utilised remote assessment of outcomes at follow-up by a blinded assessor. Fourth, the greater number of deaths among mimic cases who received the sham intervention could have influenced the results. However, a comparison of the 90-day mRS scores for surviving cases was still in favour of GTN and anyway if GTN were effective, it might well reduce death as well as dependence.

Fifth, the results could have been caused by chance, particularly given the small sample size and the moderate rate of non-adherence to the study protocol in the mimic group. Stroke mimic cases in both treatment groups included a wide variety of different neurological and non-neurological diagnoses and it is difficult to explain the effect of GTN across these disorders. Further, EQ-5D differed at day 90 (with a tendency at day 365) in favour of the GTN group, and the point estimate of the BI also favoured the GTN group (although not meeting significance).

Last, the results may reflect an actual treatment effect whereby GTN improves outcome in non-stroke mimics. GTN dilates the blood vessels, increases blood supply and lowers BP due to smooth muscle relaxation. This vasodilatory effect of GTN may improve vasospastic migraine which can present as hemiparesis or hemianopia. The improvement in the seizure group could again be attributed to vasodilation by GTN. Brain oedema has been observed in patients scanned shortly after seizure activity and would cause compression of smaller vessels. Further, NO has generic antimicrobial effects (P Bath, review in preparation) and so might have attenuated the infectious causes of mimic.

This study has direct implications for pre-hospital stroke research. In particular, the frequency of stroke mimic conditions may have an unexpected impact in any intention-to-treat analysis. Mobile stroke units (where available) may still not be the solution with high rates of mimics observed among call-outs [23]. Even hospital hyperacute stroke trials are not immune to mimics with 17% of the patients enrolled into the NOR-TEST trial having a final diagnosis of a stroke mimic [24]. For now, prehospital trials will need to be designed with the impact of mimics in mind. Developing point of care diagnostics to improve accuracy in selecting the intended trial population of stroke patients is vital [25].

Strengths and limitations

This study used high-fidelity data and the potential for bias was reduced by the limited inclusion criteria and the use of community recruitment. However, at 297 cases, the sample size was relatively small, and some cases were lost to follow-up. We also noted baseline blood pressure differences between the two study groups. Further, the use of simple randomisation may have contributed to potential undetected baseline imbalance. This approach allowed for rapid randomisation and treatment administration, but future trials could consider using phone or internet-based randomisation in the pre-hospital arena at greater expense.

Conclusions

Close to a quarter of patients suspected by paramedics to be having an acute stroke are subsequently diagnosed with a non-stroke condition. In this study, it is unclear why administration of transdermal GTN was associated with an improvement in mRS score at 90 days and one year but is likely to represent an undetected baseline imbalance or chance. The lack of difference in the rate of serious adverse events between the two groups supported the safety of GTN intervention in this population with stroke mimic conditions. Future trials should try to improve the discrimination of stroke mimics.