Rapid response teams (RRTs) have been systematically introduced into hospital practice in many countries (Australia, New Zealand, UK, USA, Canada, the Netherlands, Denmark and Sweden) under a variety of names including “medical emergency teams”, “outreach teams” or “emergency response teams”. However, they remain either absent or only sporadically available in a small minority of centres in most of Western Europe and elsewhere in the world. Irrespective of their nomenclature, the aim of RRTs is to improve the safety of deteriorating ward patients [1]. The RRT approach is based on (a) identification of patients at risk; (b) early notification of a team; (c) rapid intervention by the team and (d) audit of the system’s performance [2].

Despite their widespread introduction in Anglo-Nordic countries, it remains controversial whether RRTs are effective in achieving their stated aims. Importantly, their introduction has not been supported by large multicentre randomised controlled trials. Rather, it has been supported by overwhelming evidence that many hospital patients experience an episode of unpredicted clinical deterioration, that the response to such deterioration is often suboptimal (in the areas of identification, action and intervention) and that, by logical inference, early intervention by a suitably trained, skilled, equipped team would likely increase safety and improve the outcomes of such deteriorating ward patients.

After their introduction, their continued application and uptake by English-speaking and Nordic countries have been sustained by limited evidence. Such evidence derives from multiple before-and-after single or multiple centre studies and meta-analyses of available data [3]. These studies have typically reported a reduction in hospital cardiac arrests, greater effect with greater “RRT dose” (RRT reviews/1000 patient admissions) and variable impact on mortality. However, a major multicentre cluster randomised controlled trial called MERIT failed to demonstrate a benefit. Moreover, meta-analyses have been variable in their conclusions; several authors have questioned whether tangible benefits really exist and suggested that further higher level research and trials are required [2]. In this setting, a major practice question for many institutions in 2016, therefore, is whether the evidence is now strong enough to be sure that their introduction is clearly justified or, instead, so weak or poor that their introduction is likely a pointless and expensive misallocation of resources.

In this regard, we believe that the evidence available to support the introduction of RRTs is inconclusive and that we cannot be sure that such teams improve patient outcomes. Put in legal terms there is “reasonable doubt” about their effectiveness. This is because there have not been multicentre randomised controlled trials of sufficient power and methodological rigour. Such lack of suitable level 1 evidence is not deliberate. It is simply due to the inherent difficulties associated with designing and executing such a trial. They include the inability to blind treatment; the presence of contamination; the ethical issues surrounding the allocation of patients to a control group; the logistic challenges of training; maintaining and deploying such a team for the purpose of a trial 24/7, and the known need for such systems to become accepted and operative for a period of time before they are applied optimally to deliver an effective “treatment response”. In an attempt to overcome at least some of these challenges, investigators resorted to conducting a cluster-randomised controlled trial, where hospitals instead of individuals were randomised to introduce or not introduce an RRT. This study, called MERIT [4], remains a milestone effort in this regard and has spawned multiple papers seeking to investigate several key and often unappreciated aspects of its findings [5].

On its primary analysis, the MERIT trial did not find any evidence of an effect of RRTs on cardiac arrests, mortality or ICU admissions. The MERIT trial, however, was not a “negative” trial as several commentators have argued; it was an “inconclusive” trial. This is because the number of clusters and the marked variance in outcomes from hospital to hospital led to a grossly underpowered trial exposed to a very high chance of type II error. Moreover, as studies of RRTs have accrued over time since then, several additional important logistic aspects have emerged that could not be adjusted for in the design and execution of MERIT. They include the need for a long run-in period; the benefit of team training; the advantage of ensuring critical care medical input; the need to deliver a sufficient dose of intervention (number of calls/1000 patients); the need to optimise the performance of the afferent limb of the system (the component responsible for calling the team); the need for sufficient administrative and political institutional support, and, finally, the full commitment of the critical care team and department to ensure performance, monitoring and continued improvement of the RRT. Subsequent studies, which analysed the data from different perspectives, have argued that there was in fact an effect on cardiac arrests and mortality [2] (Table 1). Unfortunately, however, such post hoc analyses carry a significant risk of type I error, leaving clinicians still uncertain about the presence of a true beneficial effect.

Table 1 Post hoc studies using the MERIT study database

Given the above observations, we cannot be sure whether RRTs deliver better patient outcomes. In many ways, studies that report a clear effect describe what RRTs can do under optimal operating circumstances but not necessarily what they will actually do under average or perhaps even suboptimal levels of clinician and institutional commitment or support. Importantly, some of these optimal conditions include the organizational functionality of the whole hospital system, its clinical communication processes and commitment to continuous improvement processes. In supportive systems, RRTs will likely impact significantly on clinical outcomes, while in less functional systems any effect is likely to be lost.

However, despite the above considerations, the key question may well not be are we are sure or not sure that RRTs “work”? The key question may instead be how sure do we need to be that RRTs work before we introduce them? In this commentary, we have used a legal analogy and have argued that we are not sure about their effectiveness beyond “reasonable doubt”. However, the law also uses different levels of evidence according to different situations and, for civil cases, damages are paid to the plaintiff if the “preponderance of the evidence” supports the case. In this case, it is clear that the preponderance of evidence is supportive and much greater than the evidence behind trauma teams or, indeed, cardiac arrest teams, which now operate every day in almost all major hospitals in developed countries in the absence of any controlled trials.

Whether such evidence for RRTs is sufficiently compelling for clinicians, institutions and health care systems to introduce them, however, is ultimately a question that can only be addressed locally. In this regard, most Anglo-Nordic countries have already “voted with their feet”. As the patient safety agenda spreads throughout Europe, it seems likely that other countries will also progressively adopt similar systems in the future. Yet RRTs, as with any safety initiative, must always also address the need for hospital-wide and system-wide continuous improvement methodology and implementation.