The impact of anesthetic management on short and long-term postoperative outcomes is a subject of current research in anesthesiology [1]. There is increasing evidence that certain aspects of anesthetic management—including perioperative goal-directed blood flow and fluid optimization [2, 3] and monitoring of the depth of anesthesia [4, 5]—can have a beneficial impact on postoperative complication rates.

Lima et al. [6] need to be commended for their study investigating the effects of implementing a multimodal “hemodynamic and depth of anesthesia treatment protocol” on patient outcome after major open abdominal cancer surgery [6]. The authors demonstrated that in patients treated after the implementation of the treatment protocol (which included mean arterial pressure, stroke volume variation, cardiac index, central venous oxygen saturation, and bispectral index as targets) the incidence of postoperative delirium was significantly lower and the median hospital length of stay was significantly shorter compared to patients having the same kind of surgery prior to protocol implementation [6].

The study adds another piece to the puzzle of how perioperative anesthetic management can improve postoperative patient outcome. The study also prompts a discussion about different types of studies in anesthesiological research and triggers the question: What is the best study design to investigate a potentially beneficial impact of perioperative goal-directed therapy on patient outcome?

There is no definite answer to this question because one perfect study design does not exist and the optimal study design depends on the specific research question being asked. Perioperative care is characterized by multidisciplinary teamwork, and postoperative outcomes have various pathophysiologic mechanisms and complex multifactorial etiologies. Therefore, expecting that it is simple to design a study that provides generalizable evidence that a single specific intervention can change a single specific postoperative outcome is unrealistic.

Rather, it takes multiple steps and differently designed studies to progress from hypothesizing that a certain intervention may have a beneficial impact on patient outcome to eventually demonstrating that it works in daily clinical practice. As a first step, animal studies and small observational “proof of concept” studies in humans may be necessary to show that a certain intervention can have an effect on certain physiologic and biologic variables or outcomes. Further explanatory studies may then be performed to describe the details of the therapeutic intervention (dosing, timing, etc.). Feasibility studies might also be necessary to demonstrate that the therapeutic intervention and measurement systems needed to guide the intervention can be safely and precisely used in a clinical setting. Finally, the therapeutic intervention might be tested in a randomized controlled trial (Fig. 1). Smaller exploratory monocentric randomized controlled trials (with complex treatment algorithms, strict inclusion and exclusion criteria, and optimal conditions for the study intervention to work) enable the efficacy of an intervention to be tested to answer the question “Can this intervention work?” [7]. Large pragmatic multicenter randomized controlled trials (with less complicated treatment algorithms, relatively broad inclusion and exclusion criteria, a well-defined clinical endpoint, and real-world clinical conditions) can ultimately be used to demonstrate the clinical effectiveness and practical feasibility of an intervention in usual care (“Does this intervention work?”) [7]. However, a randomized controlled trial is never solely exploratory or pragmatic, but rather can have different degrees of exploratory and pragmatic components [8]. According to the principles of evidence-based medicine high-quality randomized controlled trials and systematic reviews and meta-analysis of these are supposed to provide the highest quality evidence [9, 10]. However, high-quality pragmatic randomized controlled trials are missing for many anesthesiology treatment strategies. In addition, as discussed previously, in heterogeneous patient populations—such as “surgical patients”—with complex multifactorial outcomes, randomized controlled trials and their meta-analyses have their own limitations and need to be meticulously designed [7, 11,12,13].

Fig. 1
figure 1

Schematic illustration of the study design of randomized controlled trials and before-and-after studies

Instead of performing a randomized controlled trial, Lima et al. [6] chose an uncontrolled before-and-after study design [14] (Fig. 1) comparing two periods, the one before (March to August 2013) and that after (March to August 2014) implementation of their treatment protocol.

Before-and-after studies are observational quasi-experimental effectiveness studies and compare the periods before and after the implementation of any new diagnostic or therapeutic strategy, e.g., a therapeutic intervention, treatment protocol, assessment tool, safety initiative, quality improvement project, guideline, or educational program [14]. Because they are simpler to perform, require fewer resources, and bear less ethical concerns than randomized controlled trials, before-and-after studies are frequently used in perioperative research [14], and have been performed to show that perioperative goal-directed therapy can improve patient outcome in high-risk abdominal surgery patients [15,16,17,18,19].

Before-and-after studies can also be used to assess the effectiveness of a newly implemented intervention in comparison with a historical group in a specific setting (institution, hospital, certain operating rooms). Therefore, before-and-after studies are supposed to enable the impact of an implemented treatment protocol on patient outcome to be assessed under “real-life conditions”. To reflect real-life conditions, however, medical caregivers should be blinded to the fact that the implementation of a new treatment is accompanied by a clinical study [14]. This includes that no research coordinators or study nurses should be present during patient treatment, which may impair data quality in general and makes it particularly challenging to guarantee and systematically record the compliance to the protocol. In addition, if possible, researchers assessing the outcome data should be blinded to the study period the patient data were recorded to avoid “observer bias” and should use the same outcome definitions for control and intervention group patients to avoid “measurement bias” [14]. Because the intervention group and the historical group are treated at different time points, before-and-after studies, by definition, are not randomized and thus prone to be confounded by any change in patient management that is not related to the study intervention but occurs before or during its implementation (“history bias”) [14]. For example, we may imagine that the standard of care might simply be influenced by changes in the expertise of the medical personnel between the historical period and the intervention period. Moreover, landmark studies or new guidelines suggesting a change in patient management published after the historical period but before or during the intervention period might induce a “learning contamination bias”. Both history and learning contamination bias are less likely to occur when the time span between the historical and intervention period is short. To avoid “selection bias” all eligible patients should be treated with the intervention during the intervention period [14].

In summary, before-and-after studies are used in anesthesiology to evaluate the effectiveness of perioperative goal-directed therapy on patient outcome [6, 15,16,17,18,19]. A before-and-after study can give valuable insights into the feasibility and effectiveness of an intervention under real-life clinical conditions and, therefore, can complement evidence from randomized controlled trials. Before-and-after studies, however, are prone to several types of bias and can easily be confounded by factors that are difficult to account for. A profound understanding of advantages and inherent limitations of before-and-after studies is, therefore, key for their conception, analysis, and interpretation, especially regarding definite conclusions about a causal relationship between a certain intervention and a certain clinical outcome. To eventually demonstrate that perioperative goal-directed therapy based on pathophysiologic rationale actually improves patient outcome in real-life daily clinical practice we should combine the evidence from complementary studies with different study designs, including both randomized controlled trials and before-and-after studies.