Introduction

Approximately 2–4% of adults acutely admitted to an intensive care unit (ICU) develop upper gastrointestinal (GI) bleeding [1,2,3,4]. For bleeding prevention, approximately 70% of critically ill patients are prescribed stress ulcer prophylaxis [3], most commonly proton pump inhibitors (PPIs). In a recent European, multicenter, blinded trial, intravenous pantoprazole (PPI) was associated with a reduction in clinically important upper GI bleeding compared with placebo following an unplanned ICU admission of adults who were at risk for gastrointestinal bleeding [4]. Similarly, in PEPTIC, a large cluster-crossover trial, upper GI bleeding rates were lower during treatment periods when ICUs preferentially used PPIs for stress ulcer prophylaxis rather than histamine-2 receptor blockers (H2RAs) [5]. In both of these trials, upper GI bleeding was a secondary outcome; lack of adjustment for multiplicity, and high rates of non-adherence with study treatment respectively limited inferences from these observations. Additionally, among patients with high acuity of illness in both trials, there was a trend toward higher mortality in those receiving PPI. These findings highlight the need for further research to establish the efficacy and safety of PPIs for stress ulcer prophylaxis in contemporary ICU practice.

REVISE (Re-Evaluating the Inhibition of Stress Erosions) is an ongoing international randomized trial, the objective of which is to determine the effect of pantoprazole versus placebo on the primary efficacy outcome of clinically important upper GI bleeding and the primary safety outcome of 90-day mortality [[6]: NCT03374800]. Secondary outcomes are ventilator associated pneumonia (VAP), Clostridioides difficile (C. difficile) infection, renal replacement therapy, ICU and hospital mortality, and patient-important GI bleeding.

This document outlines the statistical analysis plan (SAP) for REVISE, reported using current guidelines [7].

Methods

REVISE is a multicenter, randomized, stratified, concealed, blinded parallel group trial in patients ≥ 18 years old in the intensive care unit (ICU), designed in collaboration with the Canadian Critical Care Trials Group (CCCTG) [[8]:www.ccctg.ca], the Australian and New Zealand Intensive Care Society Clinical Trials Group (ANZICS-CTG) [[9]:www.anzics.com.au/clinical-trials-group], and other international colleagues. Inclusion and exclusion criteria are summarized in Table 1. In brief, eligible patients are adults who are admitted to the ICU, receiving invasive mechanical ventilation and expected to remain ventilated beyond the calendar day after randomization, and without a clear indication or contraindication to PPI. Eligible patients are enrolled by either a priori informed consent or by informed consent to continue or deferred consent, or opt out, in alignment with local approvals [10].

Table 1 Inclusion and exclusion criteria

When notified by research staff or investigators about eligible patients, research pharmacists or other blinded study personnel use a password-protected website to access the automated central randomization program to ensure concealed 1:1 allocation, stratified by center and pre-hospital acid suppression (i.e., PPI or H2RA or neither). Randomized patients receive either 40 mg pantoprazole intravenously or an identical placebo while mechanically ventilated in the ICU up to 90 days after randomization or until death, discontinuation of mechanical ventilation, or clinically important GI bleeding. All patients, families, clinical, and research staff are blinded to allocation.

Research staff upload daily data from hospital charts to a secure web-based electronic data capture system (iDataFax, Seattle, Washington) including baseline data, study drug administration, daily data in ICU up to 90 days post randomization, relevant laboratory values, cointerventions, trial outcomes, reports for adjudication, duration of mechanical ventilation, ICU and hospital stay, and mortality. Patients discharged alive from hospital before 90 days are followed for 90 days to obtain vital status. Further details are reported elsewhere [[6]: NCT03374800] and in our protocol [10].

Ethics

REVISE is approved by the Research Ethics Boards (REBs) and Human Research Ethics Committees (HRECs) of all participating regions and hospitals. REVISE is also approved by Health Canada [HC6-24-c210404], Clinical Trials Ontario Research Ethics Project ID: 1360, the Northern Sydney Local Health District Human Research Ethics Committee (HREC) [2019/ETH08405], and Comissão Nacional de Ética em Pesquisa (CONEP) [5.734.590]. Regulatory oversight in Canada is by Health Canada [HC6-24-c210404].

Outcomes

The primary efficacy outcome is clinically important upper GI bleeding occurring in the ICU or resulting in ICU readmission during the index hospital stay [10]. Research staff prospectively collect data related to GI bleeding, allowing blinded central duplicate adjudication as described elsewhere [1, 2, 10].

The primary safety outcome is all-cause mortality at 90 days after randomization.

Secondary outcomes include VAP in the ICU, C. difficile infection in the hospital, new renal replacement therapy in the ICU, ICU and hospital mortality, and patient-important GI bleeding as defined through interviews and focus groups with ICU survivors and family members [11].

Tertiary outcomes include total red blood cells transfused in the ICU, peak serum creatinine concentration in the ICU, and duration of mechanical ventilation and duration of stay in the ICU and hospital, measured in days [10].

Adverse events and serious adverse events are as follows: key adverse events (AEs) and serious adverse events (SAEs) relevant to REVISE are already pre-defined primary or secondary trial outcomes. REVISE follows 5 published recommendations for rational reporting of SAEs in investigator-initiated ICU trials of drugs in common use [12].

Sample size

The sample size of 4800 patients was chosen on the basis of plausible baseline risks of GI bleeding and plausible effect sizes of pantoprazole versus placebo and a target of 85% power [10]. Also, across the range of higher baseline risks of death due to high illness severity, 4800 patients will provide at least 70% power to detect an increase in risk of death associated with pantoprazole that would deter its use in such patients [4]. Further details are found elsewhere [[6]: NCT03374800] and in the REVISE protocol [10].

Statistical analysis

All analyses will be conducted blinded to treatment allocation; the randomization code will only be broken when the analyses are complete, unless serious safety concerns arise. We will describe baseline characteristics of enrolled patients using descriptive statistics (count and percentage for categorical variables and mean or median with standard deviation or interquartile range for continuous variables, as appropriate).

Details regarding the main analysis plan, sensitivity analyses, and subgroup analyses are found in Table 2. The main analyses will be conducted per the intention-to-treat principle by analyzing patients according to their randomized group, regardless of protocol compliance. We will compare the time to the primary and secondary binary outcomes in the two groups using Cox proportional hazards regression. We will adjust for the stratification variable of pre-hospital acid suppression in the analyses [13, 14]. As the APACHE II score is strongly associated with mortality, to maximize statistical efficiency, we will also adjust for baseline APACHE II score for the mortality outcomes. For the primary efficacy and primary safety outcomes, we will present Kaplan-Meier curves. We will report hazard ratios with 95% confidence intervals (CIs) as well as the absolute risk differences and their respective 95% CIs. For the outcome of total red blood cells transfused in the ICU, we will compare the two groups using Poisson regression, adjusting for pre-hospital acid suppression. For all other continuous outcomes, we will compare the two groups using linear regression on the original scale or on the log-scale (to normalize the distribution, if needed), adjusting for pre-hospital acid suppression. If the distribution remains non-normal after transformation, we will employ a nonparametric approach. We will use graphics and other relevant methods to examine the assumption of proportional hazards for Cox regression analyses [15,16,17].

Table 2 Summary of analysis plan

Since all data are collected in hospital, except 90-day mortality for those discharged alive from hospital prior to 90 days, we anticipate very few missing data. For continuous outcomes with data missing for more than 2% of patients, we will perform multiple imputation analysis using multiple imputation by chained equations and will combine using Rubin’s rule [18]. Evidence has shown that multiple imputation is one of the optimal methods for handling data that is assumed missing at random [19]. For the time-to-event analyses, patients with incomplete follow-up will be censored at time of last follow-up.

All estimates of effect will be reported to two decimal places. p-values will be reported to three decimal places with those less than 0.001 reported as p < 0.001. The criterion for statistical significance will be set at alpha = 0.05, using 2-sided tests. Separately applied to secondary and tertiary outcomes, and to subgroup analyses, the sequential Holm-Šidák approach will be used to adjust for multiple significance testing [20, 21]. The sequential Holm-Šidák approach consists of ordering all p-values from smallest to largest, and then comparing them to an adjusted level of significance calculated as 1-(1-0.05)1/C, where C indicates the number of comparisons that remain. In the case of 11 outcomes, the smallest p-value would be compared to 1-(1-0.05)1/11, the second p-value to 1-(1-0.05)1/10, and so on, with the last one being compared to 1-(1-0.05) (i.e., 0.05). The sequential testing procedure stops as soon as a p-value fails to reach the corrected significance level [20, 21]. Analyses will be performed using SAS 9.4 (Cary, NC).

Sensitivity analyses

We will perform sensitivity analyses as follows (Table 2).

  • 1) For both the primary efficacy outcome (clinically important upper GI bleeding) and the primary safety outcome (90-day mortality), we will conduct a sensitivity analysis which is not adjusted for pre-hospital acid suppression.

  • 2) For both the primary efficacy outcome (clinically important upper GI bleeding) and the primary safety outcome (90-day mortality), we will conduct a sensitivity analysis which includes center in the model as a random effect [22]. To avoid possible problems with model convergence due to a small number of events in some centers, we will not adjust for the stratification variable of center in the main models.

  • 3) For our primary efficacy outcome (clinically important upper GI bleeding), we will conduct a sensitivity analysis using a competing risk analysis approach [23] with death as the competing risk using the Fine and Gray proportional sub-distribution hazards model [24, 25].

  • 4) For both the primary efficacy outcome (clinically important upper GI bleeding) and the primary safety outcome (90-day mortality), we will conduct a sensitivity analysis restricted to patients receiving study drug on ≥ 80% of study days while mechanically ventilated, to investigate the effect of pantoprazole under conditions of optimal protocol adherence as clinically appropriate.

Subgroup analyses

We will conduct analyses of both the primary efficacy outcome of clinically important upper GI bleeding and the primary safety outcome of 90-day mortality in five a priori subgroup pairs. Analyses will be adjusted for pre-hospital acid suppression. In Table 2, we pre-specify the hypothesized direction of effect, or lack thereof, also described here. For the overall trial, we hypothesize that pantoprazole will reduce the risk of clinically important upper GI bleeding and that pantoprazole will increase the risk of death within 90 days.

  • 1) Pre-hospital use of acid suppression (PPIs or H2RAs) vs. none: We hypothesize that pantoprazole is more effective at preventing clinically important upper GI bleeding among patients with versus without pre-hospital acid suppression, given concerns about hypergastrinemia upon discontinuation of acid suppression [26, 27]. We hypothesize no modification of the effect of pantoprazole on 90-day mortality.

  • 2) Illness severity measured by APACHE II score of ≥ 25 or < 25: We hypothesize no modification of the effect of pantoprazole on clinically important upper GI bleeding. We hypothesize that pantoprazole is more harmful in terms of increased risk of 90-day mortality among patients with higher versus lower illness severity, given the high illness severity subgroup finding in the SUPICU trial [4].

  • 3) Medical vs. surgical/trauma ICU admitting diagnosis: We hypothesize no modification of the effect of pantoprazole on preventing clinically important upper GI bleeding. We hypothesize no modification of the effect of pantoprazole on 90-day mortality.

  • 4) SARS-CoV-2 positive vs. negative status: We hypothesize no modification of the effect of pantoprazole on clinically important upper GI bleeding. We hypothesize that pantoprazole is more harmful in patients with versus without SARS-CoV-2 in terms of increased risk of 90-day mortality, given findings of prolonged course and worse outcomes of infected patients exposed to PPIs [28]. Patients with SARS-CoV-2 positive status will be considered those with active COVID-19 infection, not those with an incidental SARS-CoV-2 positive test result.

  • 5) Female vs. male: We hypothesize no modification of the effect of pantoprazole on clinically important upper GI bleeding. We hypothesize no modification of the effect of pantoprazole on 90-day mortality. Sex is recorded for all patients, ascertained per medical chart documentation, and will also be analyzed and reported at baseline by randomized group. The sociocultural, environmental, and behavioral factors influencing a person’s self-identity according to gender are beyond the scope of this ICU trial.

We will include an interaction term between each subgroup variable and treatment group to test for subgroup effects. For any interaction p-value below 0.10, we will conduct a formal assessment of subgroup credibility using the only structured 4-category (high to very low credibility) instrument for assessing subgroup effects developed by an international expert panel, rigorously tested by multiple stakeholders with detailed guidance for each judgment [29].

Interim analyses

The independent REVISE data monitoring committee (DMC) includes professor Roberts (Oxford), Dr. McAuley (Queen’s University of Belfast), and Dr. Thomlinson (Chair, University of Toronto). When 1200 patients were enrolled, the DMC reviewed 90-day mortality data and recommended continuing REVISE.

When 2400 patients (50% recruitment) had 90-day mortality ascertained, we conducted a single interim analysis of all outcomes. To maintain the overall type-I error rate (i.e., α), for the interim analysis, we used a conservative Haybittle-Peto stopping rule with a critical value of three standard deviations with a fixed conservative α = 0.001 [30, 31]. The DMC also examined reports regarding the following: (1) recruitment (center and patient), screening, consent, and coenrolment rates; (2) protocol procedures (randomization, stratification, study drug, and other adherence issues); and (3) trial outcomes.

Guided by their charter [32, 33], the DMC considered the balance of benefits and harms when reviewing the blinded interim analysis data. On December 5, 2022, the DMC advised the REVISE steering committee to continue enrolment to trial completion.

Publication

The results manuscript will be submitted to a general clinical journal regardless of the results. Authorship will follow ICMJE guidelines. All trial sites, investigators, and research coordinators will be cited. Funders will have no influence on data handling, analysis, or writing of the manuscript. Our final report will follow guidance from CONSORT (Consolidated Standards of Reporting Trials) 2010 [34, 35] and CONSERVE (CONSORT and SPIRIT Extension for RCTs Revised in Extenuating Circumstances) [36]. The scientific protocol of REVISE was not modified due to the pandemic.

Discussion

Ongoing uncertainty about the consequences of daily PPIs in critically ill patients have raised concerns about the risk-benefit ratio being shifted towards a neutral effect or harm. REVISE will provide low risk of bias estimates and more than double trial evidence on the impact of PPIs on health outcomes that will markedly increase the strength of inferences regarding clinically important bleeding, mortality, VAP, and C. difficile infection. Results will not necessarily be generalizable to patients who are breathing spontaneously or are receiving no enteral nutrition. Recruitment started on July 9, 2019. As of May 1, 2023, 4124 patients have been recruited in 63 centers internationally in Canada, Australia, the UK, USA, Saudi Arabia, Kuwait, and Pakistan.

Despite the low cost of pantoprazole per dose, acid suppression costs worldwide are sizable, given nearly universal daily prescribing in the ICU. Furthermore, integrating healthcare resources consumed for bleeding and infectious outcomes is necessary to inform policy. A parallel externally funded economic evaluation (E-REVISE) will address the cost-effectiveness of acid suppression in the ICU for healthcare systems. If acid suppression causes more harm than good or consumes resources without sufficient benefit, pantoprazole will be a candidate for de-adoption.

In this SAP, we have presented plans for using an intention-to-treat approach for analyses of all outcomes, including assessment of the robustness of the findings using a series of sensitivity analyses and subgroup analyses. We have followed the guidance for SAP for clinical trials to outline the analysis details of the trial [7]. Aligned with the World Health Organization's Joint Statement on Public Disclosure of Results from Clinical Trials, requiring timely public disclosure of results [37]:https://www.who.int/news/item/18-05-2017-joint-statement-on-registration, data will be analyzed and disseminated as soon as possible to inform bedside care and practice guidelines worldwide.