Postoperative pain might be ameliorated by substituting propofol for sevoflurane anesthesia. Support for this theory comes from human pain models in which propofol reduced hyperalgesia and allodynia in response to pinprick and electric stimulation.1 Benefit may result from central and peripheral analgesic effects of sub-hypnotic doses of propofol, including suppression of spinal sensitization.

For example, animal experiments show that propofol depresses nociceptive transmission in neurons, thus diminishing the continued nociceptive barrage that normally follows tissue injury.2,3 Additional analgesic effects of propofol likely result from interactions with N-methyl-D-aspartate (NMDA), non-NMDA receptors, and via activation of gamma-aminobutyric acid A (GABAA) receptors in the dorsal root ganglion nociceptor cells.

Consistent with multiple analgesic mechanisms, some studies report that patients anesthetized with propofol have less postoperative pain than those anesthetized with volatile anesthestics.4,5 Other studies, however, do not support a postoperative analgesic effect of intraoperative propofol.6-8 In fact, Boccara et al. showed that isoflurane in N2O/O2 with 1-1.5 MAC provided better analgesia than propofol in patients recovering from abdominoplasty.6

Since it remains unclear whether intraoperative propofol analgesia ameliorates postoperative pain, we tested the primary hypothesis that postoperative opioid requirements (from the end of surgery until the next postoperative day) are greater in patients anesthetized with sevoflurane than in those anesthetized with propofol. Our secondary outcome was the average pain score for each group during the first four postoperative hours.

Methods

With Institutional Review Board and EudraCT Number approval (2009-011038-82) and patients’ written informed consent, we enrolled participating inpatients scheduled for open vein stripping under general anesthesia at the two biggest hospitals in Vienna, the Medical University of Vienna General Hospital and the Donauspital. Each patient had insufficiency of the vena saphena magna and/or the vena saphena parva valve. A single surgeon in each hospital performed each study case. All participating patients were aged 18-75 yr and American Society of Anesthesiologists physical status I-III.

We excluded patients who routinely took any analgesics. Similarly, we excluded patients with a history of allergies to morphine or morphine derivatives, patients incapable of using an intravenous patient-controlled analgesia pump, and patients who could not reliably report pain intensity.

Protocol

Participating patients were given oral midazolam 7.5 mg preoperatively on the morning of surgery. They were then randomized 1:1 and stratified by site to either propofol-based total intravenous anesthesia or sevoflurane-based volatile anesthesia. Randomization was based on computer-generated codes that were maintained in sequentially numbered opaque envelopes and opened shortly before induction of anesthesia.

Anesthesia started with a remifentanil infusion at a rate of 0.15-0.4 µg·kg−1·min−1. Thereafter, the designated anesthetic was initiated per randomized allocation: 1) propofol at 3-5 mg·kg−1 or 2) sevoflurane at 5-6%. Each patient’s airway was secured with a supraglottic airway device or, after administration of rocuronium 0.5 mg·kg−1, an endotracheal tube. Their lungs were mechanically ventilated with oxygen in air to maintain an end-tidal partial pressure of CO2 from 35-40 mmHg.

In patients assigned to propofol, anesthesia was maintained with an infusion of 2.5-3.0 mg·kg−1·hr−1. For those assigned to sevoflurane, anesthesia was maintained at an end-tidal concentration of 1.5-2.5%. Anesthetics were adjusted as necessary to target a bispectral index (BIS) near 50 or a Narcotrend value near 40 (see below for details). The remifentanil infusion, which continued throughout surgery, was adjusted to target a mean arterial pressure within 20% of each participant’s preoperative baseline pressure.

About ten minutes before the end of surgery, administration of anesthetic drugs and remifentanil was reduced and then discontinued when appropriate. The supraglottic airway device or endotracheal tube was removed, and patients were taken to the postanesthesia care unit where they remained for a minimum of four hours.

A patient-controlled analgesia pump containing morphine hydrochloride in a concentration of 2.5 mg·mL−1 was connected and set to deliver 2-mg boluses with a seven-minute lockout window and no basal rate. Patient-controlled analgesia was maintained until the first postoperative morning, typically about 8 AM.

Pain scores exceeding three points prompted administration of 1.5 mg of the opioid piritramide, which was repeated as necessary.

Measurements

Use of an 11-point Likert verbal response scale was explained to patients preoperatively. Hypnotic depth was estimated from a Narcotrend Compact M Monitor (MT MonitorTechnik GmbH, Bramsted, Germany) or a BIS Monitoring System (BISx, Covidien, Dublin, Ireland). Routine anesthetic monitor values as well as propofol and remifentanil infusion rates were recorded at five-minute intervals.

Surgeons were blinded to randomization and anesthetic management. Independent clinicians and investigators, also blinded to randomization and anesthetic management, performed all postoperative measurements. To maintain blinding, an unblinded anesthesiologist retained the electronic printout of the anesthesia record, and the blinded study personnel were instructed not to access the electronic medical record. A study nurse blinded to group assignment controlled all postoperative pain scores and opioid administration.

Verbal response pain scores as well as opioid consumption and adverse effects were assessed at rest at 30-min intervals for the initial four postoperative hours. The next morning, patients were asked to rate their average pain since leaving the postanesthesia care unit. The amount of opioid consumed during this period was recorded. Piritramide (a morphine-like opioid) was converted to morphine equivalents using the formula: 10 mg morphine = 15 mg piritramide. Adverse effects, including nausea and antiemetic treatments, were recorded.

Endpoints

The primary efficacy endpoint was postoperative opioid consumption from the end of surgery until the first postoperative morning. The pre-specified secondary outcome was postoperative pain measured during the first four hours of recovery after surgery. An exploratory outcome was pain on the first postoperative morning.

Other exploratory outcomes included intraoperative remifentanil consumption, total amount of fluids, BIS, intraoperative and postoperative blood pressure, heart rate, SaO2, and postoperative side effects (vomiting and nausea). Variables captured at repeated intervals (i.e., blood pressure, heart rate, SaO2, and BIS) were averaged within each patient. The occurrence of nausea and vomiting was evaluated at 30-min intervals throughout the first four hours of recovery, and the next morning, patients were asked if they had experienced nausea or vomiting since leaving the recovery unit. Any positive response for nausea or vomiting at any time was considered evidence of postoperative nausea and vomiting.

Statistical analysis

Balance on baseline characteristics and intraoperative factors was assessed between the randomized groups using the standardized difference, i.e., the difference in means or proportions divided by the pooled standard deviation. Imbalance was defined as a standardized difference greater than the absolute value of 1.96\( \sqrt {\frac{{n_{1} + n_{2} }}{{n_{1} \times n_{2} }}} \) = 0.41.9 Where n 1 = 42 and n 2 = 48.

We assessed the effect of sevoflurane vs propofol on the primary outcome of opioid consumption using a two-tailed Student’s t test on the log-transformed data along with a two-sided interim-adjusted 95% confidence interval (CI). We added 0.1-mg iv morphine equivalents to six patients who received zero opioids before taking the logarithm. Results were back-transformed (i.e., exponentiated) to obtain the estimated ratio of geometric means and 95% CI on the raw scale.

For postoperative pain scores repeated over time, a linear mixed-effects model was used to estimate the difference in means between sevoflurane and propofol, assuming an unstructured within-subject correlation matrix. If our findings showed an interaction between intervention and time (P < 0.10), superiority would be assessed separately at each time point.

Superiority testing was also used for tertiary outcomes. Randomized groups were compared on continuous secondary outcomes with the Student’s t test or Wilcoxon rank-sum test and on binary secondary outcomes with the Chi square or Fisher’s exact test, as appropriate.

We planned to assess efficacy and futility at each 25% of the maximum sample size using two-sided superiority tests. We used a group sequential design with a gamma spending function (gamma parameter of −1 for efficacy [alpha] and −4 for futility [beta]). The P values constituting the efficacy (futility) boundaries at each of the four planned analyses were P ≤ 0.0016 (>0.90), P ≤ 0.0048 (> 0.61), P ≤ 0.0147 (> 0.20), and P ≤ 0.0440 (> 0.0440), respectively. Results are expressed as mean (SD) or difference in means.

The significance level of two-sided superiority testing was 0.05 for each of the primary and secondary outcomes. Tertiary outcomes were assessed at an overall significance level of 0.05, with Bonferroni correction for multiple testing.

In fact, we conducted only two interim analyses so that results at the final analysis (n = 90) were reported using the Z-statistic efficacy criterion of 3.191, corresponding to an interim-adjusted significance criterion of P < 0.0014 for the two-sided tests (overall alpha of 0.05). All statistical analyses were performed using SAS® statistical software, version 9.3 (SAS institute, Carey, NC, USA).

Sample size

A previous study of hysterectomy patients reported 24-hr mean (SD) morphine consumption of 32 (15) mg with propofol and 49 (18) mg with sevoflurane, with n = 40 patients per group.2 Vein stripping is presumably less painful than a hysterectomy. We thus estimated that our patients would require about half the opioid dose, but the treatment effect expressed as a percentage would be comparable.

We planned for 136 patients (68 per group). A planned interim analysis indicated more variation than expected and resulted in our expansion to 310 patients (155 patients per group). The revised study with 155 participants per group had 80% power at the 0.05 significance level (two-tailed) to detect superiority of sevoflurane to propofol on mean opioid consumption using a ratio of means of 1.33 and assuming a coefficient of variation of 1.0, including adjustments for interim monitoring. The study was in fact stopped at 90 patients (42 sevoflurane, 48 propofol) for logistical reasons.

Results

We report the results of 90 (42 sevoflurane, 48 propofol) of the 310 required participants who completed the study.

Ninety patients were randomized to sevoflurane (n = 42) or propofol (n = 48) during the study period which lasted from September 2010 through May 2014 (Fig. 1). Thereafter, open vein stripping at both participating hospitals was virtually completely replaced by an endovascular laser treatment which causes much less pain. The Executive Committee (D.I.S. and E.J.M.) stopped the study because patients meeting the original enrolment criteria were no longer readily available. The decision was made based on patient availability without access to study results.

Fig. 1
figure 1

Enrolment, randomization, and analysis

Baseline characteristics were well balanced between groups, i.e., all absolute standardized differences < 0.41 (Table 1).

Table 1 Baseline characteristics of the patients (n = 90)

The observed median [interquartile range (IQR)] morphine sulfate equivalents were 9.8 [4-19] mg in the sevoflurane group and 10 [6-20] mg in the propofol group. Superiority of sevoflurane to propofol on postoperative opioid consumption was not found, with an observed ratio of geometric means of 0.91 (95% interim-adjusted CI, 0.33 to 2.45; P = 0.74) (Table 2, Fig. 2).

Table 2 Comparing sevoflurane and propofol on primary and secondary outcomes
Fig. 2
figure 2

Cumulative morphine equivalents over time by two interventions. Results are shown as mean (SD). PostOP = postoperative

No difference was found between sevoflurane and propofol on postoperative pain scores in the postanesthesia care unit, first postoperative morning, or overall, with a mean difference on an 11-point Likert verbal response scale of 0.20 points (95% interim-adjusted CI, −0.36 to 0.73; P = 0.31) collapsing over time (Table 2, Fig. 3). No interaction was found between the intervention effect and measurement time (P = 0.56). The mean (SD) pain score was 1.6 (1.3) in the sevoflurane group and 1.4 (1.2) in the propofol group. A sensitivity analysis using Wilcoxon rank-sum tests showed similar results and the same conclusions.

Fig. 3
figure 3

Plot of verbal response scores (VRS) on a scale from 0-10 for pain over time by two interventions. Morning = first postoperative morning. Verbal response scores were recorded at 30-min intervals in the postanesthesia care unit and during the first postoperative morning. Results are shown as mean (SD)

Furthermore, no differences were found between sevoflurane and propofol on secondary outcomes using two-tailed tests for superiority and adjusting for multiple testing and interim monitoring (all P values > 0.00011) (Table 3).

Table 3 Comparison of randomized groups on intraoperative and postoperative exploratory outcomes

Discussion

We found that opioid use and pain scores were no worse with sevoflurane than with propofol anesthesia. In fact, similar median pain scores were observed throughout the initial four postoperative hours and on the first postoperative morning. Furthermore, we found no differences in opioid consumption, although the plausible range for the ratio of mean consumption for sevoflurane to propofol was 0.33 to 2.45. The observed median [IQR] morphine sulfate equivalents were 9.8 [4-19] mg for sevoflurane and 10 [6-20] mg for propofol.

Our results diverge from some reports4,5 but are consistent with many clinical trials6-8 and animal studies showing that sub-hypnotic concentrations of propofol alone are not analgesic.2,3,10 They are also consistent with human pain models showing that, after propofol is discontinued, pain scores, hyperalgesia, and allodynia return to normal. Similarly, studies in volunteers indicate that opioids must be added to propofol to provide satisfactory pain relief.11,12 Conflicting reports may result, at least in part, from differing methodology, including induction strategy, use of opioids with various durations of action, analgesic regimes (non-opioids, nitrous oxide), various postoperative evaluation periods, and differing amounts of tissue trauma.4-6,8,11,13

Our results are generally consistent with a 2012 meta-analysis.13 Among the 25 randomized-controlled trials, ten studies with a total of 773 patients were considered suitable for comparing pain after propofol or volatile anesthesia. Four studies did not show significant differences in postoperative pain or analgesia requirements,14-17 and one study even showed significantly less pain 30 min postoperatively in children anesthetized with isoflurane vs propofol or short-acting analgesics.18 Five other studies suggested benefit from propofol.13,19-22 The overall reported effect was a 40% non-significant reduction in pain amongst patients treated with propofol compared with a volatile anesthetic. Nevertheless, the results are not directly comparable because many of the protocols differed substantially from ours. Substantive differences included use of nitrous oxide,14,15,18,20 volatile anesthetics other than sevoflurane,14,15,19,20 opioids of differing durations,21 various postoperative analgesic strategies, multimodal pain regimes,21 and various primary outcomes.14-17,20

Our methods extend previous results. For example, propofol was not administered to our sevoflurane patients, even for induction, whereas in previous studies, anesthesia was often induced with propofol in both study groups.4,6,8,21 Furthermore, for maintenance of analgesia, we used remifentanil rather than fentanyl or morphine as they accumulate and thus remain active postoperatively.4,6,16 Importantly, we avoided nitrous oxide, an NMDA antagonist,23 thus potentially providing analgesia at the level of the spinal cord. Ohashi et al. showed that brain stem opioidergic and GABAergic neurons mediate the antinociceptive effect of nitrous oxide in Fischer rats.24 The effect was inhibited by the opioid receptor antagonist naloxone and the GABAergic receptor antagonist muscimol. Consistent with this mechanism, Fassoulaki et al. 25 used nitrous oxide in each of their anesthetic groups (propofol, sevoflurane, and desflurane) and showed no differences in pain scores and opioid consumption among the groups.

Pain scores in our patients were only moderate, with average initial averages being 3-4 points on an 11-point Likert scale. As might be expected, opioid use was modest, averaging only 13-14 mg over the initial 24 hr — which was much less than in other studies.4,26 It remains possible that the putative analgesic effects of propofol would be more apparent in patients experiencing more postoperative pain. More likely, however, the moderate pain levels in our patients provided the best chance of identifying subtle propofol-induced analgesia.

Perhaps the most serious weakness of our study is recruitment of only 90 of the 310 participants required to show the pre-specified clinically important difference in opioid consumption. This is clearly reflected in our confidence interval for the ratio of means in opioid consumption, which ranges from a 67% reduction to a 2.4- fold increase. Our findings cannot be considered definitive but do add to our knowledge. Using previously published data,4 we initially estimated that 68 patients per group would be sufficient. The first interim analysis showed that population variability was twice what we expected and that more patients would be needed, up to a maximum of 155 patients per group. Nevertheless, we were forced to stop the study with a total enrolment of just 90 patients because of a radical shift in surgical technique. This shift from the conventional open approach to an endovascular laser technique causes much less pain and thus was not comparable with our initial surgical technique. A strength of our study is that only two surgeons participated and each used a similar operative technique. Consequently, tissue trauma and consequent pain was presumably comparable.

In summary, although our trial was stopped early, we did not find evidence consistent with the hypothesis that opioid consumption or pain scores were greater with sevoflurane than with propofol anesthesia. Our results add to a body of evidence in both clinical and animal models that question a propofol-related analgesic effect. Thus, while clinicians might have reasons for favouring propofol anesthesia, postoperative analgesia should not be among them.