Background

Good practice guidelines for obstetric care [1,2,3] has become increasingly important following reports on poor judgement as a cause of perinatal mortality and severe morbidity [4,5,6,7]. Furthermore, in order to meet the possible negative consequences of increasing rates of cesarean section [8], the argument for safe operative vaginal delivery is crucial. In Sweden, a national initiative of educational and policy interventions has been introduced to generally improve delivery care [9], including vacuum extraction delivery, but evaluating these policy changes has proven to be a significant challenge, and the effects on perinatal asphyxia is ambiguous [10]. Some studies, however, have shown measurable results from educational endeavors, such as a 50% decreased risk of obstetric anal sphincter injuries in Norway [11]; a decreased prevalence of severe asphyxia following the implementation of a national educational program in Australia [12], and a simultaneous decrease in emergency cesarean section and operative vaginal delivery in Sweden [13]. Some of the components of effective team training are in-house-setting, multi-professional teams, realistic training tools, and regular recurrence [14]. Suitably designed educational programs can potentially increase quality and enhance safety, but their effects require evaluation.

One possible quality and safety indicator is failed extraction, since this has been identified as a risk factor for adverse perinatal outcome [15,16,17,18,19], and recommended as audible standards by Royal College of Obstetricians and Gynaecologists [1]. A further risk increase seems to occur when multiple modes are needed, that is vacuum and forceps attempt preceding emergency cesarean section [18, 20].

This pre-post intervention study aimed to observe measurable effects on the safety of vacuum extraction deliveries when introducing team training and increased monitoring aimed at vacuum extraction. We compared three time periods characterized by different clinic-based activities that might influence how vacuum extraction is performed: Baseline (period 0), implemented team training (period 1 and 2) and monitoring of traction force during vacuum extraction (period 2). These activities are presented in more detail in the Material and methods section. Our primary hypothesis was a decreased failed extraction rate following team training (period 1), due to an increased adherence to guidelines in the delivery team, including obstetricians´ technique. In period 2, after the additional introduction of objective traction force measurement, we wanted to observe any alteration in the failed extraction rate.

Methods

We retrospectively included all women at Karolinska University hospital in Huddinge, Sweden who underwent delivery by complete or attempted vacuum extraction at fetal head station low or mid during three time periods, N = 1074. Period 0: 2007–2008; Period 1: 2011–2012; Period 2: 2013–2014. (2009–2010 are excluded as they were transition years when the educational program was in a start-up phase). See Fig. 1 for timeline of exposure. The design is conceptually equivalent to a cross-sectional study with three prevalence periods. We identified the cohorts retrospectively in the electronic medical records system (Obstetrix©). Each individual vacuum extraction protocol was examined to confirm and separately classify mid vs low deliveries according to American and British guidelines [1, 2], since this distinction of low vs mid is not available in Swedish registers. Vacuum extractions in Sweden are nearly exclusively carried out by doctors, and for non-outlet extractions metal cups are often used, nearly exclusively Bird metal cup size 50 mm. In this material only one non-outlet extraction was carried out by a midwife. All extractions reported as outlet extractions were excluded, since clinical experience and our previous data indicate that outlet extractions seldom lead to failed extractions or other complications [21]. Preterm delivery (< 36 full weeks of gestation) and multiple pregnancies were excluded. See Additional file 1 for flow chart. Data collection of clinical variables was carried out by two experienced medical doctors (residents) in obstetrics and gynecology. In addition, we identified all cases with emergency cesarean section at fully dilated cervix by scrutinizing the partogram and medical record text of all emergency cesarean section records. A description is presented of the overall registered rates of vacuum extraction, the proportion of non-outlet (low and mid) extractions, and emergency cesarean sections at the clinic during the time periods.

Fig. 1
figure 1

Timeline of exposure periods

Exposure (interventions)

Period 0 constitutes a conceptual null exposure, that is the vacuum extraction management prior to implementing active structured measures to improve delivery care. These structured improvement actions therefore form the exposure or intervention in period 1, and are described in detail in Additional file 2. In summary, they consist of a time out check list for vacuum extractions, annually recurrent clinical setting multi-professional team training based on vacuum extraction cases, and education regarding risk factors for complicated extractions. The team training sessions are led by a senior consultant and experienced midwife, and set in a delivery room using a mock pelvis and live actor from the staff, with direct feed back from a peer group rather than video recordings. The exposure or intervention of period 2 is the research based introduction of an electronical extraction handle which objectively measures and records the magnitude and duration of traction force employed in metal cup extractions.

All term, singleton deliveries were eligible, and there was no feedback on traction force to the obstetrician. These measurements infer an unintended but foreseen monitoring of individual performance. Both exposures may also be afflicted with a general highlighting effect that we believe is inevitable when all personnel are participating in a specific project, a concept known as the Hawthorne effect [22]. The equipment and results of traction force measurement have been described in detail by the authors elsewhere [21, 23].

Clinical characteristics

Maternal variables included age (years); short stature (< 1.55 m); obesity (Body mass index ≥30) and parity (0/1). Obstetric variables included gestational age (days); use of synthetic oxytocin (y/n); epidural analgesia (y/n); duration of first and second stage (minutes); fetal head station identified in the vacuum extraction protocol (low vs mid); fetal head position (occiput anterior or non-occiput anterior); indication of the procedure (dystocia or fetal distress, where dystocia included maternal fatigue and prophylactic); large for gestational age ≥ 4500 g. A registered double diagnosis of indication was scrutinized by the data collectors, requiring either a non-reassuring cardiotocogram or an elevated scalp lactate to be indexed as fetal distress); duration of the extraction (minutes from beginning of extraction until final cup removal).

Outcome variables

The primary outcome variable, failed extraction, was identified in the medical records. Failed vacuum was defined as an attempt at extraction followed by forceps, emergency cesarean section, or both. The procedural secondary outcomes were failed extraction with more than one sequential mode of delivery (followed by both forceps and cesarean section), as well as complicated extractions, defined as one or more of the following: more than 15 min duration; more than six pulls, or more than one cup detachment. Clinical secondary outcome included shoulder dystocia (y/n); asphyxia (pH < 7,0 y/n); APGAR score < 7 at 5 min (y/n); admission to neonatal intensive care unit (y/n) and anal sphincter injury (y/n).

Statistical analyses

Clinical characteristics were analyzed using descriptive frequency measures and hypothesis tests as applicable: Mean (standard deviation) and t-test for normally distributed numerical variables, median (interquartile range) Rank sum and Kruskal-Wallis for skewed continuous data, and Chi2 for categorical data. As advised for analytical cross-sectional studies, Poisson regression was used for multivariate analysis of association between the exposure(s) and outcome, providing an approximation of a relative risk estimate based on prevalence proportions [24, 25]. P-values in the multivariate analyses were Bonferroni corrected due to multiple testing. The primary and secondary procedural outcomes were adjusted by year of birth. No other available variables were identified as confounders. We have not performed repeat measurement calculations as generalized estimating equation or mixed models effect, as the recurring individuals constituted only 1 % of the study sample. Confidence intervals regarding differences involving total delivery numbers in the sample (all vacuum extractions, emergency cesarean section) were calculated by hand, since they are based on group data rather than individuals.

Missing data

There was no missing data in primary outcome. The secondary procedural outcome complicated extraction was missing in 3 %. Maternal body mass index was the only clinical variable missing in more than 1 %: (7.5%). Since these were not primary variables, we decided not to handle this missing data further.

Results

The proportion of failed extractions among low and mid extractions was 12 %, and sequential instruments were used in 1.5%. In approximately half of the cases, the indication for vacuum extraction was dystocia. More than half of the extractions were carried out at mid cavity fetal head station.

Maternal and obstetric clinical characteristics are shown in Table 1. Epidural and oxytocin use, parity, occiput anterior position, obesity and large for gestational age showed no significant differences between the three exposure periods. All other independent variables displayed some degree of difference throughout the study periods.

Table 1 Clinical characteristics

Primary and secondary procedural outcome is shown in Table 2. The primary outcome failed vacuum extraction decreased significantly in period 1 compared to period 0, adjusted relative risk 0.48. A significant decrease in complicated extractions was seen between period 1 and 2, adjusted relative risk 0.42.

Table 2 Primary and secondary procedural outcome, relative risk

The secondary outcome multiple sequential modes of delivery could not be calculated due to few cases: thirteen cases in period 0, and two cases in period 1 and 2 respectively. Secondary clinical outcome showed no significant differences (Additional file 3: Table S1).

The distribution of operative modes of delivery in the study sample is shown in Fig. 2. The total rate of vacuum extraction has decreased; period 0 9.7%; period 1 8.8%; period 2 6.5% (p < 0.05). The proportion of low and mid extractions (as compared to outlet) increased markedly throughout the periods: period 0 36%; period 1 45%; period 2 60% (p < 0.05). There was no significant increase in emergency cesarean sections between the first two periods, but from period 1 to period 2 the rate increased by 1.3% units (p < 0.05). The rate of emergency cesarean section without prior attempt of vacuum extraction at fully dilated cervix and engaged fetal head showed a non-significant increasing trend: period 0: 2.1%; period 1: 2.7%; period 2: 3.4%.

Fig. 2
figure 2

Mode of delivery. Mode of delivery (total VE, proportion low and mid VE, total ECS and proportion ECS at fully dilated cervix without prior attempt at VE) during the exposure periods. * = p < 0.05

Discussion

Main findings

The main finding of this study is the decreased frequency of failed extractions after introduction of clinical team training, and a decreased risk of complicated extraction at the additional introduction of monitoring through traction force measurement. The total frequency of vacuum extraction also decreased during the whole study period, while the proportion of emergency cesarean section was increased.

Interpretation (in the light of other evidence)

To the best of our knowledge, no previous study has specifically investigated the effect of clinical team training or monitoring traction force on failed extraction rates. However, previous studies have shown positive effects of team training [14], and educational efforts have been found to decrease the frequency of anal sphincter injury and asphyxia, and to increase rates of normal delivery [11,12,13].

The subsiding effect on the failure rate after period I complicates the interpretation; does the team training effect fade out, or is it counteracted by an opposing effect of traction force measurement? Or was there no true exposure effect between period 0 and period I, but merely a regression towards the mean and residual confounding? Interestingly, the proportion of low and mid cavity extractions increased during the study period, a situation with known increased risk for failure compared to outlet vacuum extraction. Unfortunately, the design does not allow for a separate analysis of the two exposure effects.

An increased emergency cesarean section rate as the price to pay for fewer (failed) extractions may seem intuitive, but this observation is not necessarily a direct effect of fewer vacuum extractions; emergency cesarean sections performed at a fully dilated cervix without a prior attempt at vacuum or forceps did not increase significantly during the studied time periods. This may indicate that obstetricians did not become more prone to choose cesarean section instead of vacuum, since the increase in emergency cesarean section rates were not predominantly in the category that could have undergone an attempted extraction. A recent Swedish hospital based study on approximately 8000 nulliparous women further support the notion that obstetric care enhancing efforts can lead to decreased vaginal instrumental delivery without a simultaneous increase in emergency cesarean section [13]. The monitoring of period 2 seems to have inferred an increased adherence to clinical guidelines, measured as a decreased risk of complicated extraction (prolonged duration or more than six pulls or more than one cup detachment). These factors are recurring in some guidelines [1, 3], whilst others stress the lack of evidence regarding concrete limitations [2]. Studies investigating the clinical effect of restricting extractions regarding duration, number of pulls and cup detachment show inconsistent results [6, 26, 27], and the relative importance of avoiding failed extractions vs adhering to guidelines therefore remains an open question.

Strengths and limitations

The major limitation of the pre-post test study design is arguably the restricted possibility of making causal inference: this design, also known as quasi experimental design, has inherent bias regarding comparison groups due to either a total absence of parallel control group (as in this particular study) or non-randomized control group, as well as differences due to the passage of time. Some confounding is arguably avoided by adjusting for year of birth, but there will still be residual confounding of which we do not know the magnitude. This might include known confounders where we lack data, such as policy changes, staff turnover and proportion of resident vs consultant doctors, as well as variables that we have overlooked. We are also aware of the specific problem of regression towards the mean; normally, an intervention is introduced because of an identified problem, and one can therefore expect a non-representatively extreme value of crucial variables at the study start. With a 19% failure rate among mid and low extractions in period 0, the pre-post comparison is likely to over-estimate the effect size. To overcome this, a prospective study with pre-specified evaluation protocol and a parallel control group would be a suggested alternative method.

In period 1, the imprecision of the components mediating the training effect to the decreased failure rate makes it difficult to interpret the results. Reflecting upon the clinical characteristics of Table 1, the effect was clearly not mediated by a population of taller and less dystocic women, nor smaller infants or an increased use of oxytocin. In this material, therefore, we lack an obvious explanatory model. One strength regarding exposure is that the team training set-up contained most of the active components of effective team training identified previously in a review [14].

In exposure period 2, it is reasonable to think that monitoring traction force would make obstetricians more cautious and prone to adhere to guidelines when selecting cases for and performing vacuum extraction. In a monitoring situation, a more cautious selection of candidates for vacuum extraction might be recognized as a decreasing proportion of mid cavity station and non-occiput anterior position extractions from period 1 to period 2, but this is not confirmed by our results. However, the decreased risk of complicated extraction in period 2 might support a possible effect of awareness during monitoring. In a population based study on failed extractions [19], the authors argue that a small increase in failed extractions during a period of educational efforts might actually be an effect of sensible obstetric decisions to consciously convert a difficult extraction instead of employing maximum traction force. Our finding of a partly negative association between the rate of failed extractions and rate of complicated extractions makes a possible illustration of this phenomenon.

Conclusion

Some of the decrease in failed extractions is likely an effect of active educational effort. The possible monitoring effect of the traction force measurements in period 2 did not seem to deter obstetricians from mid station extractions, but the monitoring may have led to an increased adherence to clinical guidelines.

Despite the limitations to inferring causality in this study design, the results support that alterations in clinical practice should be accompanied by clinical team education.