Introduction

Spontaneous rupture of membranes (SROM) is a normal physiological event. In about 10% of the population, it happens before labour starts [1]. It is believed that prelabour rupture of membranes increases the risk of infection and therefore induction of labour is recommended in an attempt to reduce such risk [2]. However, there is controversy about whether the induction reduces that risk. Moreover the risk of infection is always present, even when the membranes are intact. One of the downfalls of the routine induction of labour is that it limits the potential for women and her infant to experience a normal/physiological birth and its long term benefits.

Therefore, the management of prelabour rupture of membranes has been an issue of debate since the 1960’s and the pendulum has swung between inducing labour as soon as possible in an attempt to reduce the risk of infection, and giving women time to start labour spontaneously in an attempt to increase the chances of having a physiological birth and reduce the risk of caesarean section that can be associated with the induction of labour for women with prelabour rupture of membranes [3]. A Cochrane systematic review published by Middleton et al. [4] found an increase in maternal infectious morbidity (chorioamnionitis and/or endometritis combined) for women who had expectant management following term prelabour rupture of membranes, and their infants were more likely to have definite or probable early-onset neonatal sepsis combined. However, importantly for this review, Middleton et al. [4] found no statistically significant differences for caesarean section, serious maternal morbidity and mortality, definite neonatal sepsis alone, or perinatal mortality.

In addition, vaginal examinations have been known to be associated with an increased risk of chorioamninionitis. One of the first studies to highlight this issue was carried out by Schutte et al. [5]. They discovered that what was more significant was the length of time between the first vaginal examination and the birth rather than the time between the rupture of membranes and the birth. Further studies have demonstrated that vaginal examinations increase the risk of chorioamnionitis [6]. However, since vaginal examinations are a very common procedure in clinical practice, these are often overlooked and its effects have not been extensively researched in the context of prelabour rupture of membranes.

The incidence of induction of labour keeps rising, increasing in England from 22% in 2011-2012 to 33% in 2021-2022 [7]. Prelabour rupture of membranes at term is a common cause for routine induction of labour due to national guidelines’ recommendations [2]. The aim of this systematic review is to identify, evaluate and synthesise the results from observational and RCTs studies over the past three decades that compare active vs expectant management, that include vaginal examinations, and that had chorioamnionitis and/or normal birth as outcomes.

Methods

This section of the paper outlines the process that was followed to identify the primary research studies that answered the research questions. Due to its complexity, three research questions were identified using the PICO diagram and subsequently, the results from these searches were combined. The research questions are:

1) For term prelabour rupture of membranes, is expectant management associated with a lower rate of chorioamnionitis compared to active management? 2) For term prelabour rupture of membranes, is expectant management associated with a higher rate of normal birth compared to active management? 3) For term prelabour rupture of membranes, are vaginal examinations associated with chorioamnionitis?

The systematic review aimed to identify randomised controlled trials (RCTs) and observational studies that compared active vs expectant management, vaginal examinations and had chorioamnionitis and/or normal birth as outcomes. The following databases were used: MEDLINE, EMBASE, Maternity and Infant care, LILACS, CINAHL and the Cochrane Central Register of Controlled trials (CENTRAL). The systematic review was last updated in November 2019. No time limit was set on the searches as the aim was to identify all research studies that met the inclusion criteria to see how the management of prelabour rupture of membranes had evolved over time. The three research questions stated above refer to term pregnancy (36 weeks or more). The searches were performed initially without specifying gestational age but were subsequently screened manually for term pregnancy. This is because it was the approach that identified more studies. The studies that referred to less than 36 weeks gestation were excluded manually. It was decided to include papers in all different languages. Therefore, all the published studies that met these criteria were listed regardless of language. However, only papers written in English, Spanish or French were read and analysed. The inclusion criteria are outlined on Table 1.

Table 1 Inclusion and exclusion criteria

Quality assessment

There are several published tools to aid the quality assessment of research studies, as well as different tools depending on the type or methodology of the research. The quality assessment process carried out for this review is based on a synthesis of both the CASP tools [8,9,10] and the Cochrane risk of bias assessment tool developed by Higgins et al. [11]. This was because this systematic review included observational studies, as well as randomised controlled trials. Therefore, 13 questions were used for the RCTs and 12 questions for the observational studies. Tables 2 and 3 present the questions that were used to assess the quality of the RCT and observational studies respectively, and which published tool they were conceptually drawn from.

Although the risk of bias assessment tool developed by Higgins et al. [11] is well known and well accepted by the academic community to assess the quality of RCTs, It was decided to add some of the questions and concepts developed by the Critical Appraisal Skills Programme to complement it because they would contribute to assess the quality of all the studies included in this systematic review. The implications of this choice are that two similar lists of questions were created that made the process of assessing the quality of the studies less complicated, without compromising the quality assessment. Both, the Cochrane risk of bias assessment tool and the Critical Appraisal Skills Programme are deemed good tools to assess the quality of the studies.

Table 2 Quality assessment questions for RCT studies

Furthermore, the results from the systematic review are in agreement with a recent Cochrane systematic review published by Middleton et al. [4] in that the quality of most studies in this topic is generally low. Since there was a high degree of heterogeneity in the outcomes to be measured, it was not possible to do meta-analysis. All the studies that were found in the searches that met the criteria are presented in Table 4, and no studies were omitted due to their quality. Therefore, there is no bias in reporting.

A total of 13 questions were to be answered by the RCTS and 12 questions for the observational studies. All questions were deemed equaly important and had the same weight. Therefore, all studies were given a score between 0 and 12 or 13, in which 0 indicated very poor quality and 12-13 extremely good quality and then a percentage was obtained, for example 8/13 (61.5%). It was decided a priori that studies that scored less than 40% were considered low quality, studies that scored between 41% and 79% were considered intermediate quality and studies that scored 80% or more were considered of high quality. In order to maintain consistency and rigor, in the case of RCT studies where it was not clear what the primary and secondary outcomes were or in cases where these were not stated, these studies were given a score of 0 as an answer to question: “Were the primary and secondary outcomes clearly stated?”. Also in the case of RCT studies, if the randomisation system used was either not stated or the allocation to treatment was done by the day of the week, or the number of the hospital number or by means of sealed envelopes, these studies were given a score of 0 as an answer to the question “In the case of RCT, was the allocation to treatment randomised?”.

Table 3 Quality assessment questions for observational studies

Results

In total, there are 32 studies included in this review after the final search in November 2019, 27 studies were RCT (Randomised controlled trials) or quasi-randomised and 5 were observational studies [12,13,14,15,16]. In this systematic review, what is understood by randomised controlled trial is a study that has a truly random method of allocating participants to the different treatment groups, such as a random list of computer generated numbers or a computer that does the randomisation online, which means it cannot be predicted which treatment group the participant will be allocated to. On the other hand, a quasi-randomised trial, is one in which the allocation of participants can be easily predicted, because the study uses a method of allocation that is not random, for example, when the allocation of participants is based on the last digit of the date of birth, or the last digit of the medical record number, or odd numbers are allocated to group 1 and even numbers to group 2. Using these easily predictable methods to allocate participants to different treatment groups can introduce selection bias into the study. In this systematic review several studies were deemed to be quasi-randomised controlled trials, such as [17,18,19,20,21,22]. Figure 1 summarises the results obtained through the three searches mentioned earlier. This figure shows the number of papers that were relevant and met the inclusion criteria prior to assessing their quality.

Fig. 1
figure 1

Summary of results from the three searches

The 32 studies were undertaken all over the world, the TERMPROM was an international multicentre study that was carried out in 6 countries (Canada, UK, Australia, Sweden, Denmark, and Israel), six were undertaken in Europe, five in the USA, one in Canada, one in South America, two in Africa, six in Middle East, and four in Asia.

With regard to the length of time of the expectant management, out of these 27 RCT studies, two had expectant labour up to 12 hours, two studies compared IOL (induction of labour) at 12 hours vs IOL at 24 hours, 12 studies had EM (expectant management) up to 33 hours, three studies had an expectant arm up to 48 hours, three RCT had an expectant management up to 96 hours and five RCT did not state a time limit on the expectant management [21,22,23,24,25].The Supplementary file presents the RCT and observational studies found, organised in different tables according to the length of SROM (spontaneous rupture of membranes) in the expectant management.

The other aspect of study in these research studies was which agent/drug is associated with better clinical outcomes, across the included studies, the three drugs used to initiate labour were: Intravenous oxytocin, prostaglandings (PGE\(_{2}\)) and misoprostol (PGE\(_{1}\)). This systematic review, was not focused on the drugs that were used during the induction of labour but on the comparison between expectant and active management.

With regard to the primary outcome, the majority of studies focused on caesarean section or neonatal infection. No studies used “normal birth” or an equivalent term, or chorioamnionitis, as a primary outcome. Making the choice of primary outcomes included in this systematic review is one of the elements of originality. The majority of the studies were of poor quality and only four studies were scored 60% or more [3, 26,27,28]. The main problems were that the primary and secondary outcomes were not stated, the lack of definition of the outcomes, studies that are not properly randomised (i.e studies where the allocation could be predicted, for example allocation by the day of the week, or the number at the end of the case notes) or cases of selective reporting amongst other issues.

The small sample size, was another issue. With the exception of the TERMPROM trial, other studies whose primary outcome was neonatal infection were underpowered.

In the TERMPROM study [27], the rate of chorioamnionitis was higher when women had expectant management in comparison to those who had active management and were induced with IV oxytocin. However, the study authors do not report that the difference in chorioamnionitis between Active management and Expectant management [78/1259 (6.2%) vs 99/1261 (7.8%)] when inducing with prostaglandins was not statistically significant (\(X^{2}\)=2.446, Dof=1; p=0.104). Figure 2 provides a graphical representation of these results.

Fig. 2
figure 2

Chi-square test results on chorioamnionitis in TERMPROM study

The results found when inducing with intravenous oxytocin have been generalised, when, if labour is induced with prostaglandins, the difference is not significant.

This finding is important since nowadays labours are more likely to be induced with prostaglandins than with oxytocin alone. Therefore, the results that are relevant to current practice suggest that there may be less risk of chorioamnionitis associated with expectant management than has previously been assumed based on the TERMPROM results [27].

In terms of the mode of birth, as stated earlier no studies used “normal birth” or an equivalent term as primary outcome. However, the studies conducted by Grant et al. [26], and Natale et al. [28] had caesarean section as primary outcome, but only the study carried out by Grant et al. [26] had enough statistical power to address caesarean section as a primary outcome.

Grant et al. [26] compared active management (Inmediate induction of labour with IV oxytocin) with expectant management (up to 33 hours) and concluded that women allocated to the expectant management had fewer caesarean sections [38/219 (17.4% vs 25/225 (11.1%)] OR0.60; 95%CI 0.35 to 1.02; P=0.06] but the difference was not statistically significant.

The number of vaginal examinations that women received during labour was not the primary focus in any of the included studies, although the TERMPROM study [6] highlighted that the number of vaginal examinations was the strongest correlator of chorioamnionitis. Figure 3 shows a graphical representation of the relationship between VEs and chorioamnionitis based on the data published by Seaward et al. [6].

Fig. 3
figure 3

Relationship between number of VEs and chorioamnionitis

The number of vaginal examinations during labour appeared to be retrospectively analysed in some studies [26, 27, 29]. While a secondary analysis performed by Seaward et al. [6] on the TERMPROM trial concluded that vaginal examinations were associated with higher rates of infection, none of the included studies prospectively included an intervention to try to minimise chorioamnionitis by reducing the number of vaginal examinations. The studies performed by Akyol et al. [29] and Grant et al. [26] reported the number of vaginal examinations, the former as a categorical variable and the latter as a continuous variable. Neither of them conducted any analysis to see if the number of vaginal examinations was associated with chorioamnionitis. Akyol et al. [29], Hannah et al. [27], and Grant et al. [26] analysed the number of vaginal examinations whereas the studies conducted by Natale et al. [28] and Ottervanger et al. [3] did not.

Table 4 Summary of studies included in the review

Discussion

The search strategy for this systematic review did not identify any prospective studies that answered the question of whether expectant management and a reduced number of vaginal examinations are associated with a higher rate of normal birth and a lower rate of chorioamnionitis.

Although some studies looked at caesarean section, they did not provide information about physiological labour and birth. Systematic reviews of what matters to women around the world indicate that “normal/physiological birth” is valued by most [30], and the recent Lancet Series on reducing caesarian section noted that one effective way of doing this is to increase physiological labour and birth [31].

Through the process of searching and gathering studies, it also became evident that there are no studies on this topic that looked at normal birth as an outcome, most of the studies looked at reducing caesarean section as opposed to increase physiological birth. The Lancet midwifery series supports more studies with physiological birth as an outcome.

The studies found identified that the management of prelabour rupture of membranes is a matter of global interest, as there were studies published in the five continents, both in developed and high income countries, as well as in developing countries. Apart from the fact that the use of epidural was reported more often in Europe than other parts of the world, there were no particular trends depending on the country, making the findings more generalisable.

The following limitations of this systematic review were identified; This systematic review was not registered, most of the studies found were of poor quality, very few had computerised randomisation, the primary and secondary outcomes were not stated or these were not clear, the diagnosis of chorioamnionitis or neonatal infection was not blinded, the diagnosis of chorioamnionitis and neonatal infection varied a lot and in some cases the definitions were not appropriate, making it very difficult to perform a meta-analysis due to the high degree of heterogeneity.

On the other hand, most of the studies were published more than ten years ago, and although the issue of prelabour rupture of membranes has not changed, and women continue to break their waters before going into labour, practices around birth have changed.The secondary analysis undertaken for this review on the induction agent used in the TERMPROM study, which is the largest study to date, published by Hannah et al. [27] revealed that there was no statistical difference in the risk of chorioamnionitis when labour was induced with prostaglandins followed by IV oxytocin if needed in comparison to when labour was induced with IV Oxytocin on its own. Given the more widespread use of prostaglandins for labour induction in recent years, there is now a need for updated studies taking this change of practice into account.

The lack of attention to the impact of frequency of vaginal examinations is also a concern, both from the potential impact on infection (and the consequent potential need for antibiotics, in light of the increasing antibiotic resistance) and because vaginal examinations can often cause anxiety to woman [32]. A study published in Sweden, showed that 45% of women found the gynaecological examination to be a “negative” experience [33]. Therefore, it is important to minimise the number of vaginal examinations performed during their labours.

Conclusion

There are no published studies (RCTs or observational) that have looked at expectant management and an approach to minimise vaginal examinations during labour for prelabour rupture of membranes to maximise the chances of physiological birth and minimise chorioamnionitis. Considering that vaginal examinations are a routine intervention during most labours and that there is evidence that vaginal examinations are one of the strongest correlators of chorioamnionitis, it is crucial to carry out more studies that find ways to monitor the progress of labour using other means. Future studies in the management of prelabour rupture of membranes should be designed and powered to include both physiological birth and chorioamnionitis as birth outcomes.