Introduction

Urinary incontinence (UI) is the complaint of involuntary loss of urine [1]. It is a common and embarrassing problem, evoking substantial individual morbidity, loss in quality of life and socio-economic costs [2, 3]. In addition to the loss of bladder control, the need to wear incontinence pads often harms the individuality and self-confidence of young pre-partum women [4]. UI ranges from occasionally leaking urine when coughing or sneezing [stress UI (SUI)] to UI preceded by urgency [urgency UI (UUI)], or a combination of both [mixed UI (MUI)]. In the peri-partum period women often experience UI for the first time. In general, SUI is more related to the peri-partum period, whereas the prevalence of UUI and MUI increases with age [5]. Pregnancy and (vaginal) delivery are important risk factors in the development of UI in life [2, 6]. Moreover, when SUI presents during pregnancy, the risk of having SUI at 12 years post-partum is significant [7].

The prevalence and incidence of UI in pregnancy is widely researched. However, these prevalence and/or incidence figures vary greatly throughout published reports, depending on the local setting, case definitions applied, recruited population (trimester of pregnancy and parity), and study methodology [8, 9]. Former systematic reviews focused on the prevalence of pelvic floor disorders (PFDs) among community-dwelling women [10], the prevalence of UI in nulliparous women [11] or in female athletes [12]. To our knowledge, no systematic review and meta-analysis on the prevalence and incidence of UI in pregnancy is available. Reliable prevalence and incidence rates on UI in pregnancy are not only needed to indicate the burden of the health problem, but also to better inform health professionals, policy makers and researchers to set priorities and to assist in planning management of UI[13]. Furthermore, it is known that not all pregnant women are bothered by experiencing UI. It is reported that the crude UI prevalence rate is higher and probably overestimated compared to the prevalence rate of significant or bothersome UI [3]. As bothersome UI is associated with help-seeking behaviour, this discrepancy may have crucial consequences for research planning, health care providers and policy makers [14]. However, a clear and widely accepted definition of bothersome UI still does not exist, which results in the use of heterogeneous terminology and measurement instruments.

Therefore, the primary aim of this systematic review and meta-analysis was to examine the pooled overall prevalence and incidence of UI in pregnancy in the general population, specified for relevant subcategories (trimester of pregnancy, parity, type of UI, frequency and amount). A secondary aim was to provide an overview of the measurement instruments and their outcomes for bother in relation to UI as used in included studies.

Methods

The MOOSE statement for reporting systematic reviews and meta-analyses was followed [15]. The research protocol was published in the PROSPERO database (registration number CRD42018111991).

Search strategy

We performed a systematic review and meta-analysis of observational studies reporting on the prevalence and/or incidence of UI during pregnancy and experienced bother in relation to UI. We searched the electronic databases of PubMed, EMBASE and CINAHL.

We used the following search terms to search all databases: pregnancy, pregn*, prepartum, pre-partum, pre partum, peripartum, peri-partum, peri partum, nulliparous, primiparous, primigrav*, primipar*, multiparous, multigrav*, multipar*, urinary incontinence, urine loss, pelvic floor disorders, pelvic floor dysfunctions, leaking urine, incontinence, prevalence, incidence, epidemiology, bothersomeness, bother* and quality of life. In the Appendix the complete search strategy for PubMed is provided. This search string was adapted for use in the other databases.

Eligibility criteria

Observational studies published between January 1, 1998, and January 1, 2019, in Dutch, English, Portuguese, German and French were included. All studies examining prevalence and/or incidence of UI among adult primi- and multigravid women, regardless of weeks of gestation, type of UI, setting and country, were of interest. Outcomes of interest were prevalence and/or incidence of (bothersome) UI. Exclusion criteria were: articles not available in full or not reporting an overall UI prevalence of any frequency and studies examining only twin pregnancies. When articles did not report a prevalence or incidence figure or response rate, an attempt was made for estimation from the information provided. Throughout this article we use the term bother (in relation to UI) as umbrella term for related constructs [impact on daily life or quality of life (QOL)].

Study selection

Titles and/or abstracts of studies retrieved using the search strategy and those from additional sources were screened independently by two reviewers (HM and EB) to identify studies that potentially meet the inclusion criteria. The full text of these potentially eligible studies were retrieved and independently assessed for eligibility by two reviewers. Any disagreement on eligibility was resolved through discussion with a third reviewer (BB). All the included articles were reference checked.

Data extraction and risk of bias

Information on each study was extracted in a standardized data extraction form, based on the Cochrane Public Health Data Extraction and Assessment template[16].

To assess the risk of bias, the Joanna Briggs critical appraisal tool for studies reporting prevalence data was used [17, 18]. The checklist consists of nine questions, with the response options yes, no, unclear or not applicable. Overall risk of study bias was rated as low (defined as 8–9 criteria answered as ‘yes’), moderate (4–7 criteria answered as ‘yes’) or high risk (≤3 criteria answered as ‘yes’). The response option not applicable (occasionally scored in criteria 5) was considered to be a ‘yes’. Two reviewers extracted data independently. Inconsistencies were identified and resolved through discussion including a third author if necessary.

Characteristics regarding measurement instruments for bother were extracted in a separate standardized extraction form. The form contains items such as measurement instrument, related construct and measurement results.

Summary measures, statistical analyses and heterogeneity

We used a random effects model to pool the inverse variance (IV) weighted prevalence of UI in individuals to avoid undue influence on the summary estimate from smaller and less precise studies or studies with a very small prevalence. Pooled prevalence and incidence values were reported with 95% confidence intervals (CI). The degree of heterogeneity was determined by the I2 statistic, with I2 > 75% labelled as considerable heterogeneity [19].

We performed subgroup analyses based on trimester, parity, type and frequency of UI, as these factors may explain why studies show varying prevalence figures. Trimesters 1, 2 and 3 were defined as weeks 1–13, 14–26 and 27 to at term (42 weeks), respectively. STATA Statistical Software, release 15, was used for analysis.

To determine the overall experienced bother in relation to UI across included studies, the total scores of the different measurement instruments for bother were converted to a (standardized) 0 to 100 scale, with 0 indicating no bother and 100 indicating extremely bothered. We classified 1 to 20 as no to mild bother, 20 to 40 as mild to moderate bother, 40 to 60 as moderate to severe, 60 to 80 as severe to very severe and 80 to 100 as extremely severe bother.

Results

Study selection

Among the 1338 papers initially identified, 44 met the eligibility criteria (Fig. 1), resulting in a total of 88,305 participants. All included studies were observational and published between 1998 and January 1, 2019.

Fig. 1
figure 1

Study flow diagram

Risk of bias

The risk of bias items for each study are shown in Table 1. High, moderate and low risks of bias were considered to be present in 3, 34 and 7 studies respectively. Risk of bias items with the lowest ratings were 8 and 9, and risk of bias items with the highest ratings were 1 and 4.

Table 1 Characteristics and outcomes of included studies

Study characteristics

Seventeen studies originated from Asia, 15 from Europe, 8 from the USA, 3 from Africa and 1 from Oceania. The majority of women were included from a (tertiary) hospital. Other studies included women from a civil registration system [20], midwifery area [21], hospital and maternity unit [22] or obstetric/child health clinic [23, 24]. Table 1 summarizes the study characteristics of included studies.

Thirteen studies reported on (measurement instruments for) bother, whereas one study (73) reported on two measurement instruments. The result of only one measurement instrument was reported for this study, as the second one (SF-36) was incomplete. Table 2 provides an overview of the measurement instruments as used in included studies, with the original and the converted (0–100 scale) measurement results.

Table 2 Measurement of bother and results

Six different measurement instruments for bother were used, of which the ICIQ-UI SF was most frequently used. Two studies reported the results of the ICIQ-UI SF as categories [25, 26]. One measurement instrument was self-constructed and non-validated [27].

Synthesis of results

Overall prevalence

Forty-four studies involving a total of 88.305 women were used to calculate the overall prevalence of UI. The weighted average of UI prevalence among pregnant women was 41.0% (CI 95% 34.0–48.0%; I2: 99.77%), regardless of trimester, parity or type of UI (Fig. 2). The lowest prevalence of UI found in the included studies was 9% [28] and the highest prevalence 75% [29]. Prevalence figures for low, moderate and high risk of bias studies were 38% (95% 18.0–58.0), 41% (95% 36.0–46.0) and 47% (95% 39.0–54.0) respectively.

Fig. 2
figure 2

Pooled prevalence of UI during pregnancy

Subcategories trimester of pregnancy, type of UI and parity

Five out of the 44 studies included women from trimester 1 or 2 or two out of three pregnancy trimesters. Fifteen studies recruited women from the third trimester, with an overall UI prevalence of 47% (95% CI: 37.0–58.0%). Twenty-four studies recruited women from trimester 1–3, with an overall UI prevalence of 40% (95% CI: 34.0–45.0%). Based on 24 studies, SUI accounts for 63% of UI cases, whereas UUI, MUI and unexplained UI were 12%, 22% and 3% respectively.

When parity is taken into account, 42% of nulliparous women experience UI (based on 12 studies; 95% CI 33.0–51.0%; I2 = 98.6%), whereas four studies reporting only on primiparous women found an overall UI prevalence of 31% (95% CI 26.0–36.0%; I2 90.6%). Twenty-seven studies included women with any parity, resulting in a pooled prevalence of 42% (95% CI 32.0–53.0%; I2 99.8%).

Based on 12 out of 44 studies, the overall prevalence for UI in trimesters 1, 2 and 3 is 9% (95% CI 6.0–12.0%; I2 97.7%), 19% (95% CI 12.0–25.0%; I2 98.7%) and 34% (95% CI 23.0–46.0%; I2 99.0%) respectively.

Subcategories frequency and amount of UI

Based on ten studies, monthly UI accounts for 40% of UI cases (95% CI 23.0–57.0%; I2 99.0%), weekly UI for 33% (95% CI 23.0–43.0%; I2 94.8%) and daily UI for 26% (95% CI 20.0–32.0%; I2 86.9%).

The majority of studies (n = 9), reporting on the amount of urine loss (n = 14), used the International Consultation on Incontinence Questionnaire-Urinary Incontinence Short Form (ICIQ-UI SF) to assess this parameter (none, small, moderate, large amount) [30]. Three studies reported separately the ICIQ-UI SF amount item, showing that the majority (79.2–86.9%) of UI cases lose a small amount. Other descriptions of amount of urine lost were: drops or just a little, more like a trickle, more than a trickle [31, 32], a few droplets, a stream [33] and drops, small splashes and more [26, 34].

Bother

Thirteen studies reported on impact on daily life, quality of life or bother. It was heterogeneously assessed; however, the ICIQ-UI SF was used in the majority of studies (n = 7). In two studies question 3 of the ICIQ-UI SF on interference in daily life was reported as a measurement instrument for bother. Other measurement instruments that were used only once were the Incontinence Quality of Life (I-QOL), Incontinence Impact Questionnaire (IIQ-7), Wagner’s quality of life questionnaire and a self-constructed non-validated questionnaire. The overall bother of UI during pregnancy, on a 0 to 100 scale, ranges between 9.5 and 34.1, consistent with mild to moderate bother, whereas the experienced bother is higher in the 3rd trimester (between 13.3 and 57.6) (Table 3).

Table 3 Converted results 0–100 by measurement instrument

Case definition

The majority of studies (n = 30) did not specify a case definition for UI. Four studies used as a case definition ‘any leakage’ or used the frequency (n = 5), amount/volume (n = 1), timeframe (n = 2) or UI type (n = 3) criteria in their case definition, or a combination of those (n = 3).

Incidence

Few studies have examined incidence of UI during pregnancy, using different trimesters of pregnancy and case definitions. Therefore, no pooling was done for this outcome. Five studies reported on new-onset UI during pregnancy among women who were continent 12 months before the index pregnancy [31, 32] or who had no UI previous to pregnancy [20, 22, 26]. Daly et al. reported that 21.7% of nulliparous women experienced any new-onset urinary leakage in early pregnancy [32]. The frequency of leakage among new-onset UI was less than once per month in 55% of cases and on a monthly, weekly and daily basis in 26.7%, 13.3% and 5.0% of cases respectively. The majority (83.1%) experienced drops or just a little amount of leakage. Brown et al. [31] reported 146 incident cases for any UI in early pregnancy (≤ 24 weeks of gestation; 16.4%) compared to 561 cases in late pregnancy (31 weeks; 63.2%). It appeared that new cases of SUI accounted for more than two thirds of prevalent cases in early and late pregnancy, 70.4% and 73.9%, respectively. Hvidman et al. concluded that UI incidence during pregnancy was 16.8% among nulliparous and 8.4% among primiparous women [20]. Overall, incidence rates in early pregnancy among nulliparous women range between 16.4% and 21.7% [31, 32]. When considering late pregnancy, the incidence rate increases to 45.6–63.2% [22, 31]. The incidence rate of UI during first pregnancy, regardless of trimester, is 16.8–39.1% [20, 26].

Discussion

The aim of this systematic review was to examine the pooled prevalence and incidence of UI during pregnancy and to provide an overview of measurement instruments, including the measurement results, to assess bother in relation to UI. The results show an overall mean prevalence of UI during pregnancy of 41%, with a range of 9–75%. The prevalence numbers rise with gestational period from 9% in the first trimester to 34% in the third. SUI is the most prevalent type of UI, accounting for 63% of cases. Twenty-six percent of the women reported daily loss, whereas 40% reported loss on a monthly basis. Most of the cases reported a small amount of urine loss.

Incidence/new onset UI in nulliparous women in early pregnancy varied between 16.4% and 21.7%[31, 32]. This variation might be explained by the different case definition used for UI (e.g. any UI [32] in contrast to UI at least once a month [31]). Incidence in late pregnancy increased to 63.2% [31]. Over 70% of new onset UI was SUI. The high prevalence and rising incidence numbers of SUI during pregnancy might be due to several factors such as physiological weight gain, which results in increased intra-abdominal pressure on the bladder and pelvic floor muscles [35]. Additionally, it is known that pregnant women with SUI have significantly less pelvic floor muscle strength and thickness [36] and/or a larger hiatal area at rest and during pelvic floor muscle contraction [37]. But also previous childbirth and high body mass index are risk factors for developing SUI [38, 39].

Most included studies showed a moderate risk of bias. Although several factors influence reported prevalence rates, e.g. case definition, studies with moderate or high risk of bias may distort prevalence and/or incidence rates. The prevalence rate among three studies with high risk of bias is 47% compared to 38% among studies with low risk of bias (in studies with a moderate risk of bias the prevalence is 41%). As studies with a low risk of bias tend to have a slightly lower prevalence, it is likely that the real prevalence of UI in pregnant women is in the range of 38–41%.

Only 13 out of 44 studies reported bother in relation to UI; these studies used a variety of measurement instruments. In an attempt to provide an overall assessment of experienced bother in relation to UI, we (arbitrarily) chose to standardize the measurement results of different bother scales to a 0 to 100 scale. Bother of UI during pregnancy ranges between 9.5 and 34.1 on a (standardized) 0 to 100 scale. The 0 to 100 scale can be regarded as a visual analogue scale (VAS). The VAS is a valid and reproducible method to measure the impact of UI on QOL [40]. No studies are known that report on cut-off points for QOL in pregnant women with UI. Therefore, cut-off points must be interpreted cautiously. One study comparing the VAS with a measure that assesses the impact on functioning in patients with pain identified three classes: class 1, mild interference (score 1–34); class 2, moderate interference (score 35–64); class 3, severe interference with daily life (score 65–100) [41]. Based on these classes the overall bother of UI during pregnancy is mild and in the third trimester mild to moderate. One study reporting on bother of UI in the last 4 weeks of pregnancy reported the highest bother of 57 [42]. This might be due to the rising prevalence over time in pregnancy [29, 34, 42,43,44,45,46].

The ICI provides an overview of (recommended) grade A (high-quality) measurement instruments for bother in relation to UI [3] and advises to report prevalence figures in combination with the experienced bother. The ICIQ-UI SF, IIQ and I-QOL, for example, are rated as grade A measurement instruments. Wagner’s QOL and the VAS are not incorporated in the ICI overview, nor is the separate use of question 3 of the ICIQ-UI SF as a bother measure. A closer look at the measurement instruments shows that there are differences with regard to assessed constructs and domains. The ICIQ-UI SF is a quick way to assess frequency, severity and bother of UI. The IIQ, I-QOL and Wagner’s QOL scale assess bother of UI with a variety of subscales such as psychosocial impact, social embarrassment, relations, and physical activity and provide therefore more in-depth information.

This systematic review revealed that the reporting of prevalence with a measure of bother is not common practice yet. To improve the reporting of UI prevalence, it is recommended that in research projects both prevalence and bother should be measured with high quality measurement instruments in line with the recommendations of ICI. In clinical practice, measurement results of bother support healthcare professionals in the clinical reasoning process as it may provide information on diagnosis or prognosis or may evaluate one’s own actions. At the same time, it standardizes communication with colleagues. Moreover, measurement results can be used to better inform patients about their situation and to involve them more easily in joint therapy decisions (shared decision making).

The construct bother in relation to UI seems difficult to grasp, as included studies used different definitions. The following terms were used: effect on daily activities/everyday life, interference on daily life, health-related quality of life, severity, lifestyle changes, (perceived) impact on quality of life, distress, experienced discomfort and amount of bother. As the degree of bother is related to help-seeking behaviour for UI [47, 48], it is of importance to define the construct bother (what does bother in relation to UI mean for pregnant women) and quantify bother. When bother is well defined and quantified, this will support researchers in selecting the appropriate measurement instrument and interpretation of the results.

Based on the prevalence figures, it would appear that UI in pregnancy is an enormous healthcare problem. However, not everyone seeks (medical) help for UI immediately. Several factors determine help-seeking behaviour of pregnant women, such as awareness of treatment possibilities and the experienced burden of UI [48, 49]. Also the belief that UI will resolve by itself after delivery and the lack of knowledge that UI during pregnancy raises the odds for post-partum UI substantially obstructs help-seeking [50, 51].

Management of UI should be directed to women who seek help for UI, but may also be directed towards women who experience bother or have risk factors for developing UI (prevention). Such uncertainties require further evaluation and data on duration of treatment effects of PFM(G)T [52]. Maternity care workers need to assess women for the presence, severity and bother of UI and, in consultation with them, develop a specially tailored plan of care to meet the women’s needs.

The strength of this systematic review is the large number of included studies, which resulted in the availability of prevalence and incidence numbers for different subpopulations (countries, parity, trimester of pregnancy) and for different purposes (research planning, health care providers and policy makers). To our knowledge, this review is the first one that focused on assessment methods for bother in relation to UI and degree of adherence to the recommendations of ICI with regard to this topic.

The limitations of this systematic review are, first, the presence of substantial clinical heterogeneity of the studies. Clinical heterogeneity is due to differences in: case definition (any UI or different frequencies of UI in a certain period of time), population (primigravida-multigravida) or periods researched (first, second, third trimester or any specific trimester). Second, the considerable statistical heterogeneity of the studies resulted in large CIs. Third, as the Joanna Briggs critical appraisal tool does not recommend cut-off points for high, moderate or low risk of bias, we arbitrarily chose the cut-off points reported in this systematic review to explore possible differences in prevalence numbers if stratified for risk of bias. However, we did not include or exclude studies based on risk of bias.

Conclusion

UI is a very common symptom in pregnancy, and the prevalence rises as weeks of gestation progress. SUI is the most common types and in most of the cases a small amount of urine was lost. The level of bother for UI is heterogeneously assessed and is experienced as mild to moderate by pregnant women.