Introduction

Urinary incontinence (UI) is the complaint of involuntary loss of urine [1]. The main subtypes of UI are stress (S) UI, urgency (U) UI and mixed (M) UI. SUI is leaking urine when coughing or sneezing [1]. SUI is more common in younger women [2]. Pregnancy and vaginal delivery are well-documented risk factors for developing UI [3,4,5]. Seventy-three percent of women with UI 3 months post-partum still report UI at 6 years post-partum [6]. In general, UI prevalence and incidence rise with ageing [7]. Women often experience UI as embarrassing and humiliating, resulting in loss in quality of life [8]. UI also causes considerable socio-economic costs [9, 10].

The prevalence and incidence of UI in the post-partum period are widely studied. However, these prevalence and/or incidence figures vary greatly throughout published reports, depending on local setting, case definitions applied, recruited population (period post-partum and parity) and study methodology [11, 12]. A systematic review on the prevalence of post-partum UI and the relation to the mode of delivery was published in 2010 [13]. At that time, studies hardly reported on bother. In 2017, the International Consultation on Incontinence (ICI) recommended that prevalence numbers should be accompanied by the experienced bother [14], as there are indications that the prevalence of bothersome UI is lower than the crude UI prevalence [14]. As women with bothersome UI tend to seek more help [15], health professionals, policy makers and researchers need reliable prevalence numbers to specify the health problem UI causes and to help set priorities and assist in planning the management of UI.

Therefore, the primary aim of this systematic review and meta-analysis was to examine the pooled overall prevalence and incidence of UI between 6 weeks and 1 year post-partum in the general population of the Western world, specified for relevant subcategories (period post-partum, parity, type of UI, frequency and amount). A secondary aim was to provide an overview of the assessment methods and outcomes for bother in relation to UI as used in included studies.

Methods

The MOOSE statement for reporting systematic reviews and meta-analyses was followed [16]. The research protocol was published in the PROSPERO database (registration number CRD42018111991).

Search strategy

We performed a systematic review and meta-analysis of observational studies (cross-sectional and cohort studies) reporting on the prevalence and incidence [17] of UI after delivery and experienced bother. We searched the electronic databases of PubMed, EMBASE and CINAHL. All included articles were reference checked. Titles and/or abstracts of studies retrieved using the search strategy and those from additional sources were screened independently by two reviewers. Full texts of potentially eligible studies were retrieved and independently assessed for eligibility by two review team members. Any disagreement on eligibility was resolved through discussion with a third reviewer.

We used the following search terms to search all databases: postpartum, post-partum, post partum, peripartum, peri-partum, peri partum, primiparous, multiparous, multigrav*, multipar*, urinary incontinence, urine loss, leaking urine, incontinence, prevalence, incidence, epidemiology, frequency, bothersomeness, bother*, quality of life and hindrance. In the Appendix the complete search strategy for PubMed is provided. This search string was adapted for use in the other databases.

Eligibility criteria

Observational studies published between January 1, 1998, and March 1, 2020, in Dutch, English, German and French were included. All studies examining prevalence and/or incidence of UI from 6 weeks to 12 months post-partum among adult primi- and multiparous women in the Western world, regardless of type of UI and setting, were of interest. Six weeks post-partum was chosen to ensure a large proportion of the sample had recovered physiologically from the delivery. Outcomes of interest were prevalence and/or incidence of (bothersome) UI. Exclusion criteria were: articles not available in full or not reporting an overall UI prevalence and/or incidence of any frequency, studies examining only twin pregnancies and studies originating from non-Western countries. The latter criteria were chosen for the purpose of homogeneity in population characteristics. When articles did not report a prevalence figure or response rate, an attempt was made for estimation from the information provided. Throughout this article we use the term bother (in relation to UI) as an umbrella term for related constructs [impact on daily life or quality of life (QOL)].

Study selection

Titles and/or abstracts of studies retrieved using the search strategy and those from additional sources were screened independently by two reviewers (HM and EB) to identify studies that potentially meet the inclusion criteria outlined above. The full texts of these potentially eligible studies were retrieved and independently assessed for eligibility by these two reviewers. Any disagreement on eligibility was resolved through discussion with a third reviewer (BB). All the included articles were reference checked.

Data extraction and risk of bias

Information on each study was extracted in a standardized data extraction form, based on the Cochrane Public Health Data Extraction and Assessment template [18]. To assess the risk of bias, the Joanna Briggs critical appraisal tool for studies reporting prevalence data was used [19, 20]. The checklist consists of nine questions, with the response options yes, no, unclear or not applicable. Overall risk of study bias was rated as low (defined as 8–9 criteria answered as ‘yes’), moderate (4–7 criteria answered as ‘yes’) or high risk (≤ 3 criteria answered as ‘yes’). The response option not applicable (occasionally scored in criteria 5) was considered to be a ‘yes’. Two reviewers (HM, EB) extracted data independently. Inconsistencies were identified and resolved through discussion including a third author (BB) if necessary.

Characteristics regarding measurement instruments for bother were extracted in a separate standardized extraction form. The form contains items such as measurement instrument, related construct and measurement results.

Summary measures, statistical analyses and heterogeneity

We used a random effects model to pool the inverse variance (IV) weighted prevalence of UI in individuals to avoid undue influence on the summary estimate from smaller and less precise studies or studies with a very small prevalence. Pooled prevalence and incidence values were reported with 95% confidence intervals (CI). The degree of heterogeneity was determined by the I2 statistic, with I2 > 75% labelled as high heterogeneity [21].

Prevalence was studied by subgroup [post-partum period (6 weeks, 3, 6, 9 and 12 months), type and frequency of UI, and parity (primi- and multiparous)] as this might explain why studies show varying prevalence figures. Studies reporting on a post-partum period other than the five established periods are classified in the closest post-partum period. Moreover, studies reporting a period prevalence (e.g. 9–12 months post-partum) are classified in the upper range of the period prevalence (i.e. 12 months), as most women will most likely report on their current status, which is less prone to recall bias. Incidence is reported in two periods: from delivery up to and including 3 months post-partum and from 3 to 12 months post-partum and for primi- and multiparous women. STATA Statistical Software, release 15, was used for analysis.

To determine the overall experienced bother in relation to UI across included studies, the measurement results of the different measurement instruments for bother were converted, where possible, to a (standardized) 0 to 100 scale, with 0 indicating no bother and 100 indicating extremely bothered. We classified 1 to 20 as no to mild bother, 20 to 40 as mild to moderate bother, 40 to 60 as moderate to severe, 60 to 80 as severe to very severe and 80 to 100 as extremely severe bother. We used the following conversion method for the ICIQ-UI SF (range 0-21): converted score = observed original score * 4.76 (the value 4.76 is derived from 100 (upper limit converted score)/21 (upper limit original score). Likewise, question 3 from the ICIQ-UI SF (range 0–10) is calculated as follows: converted score = observed original score * 10.

Results

Study selection

Among the 1063 papers initially identified, 31 met the eligibility criteria (Fig. 1), resulting in a total of 38,209 participants. All included studies were observational (20 cohort studies [5, 11, 12, 22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38] and 11 cross-sectional studies [39,40,41,42,43,44,45,46,47,48,49]) and published between 1998 and March 1, 2020. Studies were excluded based on inadequate study design, study population, non-Western countries, outcome, follow-up or language. Twenty-nine studies reported on prevalence and/or incidence figures and two studies only reported on incidence figures.

Fig. 1
figure 1

Study flow diagram

Risk of bias

The risk of bias for each study is shown in Table 1. High, moderate and low risk of bias was considered to be present in 1, 26 and 4 studies respectively. Risk-of-bias items with the lowest ratings were 8 and 9, and risk-of-bias items with the highest ratings were 1 and 3.

Table 1 Characteristics and outcomes of included studies post-partum

Study characteristics

The studies originated from Europe (n = 17), North America (n = 8) and Australia (n = 5). One study was mixed (Europe/Australia). The majority of women were included from a (tertiary) hospital (n = 26). The remaining studies included women from the community, primary health care service or health care insurance service. Nineteen studies only reported on primiparous women. Twelve studies used validated questionnaires to determine the presence of UI, and 19 studies used self-constructed, non-validated questionnaires. Table 1 summarizes the study characteristics of included studies.

Nine studies reported on (measurement instruments for) bother. Table 2 provides an overview of the measurement instruments as used in the included studies, with the original and the converted (0–100 scale) measurement results. Five different measurement instruments for bother were used, of which the ICIQ-UI SF was most frequently used [27, 29, 37, 39]. One study only reported the results of the ICIQ-UI SF as categories [50], and two studies did not report total scores [24, 45]. One measurement instrument was self-constructed and non-validated [30].

Table 2 Measurement of bother and results

Synthesis of results

Overall prevalence

Twenty-four out of 31 studies contributed to the calculation of the overall prevalence of post-partum UI, involving a total of 35,064 women. The weighted mean of UI prevalence among post-partum women (6 weeks to 12 months) was 31.0% (CI 95% 26.0–36.0%; I2: 99.0%), regardless of parity or type of UI (Fig. 2). The lowest prevalence of UI found in the included studies was 10% [51] and the highest prevalence 63% [52]. Prevalence figures for studies with low (n = 3), moderate (n = 20 studies) and high risk of bias (n = 1) were 28% (95% CI 17.0–39.0), 29% (95% CI 24.0–35.0) and 63%, respectively (Table 1).

Fig. 2
figure 2

Pooled prevalence of UI post-partum

Subcategories post-partum period, type of UI and parity

Figure 3 summarizes the mean UI prevalence at 6 weeks, 3, 6, 9, and 12 months post-partum. From an initial drop in (weighted) prevalence between 6 weeks [24.0%, 95% CI: 17.0-32.0% (1349/5137)] and 3 months post-partum [21.0%, 95% CI: 17.0–25.0% (3677/17,165)], prevalence numbers gradually rise to 32.0% at 12 months post-partum (95% CI: 23.0–41.0% (2997/9220)). The prevalence of UI post-partum is equal among primi- and multiparous women, 31% [11, 12, 22,23,24, 26, 27, 36, 37, 42, 44, 48, 49, 53] and 30% [30, 33,34,35, 40, 43, 45,46,47, 54], respectively.

Fig. 3
figure 3

Prevalence of UI by type and period

Based on nine studies, SUI accounts for 54% UUI and MUI for 26% and 16% of cases respectively, whereas 4% was unexplained UI [23, 24, 27, 34, 36, 37, 44, 47, 48].

Subcategories frequency and amount of UI

Seven out of 31 studies reported on frequency of UI. The most used frequency categories (n = 3) were: less than once a week, less than daily, more than or equal once per week and more than or equal daily leakage. A frequency of less than once a week was most frequently reported (50%–66.3%) [23, 46, 55]. Two studies reported frequency of UI as: less than once per month, a few times a month, a few times a week, every day and/or night [34, 50]. One study reported: occasionally, once per week, several times per week and daily [32] and one study reported the ICIQ-UI SF question on frequency [27].

Four studies reported on the amount of urine loss [5, 27, 34, 50]. One study used the ICIQ-UI SF to assess this parameter (none, small, moderate, large amount) [27]. One study reported the ICIQ-UI SF item ‘amount’ separately, showing that the majority of UI patients lose a small amount (85.3%) [27]. Other descriptions of amount of urine lost used were: drops, small splashes and more [34, 50]. Drops were most frequently reported in 71.6% of cases [50]. The remaining study reported amount as a drop or two, pad or clothing damp, and pad or clothing soaked [5].

Bother

Nine studies reported on impact on daily life or quality of life [24, 27, 30, 31, 37, 39, 45, 50, 53], which was heterogeneously assessed. The ICIQ-UI SF total score was used most frequently (n = 5). Martin-Martin et al. reported the impact on daily life (0–10) based on the ICIQ-UI SF [31]. Other questionnaires used once to assess impact on daily life were: Incontinence Quality of life (I-QOL) [45], King’s Health Questionnaire (KHQ) [24] and a self-constructed non-validated questionnaire [30]. The overall bother of UI post-partum, on a 0 to 100 scale, ranges between 24.3 and 47.6, consistent with mild to moderate bother. At 3 months post-partum, degree of bother ranged between 28.6 and 41.0, at 6 months post-partum between 39.0 and 45.0 and at 12 months post-partum between 24.3 and 28.1 (Table 2).

Case definition

The majority of studies (n = 11) used ‘any leakage’ as a case definition [5, 24, 25, 27, 28, 30, 33, 49, 53, 55, 56]. Eight studies used the ICS definition, which was not mentioned as such in some cases [11, 23, 32, 35, 38, 39, 46, 47]. Six studies did not specify a case definition for UI [12, 31, 37, 43, 48, 57], five used a frequency [22, 26, 34, 40, 50], and one study used the Clinical Classification of Urinary Incontinence (FPSUND) [45].

Incidence

Ten studies have examined the incidence of UI post-partum (Table 1) [11, 25, 31, 34, 36, 38, 39, 49, 54, 56]. Five studies reported incidence up to and including 3 months [25, 31, 34, 50, 54] and six reported from 3 until 12 months [11, 25, 38, 39, 49, 56]. One study reported for both periods [25]. The incidence of UI in primiparous and multiparous women up to 3 months was 9.0–21.9% and 4.4–30.0%, respectively. Incidence up to 1 year was 4.3–34.1% in primiparous women.

Discussion

The aim of this systematic review and meta-analysis was to summarize the pooled prevalence and incidence of UI between 6 weeks and 12 months post-partum, to provide an overview of assessment methods for bother in relation to UI and to assess the degree of bother post-partum. The results show an overall mean prevalence rate of UI up to 1 year post-partum of 31%, with a range of 10% to 63%. The prevalence of 10% was reported in a study on maternal health using a generic questionnaire including only one question on UI [43] in contrast to the other studies using health problem-specific questionnaires. This might have influenced the tendency for women to report UI [58].

The prevalence numbers in the first year post-partum rise from 24% at 6 weeks to 32% at 12 months post-partum after an initial drop between 6 weeks and 3 months. A recently published systematic review and meta-analysis on the prevalence of UI during pregnancy reported a prevalence of UI of 34% in the third trimester [59]. The drop in UI prevalence early post-partum compared to the third trimester of pregnancy might be explained by the natural recovery of the levator ani muscle, which occurs mainly up to 4 to 6 months post-partum [60, 61]. The rise in prevalence from 3 to 12 months post-partum might be due to return to daily activities, such as return to work and starting with sports, with an associated increase in physical activity level and as a consequent loading of the continence system [62, 63]. The prevalence of UI between primi- and multiparous women was nearly equal (31% and 30%). This is in line with the EPINCONT study on 27,900 women, which reported that the first delivery is the largest risk factor for UI, more specifically SUI and MUI, post-partum [64].

Thom et al. published a systematic review with 33 studies on the prevalence of post-partum urinary incontinence. The overall prevalence reported by Thom et al. between 2 and 13 weeks post-partum was 33.3%. As only one included study covered the period of 14 to 52 weeks post-partum, an overall prevalence number could not be calculated [13]; 33.3% is a higher prevalence than the 24% at 6 weeks and 21% at 3 months reported in this study. This might be due to the fact that Thom et al. did not report a weighted prevalence.

When interpreting the prevalence numbers at different time points post-partum, it is important to keep in mind that UI might be a dynamic phenomenon. This means that a woman’s continence status can change both ways over a period of time [33].

The incidence numbers between 6 weeks up to 3 months and 3 months up to 12 months and among primi- and multiparous women varied. The low incidence number of 4.3% in the short term might be explained by the fact that this study only reported on SUI or MUI incidence [49]. Although the study of Thomason et al. claims to report the incidence of total UI, only women who reported UI with a positive (cough) stress test were included. Women who were able to contract their pelvic floor muscles properly and timely during an anticipated known rise in abdominal pressure might therefore be considered continent. However, these women might be incontinent during an unexpected rise in abdominal pressure. Also, the small sample (n = 121) this study is based on might distort the results. If the overall incidence of the up to 3 months post-partum is compared with the up to 12 months post-partum group, the incidence numbers show a small rise in the latter, 4.3–30.0% and 4.4–34.1%, respectively. The rise in incidence follows the pattern of the rise in prevalence of UI between 3 and 12 months post-partum.

Most included studies showed a moderate risk of bias, which influences the possibility to differentiate prevalence between groups regarding risk of methodological quality. The mean prevalence of UI reported by studies with low and moderate risk of bias did not differ. However, the one high risk study reported the highest prevalence of 63% [48]. Because a weighted prevalence number was calculated, this high risk study with only 196 participants and low response rate of 25.6% hardly influences the overall prevalence of UI.

The ICI recommends reporting prevalence numbers along with a measure of experienced bother [14]. Only 9 out of 31 studies (approximately 30%) reported bother in relation to UI with a variety of measurement instruments, which shows that combined assessment is not yet common practice [24, 27, 30, 31, 36, 37, 39, 45, 53]. Eight studies used high-quality measurement instruments, most frequently the ICIQ-UI SF [27, 36, 37, 39, 53]. In an attempt to provide an overall assessment of degree of experienced bother in relation to UI, after studying all available materials, we decided to standardize the measurement results of different bother scales to a 0 to 100 scale. The 0 to 100 scale can be regarded as a visual analogue scale (VAS). The VAS is a valid and reproducible method to quantify the impact of UI on QoL [65], although no studies are known that report on cut-off scores for QoL specifically in post-partum women with UI. Boonstra et al. compared the VAS with a measure that assesses the impact on functioning in patients with pain and identified three classes: class 1, mild interference (score 1–34), class 2, moderate interference (score 35–64) and class 3, severe interference with daily life (score 65–100) [66]. Based on these classes, this systematic review revealed that women experience their post-partum UI as mild to moderate (range 24.3–47.6). Based on two studies, the results show a trend that bother of UI reduces at 12 months post-partum [27, 53]. Women report for instance that UI becomes less of a problem because they get used to it and that they find practical ways to cope by using panty liners and avoiding certain activities [54].

Nevertheless, over half of the women with UI post-partum think that it will improve by itself in time and only 25% of women with post-partum UI actually seek help [67]. However, 73% of women with UI 3 months post-partum still report UI at 6 years post-partum [6]. Reliable information on UI prevalence is thus essential to estimate health care burden, allocation of health care resources and research planning.

Strengths and limitations

The strength of this systematic review and meta-analysis is the large number of included studies, which resulted in the availability of prevalence and incidence numbers for different subpopulations (parity, post-partum period, type of UI) and for different purposes (health care providers, research planning and policy makers). This is the first review to report the prevalence and incidence over the first 12 months post-partum and bother in relation to post-partum UI.

The limitations of this study are, first, the presence of substantial clinical heterogeneity of the studies. Clinical heterogeneity may be due to differences in: case definition (any UI or different frequencies of UI in a certain period of time), population (primiparous and multiparous) or periods researched. Second, the considerable statistical heterogeneity of the studies resulted in large CIs. Third, as the Joanna Briggs critical appraisal tool does not recommend cut-off points for high, moderate or low risk of bias, we arbitrarily chose the cut-off points reported in this systematic review to explore possible differences in prevalence numbers if stratified for risk of bias. However, we did not include or exclude studies based on risk of bias.

Conclusion

After an initial drop in prevalence of UI at 3 months post-partum (21%), at 1 year post-partum, prevalence rises again to 31%. UI prevalence does not differ between primi- and multiparous women. Bother of UI is heterogeneously assessed and is reported as mild to moderate.