Introduction

Perinatal depression is defined as a non-psychotic depressive episode with various degrees of severity, from mild to major, occurring during pregnancy and up to one year following delivery [1,2,3,4,5], which has been shown to exert harmful effects on both mother and child [6,7,8,9]. Mothers may suffer from spontaneous abortion, bleeding during the gestational period, increased resistance of the uterine artery, preterm deliveries, and assisted deliveries, such as Caesarean section (CS), or the use of vaginal instruments [7]. Babies may have lower APGAR (Appearance Pulse Grimace Activity Respiration) scores, increased admission to neonatal care, a higher risk of poor growth development, and increased episodes of diarrheal infection [7, 9]. During the postpartum period, maternal depression has been linked with negative impacts on breastfeeding [3] and mother–infant interactions [1, 10].

Systematic reviews to establish the prevalence and correlates of perinatal depression are an important step in developing effective strategies and interventions, and several systematic reviews have been conducted to date [2, 11,12,13]. The number of systematic reviews addressing the prevalence and correlates of perinatal depression has been on the rise (a PubMed search found two reviews in the 1990s, 11 reviews in the 2000s, and 95 reviews in the 2010s). Systematic reviews of systematic reviews have been used to provide a broad overview and an up- to -date summary of the research in a particular field, which cannot be provided in more focused systematic reviews, highlighting commonalities and discrepancies, and hopefully identifying new promising approaches [14, 15].

The aim of the present research was to systematically review previous systematic reviews of perinatal depression to provide a global estimate of the prevalence rates and explore potential influences on those prevalence rates, including population characteristics and the methodological quality of the systematic reviews undertaken. Knowledge of the prevalence of perinatal depression and who is at risk will help inform public health policies to prevent and treat perinatal depression. We aimed to answer three research questions:

  1. 1.

    What is the global prevalence of perinatal depression?

  2. 2.

    What are the risk factors associated with developing perinatal depression?

  3. 3.

    What features of the systematic review and/or population under investigation might account for the prevalence and/or correlates of perinatal depression?

Materials and methods

Search strategy

The review followed guidelines for Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) [16]. The research questions, inclusion/exclusion criteria and data synthesis approaches were specified in advance and published in PROSPERO (CRD42019135838). We systematically reviewed materials published from the inception of the database until April 2022. The search for systematic reviews was performed using two strategies: (1) systematic searches of electronic databases (Embase, MEDLINE, PsycINFO, CINAHL Plus and Maternity and Infant Care) covering every source related to perinatal depression and grey literature (using PROSPERO and Scopus); and (2) hand searches, searching lists of references for cited systematic reviews. We developed search terms filtered by a combination of different key words and subject medical headings (MeSH), as shown in Fig. 1.

Fig. 1
figure 1

Search terms used (for MeSH terms and keywords in abstract and title)

Inclusion and exclusion criteria

To be considered “systematic”, a review must: (a) describe systematic searches of multiple databases to identify studies that match their research questions; (b) include details of inclusion and exclusion criteria; and (c) make some attempt at data synthesis, including meta-analysis. We included systematic reviews of all cohort, case–control, or cross-sectional studies that investigated and reported the prevalence of perinatal depression or its correlates (risk factors). In addition, all systematic reviews with interventions or experimental studies were included if their control groups provided any outcomes relevant to the prevalence and correlates of perinatal depression. Systematic reviews were included if they assessed the risk factors and prevalence of perinatal depression among women of all ages, ethnicity, country of origin or language spoken. No publication date or language restrictions were applied.

Systematic reviews were excluded if they were duplicates and if they did not report the prevalence of, or factors associated with, perinatal depression.

Data extraction

The initial search was performed by the first author using pre-determined research terms and strategies for searching databases. All search results were exported to EndNote (version 9) and duplicates were removed. Two reviewers (the first author and the doctoral candidate) then screened all of the titles and abstracts for eligibility, with an agreement level of 99%. Following this, the two reviewers read the full text of potentially relevant systematic reviews to determine whether they met the criteria of eligibility for final inclusion (96% agreement). In cases of disagreement between the reviewers, a third reviewer (a co-author) made the final decision. The main reviewer extracted data according to a pre-designated extraction form. The data extraction form included details about the first author of the systematic review, publication year, title of review, review objective or aims, number of included studies, type of studies, characteristics of population and sample size, countries, time of measurements, type of depression measurements and key findings (prevalence and/or risk factors).

Assessment of methodological quality

The validated Assessment of Multiple Systematic Reviews (AMSTAR) tool was used to assess the quality of the methodologies in the systematic reviews [17, 18]. AMSTAR contains 11 items with binary scoring; each item has a score of 1 if the criterion is met and 0 if it is not met, is unclear or inapplicable [19]. The total score of the review quality is then calculated by totaling the scores of each item [19]. The AMSTAR score has three levels of review quality: 8–11 indicates high quality, 4–7 indicates medium quality, and 0–3 indicates low quality [19].

Results

The review selection process in this study is presented in Fig. 2 [16]. We obtained 2550 potentially relevant records (comprising 2165 records identified through database searching and 385 identified from other sources, such as grey literature). Of the 2550 identified records, 1081 were duplicates (indexed in more than one database) and were excluded. The remaining 1469 records were included for initial screening of their titles and abstracts, as a result of which 1207 did not meet the inclusion criteria and were excluded. Full-text reviews were then carried out on the remaining 262 reviews, which identified 128 studies that met the inclusion criteria.

Fig. 2
figure 2

PRISMA diagram of search strategy

Description of the included systematic reviews

The 128 included systematic reviews described 4242 primary studies, of which 54 (42.2%) reported the prevalence of perinatal depression (supplementary Table 1) and 74 (57.8%) examined risk factors associated with developing perinatal depression (supplementary Table 2). All systematic reviews were examined for features that might account for their apparent discrepancies in the prevalence and/or correlates of perinatal depression.

Seventy-seven (60.2%) of the systematic reviews achieved high quality AMSTAR scores (8–11), while the remaining 51 (39.8%) achieved a medium quality score (4–7) (supplementary Table 3).

Global prevalence of perinatal depression

The overall mean prevalence of perinatal depression among the included reviews was 26.3%, (SD = 11.6, median = 23.8%). The mean prevalence of antenatal depression (from 11 of the 54 reviews, 22%) was 28.5% (SD = 18.2) versus the mean prevalence of postnatal depression (from 24 of the 54 reviews, 44%) which was 27.6% (SD = 8.0). The effect size for the difference was d = 0.1, which is considered a small effect [20].

Prevalence of difference between using self-reported measures and diagnostic assessment

There was a difference between the mean perinatal prevalence of depression identified using self-reported measures (27.4%, SD = 12.6) compared to those identified using a structured interview (17.0%, SD = 4.5; d = 1.0) and those identified using both self-reported measures and structured interviews (25.7%, SD = 9.0; d = 0.8). Out of the included systematic reviews, 48.1% were based only on self-reported measures, 50.0% were based on both self-reported and diagnostic assessments, and 1.9% of the included systematic reviews were based only on diagnostic assessment.

Prevalence in potentially vulnerable populations

The prevalence of perinatal depression was markedly higher (32.5%; SD = 16.7) among potentially vulnerable populations, such as immigrants [21,22,23,24,25], HIV-infected African women [26], women who gave birth prematurely or whose infants had a very low birth weight [27], mothers who abused substances during pregnancy [28], military personnel [29], women who had experienced earthquakes [30, 31] and pregnant inmates in correctional facilities [32], compared to the general population (excluding data from vulnerable groups), with an estimated mean prevalence of 24.5% (SD = 8.1; d = 0.6). This may indicate that potentially vulnerable populations are at a higher risk of developing perinatal depression than the general population.

The Prevalence of depression varied widely (SD = 16.7, Table 1), which was largely attributable to Mukherjee et al. [32], who reported 80% prevalence among pregnant inmates in their last trimester. Mukherjee et al. [32] interpreted their findings as being due to an absence of satisfactory medical care, isolation, stress, anxiety, maternal role transition, and parenting worries. This review uniquely considered depression among incarcerated pregnant women. Further research is required to address mental health issues among what might be a particularly vulnerable group.

Table 1 The mean prevalence of perinatal depression among vulnerable groups

Prevalence of perinatal depression during the COVID-19 pandemic

The overall mean prevalence of perinatal depression among the included three updated systematic reviews and meta-analyses [33,34,35], with a total of 43 studies evaluating perinatal depression during the COVID-19 pandemic, was 28% (SD = 6).

Factors associated with perinatal depression

A total of 2123 studies reported correlates of perinatal depression in 74 systematic reviews. The correlates were routinely classified as ‘major’ or ‘minor’.

Major correlates

Major correlates of perinatal depression were defined as statistically significant predictors in multivariable/logistic regression analyses or with a medium-to-large significant effect size (rs > 0.30). Other correlates, which were classed as minor or inconsistent, can be found in supplementary Table 2.

Medium to large significant effect sizes were reported in meta-analyses between perinatal depression and the following risk factors: personal history of mental illness (e.g., anxiety, depression, etc.), with mean r ranging from 0.30 to 0.51; childcare stress (r = 0.48–0.49) or infant temperament (0.33–0.34); experiencing stressful life events (r = 0.36–0.40); lack of social support (r = 0.37–0.45); maternity blues (r = 0.35–0.37); and marital conflicts or dissatisfaction (r = 0.37–0.39) [6, 11, 36, 37].

Lifetime history of abuse (e.g., childhood/adult abuse, maternal violence, or intimate partner violence) was found to conclusively and consistently lead to a developed risk of perinatal depression, sourced from nine systematic reviews and five meta-analyses [36,37,38,39,40,41,42,43,44,45,46,47,48,49]. Women who had experienced intimate partner violence during pregnancy had increased odds of both antenatal and postnatal depression, 1.69–3.76 and 1.46–7.04, respectively [41]. A history of abuse was significantly associated with antenatal depression, with moderate effect sizes for: any abuse (r = 0.29; 95% CI = 0.22–0.35; P < 0.001); physical abuse (r = 0.27; 95% CI = 0.24–0.30; P < 0.001); sexual abuse (r = 0.26; 95% CI = 0.22–0.30; P < 0.001); and emotional abuse (r = 0.34; 95% CI = 0.23–0.44; P < 0.001) [46]. A recent meta-analysis found that women who had experienced intimate partner violence had a 5.46-fold increased risk of postnatal depression (OR = 5.46 (95%CI: 3.94- 7.56) [36]. Another meta-analysis revealed that women who had experienced maternal violence were noted to be at a higher risk of postnatal depression (OR = 2.04; 95% CI 1.72–2.41) [49]. This risk was increased in women with a history of having experienced sexual violence (OR = 1.56; 95% CI 1.35–1.81), emotional violence (OR = 1.75; 95% CI 1.61–1.89), physical violence (OR = 1.90; 95% CI 1.36–2.67), and domestic violence (OR = 2.05; 95% CI 1.50–2.80) or childhood violence (OR = 1.59; 95% CI 1.34–1.88) [49].

In three of the included meta-analyses, gestational diabetes mellitus (GDM) was reported to lead to a significantly increased risk of postnatal depression (relative risk (RR) = 1.32, 95% CI 1.09–1.60, P = 0.001) [50] (RR = 1.59 (95% CI 1.22–2.07, P = 0.001) [51] compared to women without GDM. This finding was echoed by a US-based systematic review, which noted a strong relationship between GDM and postnatal depression among ethnically diverse women from LICs [52].

Chronic medical conditions were at increased risk for perinatal depression (adjusted pooled odds ratios [aPOR] = 1.45; 95% CI 1.25–1.67), specifically: diabetes (aPOR = 1.34; 95% CI 1.07–1.69); hypertension/heart disease (aPOR = 1.60; 95% CI 1.05–2.45); migraine (aPOR = 1.75; 95% CI 1.20–2.54); and other neurological disorders (aPOR = 1.45; 95% CI 1.19–1.77) [53]. The risk of perinatal depression was also noted to increase in severity among women with preeclampsia and preterm deliveries (severity score of 2.7) compared to healthy preterm mothers (severity score of 1.2) [54].

Other factors, such as exposure to second-hand smoke, also significantly increased the risk of developing perinatal depression (ORs = 1.77; 95% CI = 1.12–2.79; P = 0.01) [55]. Sleep disturbance as a predictor of postnatal depression was examined in one systematic review [56], with effect size being used as an indicator for the strength of the relationship between postnatal depression and postpartum sleep disturbance, categorised as: small (0.2–0.4), medium (0.5–0.7), large (0.8–1) and very large (1.1–2). That particular systematic review showed that the effect sizes ranged from moderate to large and very large (0.6–1.7), indicating a strong relationship between sleep disturbance and postnatal depression among its included studies, bar one study with a small effect size (0.4) [56].

Discussion

To the best of our knowledge, this is the first systematic review of systematic reviews focusing on perinatal depression and its correlates, providing an overall, global viewpoint. The primary aim of this study was to estimate the mean global prevalence of perinatal depression, which was found to be 26.3% (SD = 11.6) with a median prevalence of 24.3%. Notably, the absolute difference in perinatal depression prevalence estimates between antenatal (28.5%) and postnatal (27.6%) depression, though significant, was relatively small. Evidence shows that antenatal depression leads to an increased risk of developing postnatal depression [3, 57]. Therefore, early screening for depressive symptoms and diagnosis by an allied health professional during pregnancy is crucial to prevent postnatal depression, by providing early intervention and support. Perinatal depression rates were also noted to be higher among potentially vulnerable populations compared to the general population, thus developing protocols for the early identification and management of perinatal depression is essential for vulnerable groups.

Another aim of this study was to examine the correlates of risk factors for developing perinatal depression. Medium to large significant effect sizes were reported in meta-analyses between perinatal depression and the following of factors: personal history of mental illness (e.g., anxiety, depression, etc.); child care stress or infant temperament; experiencing stressful life events; lack of social support; maternity blues; marital conflicts or dissatisfaction and history of lifetime abuse (e.g., childhood/adult abuse, maternal violence, or intimate partner violence).

The third aim of this study was to report on the features of the included systematic reviews; variations in how the included systematic reviews were conducted could have potentially affected the findings of this systematic review of systematic reviews. Notably, the systematic reviews were restricted to a local community or geographical region [58,59,60]. Of the 128 included systematic reviews, 54 reported the prevalence of perinatal depression, 39 (72.2%) of which were country-specific, and only 10 (18.5%) reported the prevalence across more than one country. Populations present in a specific country may not be necessarily representative of the general population across other countries and there are even differences within countries due to a variety of differences, such as ethnicity and culture. There is currently no evidence of how these variations could potentially affect the findings, and so primary, cross-cultural research is needed to help understand this issue.

Various risk factors, such as exposure to second-hand smoke [55], GDM

[50, 51, 61] and sleep disturbance [62] were found to significantly increase the risk of postnatal depression; however, these risk factors were examined during the postnatal period and there was a lack of evidence regarding their association with antenatal depression. Therefore, separating the assessment of risk factors of both antenatal and postnatal depression is an important first step in clarifying the cause and effects among the many potential risk factors. Additionally, periodic assessment of the risk factors is needed as these correlates, such as social support and experiencing stressful life events, may fluctuate over time. Further research is needed to investigate these correlates both after delivery and during the pregnancy.

We also observed that each systematic review used different time periods to assess or identify the symptoms of depression. For example, some studies measured depression in the first trimester of pregnancy [63], while others measured depression during the pregnancy or 6 months after the birth, with some even including participants up to 1 year postnatally [1]. The time period during which the research is being carried out should be assessed more thoroughly prior to the start of the study, there is a need for further epidemiology studies to confirm prevalence rates of depression at different time points, as well as the need to confirm which specific time point(s) would allow for the best rate of diagnosis of depressed women [64].

Another noted discrepancy among the included systematic reviews was the assessment method used to identify depressive symptoms, there was a larger number of reviews using self-reported measures as compared to diagnostic assessment. This might have affected the estimates of perinatal depression due to self-reported measures potentially overestimating the findings. It was also observed that the self-reported measures among the included systematic reviews used different cut-off points for the indication of any depressive disorder or major depressive disorder: different Edinburgh Postnatal Depression Scale cut-off points of 9/10 (or more) to 12/13 (or more) out of 30; Beck’s Depression Inventory with 15 or 16 (or more) out of 63; and CESD between 16 (or more) to 24 (or more) out of 60. Despite this variation, there was no clear correlation between cut-off points and prevalence estimates for either measurement tool, suggesting that using lower scores may increase the number of false positives but may not necessarily increase the estimate.

Another feature among the systematic reviews was a lack of, or limited, adjustment for confounding factors which may affect the consistency of some of the risk factors of perinatal depression. For example, there were inconsistent findings in the relationship between CS and developing perinatal depression [65,66,67,68]. This may be explained due to the fact that CS seems to present as a risk factor only if women are vulnerable to perinatal depression, brought on by another factor, such as lack of experience with children and/or a negative birth experience. The impact of CS on maternal mood may also depend on the context in which the CS arose (e.g., cultural rules, poor mental health, previous negative birth experiences, preparedness and the social support available to the mother) [65, 67].

Although the systematic reviews were all of medium–high quality based on the AMSTAR tool, the quality of the primary studies needs further improvement in the following areas: publication bias, fewer longitudinal studies and more cross-sectional studies, different measures of predictor(s) and outcome(s), and sample size. These features may have led to low methodological quality, which in turn may increase confusion regarding the certainty of the estimates of the prevalence of perinatal depression and its risk factors.

Despite the aforementioned limitations, to the best of our knowledge, this is the first systematic review of systematic reviews summarising the global prevalence of perinatal depression and its associated risk factors. No publication date or language restrictions were applied and this helped to include and review all systematic reviews focusing on the prevalence and risk factors of perinatal depression. The main strength of this systematic review of systematic reviews is that it brings together reviews of differing quality, in different languages, summarises them and discusses them critically. The results of this study offer a concise and comprehensive review that covers both the estimation of perinatal depression and its common correlates.

Healthcare professionals, including nurses, midwives, and physicians, should be aware of the prevalence and risk factors of perinatal depression globally. Perinatal depression can lead to negative consequences [1, 3, 7,8,9,10], therefore the identification of effective and culturally applicable interventions to prevent and treat perinatal depression is crucial. The use of antidepressants and telemedicine was noted to be the most effective therapeutic interventions for treating perinatal depression based on recent research [69]. The early detection of perinatal depression is known to be important regarding the healthcare outcomes of the mothers and their babies. Screening programmes, carried out by well-trained healthcare professionals, are believed to lead to the most effective outcomes [13]. Special attention is needed in assessing and screening for depressive symptoms during both the antenatal and postnatal periods. As the perinatal period is the most common time in which women have contact with their healthcare providers, this is when at-risk women should be identified and supported. It is essential that all healthcare and allied professionals ensure that pregnant patients are appropriately screened for antenatal depression, in order to identify any symptoms of mental illness, such as depression, early and allow for appropriate support during pregnancy, preventing the risk of developing postnatal depression. The development of policies and the integration of mental health services into the primacy care system must be a high priority. To support this, research focusing on understanding the experiences of healthcare professionals, as well as perinatal depression in the primary care system, needs to be expanded upon, to clarify a clear pathway for policies addressing mental health needs in primary healthcare systems.

Conclusion

This systematic review of systematic reviews found that perinatal depression is highly prevalent and strongly associated with risk factors for depression in the general population, rather than factors associated with pregnancy. Improvements with regards to the quality of the primary research are needed and should aim to be more consistent, namely in: the design of longitudinal studies, representative study samples, standard measurements or scales with standard cut-off points, standard definitions of depression and confounding variables controlled. To align with these recommendations, more consideration is also needed with regards to different cultural expressions of distress and some individuals lack of access to mental health care in some countries.