Abstract
The objective of this study is to explore the associations between the patient-reported Edinburgh Postnatal Depression Scale (EPDS) and Patient Health Questionnaire (PHQ)-9 and clinician-reported 17-item Hamilton Depression Rating Scale (HAMD-17) in order to facilitate clinical decision-making. An integrated efficacy dataset of three randomized placebo-controlled trials (NCT02614547, NCT02942004, and NCT02942017) evaluating brexanolone injection, a neuroactive steroid chemically identical to allopregnanolone, in women with postpartum depression was used for this post hoc analysis. Data were pooled across treatment arms. Associations were assessed at day 30 (end-of-trial follow-up). Pearson correlation assessed the relationship between EPDS and PHQ-9 item and total scores and HAMD-17 total score. Cohen’s kappa assessed agreement of EPDS remission (score < 10) and PHQ-9 remission (score < 5) with HAMD-17 remission (score ≤ 7). Ordinary least squares (OLS) regression models were used to develop equations estimating HAMD-17 total scores from EPDS and PHQ-9 scores, respectively. The total scores showed large correlations (HAMD-17/EPDS: r = 0.71, p < 0.001; HAMD-17/PHQ-9: r = 0.75, p < 0.001). Individual EPDS and PHQ-9 items significantly correlated (r= 0.35 to 0.67, all p < 0.001) with HAMD-17 total score. EPDS had 79% sensitivity and 67% specificity to detect HAMD-17 remission; corresponding estimates for PHQ-9 were 76% and 78%. OLS models yielded the following equations: HAMD-17 total = 2.66 + (EPDS total × 0.87) and HAMD-17 total = 3.99 + (PHQ-9 total × 0.97). There were large and statistically significant associations between patient-reported outcomes (EPDS, PHQ-9) and clinician-reported outcomes (HAMD-17) as clinical improvements were associated with patient-reported symptom improvement. These results provide tools to help translate clinical trial data to clinical practice, thus aiding shared decision-making for this critical population.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Postpartum depression (PPD) is one of the most common medical complications during and after pregnancy, resulting in a substantial health-related quality of life burden for mothers, children, and partners (Callaghan et al. 2010; Centers for Disease Control and Prevention 2018; Centers for Disease Control and Prevention 2019; De Sisto et al. 2014; Ko et al. 2017; Martin et al. 2019; Moore Simas et al. 2019; Roberts et al. 2013; Vismara et al. 2016). In the USA, an estimated 11.5% of new mothers experience symptoms of PPD, with global estimates of 17.7% (Hahn-Holbrook et al. 2017; Ko et al. 2017).
Clinicians treating perinatal women commonly use the Edinburgh Postnatal Depression Scale (EPDS) or Patient Health Questionnaire (PHQ)-9 as PPD screening tools. Both the EPDS and PHQ-9 are patient-reported instruments and therefore provide unique perspective into the patient’s experience of their symptoms. Given the increasing focus on patient-centred care, understanding and incorporating patient’s voice into the assessment of health outcomes can facilitate shared decision-making, early diagnosis, individualized care, and patient satisfaction with treatment. The EPDS is a validated screening tool for PPD widely used to assess the likelihood of clinical depression and recently recommended by the American College of Obstetricians and Gynecologists (ACOG Committee Opinion No. 757 2018; Cox et al. 1987; Smith et al. 2016). Although the EPDS was developed as a screening tool, it is often used as an outcome measure in PPD studies and clinical practice (Appleby et al. 1997; McCabe-Beane et al. 2016; Sharp et al. 2010). The EPDS has been shown to perform well when compared to clinician-reported tools such as structured clinical interviews and more consistently than other patient-reported tools such as the Beck Depression Inventory (Beck et al. 1988; Bennett et al. 2004; Ji et al. 2011; Spinelli et al. 2013). The PHQ-9 is a general depression screening questionnaire that is also commonly used to screen for PPD, particularly in primary care settings, has been found to be highly specific for identifying PPD, and has comparable psychometrics, sensitivity, and specificity to the EPDS (Flynn et al. 2011; Gjerdingen et al. 2009; Kroenke et al. 2001; O'Connor et al. 2016; Sidebottom et al. 2012; Sit and Wisner 2009; Zhong et al. 2014).
The 17-item Hamilton Rating Scale for Depression (HAMD-17) is widely used as a primary endpoint in major depressive disorder (MDD) and PPD clinical trials and is commonly required by regulatory bodies (e.g. Clayton et al. 2015; Hantsoo et al. 2014; Papakostas et al. 2012). As an established clinician-reported outcome measure, it is regarded as the gold standard assessment in clinical trials, demonstrating stability over time and allowing for comparison of treatment effects between studies (Ji et al. 2011; Khan et al. 2002; Kim et al. 2014). However, given its complexity and length of time to complete, the HAMD-17 is less commonly used to assess depression in clinical practice. While EPDS and PHQ-9 severity group cut-off scores have been reported (Ji et al. 2011; Kroenke et al. 2001; McCabe-Beane et al. 2016), no mapping of EPDS or PHQ-9 scores to predict HAMD-17 scores has previously been reported. Understanding the direct relationship between patient-reported outcomes as measured either by the EPDS or by the PHQ-9 and the clinician-reported HAMD-17, based on evidence from a PPD-specific population, could further validate these screening tools in the identification of patients in need of treatment given symptom severity and enhance early diagnosis in this vulnerable population.
The present post hoc analysis aimed to bridge the existing interpretation gap and examine the relationship between the EPDS, PHQ-9, and HAMD-17 instruments. These post hoc analyses used an integrated dataset combining the data from three randomized, double-blind, trials of intravenous brexanolone compared to intravenous placebo in women with moderate and severe PPD (Kanes et al. 2017; Meltzer-Brody et al. 2018a). More specifically, using the HAMD-17 as the criterion measure, this study aimed to (1) assess the association between the current EPDS and PHQ-9 definitions of remission and the HAMD-17 definition of remission, (2) assess the association between the EPDS and PHQ-9 item and total scores with the HAMD-17 total score, (3) develop a mapping equation to permit the estimation of HAMD-17 scores from EPDS and PHQ-9 scores.
Methods
Study design and participants
The current report utilizes data from phase 2 and phase 3 clinical trials, which examined the safety and efficacy of intravenous brexanolone compared with intravenous placebo in patients with moderate and severe PPD (clinicaltrials.gov identifiers NCT02614547, NCT02942004, and NCT02942017). Full descriptions of the trial designs and inclusion and exclusion criteria have been published previously (Kanes et al. 2017; Meltzer-Brody et al. 2018a). Briefly, one phase 2 (study A) and two phase 3 (studies B and C) multicentre, randomized, double-blind, placebo-controlled trials were conducted across the USA under an umbrella protocol. All trials enrolled female participants; aged 18–45 years; in good physical health; had a major depressive episode with onset no earlier than the third trimester and no later than 4 weeks after delivery, as determined by the Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders-IV Axis I Disorders (First et al. 1996); had a HAMD-17 total score ≥ 26 (severe PPD, studies A and B) or 20–25 (moderate PPD, study C); and were 6 months postpartum or less at screening. Patients were either randomized 1:1:1 to receive 60-h infusion of brexanolone 90 μg/kg/h, brexanolone 60 μg/kg/h, or placebo (study B), or 1:1 to receive 90 μg/kg/h brexanolone or placebo (studies A and C). Patients who received the recommended target dose for brexanolone, 90 μg/kg/h, or placebo were included in the integrated efficacy dataset, employed for these analyses.
Patients were treated in a medically supervised setting for 72 h: 60 h of study drug infusion and 12 h for completion of assessments. Patients were followed until day 30, with clinical and safety assessments at days 7 and 30. The EPDS, PHQ-9, and HAMD-17 were administered at baseline, hour 60, day 7, and day 30 for all patients. The primary endpoint in each trial was the least-squares mean difference in change from baseline in HAMD-17 total score at hour 60.
Measures
The present analysis included three clinical outcome assessments: the patient-reported EPDS, patient-reported PHQ-9, and clinician-reported HAMD-17. The psychometric properties of these scales have been extensively described in the literature (Bagby et al. 2004; Flynn et al. 2011; Hamilton 1960; Kroenke et al. 2001; McCabe-Beane et al. 2016; Zhong et al. 2014).
The EPDS is a 10-item patient-reported outcome measure of depressive symptom severity specific to the perinatal period, originally developed as a screening tool. The EPDS has a recall period of 7 days. Items are scored on a 0–3 scale and summed to compute a total score, which ranges from 0 to 30, with higher scores indicating more severe depression. A score of 10 has been recommended as a cut-off point for the presence of minor PPD, and score of 13 and above for major PPD in English-speaking women (Cox et al. 1987; Harris et al. 1989; Murray and Carothers 1990). Remission was defined as an EPDS score of < 10 in the present analysis.
The PHQ-9 is a 9-item patient-reported outcome and depression screening tool. The PHQ-9 has a recall period of 14 days. Items are scored on a 0 (not at all) to 3 (nearly every day) scale and summed for a total score ranging from 0 to 27. Remission or minimal symptom severity has been defined as a score of < 5 (Kroenke et al. 2001).
The HAMD-17 is clinician-reported scale that evaluates core symptoms of depression. Items are scored on a 0 (none/absent) to 4 (most severe) or 0 (none/absent) to 2 (severe) scale (Hamilton 1960). Individual item scores are summed to compute the total score, which ranges from 0 to 52, with higher scores indicating more severe depression. Clinical remission is defined as a HAMD-17 total score of ≤ 7.
Statistical analysis
The analysis was conducted on the integrated efficacy dataset. The efficacy dataset included all randomized patients who started the infusion of study drug and who had a valid baseline HAMD-17 assessment and at least one post baseline HAMD-17 assessment. As the objective of this analysis was to examine the relationship between the EPDS, PHQ-9, and HAMD-17, data was pooled across treatment arms. Furthermore, only the day 30 assessment point data (end of study) was utilized. Due to study inclusion criteria, the data at baseline and earlier assessment points showed less variability, and variability is required when trying to establish a relationship between measures (Salkind 2010).
Patients with day 30 EPDS, PHQ-9, and HAMD-17 data were included in the analytical sample for the present analysis. The sample characteristics were summarized descriptively.
To examine the association EPDS remission (score < 10) and PHQ-9 remission (score < 5) with HAMD-17 remission (score ≤ 7), the proportions of EPDS and PHQ-9 remitters were compared with the proportion of HAMD-17 remitters using Fisher’s exact test; concordance was tested using the Kappa coefficient. Kappa were interpreted as poor (< 0), slight (0–0.2), fair (> 0.2–0.4), moderate (> 0.4–0.6), or substantial (> 0.6–0.8) (Landis and Koch 1977).
Pearson correlations were calculated to examine the association of EPDS and PHQ-9 total scores with HAMD-17 total score. As a validity check, as well as to inform multivariable modelling, the correlations between the EPDS and PHQ-9 items and the HAMD-17 total score were assessed for inconsistent or disproportionate values. The correlations were interpreted as small (0.3–< 0.5), moderate (0.5–< 0.7), or large (≥ 0.7) (Hinkle et al. 2003).
Ordinary least squares regression models were used to develop equations to estimate HAMD-17 total score from EPDS and PHQ-9 total score, respectively. For each screening tool, three models were applied: (1) a simple regression of the HAMD-17 and EPDS/PHQ-9 total score (model 1), (2) model 1 with a quadratic term added (EPDS/PHQ-9 total score squared; model 2), (3) a stepwise fivefold cross-validation predicted residual error sum of squares (CV PRESS) model using EPDS/PHQ-9 items to predict HAMD-17 total score (model 3). For model 3 using cross-validation methods as a means of preventing overfitting, cross-validated covariates were selected from the 10 EPDS items and 9 PHQ-9 items, using CV PRESS as the stepwise selection and stop criteria. Age and a combined PPD history/nulliparity variable (first pregnancy; pregnancy history with PPD, pregnancy history without PPD) were initially included as covariates but were removed as found non-significant to keep the models as simple as possible for potential use in a clinical practice setting.
Two-sided tests were used and p values of ≤ 0.05 were considered statistically significant. Statistical significance was not adjusted for multiple comparisons. SAS version 9.4 (SAS Institute Inc., Cary, NC, USA) was used for all statistical analysis.
Results
In total, 199 patients provided EPDS, PHQ-9, and HAMD-17 data at day 30 and were thus eligible for the present analysis. While all analyses were conducted on the pooled sample, the patient demographics and baseline characteristics are presented overall and by treatment arm for information (Table 1). Of the 199 patients included in these analyses at baseline, most patients were white (121, 61%), 74 (37%) were black or African-American, and 35 (18%) were Hispanic or Latino. A higher proportion of patients had onset of PPD within 4 weeks of delivery than the third trimester. The mean EPDS total score at day 30 was 9.04 (SD = 6.84), the mean PHQ-9 total score was 6.76 (SD = 6.52), and the mean HAMD-17 total score was 10.56 (SD = 8.47), with minimum and maximum scores 0–27, 0–25, and 0–31, respectively.
Association EPDS and PHQ-9 remission with HAMD-17 remission
There were statistically significant associations between EPDS and PHQ-9 definitions of remission with HAMD-17 remission (Table 2), both with moderate Kappa agreement. Among patients classified as remitters according to the HAMD-17 definition, 79% and 76% were also classified as remitters by the EPDS and PHQ-9 definitions, respectively (sensitivity). Among patients classified as a non-remitter on the HAMD-17, 67% and 78% were also classified as such by the EPDS and PHQ-9, respectively (specificity).
Association of EPDS and PHQ-9 total and item scores with HAMD-17 total score
All correlations of EPDS and PHQ-9 total and items scores with HAMD-17 total score were statistically significant (ps < 0.001; Table 3). The total scores showed large correlations (EPDS/HAMD-17: r = 0.71; PHQ-9/HAMD-17: r = 0.75). Item scores were moderately correlated with HAMD-17 total score except ‘thought of self-harm’ for both EPDS and PHQ-9 and ‘able to laugh’ and ‘looked forward with enjoyment’ for EPDS.
Estimating HAMD-17 total score from EPDS and PHQ-9
As shown in Table 4, the additional complexity of the quadratic term (model 2) and CV PRESS item-level model (model 3) did little to improve the variance explained statistics or reduce the prediction error, making model 1 the most parsimonious and recommended model to estimate HAMD-17 total score for both the EPDS and PHQ-9. Age and PPD/nulliparity were not significant predictors in any models (p > 0.05) and were thus removed. Based on model 1, the following equation can be used to estimate HAMD-17 total score from EPDS total score: HAMD-17 total = 2.66 + (EPDS total × 0.87). HAMD-17 total score can be estimated from PHQ-9 total score with a similar equation: HAMD-17 total = 3.99 + (PHQ-9 total × 0.97). This results in an integer range of estimated HAMD-17 total scores of 3–29 from the EPDS [2.66 + (0 / 30 × 0.87)] and 4–30 from the PHQ-9 [3.99 + (0 / 27 × 0.97)], very much aligned with the observed range of HAMD-17 total score (0–31).
Discussion
PPD is frequently unrecognized, undiagnosed, and inadequately treated (Cox et al. 2016). A more nuanced understanding of how commonly used screening tools such as the EPDS and PHQ-9 relate to clinical trial outcomes, such as the HAMD-17, may help address failures in diagnosis and treatment. The observed large associations between EPDS and PHQ-9 total score and HAMD-17 total score, as well as moderate associations of remission definitions, suggest that patients’ self-reports of their symptoms generally align with clinician-ratings, such as the HAMD-17, that may be prohibitively time and training intensive to obtain and further validate the ability of these screening tools to identify patients with PPD symptoms. Once a patient is diagnosed, management generally follows a stepped care approach, wherein patients with mild symptoms are treated through low-intensity interventions and those with symptoms that do not respond, are more severe, or present acutely are typically offered pharmacologic interventions, either alone or adjunctive to low-intensity interventions (Meltzer-Brody et al. 2018b). The observed pattern of results in these analyses, wherein higher scores on EPDS and PHQ-9 are associated with more severe symptoms as assessed by HAMD-17, may help clinicians who have identified and diagnosed PPD patients to understand and discuss with the patients what treatments are likely appropriate. Primary care physicians, obstetricians, and gynaecologists and even paediatricians who administer an EPDS or PHQ-9 may have a better ability to recognize which women are likely to receive benefit from low-intensity interventions, such as psychosocial support, and those that may warrant prompt referral for more intensive care. Clinicians seeking to practice shared decision-making regularly need to draw on and interpret a high volume of new clinical evidence in order to describe and discuss available options, alongside possible benefits or harms (Price 2005). Being able to understand and translate the relationship between the HAMD-17 data commonly reported as evidence of treatment efficacy in clinical trials and the EPDS or PHQ-9 data being captured in clinic will hopefully assist clinicians in more confidently and readily interpreting trial evidence to present during shared decision discussions. Given limitations in healthcare resources and availability of specialists, confidence in the use of these tools for valid screening and treatment decision-making may also help ameliorate stresses to the healthcare system.
The large associations observed between total scores are notable given that the measures differ in terms of patient versus clinician-rating, as well as in content, with EPDS designed specifically for the perinatal period, PHQ-9 initially designed for general depression, and HAMD-17 designed to assess changes in symptom severity. The moderate associations observed between EPDS and PHQ-9 item scores and HAMD-17 total score are in line with expectations, as individual item scores are expected to show more variability than summed total scores. Furthermore, as an advance to the current reliance upon individual outcome measure cut-off values to interpret scores, the current study also conducted regression analysis to provide two simple mapping equations that can easily be applied in any clinical setting.
Limitations
While the present analysis advances current understanding and interpretation of EPDS, PHQ-9, and HAMD-17, these analyses are based on a clinical trial population, and the generalizability to the broader population should be examined through validation of this work in an external dataset. For example, it may be of interest to validate the mapping equations on a sample including a broader range of HAMD-17 total score. While the current sample included EPDS and PHQ-9 total score from almost the full range, 0–25/27 and 0–27/30, respectively, the HAMD-17 total score range was 0–31/52. However, as any HAMD-17 total score higher than 25 indicates severe depression, the full range of depression severity required for clinical decision-making was covered in the present analysis. Indeed, the maximum baseline HAMD-17 total score in the severe PPD patients included in this analysis was 38, as has been reported in severe PPD elsewhere (Nonacs et al. 2005), suggesting that the higher values may be less common in PPD. The baseline data was not included in the analysis reported here due to the restriction in range of PHQ-9, EPDS, and HAMD-17 total score as a result of the trial inclusion criteria for moderate/severe PPD.
Conclusions
The present post hoc analysis demonstrates large and statistically significant associations between patient-reported EPDS and PHQ-9 scores and the clinician-reported HAMD-17 and in an integrated efficacy dataset of PPD trials and provides the first mapping equations to permit the estimation of HAMD-17 total score from known EPDS or PHQ-9 scores. Given the role of the HAMD-17 as the gold standard criterion for evaluating the efficacy of treatments in clinical trials, the results presented here provide tools to aid interpretation of clinical trial data in clinical practice, thus aiding informed decision-making for multiple stakeholders in regulatory and clinical settings and shared decision-making in clinical practice for this critical population of women with PPD.
References
ACOG Committee Opinion No. 757 (2018) Obstet Gynecol 132:1314–1316. https://doi.org/10.1097/AOG.0000000000002928
Appleby L, Warner R, Whitton A, Faragher B (1997) A controlled study of fluoxetine and cognitive-behavioural counselling in the treatment of postnatal depression. BMJ 314:932–936. https://doi.org/10.1136/bmj.314.7085.932
Bagby RM, Ryder AG, Schuller DR, Marshall MB (2004) The Hamilton Depression Rating Scale: has the gold standard become a lead weight? Am J Psychiatry 161:2163–2177. https://doi.org/10.1176/appi.ajp.161.12.2163
Beck AT, Steer RA, Carbin MG (1988) Psychometric properties of the Beck Depression Inventory: twenty-five years of evaluation. Clin Psychol Rev 8:77–100. https://doi.org/10.1016/0272-7358(88)90050-5
Bennett HA, Einarson A, Taddio A, Koren G, Einarson TR (2004) Prevalence of depression during pregnancy: systematic review. Obstet Gynecol 103:698–709. https://doi.org/10.1097/01.AOG.0000116689.75396.5f
Callaghan WM, Kuklina EV, Berg CJ (2010) Trends in postpartum hemorrhage: United States, 1994-2006. Am J Obstet Gynecol 202(353):e351–e356. https://doi.org/10.1016/j.ajog.2010.01.011
Centers for Disease Control and Prevention (2018) Diabetes during pregnancy. https://www.cdc.gov/reproductivehealth/maternalinfanthealth/diabetes-during-pregnancy.htm. Accessed March 31 2020
Centers for Disease Control and Prevention (2019) Data on selected pregnancy complications in the United States. https://www.cdc.gov/reproductivehealth/maternalinfanthealth/pregnancy-complications-data.htm. Accessed March 31 2020
Clayton AH, Tourian KA, Focht K, Hwang E, Cheng RF, Thase ME (2015) Desvenlafaxine 50 and 100 mg/d versus placebo for the treatment of major depressive disorder: a phase 4, randomized controlled trial. J Clin Psychiatry 76:562–569. https://doi.org/10.4088/JCP.13m08978
Cox JL, Holden JM, Sagovsky R (1987) Detection of postnatal depression. Development of the 10-item Edinburgh Postnatal Depression Scale. Br J Psychiatry 150:782–786. https://doi.org/10.1192/bjp.150.6.782
Cox EQ, Sowa NA, Meltzer-Brody SE, Gaynes BN (2016) The perinatal depression treatment cascade: baby steps toward improving outcomes. J Clin Psychiatry 77:1189–1200. https://doi.org/10.4088/JCP.15r10174
De Sisto CL, Kim SY, Sharma AJ (2014) Prevalence estimates of gestational diabetes mellitus in the United States, Pregnancy Risk Assessment Monitoring System (PRAMS), 2007–2010. Prev Chronic Dis 11. https://doi.org/10.5888/pcd11.130415
First MB, Spitzer RL, Gibbon M, Williams JBW (1996) Structured clinical interview for DSM-IV axis I disorders, clinician version (SCID-CV). American Psychiatric Press, Washington, DC
Flynn HA, Sexton M, Ratliff S, Porter K, Zivin K (2011) Comparative performance of the Edinburgh Postnatal Depression Scale and the Patient Health Questionnaire-9 in pregnant and postpartum women seeking psychiatric services. Psychiatry Res 187:130–134. https://doi.org/10.1016/j.psychres.2010.10.022
Gjerdingen D, Crow S, McGovern P, Miner M, Center B (2009) Postpartum depression screening at well-child visits: validity of a 2-question screen and the PHQ-9. Ann Fam Med 7:63–70. https://doi.org/10.1370/afm.933
Hahn-Holbrook J, Cornwell-Hinrichs T, Anaya I (2017) Economic and health predictors of national postpartum depression prevalence: a systematic review, meta-analysis, and meta-regression of 291 studies from 56 countries. Front Psychiatry 8:248. https://doi.org/10.3389/fpsyt.2017.00248
Hamilton M (1960) A rating scale for depression. J Neurol Neurosurg Psychiatry 23:56–62
Hantsoo L, Ward-O'Brien D, Czarkowski KA, Gueorguieva R, Price LH, Epperson CN (2014) A randomized, placebo-controlled, double-blind trial of sertraline for postpartum depression. Psychopharmacology 231:939–948. https://doi.org/10.1007/s00213-013-3316-1 Epub 2013 Oct 31
Harris B, Huckle P, Thomas R, Johns S, Fung H (1989) The use of rating scales to identify post-natal depression. Br J Psychiatry 154:813–817. https://doi.org/10.1192/bjp.154.6.813
Hinkle DE, Wiersma W, Jurs SG (2003) Applied statistics for the behavioral sciences, 5th edn. Houghton Mifflin, Boston
Ji S, Long Q, Jeffrey Newport D, Na H, Knight B, Zach EB, Morris NJ, Kutner M, Stowe ZN (2011) Validity of depression rating scales during pregnancy and the postpartum period: impact of trimester and parity. J Psychiatr Res 45:213–219. https://doi.org/10.1016/j.jpsychires.2010.05.017
Kanes S, Colquhoun H, Gunduz-Bruce H, Raines S, Arnold R, Schacterle A, Doherty J, Epperson CN, Deligiannidis KM, Riesenberg R, Hoffmann E, Rubinow D, Jonas J, Paul S, Meltzer-Brody S (2017) Brexanolone (SAGE-547 injection) in post-partum depression: a randomised controlled trial. Lancet 390:480–489. https://doi.org/10.1016/s0140-6736(17)31264-3 Epub 2017 Jun 12
Khan A, Khan SR, Shankles EB, Polissar NL (2002) Relative sensitivity of the Montgomery-Asberg Depression Rating Scale, the Hamilton Depression rating scale and the Clinical Global Impressions rating scale in antidepressant clinical trials. Int Clin Psychopharmacol 17:281–285
Kim DR, Epperson CN, Weiss AR, Wisner KL (2014) Pharmacotherapy of postpartum depression: an update. Expert Opin Pharmacother 15:1223–1234. https://doi.org/10.1517/14656566.2014.911842 Epub 2014 Apr 29
Ko JY, Rockhill KM, Tong VT, Morrow B, Farr SL (2017) Trends in postpartum depressive symptoms - 27 states, 2004, 2008, and 2012. MMWR Morb Mortal Wkly Rep 66:153–158. https://doi.org/10.15585/mmwr.mm6606a1
Kroenke K, Spitzer RL, Williams JB (2001) The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med 16:606–613. https://doi.org/10.1046/j.1525-1497.2001.016009606.x
Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174. https://doi.org/10.2307/2529310
Martin JA, Hamilton BE, Osterman MJK, Driscoll AK (2019) Births: final data for 2018. Natl Vital Stat Rep Volume 68, Number 13. https://www.cdc.gov/nchs/data/nvsr/nvsr68/nvsr68_13-508.pdf. Accessed March 31 2020
McCabe-Beane JE, Segre LS, Perkhounkova Y, Stuart S, O’Hara MW (2016) The identification of severity ranges for the Edinburgh Postnatal Depression Scale. Journal of Reproductive and Infant Psychology 34:293–303. https://doi.org/10.1080/02646838.2016.1141346
Meltzer-Brody S, Colquhoun H, Riesenberg R, Epperson CN, Deligiannidis KM, Rubinow DR, Li H, Sankoh AJ, Clemson C, Schacterle A, Jonas J, Kanes S (2018a) Brexanolone injection in post-partum depression: two multicentre, double-blind, randomised, placebo-controlled, phase 3 trials. Lancet 392:1058–1070. https://doi.org/10.1016/s0140-6736(18)31551-4 Epub 2018 Aug 31
Meltzer-Brody S, Howard LM, Bergink V, Vigod S, Jones I, Munk-Olsen T, Honikman S, Milgrom J (2018b) Postpartum psychiatric disorders. Nat Rev Dis Primers 4:18022. https://doi.org/10.1038/nrdp.2018.22
Moore Simas TA, Huang M-Y, Patton C, Reinhart M, Chawla AJ, Clemson C, Eldar-Lissai A (2019) The humanistic burden of postpartum depression: a systematic literature review. Curr Med Res Opin 35:383–393. https://doi.org/10.1080/03007995.2018.1552039
Murray L, Carothers AD (1990) The validation of the Edinburgh Post-natal Depression Scale on a community sample. Br J Psychiatry 157:288–290. https://doi.org/10.1192/bjp.157.2.288
Nonacs RM, Soares CN, Viguera AC, Pearson K, Poitras JR, Cohen LS (2005) Bupropion SR for the treatment of postpartum depression: a pilot study. Int J Neuropsychopharmacol 8:445–449. https://doi.org/10.1017/S1461145705005079
O'Connor E, Rossom RC, Henninger M, Groom HC, Burda BU (2016) Primary care screening for and treatment of depression in pregnant and postpartum women: evidence report and systematic review for the US Preventive Services Task Force. JAMA 315:388–406. https://doi.org/10.1001/jama.2015.18948
Papakostas GI, Shelton RC, Zajecka JM, Etemad B, Rickels K, Clain A, Baer L, Dalton ED, Sacco GR, Schoenfeld D, Pencina M, Meisner A, Bottiglieri T, Nelson E, Mischoulon D, Alpert JE, Barbee JG, Zisook S, Fava M (2012) L-methylfolate as adjunctive therapy for SSRI-resistant major depression: results of two randomized, double-blind, parallel-sequential trials. Am J Psychiatry 169:1267–1274. https://doi.org/10.1176/appi.ajp.2012.11071114
Price D (2005) Sharing clinical decisions by discussing evidence with patients. Perm J 9:70–73. https://doi.org/10.7812/tpp/05-006
Roberts JM et al (2013) Hypertension in pregnancy. Report of the American College of Obstetricians and Gynecologists’ Task Force on Hypertension in Pregnancy. Obstet Gynecol 122:1122–1131. https://doi.org/10.1097/01.AOG.0000437382.03963.88
Salkind N (2010) Restriction of range. In: Encyclopedia of research design. SAGE Publications, Inc., Thousand Oaks, California. doi:https://doi.org/10.4135/9781412961288.n388, Restriction of Range
Sharp DJ et al (2010) A pragmatic randomised controlled trial to compare antidepressants with a community-based psychosocial intervention for the treatment of women with postnatal depression: the RESPOND trial. Health Technol Assess 14(iii-iv, ix-xi):1–153. https://doi.org/10.3310/hta14430
Sidebottom AC, Harrison PA, Godecker A, Kim H (2012) Validation of the Patient Health Questionnaire (PHQ)-9 for prenatal depression screening. Arch Womens Ment Health 15:367–374. https://doi.org/10.1007/s00737-012-0295-x
Sit DK, Wisner KL (2009) Identification of postpartum depression. Clin Obstet Gynecol 52:456–468. https://doi.org/10.1097/GRF.0b013e3181b5a57c
Smith EK, Gopalan P, Glance JB, Azzam PN (2016) Postpartum depression screening: a review for psychiatrists. Harv Rev Psychiatry 24:173–187. https://doi.org/10.1097/HRP.0000000000000103
Spinelli MG, Endicott J, Leon AC, Goetz RR, Kalish RB, Brustman LE, Carmona YR, Meyreles Q, Vega M, Schulick JL (2013) A controlled clinical treatment trial of interpersonal psychotherapy for depressed pregnant women at 3 New York City sites. J Clin Psychiatry 74:393–399. https://doi.org/10.4088/JCP.12m07909
Vismara L, Rollè L, Agostini F, Sechi C, Fenaroli V, Molgora S, Neri E, Prino LE, Odorisio F, Trovato A, Polizzi C, Brustia P, Lucarelli L, Monti F, Saita E, Tambelli R (2016) Perinatal parenting stress, anxiety, and depression outcomes in first-time mothers and fathers: a 3- to 6-months postpartum follow-up study. Front Psychol 7:938. https://doi.org/10.3389/fpsyg.2016.00938
Zhong Q, Gelaye B, Rondon M, E. Sánchez S, J. García P, Sánchez E, V. Barrios Y, E. Simon G, C. Henderson D, May Cripe S, A. Williams M (2014) Comparative performance of Patient Health Questionnaire-9 and Edinburgh Postnatal Depression Scale for screening antepartum depression. J Affect Disord 162:1–7. https://doi.org/10.1016/j.jad.2014.03.028
Funding
This study was funded by Sage Therapeutics, Inc., Cambridge, MA.
Author information
Authors and Affiliations
Contributions
Margaret E. Gerbasi: conception and design of the analysis, data interpretation and contextualization, generation of the first manuscript draft, and manuscript preparation. Samantha Meltzer-Brody: data collection, data interpretation and contextualization, and manuscript preparation. Sarah Acaster: conception and design of the analysis, data interpretation and contextualization, generation of the first draft, and manuscript preparation. Moshe Fridman: conception and design of the analysis, statistical analysis, data interpretation and contextualization, generation of the first draft, and manuscript preparation. Vijayveer Bonthapally: conception and design of the analysis, data interpretation and contextualization, and manuscript preparation. Paul Hodgkins: conception and design of the analysis, data interpretation and contextualization, and manuscript preparation. Stephen J. Kanes: data collection, data interpretation and contextualization, and manuscript preparation. Adi Eldar-Lissai: conception and design of the analysis, data interpretation and contextualization, generation of the first draft, and manuscript preparation. All authors have contributed to and approved of the final manuscript.
Corresponding author
Ethics declarations
Disclaimer
The funding source was involved in the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, and approval of the manuscript.
Ethical approval
All procedures performed in studies involving human participants were in accordance and compliance with the ethical standards of the relevant institutional review boards or independent ethics committees and with the 1964 Helsinki declaration and its later amendments as well as Good Clinical Practice guidelines.
Informed consent
Informed consent was obtained from all individual participants included in the studies.
Conflict of interest
MEG, VB, PH, SJK, and AE-L are employees of Sage Therapeutics, Inc. and own stock or stock options in the company. SM-B reports personal fees from MedScape and grants from Sage Therapeutics, Inc., awarded to the University of Carolina (Chapel Hill, NC, USA) during the conduct of the brexanolone injection clinical trials and grants from Janssen, PCORI, and the NIH outside the submitted work. SA and MF are employees of Acaster Lloyd Consulting, Ltd. and AMF Consulting, respectively, which were paid by Sage Therapeutics to conduct the research reported in this manuscript.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gerbasi, M.E., Eldar-Lissai, A., Acaster, S. et al. Associations between commonly used patient-reported outcome tools in postpartum depression clinical practice and the Hamilton Rating Scale for Depression. Arch Womens Ment Health 23, 727–735 (2020). https://doi.org/10.1007/s00737-020-01042-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00737-020-01042-y