Introduction

Depressive episodes in uni- and bipolar-affective disorders are multifactorial comprising (neuro-)biological and psychosocial factors. Such episodes can affect people during their entire life span compromising life quality and expectancy from early on (Otte et al. 2016; Vieta et al. 2018; Solmi et al. 2022). While the clinical phenomenology in depressed patients varies within a known framework of symptoms and may turn into a chronic, treatment-resistant condition in about 20–30% of the individuals (Fava and Davidson 1996), the neurobiological analogues of depression are heterogeneous and less well defined. Regarding personalized antidepressant therapy, the translation into clinical biomarkers for (deep) phenotyping of depressed patients has remained challenging.

The investigation of genetic heritability and gene–environment interactions (Karg and Sen 2012) based on candidate genes of neurotransmitter systems (Caspi et al. 2003; Baune et al. 2008) has contributed to the development of concepts on interacting risk and disease-modifying factors of depression. The read-out of epigenetic modifications ‘picturing’ a patient’s (adverse) life events has allowed to study pharmacoepigenetics patterns for the prediction of an impaired response to treatment, e.g. with selective serotonin reuptake inhibitors (SSRIs) (Schiele et al. 2021). (Epi-)genome-wide association studies (E-/GWAS) in depression, e.g. (Major Depressive Disorder Working Group of the Psychiatric Genomic Consortium et al. 2013; Menke et al. 2012a, b; Okbay et al. 2016; Story Jovanova et al. 2018) have upscaled single nucleotide polymorphisms (SNP) investigations to larger cohorts, thereby defining further genetic regions of interest and facilitating novel (predictive) measurements such as polygenic risk scores (Fanelli et al. 2022). Recent GWAS for depression have essentially emphasized the importance of genes in synaptic structure and function (Howard et al. 2019). Most of these studies, however, focused on categorical definitions of diseases not considering that depression is a heterogenous, dimensional, and systemic condition affecting multiple organs of the body (Sotelo and Nemeroff 2017).

To define prospective, reproducible biomarkers for depression subtypes based upon whole body disease concepts, a standardized organ and/or body system-tailored deep phenotyping for large cohorts of depressed patients is instrumental.

Essential stress-based pathophysiological mechanisms of depression comprise an impairment of the hypothalamus–pituitary–adrenal (HPA) axis, the adrenergic autonomic nervous system (ANS), and the immune system (Carney et al. 2005; Pariante and Lightman 2008; McEwen and Akil 2020; Beurel et al. 2020). In the pathogenesis of depression, a modulated heart–brain interaction may be responsible for the elevated risk of cardiovascular disease and cardiac mortality in individuals with major depression (Nemeroff and Goldschmidt-Clermont 2012; Hare et al. 2014; Nielsen et al. 2021). Previous studies showed that dysregulation of the HPA axis in depression is linked to an impaired sensitivity of glucocorticoid receptors (GR) (Pariante and Lightman 2008; Menke et al. 2012a, b). Depressed patients are at higher risk for cardiovascular diseases (CVD) including heart failure (Gustad et al. 2014), possibly due to a dysfunctional adrenergic ANS. This may result in increased heart rate, hypertension, and a reduced heart rate variability (HRV) (Koch et al. 2019), i.e. symptoms shared with patients suffering from heart failure (Parati and Esler 2012). Even though the insular cortex—a brain region associated with interoceptive attention (Wang et al. 2019) which is compromised in depressed patients (Eggart et al. 2019)—and its functional networks are considered the neuronal representations of ANS activity (Beissner et al. 2013), it remains unclear whether and how these neuronal substrates may affect the course of patients with depression and heart failure, respectively. Genotyping in patients with heart failure and comorbid depression revealed that genetic variants implicated in anxious behaviour (NPSR1) (Angermann et al. 2017) as well as inflammation (C-reactive protein (CRP), interleukin 6 (IL-6)) (Kittel-Schneider et al. 2018) modify the risk of progression and mortality. Patients with heart failure, on the other hand, not only face a higher risk for depression (Rutledge et al. 2006), but also an increased mortality caused by depression itself (Penninx et al. 2001; Zambrano et al. 2020). However, patients with heart failure and comorbid depression experience no prognostic benefit from antidepressive pharmacological treatment, indicating heterogeneity of the depression phenotype (Angermann et al. 2016).

Here, we introduce the design of the observational GEParD (Genetics and Epigenetics of Pharmaco- and Psychotherapy in acute and recurrent Depression) cohort and the DaCFail (Depression associated Cardiac Failure) protocol, the latter designed as a tailored protocol on the role of stress systems in heart–brain interaction. The main intention was to monitor a naturalistic antidepressant therapy response within an inpatient setting applying a broad repertoire of psychometric and neuropsychological testing, routine blood diagnostics including endocrine and inflammatory biomarkers, established genetic and epigenetic testing, and finally a comprehensive cardiac phenotyping including parameters of the ANS. We finally implemented our ideas in an exploratory approach to generate hypotheses for follow-up studies dealing among others with the following questions:

  1. 1.

    Which psychometric parameters and stress-related biomarkers (endocrine, inflammatory, and ANS) can be defined as biomarkers of depression symptoms and antidepressant treatment response?

  2. 2.

    Which (epi-)genetic biomarkers of depression symptoms and antidepressant treatment response can be defined in interaction with proximal and/or distal life events?

  3. 3.

    Which parameters of the heart–brain interaction differentiate depression with and without heart failure as compared to healthy controls and/or are relevant for antidepressant treatment response?

  4. 4.

    How does the function of the insula and its networks differ in depressed patients with and without heart failure as compared to healthy controls?

Methods

The protocols

The GEParD protocol was initiated in 2012 and carried out as a prospective observational cohort study of hospitalized, depressed uni- and bipolar patients with weekly measurements for up to 7 weeks (Fig. 1). In a second phase (2016–2020), the DaCFail protocol was initiated complementing the existing study concept by parameters focusing on the stress and the cardiac system (Fig. 1). The DaCFail protocol was designed as a refined case–control protocol for 200 patients and methodologically based on GEParD. For DaCFail, patients were enclosed in four study groups: Group 1, patients with depression and heart failure; Group 2, depressed patients; Group 3, patients with heart failure; Group 4, healthy subjects (control group, Table 1). Measurements were carried out weekly for a maximum of four weeks duration (Fig. 1). Both functional magnetic resonance imaging (fMRI) of the insular cortex and 123I-meta-iodobenzylguanidine (MIBG) scintigraphy were integrated as quantitative imaging modalities within the DaCFail study protocol (Fig. 1). The procedures were approved by the local ethics committee of the University Hospital of Würzburg (GEParD: vote no 104/12 and 128/15, DaCFail: vote no. 285/14) and were carried out in accordance with the ethical standards of the Declaration of Helsinki and its later amendments (Williams 2013).

Fig. 1
figure 1

Study designs. After study enrolment, patients were scheduled for weekly blood sample collection in both study protocols. Psychometry was carried out within the clinical routine, with exemptions. In selected study weeks, patients were scheduled for cardiac assessment, modified dexamethasone-suppression test (mDST), and blood sample collection for (epi-)genetic analyses. Patients of DaCFail group 1 and 3 underwent MIBG scintigraphy. Abbreviations: diff = differential, MIBG = 123I-meta-iodobenzylguanidine, RR = blood pressure, TDM = therapeutic drug monitoring, TTE = transthoracic echocardiography, w = week, w/ = with. For psychometry and multimodal biomarkers including functional imaging in detail, please see Tables 2 and 3

Table 1 Inclusion and exclusion criteria for the GEParD and DaCFail protocols

Participant selection criteria

The GEParD protocol recruited patients aged 18–80 years (Table 1) presenting with a unipolar and bipolar depressive disorder diagnosed by a validated, standardized interview according to DSM-IV criteria (Structured Clinical Interview for DSM, SCID-I (First and Gibbon 2004)). Antidepressant treatment was carried out according to the treating physician’s choice within an inpatient setting using psychopharmacology, psychotherapy, and electroconvulsive therapy (ECT).

Within the DaCFail protocol, subjects beyond an age of 50 years were eligible for one of the following study arms applying the selection criteria detailed in Table 1. Healthy controls without medical and mental illness precondition were asked for participation from the population-based “Characteristics and Course of Heart Failure Stages A–B and Determinants of Progression (STAAB)” cohort study, comprising a representative age-stratified sample of about 5000 Würzburg residents aged 30–79 years at baseline assessment (Morbach et al. 2021; Wagner et al. 2017). Healthy subjects were matched in age (≥ 50 years) and gender to the other study groups. For both the GEParD and/or DaCFail study protocol, the exclusion criteria listed in Table 1 were applied.

Recruitment strategy

For both study protocols, hospitalized depressed patients were recruited on-site by specialized staff within the inpatient setting of the Department of Psychiatry, Psychotherapy and Psychosomatic Medicine, University Hospital Würzburg, Germany. Here, four wards with a therapy focus on affective disorders and intensive psychotherapy as well as one neuropsychiatric day-care unit specialized for geriatric patients offer a naturalistic study environment (Fig. 2). Newly admitted patients fulfilling the inclusion criteria of the study protocols were informed about a possible participation by their attending physician. Detailed information sheets of the study protocols were provided with the aim of informing about the study course, study criteria, and applied procedures and measurements. For the patients diagnosed with heart failure (DaCFail Group 3) and the healthy controls (DaCFail Group 4), the recruitment was performed in cooperation with the Department of Internal Medicine I and the Comprehensive Heart Failure Center Würzburg (CHFC) (Fig. 2). For both groups, study assessments were carried out at the Center of Mental Health.

Fig. 2
figure 2

Consort chart for the GEParD and DaCFail protocols (2012–2020). Overview of screening and enrolment process in the GEParD and DaCFail study protocols

Minimal recruitment number

The GEParD study protocol as a prospective observational study aimed to recruit 500 depressed patients. For the DaCFail study protocol, the minimum recruitment number of participants was defined by a power analysis for expected HRV differences between groups as the original primary outcome of the case–control design. To test on HRV differences in depressed patients and controls with an analysis power of 0.8 on a significance level of α = 0.05 and an effect size of Cohen’s d = 1, a minimum recruitment number of N = 24 participants per group was calculated. It was assumed that investigated effects differ in the range of one standard deviation (SD) in the cohort of depressed patients versus controls. Given the four study groups, variances were doubled leading to a minimal recruitment number of N = 48 participants for each of the four DaCFail study groups under the defined test conditions.

Psychometric and neuropsychological testing

In the process of and after study enrolment, psychometric evaluation of patients was carried out in scheduled study rounds within the inpatient setting or single appointments at the outpatient clinic (DaCFail Group 3, 4) (Fig. 1, Table 2).

Table 2 Psychometry, life events, and coping style

As screening tool for depressive and anxious symptoms, the SCID-I, the depression module of Patient Health Questionnaire-9 (PHQ-9) and the 7-item anxiety scale for generalized anxiety disorder (GAD-7) (Spitzer et al. 2006) were used (Table 2).

Depression severity and dimension were assessed by the Beck Depression Inventory (BDI-II) (Beck et al. 1961), Hamilton Depression Rating Scale (HDRS21, 21 items in total, 17 of 21 items were used for evaluation) (Hamilton 1960), and the Montgomery Ǻsberg Depression Rating Scale (MADRS) (Montgomery and Asberg 1979), further, particularly focusing on the entity of anxious depression, the Anxiety Sensitivity Index (ASI) (Reiss et al. 1986) and the State-Trait Anxiety Inventory (STAI-T) were used. Emotional regulation in patients was evaluated by the Cognitive Emotion Regulation Questionnaire (CERQ) (Garnefski et al. 2001). The Montreal Cognitive Assessment (MOCA) (Nasreddine et al. 2005) was used to assess possible neurocognitive impairments.

To contextualize gene–environment interactions, distal and proximal life events of patients were evaluated by the Childhood Trauma Questionnaire (CTQ) (Bernstein et al. 2003) and the List of Threatening Experiences (LTE) (Brugha et al. 1985). The patient’s coping strategies were assessed by the Coping Orientation to Problems Experienced Inventory (COPE) (Carver et al. 1989). The Columbia-Suicide Severity Rating Scale (C-SSRS) (Posner et al. 2011) was used to evaluate suicidal ideation (Menke et al. 2012a, b). The Short-Form-36 Health Survey (SF-36) (Bullinger et al. 1995) assessed the life quality of patients with the Arizona Sexual Experience Scale (ASEX) (McGahuey et al. 2000) monitoring sexual dysfunction.

Physical activity

To monitor (altered) locomotor activity in depressed patients (Wuthrich et al. 2022) and to investigate possible interactions with the HPA axis (Menke et al. 2014), step counts were quantified in a sub-cohort of depressed patients using actigraphy (ActiGraph GT9X Link, ActiGraph LLC, Pensacola, US).

Blood sampling

All enrolled patients were scheduled for blood sampling at defined time points to perform routine blood analysis including established therapeutic drug monitoring (TDM) of psychopharmaceutical medication, analysis of endocrine and inflammation markers, as well as genetic and epigenetic analysis (Fig. 1, Table 3). Blood sample collection was carried out by trained staff.

Table 3 Laboratory, cardiological phenotype, and imaging

On a weekly basis until study week 4 (DaCFail) and study week 7 (GEParD), respectively, a routine blood sample of each patient was collected for analysis of the complete blood count (CBC) with differential blood cell count, transaminases (glutamic oxaloacetic transaminase (GOT), glutamate-pyruvate transaminase (GPT), γ-glutamyltransferase (γ-GT)), and CRP. In study week 1, thyroid markers (thyroid-stimulating hormone (TSH), free triiodothyronine (fT3), and free thyroxine (fT4)) were collected. In study week 1 and 4 (DaCFail), blood samples to investigate the inflammatory parameters IL-6, interleukin 1 (IL-1), and tumour necrosis factor alpha (TNF-α) were additionally collected. For analyses of (epi-)genetic parameters and the HPA axis, please see the respective sections.

(Epi-)genetic analyses

In study week 1, 4 and 7 as well as parallel to the modified dexamethasone-suppression test (mDST), PAXgene blood test tubes (Qiagen, Hilden, Germany) for analysis of mRNA expression and ethylene diamine tetra-acetic acid (EDTA) tubes for analyses of DNA methylation and analysis of genetic variants were taken.

Modified dexamethasone-suppression test (mDST)

An mDST, as previously described (Menke et al. 2021; Leistner and Menke 2018), was applied at week 1 and 4 in inpatients and once in the above described outpatient setting (Fig. 1, Table 3). Before oral administration of 1.5 mg dexamethasone, blood was drawn by trained staff at 6 p.m. for the analysis of a CBC with differential blood cell count, cortisol, and adrenocorticotropic hormone (ACTH). PAXgene blood test tubes (Qiagen, Hilden, Germany) for analysis of mRNA expression, and EDTA tubes for analysis of DNA methylation and genetic variants were also used. Three hours post-medication, a second blood sample was collected for analysis of the above-mentioned parameters.

Assessment of cardiovascular parameters

Comprehensive cardiological diagnostics were performed for all patients enclosed in the DaCFail study protocol. This included 24 h electrocardiogram (ECG), 24 h blood pressure measurement (Riva-Rocci, RR), and transthoracic echocardiography (TTE) for patients ≥ 50 yrs (Table 3). In study week 1 and 4 of the DaCFail study protocol (Fig. 1), amino-terminal pro-hormone brain natriuretic peptide (NTproBNP), and troponin T/I (trop T/I) were collected as cardiac blood biomarkers.

123I-meta-iodobenzylguanidine scintigraphy

MIBG scintigraphy (Chirumamilla and Travin 2011) was used to monitor synaptic activity of cardiac sympathetic neurons in HF patients (DaCFail Group 1, 3) and carried out at the Department for Nuclear Medicine at University Hospital of Würzburg (Werner et al. 2018). Increased uptake of the tracer 123I-MIBG in synaptic vesicles was used as a marker of synaptic transmission and plasticity in sympathetic neurons and quantified via the heart mediastinum ratio (HMR). The imaging protocol set two time points of scanning (early scan 15 min post-injection, delayed scan 4 h post-injection). For single- and three-dimensional imaging, a planar scintigraphy protocol as well as single-photon emission computed tomography (SPECT) was used.

Functional MRI and interoceptive paradigm

For an investigation of the insular cortex–heart axis in depression (Fig. 1, Table 3), functional MRI measurements in combination with interoceptive paradigms were applied. The insular cortices are considered as neuronal substrates for the perception of internal sensory information (interoception), which is discussed to be compromised in patients with depression (Eggart et al. 2019), as well as of ANS activity (Beissner et al. 2013). Patients of DaCFail groups suffering from depression (groups 1 + 2) and controls (group 4), respectively, underwent the following measurements in one single session:

  1. (i)

    Based on the concept of interoceptive accuracy (Garfinkel et al. 2015), a modified version of the Schandry task (Schandry 1981) as heartbeat perception task was used to objectively measure perception of internal sensory information.

  2. (ii)

    HRV recordings were established to evaluate possible confounder phenomena and analysed for a defined time (300 s) using the RMSSD (Root Mean Sum of Squared Distance, Unit: ms).

  3. (iii)

    For morphometry of defined regions of interest in the insular cortex of both hemispheres (ventral and dorsal anterior insula, posterior insula) and for functional connectivity analyses, MRI data were acquired on a 3 Tesla scanner (MAGNETOM Skyra, Siemens, Erlangen, Germany). A 64-channel head coil was used. Structural images were acquired with a high-resolution T1-weighted magnetization prepared rapid gradient echo (MPRAGE). Resting-state recordings were acquired for a total duration of 10 min using T2*-weighted blood oxygen level-dependent (BOLD) images as echo-planar imaging (EPI) sequence. Participants were instructed to stay awake and keep their eyes open.

Study end point

For both study protocols, at the study end point after 7 weeks or after 4 weeks of antidepressant therapy in the inpatient setting, the difference of the individual HDRS scores was measured (Fig. 1). This therapy outcome parameter in both study protocols was identical because the second case-controlled study protocol (DaCFail) had been designed on the initial prospective, observational GEParD cohort protocol. Demographic characterization, psychometric parameters, somatic, laboratory, and (epi-)genetic biomarkers were used for evaluation of and/or correlation with therapy response. In the DaCFail study protocol, patients in addition were evaluated by HRV analyses (24 h ECG) using the RMSSD as the original primary outcome parameter and HMR analyses using MIBG scintigraphy in correlation with psychometric measurements of depression.

Statistics

All statistical calculations were and will be carried out with the IBM SPSS software package 26 (SPSS Inc., Chicago, USA) and SigmaPlot 14 (Systat Software, Düsseldorf, Germany). Further calculations may implement custom-written programming scripts for individual deep phenotyping based on (un-)supervised machine leaning algorithms.

Results

Recruitment

In total, N = 573 patients were recruited to the GEParD study protocol between October 2012 and December 2020 (N = 458 patients) and to the DaCFail study protocol between March 2016 and December 2020 (N = 268 patients). For the parallel inclusion in both protocols, N = 153 patients were initially evaluated. In the process of enrolment in one or both protocols, N = 127 patients had to be excluded due to somatic conditions or unforeseen disease courses (N = 94), changes in the patients’ personal motivation for study participation (N = 16), and unanticipated wishes for discharge (N = 15). In most cases, somatic or mental exclusion criteria (manic or psychotic episodes) had been unknown and/or unfolded within the first study week based upon the detailed diagnostic pipeline. Of the 120 participating depressed patients in the DacFail protocol, N = 115 completed the GEParD study protocol in parallel. In the process of and after enrolment, five depressed patients decided to solely participate in the DaCFail protocol. The number of patients completing one or both protocols thus consisted of 446 probands, of which 351 patients suffered from depression and 95 probands, recruited for the DaCFail protocol, were considered as non-depressed controls (patients with heart failure, healthy probands). Sizes of study groups are further detailed in Fig. 2 and Tables 4 and 5.

Table 4 Basic demographic data and baseline depressive phenotype for the entire cohort of depressed patients (N = 351)
Table 5 Basic demographic data and baseline depressive phenotype for the DaCFail study groups (N = 215)

Demographics

The mean age of all 351 depressed subjects was 45.8 years (standard deviation (SD) 15.3, range 18–80), N = 202 depressed participants (57.5%) were male. N = 200 depressed patients (57.0%) were married or engaged in a relationship. The control cohort of 95 non-depressed subjects had a mean age of 66.1 years (SD 8.0, range 51–84), of which 66 participants (69.5%) were male. N = 72 control patients (75.8%) were married or engaged in a relationship. Approximately, one-third of the depressive participants (N = 123, 35.0%) were active smokers. Basic demographic data of the different study groups including the DaCFail subgroups are further detailed in Tables 4 and 5.

Baseline depressive phenotype

The cohort of depressed patients (N = 351) consisted of 305 unipolar depressed patients, of which 49 patients (14.0%) suffered from their first depressive episode and 259 patients (73.8%) from a recurrent depressive episode (basic characteristics of all depressed patients are stated in Table 4). N = 43 patients (12.3%) had the diagnosis of a depressive episode within a bipolar-affective disorder. Regarding the disease load of depressive episodes, N = 120 patients (34.2%) stated an experience of more than three depressive episodes before study enrolment. In the total cohort, two peaks of first depression onset could be differentiated in patients retrospectively based on the previous medical history at the time point of study enrolment: 1. youth and adolescence (< 18 years) and young adulthood (19–29 years) in 170 patients (48.4%); 2. fifth decade of life (40–49 years) in 73 patients (20.8%). Before admission, 286 patients (81.5%) had received treatment with antidepressants. Nearly 25% of all depressed patients had a history of suicide attempts. Most depressed patients (71.5%) had suffered from various sleep disturbances. Regarding the initial quantitative psychometry in the first week of study, depressed patients were assessed by researchers with a mean of 22 (SD 6.6) points in the HDRS (17 of 21 items used for evaluation) and a median of 32 (27–36) points in the MADRS. Using the BDI-II, patients self-rated the severity of their depressive episode with a median of 24 points (16–32). 70.7% of the depressed patients suffered from anxious symptoms and rated a median of 12 points (SD 8–15) on the GAD-7. Further data on the basic depressive phenotype of the DaCFail groups are detailed in Table 5.

Regarding their baseline somatic phenotype, the majority (N = 212, 60.4%) of depressive participants were classified as overweight or obese (body mass index (BMI) ≥ 25). In the 24 h RR analyses, nearly a third of all depressed patients suffered from either a stage of prehypertension with elevated measurements in total (RRsys 130–139 mmHg and/or RRdia 85–89) or from arterial hypertension at a minimum stage of 1 (stage 1: RRsys 140–159 mmHg and/or RRdia 90–99 mmHg, stage 2: RRsys 160–179 mmHg and/or RRdia 100–109 mmHg).

Discussion

The GEParD cohort study as prospective observational study protocol and the DaCFail study as a case–control study protocol are an example for deep phenotyping of depressed patients in an inpatient setting, particularly emphasizing the stress system and the heart–brain axis. The present repertoire of psychometry, tailored quantification of somatic and cardiac parameters, as well as laboratory parameters including (epi-)genetic analyses allows to evaluate response or resistance to antidepressant therapy during hospitalization using biological and psychological markers.

A recent GWAS study for major depressive disorder has accentuated the need of standardized quantification of disease parameters to impede biased views on genetic architecture and underlying pathogenesis by minimal phenotyping (Cai et al. 2020). Study protocols such as the Biological Classification of Mental Disorders (BeCOME) study (Bruckl et al. 2020) focus on in depth phenotyping of patients suffering from affective, anxiety and stress-related mental disorders focusing on mental phenotypes. The present protocol complements this approach by adding somatic phenotypes under the concept of depression as a systemic disorder with a focus on cardiac phenotypes.

Preliminary analyses based on selected phenotypes in subcohorts of the GEParD and DaCFail protocols have facilitated further understanding of the disease course and risk factors in depressive episodes which will be used for comprehensive analyses of the overall study cohort and additional subgroup analyses (e.g. regarding different age groups, onset of disease manifestations):

  1. (i)

    Anxious depression and traumatic events in the childhood of patients are associated with increased sensitivity of the HPA axis and the immune system using FKBP5 mRNA expression and the CTQ as a phenotypic marker (Menke et al. 2018).

  2. (ii)

    Severe life events occurring prior to depressive episodes may impair psychopharmaceutical treatment assessed by FKBP5, SGK1, and NR3C1 mRNA-expression levels (Menke et al. 2021).

  3. (iii)

    Covariation bias, an overestimation of the relationship between fear-relevant stimuli and aversive consequences, has been revealed as possible characterization of non-responders for antidepressant treatment. It may serve as a possible neurocognitive marker for emotional information processing in depressive episodes (Stonawski et al. 2019).

  4. (iv)

    Psychological paradigms show that fear acquisition and extinction may be impaired in patients suffering from severe depressive episodes. This may underpin the importance of future studies addressing extinction learning elements in antidepressant treatment (Wurst et al. 2021).

The present two-level study design faces limitations of which the foremost are: the naturalistic and observational inpatient setting does not allow for controlled interventions with randomized and matched treatment and control samples. Comprehensive diagnostics results in a high number of single procedures. This may challenge depressed patients who are treated in an inpatient setting because of severe depression and cause a relevant dropout of patients due to motivation, distress, and wishes for discharge (Figs. 1 and 2, Tables 3, 4 and 5). In addition, this contributes to a fragmentary mosaic of quantified biomarkers with missing data, which counteracts the effort for standardized endo-phenotyping of depressive episodes in a cohort representative for the inpatient population. The Department of Psychiatry, Psychotherapy and Psychosomatic Medicine of the University Hospital of Würzburg provides specialized and acute psychiatric care for Würzburg, a medium-sized German university town, which in a monocentric approach limits the recruitment of rare, however, severely ill patient groups, such as DaCFail study group 1 (depression and HF, Fig. 2).

Overall, the described concept for deep phenotyping of depressive episodes illustrates a promising approach to unravel predictive measurements for the onset, course, and treatment response of depressive episodes with a focus on heart comorbidity using a broad repertoire of established psychometric, somatic, and laboratory including (epi-)genetic markers. Experiences with these mainly naturalistic protocols will contribute to successful multicentre studies investigating personalized antidepressant therapies.