Historically, mindfulness can be traced back to 2500-year-old Buddhist teachings and represents an essential internal attitude in Eastern meditation practice (Bodhi, 2011). In Western psychological literature, various definitions of mindfulness exist focusing on different aspects (for an overview, see Medvedev et al., 2022). One of the most common definitions was introduced by Kabat-Zinn (1994), who describes mindfulness as a non-judgmental, nonreactive, and open-hearted moment-by-moment focus of attention. Mindfulness seems to be an important beneficial factor in the context of mental health (Tomlinson et al., 2018). Therefore, it has become more popular in both psychological practice and research and has evolved into a broad concept during the last decades (Bravo et al., 2022). This development has resulted, on the one hand, in a critical debate about discrepancies between the Western understandings of mindfulness and its original Buddhist conceptualization (e.g., Baer, 2019; Feng et al., 2018) and, on the other hand, in a lack of consensus regarding the conceptualizations of mindfulness in general (van Dam et al., 2018).

The distinct conceptual understandings are also reflected in different approaches to operationalize mindfulness. As a result, a large number of mindfulness questionnaires were published that are based on a different idea of dimensionality of the construct (for an overview, see Bergomi et al., 2013; Sauer et al., 2013). Two of the most commonly used questionnaires are the Mindful Attention and Awareness Scale (MAAS; Brown & Ryan, 2003) and the Five Facet Mindfulness Questionnaire (FFMQ; Baer et al., 2006). While the MAAS is based on a unidimensional understanding of mindfulness, Baer et al. (2006) considered mindfulness to be a multidimensional construct. This is based on the factor analytical development of the FFMQ which comprises the following facets: “observing” (tendency to notice or attend to internal and external experiences), “describing” (tendency to label internal and external experiences with words), “acting with awareness” (full awareness and undivided attention of current experiences), “nonjudging of experience” (viewing internal experiences with a non-evaluative stance), and “nonreactivity to inner experience” (tendency to allow thoughts, emotions, and bodily sensations to come and go).

Besides the different operationalizations and conflicting underlying definitions of mindfulness, further methodological concerns regarding the validity of mindfulness assessed by questionnaires need to be considered (van Dam et al., 2018). Firstly, empirical findings suggested an unstable factor structure of the FFMQ depending on the general experience with mindfulness or meditation (Baer et al., 2006; Baer et al., 2008; Williams et al., 2014), the participation in a mindfulness-based intervention (Gu et al., 2016), or the psychopathology of the examined sample (Curtiss & Klemanski, 2014). Secondly, recent findings stated that FFMQ scores did not differ between groups receiving a mindfulness intervention compared to control conditions (Goldberg et al., 2016; Isbel et al., 2020), and showed no significant association with duration or frequency of actual mindfulness practice (Falkenström, 2010; Manuel et al., 2017). These results indicate discrepancies between how mindful individuals consider themselves in their ratings in questionnaires and how mindful they really seem to be. One main explanation is that the semantic understanding of mindfulness items (e.g., “I rush through activities without being really attentive to them.” from the FFMQ) seems to depend on one’s individual experience with and understanding of mindfulness practices (Grossman, 2008; van Dam et al., 2018). As mindfulness practice enhances self-awareness of one’s actual mindlessness, it may then result in lower mindfulness self-reports (van Dam et al., 2018).

In a recent psychometric review, Lecuona et al. (2020) recommended the development and evaluation of alternative assessment tools measuring mindfulness. Common alternative approaches to mindfulness questionnaires are experience-sampling methods during a meditative practice: the Mindful-Breathing Exercise (MBE; Burg & Michalak, 2011), the Breath-Counting Test (BCT; Levinson et al., 2014), and the Meditation Breath Attention Scores (MBAS; Frewen et al., 2008). They are also based on self-reports but do not use a semantic approach. Instead, they assess the ability to maintain one’s attention to the breath (e.g., without mind wandering) during a 15–20-min sitting meditation.

Another major aspect in terms of the conceptionalization and operationalization of mindfulness is the distinction between trait and state: While most definitions consider mindfulness to be a dispositional long-lasting trait (Brown & Ryan, 2003), it can also be understood as a current state (Lau et al., 2006). Trait mindfulness is defined by a “general tendency of being mindful” in daily life, whereas state mindfulness rather comprises “the degree of mindfulness at any particular point in time or context” (Bravo et al., 2022, p. 4). The majority of the established questionnaires conceptualized mindfulness as a trait only (Lau et al., 2006). Therefore, experience-sampling approaches have the potential to become an important addition in the assessment of mindfulness as they represent more state-like qualities of mindfulness, i.e., the capacity to cultivate a state of mindfulness during meditative practice.

The MBE seems to be particularly suitable as an experience-sampling method because it follows a naturalistic approach of just mindfully observing the breathing. Burg and Michalak (2011) examined the convergent validity of the MBE in a sample of undergraduates and found positive correlations with mindfulness questionnaire facets of “acting with awareness” and “accepting without judgment.” Further results indicated positive correlations of the MBE with self-regulation (Burg et al., 2012) and self-esteem (Burg & Michalak, 2012).

Just as mindfulness has been shown to be related to aspects of positive mental health, it also seems to constitute an essential protective and psychological resilience factor for mental disorders like posttraumatic stress disorder (PTSD; R. W. Thompson et al., 2011) and depression (e.g., Schut & Boelen, 2017). Concerning PTSD, chronic avoidance of trauma-related memories, thoughts, and feelings highly contributes to the etiology and maintenance of the disorder (e.g., Orsillo & Batten, 2005; Walser & Hayes, 2006). In contrast, a mindful, non-judgmental awareness of painful memories and trauma-related thoughts and feelings counteracts avoidance behaviors and diminishes the risk of PTSD after a traumatic experience (Thompson et al., 2011). In the context of depression, one explanation is that mindfulness promotes a detached, decentered relationship with negative feelings and thoughts, which, in turn, leads to increased disengagement from ruminative processing (Teasdale et al., 2002; van der Velden et al., 2015). Hence, mindfulness is essential in preventing the exacerbation of negative thinking patterns (Ma & Teasdale, 2004).

Various correlational findings and group comparisons provide empirical evidence for the above-mentioned theoretical considerations regarding lower levels of self-reported mindfulness in patients diagnosed with PTSD or depression. In terms of PTSD, questionnaire-based measurements of mindfulness were negatively associated with overall PTSD symptomology (Carpenter et al., 2019), with its symptom clusters of intrusions and hyperarousal (Martin et al., 2018), as well as with that of avoidance (Thompson & Waltz, 2010). Additionally, Boughner et al. (2016) found that lower levels of self-reported mindfulness partially mediated the association between increased traumatic experiences and PTSD overall symptomology. Questionnaire-based measurements of mindfulness have also shown negative associations with depression symptom severity in trauma-exposed patients (Barr et al., 2019), and patients with PTSD (Schoorl et al., 2015) and with depressive disorders (Argus & Thompson, 2008; Solem et al., 2015), as well as in non-clinical samples (e.g., Freudenthaler et al., 2017; López et al., 2016). Regarding experience-sampling methods, Burg and Michalak (2011) found a negative association of the MBE with depressive symptoms, rumination, and repetitive negative thinking in a non-clinical sample.

As known so far, results of group comparisons indicated that traumatized patients with PTSD have lower mindfulness scores in questionnaires compared to traumatized individuals without PTSD and healthy controls (Basharpoor et al., 2015; Wahbeh et al., 2011). Nevertheless, the amount of group comparisons in relation to PTSD is sparse, and experience-sampling approaches have not yet been used with PTSD patients. In terms of depression, patients diagnosed with a current depressive episode had significantly lower mindfulness scores in questionnaires than patients in remission (Solem et al., 2015) and healthy controls (Crowe & McKay, 2016; Didonna et al., 2019). Rohde et al. (2014) showed that patients with depression had lower MBE scores in comparison to never-depressed controls and had significantly higher self-reported self-criticism after mind wandering during the MBE.

In the light of these findings, an ever-growing number of mindfulness-based programs has been developed in the clinical context (Baminiwatta & Solangaarachchi, 2021). However, valid and reliable measurements of mindfulness are required in order to assess the effectiveness of these programs (Isbel et al., 2020; Medvedev et al., 2022). Against the background of the described methodological limitations of mindfulness questionnaires (van Dam et al., 2018), it seems important to implement psychometrically solid additional measurement approaches of mindfulness. Various studies delivered promising results using self-report-based experience-sampling approaches to assess more state-like aspects of mindfulness during a meditation task (e.g., Burg & Michalak, 2011; Frewen et al., 2008; Levinson et al., 2014). In the context of depressive symptoms, experience-sampling methods for assessing mindfulness have been examined in non-clinical samples (Burg & Michalak, 2011; Frewen et al., 2011) and in patients diagnosed with depression (Rohde et al., 2014). However, apart from that, these approaches are not well established in clinical research yet. Up until now, only few studies have used them to measure mindfulness as an outcome variable in the evaluation of mindfulness-based interventions, e.g., one with healthy individuals (Isbel et al., 2020) and one with PTSD patients (Müller-Engelmann et al., 2019). Furthermore, systematic studies examining experience-sampling methods for mindfulness in the context of PTSD seem to be lacking entirely, which we consider a major issue, given the recently increasing presence of mindfulness in traumatic stress research (e.g., Hopwood & Schutte, 2017; Taylor et al., 2020).

Therefore, the present study pursued the main goal of evaluating the added value of an experience-sampling method as an additional measurement approach to mindfulness questionnaires. The combination of these two self-report-based operationalizations of mindfulness is ideally suited to triangulate these assessments in the clinical context. In the context of depressive symptoms, a lot of research has focused on mindfulness measured by questionnaires and first results exist with regard to experience-sampling. Therefore, it seems particularly promising to examine both measurement approaches in PTSD patients not only in reference to healthy individuals, but also in reference to patients diagnosed with depression. Until now, studies comparing different clinical groups with respect to mindfulness were based on questionnaires only (e.g., Crowe & McKay, 2016; Didonna et al., 2019). In contrast, the present study followed a multi-measurement approach using the MBE as an experience-sampling method to assess state aspects of mindfulness in combination with questionnaire-based measurements of mindfulness covering trait aspects.

As a first step, we examined associations of the two measurement approaches separately in three different samples: PTSD group, depression group, and control group (Hypothesis I). Next, we examined the relationship of experience-sampling and questionnaire-based measurements of mindfulness with clinical variables (Hypothesis II). Last, we performed group comparisons between the three groups (Hypothesis III). Our hypotheses were as follows: (I) We expected to find positive correlations of experience-sampling and questionnaire-based measurements of mindfulness in all three samples. (II) We expected both measurement approaches to be negatively associated with all three PTSD symptom clusters (intrusions, avoidance, and hyperarousal) in the PTSD group. Furthermore, we predicted negative associations with depressive symptoms in all three groups. In addition, we examined to which extent mindfulness, assessed by experience-sampling, predicts unique variance above and beyond questionnaire-based measurements of mindfulness in the PTSD symptom clusters (only the PTSD group) and in the depressive symptoms (the depression and control group). (III) We expected the PTSD and depression groups to show lower levels of mindfulness in both measurement approaches compared to the control group. We further compared both clinical groups, which might allow the discovery of disorder-specific characteristics in mindfulness.

Method

Participants

The overall sample (N = 99) in this study was made up of three samples (PTSD group, depression group, and control group; each sample n = 33). The PTSD and depression group were recruited from the waiting list of our outpatient treatment center at Goethe-University Frankfurt, and half of the data of the PTSD group (n = 16) were collected from another associated therapy study (Müller-Engelmann et al., 2019). The control group was recruited via different public channels.

For all participants, inclusion criteria were an age between 18 and 65 years and informed consent. In the PTSD group, we included patients having a primary diagnosis of PTSD and allowed for additional Axis I diagnoses, as in most patients there are high comorbidities in clinical practice (e.g., Nichter et al., 2019; Nickerson et al., 2017). Consequently, the PTSD group also comprised PTSD patients suffering from comorbid depression. In contrast, in the depression group, we only included patients with a single major depressive episode, recurrent depressive disorders, or persistent depressive disorders while not having any further Axis I diagnosis. We consider the two clinical samples (PTSD group vs. depression group) as conceptually distinct even though depression was not an exclusion criterion in the PTSD group since groups did differ in terms of their primary diagnosis and their potential focus of psychopathology. An advantage of the described inclusion criteria is that we recruited the patients in a naturalistic setting, and that the samples are close to clinical reality. In the control group, we only included participants who were characterized by an absence of any current mental disorder.

In all samples, the following factors were defined as exclusion criteria: current psychotherapeutic treatment, lifetime diagnosis of schizophrenia or bipolar disorder, intellectual disability, current substance addiction, suicide attempt in the last 12 months or current high suicide risk, and a body mass index lower than 16.

Procedures

After a brief telephone screening, suitable participants received an invitation to our laboratory where they gave written informed consent and sociodemographic data were collected. Inclusion and exclusion criteria were assessed with the Structured Clinical Interview for DSM-IV (SCID-I; German version: Wittchen et al., 1997). Then, participants completed the MBE (Burg & Michalak, 2011) and filled out the questionnaires electronically using Unipark (QuestBack GmbH). The whole procedure lasted about 3 hr including a half-hour break, and all participants received monetary compensation. The samples were matched to achieve comparable groups in terms of gender and age. In total, we screened 184 participants on the phone (72 for the PTSD group, 60 for the depression group, 52 for the control group), conducted the SCID-I with 119 participants (44 for the PTSD group, 41 for the depression group, 34 for the control group), and included 99 participants (33 in each group) in the final sample.

Measures

The SCID-1 (Wittchen et al., 1997) was used to assess present Axis I diagnoses. As no German DSM-5-based structured clinical interviews existed at the start of the study, we assessed Axis I diagnoses according to the DSM-IV. In all groups, traumatic events were recorded during the SCID-I interview and if present, the occurrence of posttraumatic stress symptoms was assessed. In the PTSD group, traumatic events were additionally identified with the Life Events Checklist (LEC-5; Weathers et al., 2013), which is a self-report measure that assesses exposure to 17 different traumatic events.

Depression symptoms over the past 2 weeks were assessed using the Beck Depression Inventory (BDI-II; Beck et al., 1996; German version: Hautzinger et al., 2006) in the two clinical groups. The BDI-II consists of 21 items that are answered on a 4-point scale with at least four options of increasing intensity to choose from. The German version of the BDI-II is a reliable and valid measure for depression (Hautzinger et al., 2006). In the present study, internal consistency for the BDI-II was good (Cronbach’s α = 0.93 and McDonald’s ω = 0.93).

The Davidson Trauma Scale (DTS; Davidson et al., 1997) was used as a self-rating scale to assess the frequency and severity of posttraumatic stress symptoms in the PTSD group. It consists of 17 items that are rated on a 5-point scale ranging from 0 (not at all) to 4 (every day) for frequency and from 0 (not at all distressing) to 4 (extremely distressing) for severity. In the present study, results are reported for the three subscales (intrusions, avoidance, and hyperarousal). The DTS shows good reliability and validity (Davidson et al., 1997). In the present study, internal consistency was good for all three subscales (intrusions: Cronbach’s α = 0.89 and McDonald’s ω = 0.87; avoidance: Cronbach’s α = 0.88 and McDonald’s ω = 0.86; hyperarousal: Cronbach’s α = 0.89 and McDonald’s ω = 0.87).

We used the Five Facet Mindfulness Questionnaire (FFMQ; Baer et al., 2006; German Version: Michalak et al., 2016) to measure mindfulness by questionnaire. The FFMQ consists of 39 items, which assess five facets of mindfulness (see above). Items are rated on a 5-point scale ranging from 1 (never or very rarely true) to 5 (very often or always true). The psychometric properties and the construct validity of the FFMQ were found to be good for both the English (Baer et al., 2008) and the German version (Michalak et al., 2016). However, factor analyses suggest that the “observing” subscale should not be part of the overarching mindfulness construct among non-meditators (Baer et al., 2006; Gu et al., 2016; Williams et al., 2014) and in the context of clinical samples (Curtiss & Klemanski, 2014; Williams et al., 2014). Thus, we report results for a FFMQ total score without the “observing” scale and internal consistency was good in the present study (Cronbach’s α = 0.92 and McDonald’s ω = 0.93).

As an experience-sampling measure, we used the Mindful-Breathing Exercise (MBE; Burg & Michalak, 2011). In this exercise, participants were asked to stay in contact with their breath for 18 min. Acoustic signals were given at irregular intervals, and participants were asked to indicate by mouse click whether they are still in contact with their breath or whether their mind is wandering. Additionally, they were asked to indicate whenever they notice that their mind is wandering during the intervals between the signals. The MBE consisted of 22 phases of different lengths (20 to 80 s) that end with a signal tone. A score for the MBE was calculated by considering all phases of the MBE without mind wandering indicated during and at the end of the phase. A high score corresponds to many phases in the MBE without mind wandering.

Data Analyses

There were no missing values. To avoid confounding of the results by age and gender, we used a matched group design. Simple ANOVAs and chi-square tests were performed to detect group differences regarding sociodemographic data.

To test for associations between the MBE and the FFMQ total score (without “observing”) and FFMQ facets (Hypothesis I), we computed separate Pearson product-moment correlations (one-tailed) for all three groups.

To test for associations of the MBE and the FFMQ total score (without “observing”) with clinical variables (Hypothesis II), in a first step we computed Pearson product-moment correlations (one-tailed) of the MBE and the FFMQ with the DTS symptom clusters in the PTSD group, and with the BDI-II in the depression and control group. In order to assess incremental validity of the MBE, we performed multiple linear regressions entering both the MBE and the FFMQ simultaneously into the model predicting all three DTS symptom clusters in the PTSD group and the BDI-II in the depression and control group. As 81.9% of the PTSD group suffered from a comorbid depressive disorder, depression could function as a confounding variable. Therefore, we computed Pearson product-moment correlations of the MBE and the FFMQ with the BDI-II in the PTSD group. Furthermore, in order to control statistically for comorbid depressive symptoms in the PTSD group, we added the BDI-II as control variable into the model. We applied Cohen’s conventions (1988) to interpret the size of the correlations (r ≥ 0.10 small association; r ≥ 0.30 moderate association; r ≥ 0.50 large association). We found no violations of the assumptions regarding Pearson product-moment correlations for Hypotheses I and II. Regarding the performed regression models (Hypothesis II), we did not detect any violations in terms of outlying values, nonlinearity, heteroscedasticity, auto-correlations among predictors, or multicollinearity. We found one violation to the normality assumption in the prediction of the BDI-II in the control group. However, in line with recommendations by Schmidt and Finan (2018), we decided against any outcome transformations, especially as regression models seem to be relatively robust to violations of the normality assumption.

To test for group differences (Hypothesis III), we conducted two separate ANOVAs (in-between design) for the MBE and the FFMQ total score (without “observing”) as dependent variables and the three groups as independent variables. Simple contrasts were used for pairwise comparisons. To counteract the problem of multiple comparisons, Bonferroni correction was used. Effect sizes were assessed via Cohen’s d (Cohen, 1988; d = 0.20 small effect, d = 0.50 medium effect, d = 0.80 large effect). No violation of ANOVA assumptions was found in the Shapiro-Wilk test and Levene’s test.

Results

Sample Characteristics

Sociodemographic data for the three groups are presented in Table 1. In terms of group differences, the control group had higher levels of education than the two clinical groups. Furthermore, the control group had lower levels of depressive symptoms (BDI-II) than the two clinical groups, while the two clinical groups did not differ significantly. Compared to the PTSD group, the proportion of those who experienced a traumatic event in the past was lower in the depression group and in the control group. For prior experiences of mindfulness, results showed significant differences between the three groups with the highest percentage of prior mindfulness experience in the PTSD group and the lowest in the control group.

Table 1 Sample characteristics

PTSD patients showed different index traumas: childhood sexual and/or physical abuse (14 patients, 42.4%), physical and/or sexual violence in adulthood (14 patients, 42.2%), witnessing a violent assault or sudden death of a close relative (3 patients, 6.1%), severe car or work accident (2 patients, 6.1%). The average number of traumatic event types experienced, as reported in the LEC, was 5.50 (SD = 2.93, range 1–10). The average duration of PTSD was 9.29 years (SD = 11.02, range 0.3–42). Thirty-one patients (93.93%) fulfilled the criteria for comorbid disorders according to the SCID-I interview. The average number of comorbid Axis I disorders was 1.94 (SD = 1.39, range 0–7). The most frequent Axis I diagnoses in addition to PTSD were depressive disorders (a total of 81.9% of patients, which is divided as follows: current episode/dysthymia/double depression: 60.7%; recurrent depressive disorder, currently in remission: 21.2%).

In the depression group, seven patients (21.2%) suffered from a first major depressive episode, 24 patients (72.7%) had a recurrent depressive disorder with an average of 5.12 episodes in the past (SD = 4.98, range 2–20), and two patients (6.1%) suffered from a double depression. According to SCID-I diagnostic, six patients (18.2%) were classified as having currently a mild episode, 21 patients (63.6%) as having a moderate episode, and six patients (18.2%) as having a severe episode. The average duration of the current depressive episode was 4.76 months (SD = 4.02, range 1–14).

Hypothesis I: Associations Between MBE and FFMQ

Table 2 shows correlations of the MBE with the FFMQ total score (without “observing”) and the FFMQ facets separately for all three groups. Regarding associations between the MBE and the FFMQ total score, analyses revealed no significant result in the PTSD group, a marginal significant moderate positive correlation in the depression group, and a highly significant moderate positive correlation in the control group. With regard to the FFMQ facets, in the PTSD group we found a significant moderate negative association with “nonjudging” and a significant moderate positive association with “nonreactivity.” In the depression group, marginal significant positive associations emerged for “acting with awareness” and “nonjudging.” In the control group, the MBE showed significant moderate positive correlations with all FFMQ facets except for “observing.”

Table 2 Pearson product-moment correlations of the MBE with the FFMQ total score and the FFMQ facets

Hypothesis II: Associations of MBE and FFMQ with Clinical Variables

Table 3 displays correlations of the MBE and the FFMQ total score (without “observing”) with clinical variables in all three groups. In the PTSD group, the MBE and the FFMQ showed a significant moderate negative correlation with DTS hyperarousal and a marginal significant small to moderate negative correlation with DTS intrusions while no significant correlations emerged with DTS avoidance. Furthermore, we found a moderate significant negative correlation of the BDI-II with the FFMQ in the PTSD group, while no association emerged with the MBE. In the depression group, analyses revealed significant moderate negative correlations of the MBE and the FFMQ with the BDI-II. In the control group, we found significant moderate to large negative correlations of both the MBE and the FFMQ with the BDI-II.

Table 3 Pearson product-moment correlations of the MBE and the FFMQ total score with clinical variables

In a next step, we performed linear multiple regressions to predict the clinical variables based on the MBE and the FFMQ. Results are shown in Table 4. In the PTSD group, we performed regressions on the DTS symptom clusters of intrusions, avoidance, and hyperarousal and controlled statistically for the BDI-II since comorbidity of depression was high in the PTSD group. In the PTSD group, all three regression models were significant (DTS intrusions: F (3, 29) = 4.09, p = 0.015; DTS avoidance: F (3, 29) = 5.08, p = 0.006; DTS hyperarousal: F (3, 29) = 9.80, p < 0.001). The BDI-II as a control variable significantly contributed to the prediction of all three DTS symptom clusters. The MBE was a marginally significant predictor of DTS intrusions and a significant predictor of DTS hyperarousal, while the FFMQ was not significantly associated with these outcomes. In the depression and control group, we performed regressions on the BDI-II. Significant overall models emerged in both groups (depression group: F (2, 30) = 5.06, p = 0.013; control group: F (2, 30) = 14.39, p < 0.001) with the FFMQ as the only significant predictor.

Table 4 Regressions of the MBE and the FFMQ total score on clinical variables

Hypothesis III: MBE and FFMQ Group Differences

Means and standard deviations of the MBE and the FFMQ for the three groups are presented in Table 5. We found significant univariate group effects for the MBE (F (2, 96) = 3.21, p = 0.045) and the FFMQ (F (2, 96) = 38.22, p < 0.001). Table 5 presents the results of the pairwise comparisons and the corresponding effect sizes. Post hoc tests revealed that the PTSD and depression group had marginal significantly lower MBE and FFMQ scores compared to the control group. No significant differences were found between the two clinical groups in the MBE and the FFMQ.

Table 5 Means and standard deviations of the MBE and the FFMQ total score and effect sizes of group differences

Discussion

Various methodological concerns challenge the validity of questionnaire-based assessments of mindfulness and hint at the necessity of alternative assessment approaches, e.g., other self-report measures such as experience-sampling or behavioral assessments. Hence, the main goal of this study was to evaluate the potential added value of an experience-sampling approach in the clinical context of PTSD and depression. Therefore, we used the MBE (Burg & Michalak, 2011), which measures the capacity to cultivate a state of mindful awareness during a breathing meditation exercise. The MBE covers rather state-like aspects of mindfulness while the FFMQ assesses trait aspects and comprises multiple facets of mindfulness, e.g., “nonjudging” or “acting with awareness” (Baer et al., 2006).

We separately examined the associations between the MBE and the FFMQ for patients with PTSD and depression as well as for healthy individuals. The fact that we did not find a significant association between the MBE and the FFMQ total score in the PTSD group might be explained by PTSD-typical symptoms such as hyperarousal and intrusions, which might affect performance more during a meditative exercise like the MBE than in a retrospective self-report like the FFMQ. In contrast, in the depression group, the MBE showed a moderate correlation with the FFMQ total score. In the control group, this association turned out to be even moderate to large. Assuming that the MBE assesses rather state-like aspects and the FFMQ trait-like aspects of mindfulness, the results of the present study make an important contribution to the respective literature as previous findings actually indicated that state and trait mindfulness are not strongly related (Bravo et al., 2018; Tanay & Bernstein, 2013; Thompson & Waltz, 2007). However, these results were obtained in non-clinical samples only, and state mindfulness was assessed with retrospective questionnaires.

Regarding convergent validity, in the PTSD group, the MBE seems to assess unique aspects of mindfulness, which are not fully covered by the FFMQ. However, we did find a significant moderate positive correlation between the MBE and the FFMQ facet “nonreactivity.” Nonetheless, the FFMQ facet “nonjudging” showed a negative moderate association with the MBE which was unexpected. We attribute this paradoxical finding to our small sample size. Although we did not find any statistically relevant outliers, scatterplots revealed extreme values in the MBE for few PTSD patients which led to considerable changes in the correlation found with the FFMQ. Therefore, one alternative explanation for the non-significant association between the MBE and the FFMQ could be that self-assessments and self-perceptions of PTSD patients might be biased due to their symptomology.

In contrast to the correlational results obtained in the PTSD group, in the depression and in the control group, the positive associations between the MBE and the FFMQ total score indicate a certain conceptual overlap. In the depression group, “acting with awareness” and “nonjudging” seemed to be crucial for this finding as they were the only FFMQ facets which showed a marginal significant moderate negative association with the MBE. In the control group, all FFMQ facets except “observing” showed significant moderate positive correlations with the MBE. In previous studies with non-clinical samples, Burg and Michalak (2011) only found “acting with awareness” and “accepting without judgment” to positively correlate with the MBE, while results of Frewen et al. (2014) indicate that only “acting with awareness” reliably correlates with the MBAS. However, in line with our findings, a study by Levinson et al. (2014) delivered moderate positive correlations of the BCT with all mindfulness facets except “observing” in two different non-clinical samples as well.

Overall, the results of the present study make an important contribution to former results regarding associations between experience-sampling and questionnaire-based approaches of mindfulness. Nevertheless, the findings are quite heterogeneous. Additionally, despite certain similarities between MBE, MBAS, and BCT, these measurement approaches are not equivalent as they differ in terms of length, setting, and instructions given. Therefore, future research is needed to draw solid conclusions.

A further step of the study was to look at the association of the MBE and the FFMQ with clinical variables. In the PTSD group, no significant correlation emerged for the MBE and the BDI-II. However, in line with previous results (Barr et al., 2019; Schoorl et al., 2015), we found a significant negative moderate correlation of the FFMQ with the BDI-II. Furthermore, both the MBE and the FFMQ showed a moderate negative correlation with hyperarousal, which supports previous results regarding the FFMQ (Martin et al., 2018). Remarkably, in the regression analysis predicting hyperarousal while controlling for depression, the MBE remained significant, whereas the FFMQ did not significantly contribute to the prediction of hyperarousal. For intrusions, we found a marginally significant small to moderate negative correlation with the MBE. In contrast to previous results (Martin et al., 2018), the correlation with the FFMQ was not significant. Regression analysis predicting intrusions showed that when controlling for depression, again only the MBE was a marginally significant predictor. To sum up, the MBE seemed to be superior to the FFMQ in terms of its predictive value for hyperarousal and intrusions.

Compared to healthy controls, PTSD patients in our study showed reduced levels of mindfulness in both the MBE and the FFMQ, which is in line with previous research assessing mindfulness with questionnaires (e.g., Basharpoor et al., 2015). We extended these findings following a multi-measurement approach using both the MBE and the FFMQ, which allowed us to gain more specific insights: Whereas impairments in the FFMQ in the PTSD group seem to be associated with comorbid depressive symptoms, impairments in the MBE might be associated with PTSD-specific symptomology. Thus, an elevated level of arousal and reactivity (e.g., hypervigilance, problems with concentration, and sleep disturbances; American Psychiatric Association, 2013) seems to impair patients’ abilities in a naturalistic meditative exercise such as the MBE. In this context, increased intrusions might be associated with difficulties focusing on one’s breathing due to more mind wandering and distractions caused by distressing trauma-related thoughts, feelings, and memories.

In contrast to our hypothesis, we found no association of the MBE and the FFMQ with avoidance in the PTSD group. This is surprising as previous studies indicated associations of several mindfulness facets with avoidance (Thompson & Waltz, 2010), and also from a theoretical point of view, avoiding trauma-related feelings, memories, and thoughts seems to be opposed to a mindful coping strategy. Further research is needed to clarify potential associations between PTSD-specific avoidance and mindfulness.

In the depression and control group, analyses yielded negative moderate associations of the MBE with the BDI-II. With this, we replicated findings of Burg and Michalak (2011) showing a similar association in a non-clinical sample, and extended them to a clinical sample of patients diagnosed with depression. Additionally, the FFMQ and the BDI-II showed negative correlations in both groups, which is in accordance with previous findings in samples of patients with depression (Argus & Thompson, 2008; Solem et al., 2015) and in samples of healthy individuals (e.g., Freudenthaler et al., 2017; López et al., 2016). Surprisingly, in regression models for both groups, the FFMQ turned out to be the only significant predictor. Thus, in these two groups, the MBE did not predict any unique variance of depressive symptoms above and beyond the FFMQ. The significant group differences in both measurement approaches between the depression and the control group are in line with previous findings concerning the FFMQ (e.g., Didonna et al., 2019) and the MBE (Rohde et al., 2014). The larger effect size we found for the FFMQ in this group comparison suggests that impairments in depressed patients might be more relevant in trait qualities of mindfulness assessed via questionnaires than in state aspects assessed with experience-sampling. One explanation might be that depressed patients’ evaluations of their abilities are negatively biased (e.g., Petersen et al., 2019; Serra-Blasco et al., 2019), which might result in an underestimation of their mindfulness skills in questionnaires.

Lastly, although we found distinct association patterns of the MBE and the FFMQ with psychopathology in the PTSD and depression group, we did not find differences between the two clinical groups in terms of their levels of mindfulness in both measurement approaches. The result with respect to the FFMQ might be explained by the high percentage of PTSD patients with comorbid depression in our sample. The results suggest that both clinical groups could benefit from mindfulness-based intervention, and we consider the findings of the present study to be quite notable for future evaluations of such interventions. Our findings emphasize the added value of the MBE to questionnaire-based assessments of mindfulness with respect to its predictive value for hyperarousal and intrusions. Future studies addressing mechanisms of change in PTSD symptomology might thus benefit from including the MBE in addition to questionnaire-based assessments of mindfulness. Nevertheless, the regression coefficients in the present study were still quite small and other factors explaining PTSD symptomology should be considered as well.

Limitations and Future Research

Our study has some limitations. Firstly, all our statistical analyses would have gained from a larger sample size; therefore, the results must be interpreted with due caution. In addition, Pearson product-moment correlations are relatively prone to outliers, which can be particularly relevant in the case of our small sample size leading to over- or underestimations of the actual associations.

Another limitation is the high comorbidity with depressive disorders in the PTSD group and the high self-reported level of depression in the BDI-II. However, the PTSD group we studied is representative of clinical reality where comorbidity with depression is particularly frequent (e.g., Nichter et al., 2019; Nickerson et al., 2017). In addition, 28.8% of the depression group and 12.1% of the control group had experienced past trauma, which is in line with the prevalence rate of traumatic life events in the general population (e.g., around 20%, Hauffa et al., 2011). Furthermore, the majority of the PTSD and depression group had prior experience with mindfulness, mostly due to participating in mindfulness group sessions during inpatient treatment. In contrast, in the control group only about 20% reported previous experience in mindfulness. This may have impaired the validity of the results of the group comparison: The differences between the two clinical groups and the control group might have turned out to be larger due to the patients’ prior experiences with mindfulness practice. Following van Dam et al. (2018), deficits could be perceived more strongly with increasing prior experience in mindfulness and thus lead to a lower FFMQ score. In contrast, inexperienced beginners, like the majority of the control group, might have a more naïve comprehension of the concept and therefore be prone to overestimating their mindfulness skills (Sauer et al., 2013).

A very important limitation of the present study is the absence of a questionnaire-based measurement approach of state mindfulness such as the Toronto Mindfulness Scale (TMS; Lau et al., 2006) or the State Mindfulness Scale (SMS; Tanay & Bernstein, 2013). Various previous studies followed a two-step protocol to measure state mindfulness: first, the completion of a short meditation exercise, and afterwards, the assessment of state mindfulness during the exercise via questionnaire-based self-reports (e.g., Bravo et al., 2018; Thompson & Waltz, 2007). In contrast to this procedure, we used the MBE, where participants are instructed to rate their current state of mindfulness directly during the meditation exercise, which avoids retrospective biases. Nevertheless, the present study would have benefitted from the inclusion of a questionnaire-based measure of state mindfulness additional to the MBE. This would have allowed us to empirically test our theoretical assumptions that the MBE captures rather state than trait-like aspects of mindfulness.

With respect to experience-sampling approaches, one major shortcoming lies in the fact that reporting one’s own experience during a meditation exercise is still based on self-reports (Csikszentmihalyi & Larson, 2014), which can lead to common biases (e.g., social desirability; Tracey, 2016). Furthermore, just like the FFMQ, the MBE is also prone to the contradictory effect of increased self-awareness leading to decreased mindfulness ratings: Being more aware of one’s absent-mindedness and more sensitive to mind wandering can still result in lower MBE scores. In contrast to the FFMQ, the MBE avoids any kind of semantic understanding of mindfulness concepts since participants only rate their current state of mindfulness by a simple mouse click. Therefore, self-ratings during the MBE might depend less on previous individual experience with mindfulness than self-ratings in questionnaires. In addition, the MBE avoids retrospective biases due to an immediate assessment of mindfulness during a meditation exercise in a task environment. Therefore, the MBE seems to be reliable and sensitive to measure attention-related aspects of mindfulness. Nevertheless, one disadvantage of the MBE is that it does not assess the attitudes of nonjudging, nonreactivity, and friendly curiosity that are often described as essential to mindfulness (Kabat-Zinn, 2015) and are assessed with the FFMQ. To sum up, the MBE and the FFMQ both have their specific benefits and shortcomings.

For future research, it would be of great interest to compare other existing experience-sampling approaches for mindfulness such as the BCT (Levinson et al., 2014) and the MBAS (Frewen et al., 2008), with the MBE (Burg & Michalak, 2011). Furthermore, future studies might examine the association of mindfulness based on experience-sampling with questionnaire-based measurements of both state (e.g., TMS; Lau et al., 2006) and trait mindfulness (such as the FFMQ). In the present study, we conducted analyses based on cross-sectional data, which do not allow any conclusions in terms of causality. Therefore, longitudinal analysis methods are required to interpret the direction of the association between mindfulness and psychopathology. Lastly, the present study has demonstrated different correlational patterns of the MBE and the FFMQ in the three groups, thus showing that results based on non-clinical samples cannot be generalized to patient groups. Therefore, future research on different measurement approaches of mindfulness might be advised to shift its focus more to clinical samples.