Positive psychology is a relatively new field that focuses on enhancing well-being and optimal functioning rather than ameliorating symptoms, and complements rather than replaces traditional psychology [1]. Common themes in positive psychology include savoring, gratitude, kindness, promoting positive relationships, and pursuing hope and meaning [2].

Now that it has been repeatedly shown that well-being and psychopathology are two moderately correlated yet independent constructs of mental health [3,4,5,6], well-being receives growing attention in clinical research and practice. Even after successful treatment of psychopathology, low levels of well-being may persist in individuals, which, in turn, form a substantial risk factor for psychological distress [7]. In the light of a substantial body of evidence demonstrating that high levels of well-being buffer against psychological symptomatology, including relapse or recurrence of symptoms, besides enhancing quality of life and longevity [5, 7,8,9,10,11,12,13,14], we anticipate that clinical samples could greatly benefit from positive psychological interventions (PPIs) which explicitly aim to enhance well-being, that is, positive feelings, cognitions or behaviors [15].

Although PPIs have been mostly examined in non-clinical samples [16], some preliminary evidence exist for their efficacy in clinical samples [16, 17]. Independent lines of research have shown that PPIs improved well-being and decreased psychological distress in mildly depressed individuals [18], in patients with mood and depressive disorders [19, 20], in patients with psychotic disorders [21] and improving quality of life and well-being in breast cancer patients [22]. Thus, PPIs may have the potential to be of value to clinical samples but their effectiveness in these samples is not well established.

To date, two meta-analyses have been published that examined the effectiveness of PPIs in predominantly non-clinical samples. First, Sin and Lyubomirsky [17] included 49 controlled studies with 4235 individuals examining the effectiveness of PPIs on well-being and depression. They found that PPIs were significantly more effective than comparators (i.e. active control or treatment as usual) for enhancing well-being (r = .29) and decreasing depression (r = .31). Second, to address several methodological issues in Sin and Lyubomirsky’s meta-analysis [17] such as lack of methodological quality assessment of the included studies, Bolier and colleagues [16] re-examined the literature. Using more stringent methodological and inclusion criteria, they systematically collected and synthesized the findings of 39 randomized controlled studies with 6139 individuals. Small but significant effects of PPIs on subjective well-being, psychological well-being and depression were found, with Cohen’s d effect sizes of 0.34, 0.20 and 0.23, respectively.

However, these previously published meta-analyses are inconclusive regarding the effectiveness of PPIs in improving well-being and alleviating psychological distress in clinical samples. Although both meta-analyses included a number of studies with clinical samples, 12 out of 49 studies [17] and 4 out of 39 studies [16], respectively, these were limited to psychiatric samples with depressive or anxiety symptoms. To our knowledge, no attempt has been made to systematically examine the effects of PPIs in samples with somatic disorders who may benefit from improvements in well-being [23].

Since there is growing interest in the application of PPIs targeting clinical samples, the aim of the study was to add to the existing literature on the effectiveness of PPIs in primarily non-clinical samples [16, 17] through meta-analytically testing the effects of PPIs on well-being and distress across a broad range of clinical samples with psychiatric and somatic disorders.


This study was prepared and conducted according to the preferred reporting items for systematic reviews and meta-analyses (PRISMA) guidelines [24] and registered on April 29, 2016 in PROSPERO (#CRD42016037451), an international prospective register for systematic reviews.

Search strategy

The electronic databases PsycINFO, PubMed, and Scopus were searched from 1998 (the start of the positive psychology movement) to March 31, 2016, and an update of the search was conducted on May 31, 2017. For each database, text word search terms, medical subject headings (PubMed) or thesaurus terms (PsycINFO) were used relating to ‘well-being’ and ‘positive psychology’, in combination with terms related to ‘interventions’, ‘disorders and illness’ and ‘outcome’ (see Additional file 1 for more detailed information on the search terms). Studies cited in the previously published meta-analytic reviews [16, 17, 22] were cross-checked. Additionally, three clinical trial registers (,, were searched on August 31, 2016, to detect trials with unpublished results available.

Selection of studies

Potentially eligible studies were screened on title in the first phase, on abstract in the second phase, and on full paper in the third phase. Studies were included in the meta-analysis if they: 1) examined the effects of an intervention developed in line with the theoretical tradition of positive psychology cfm. Sin and Lyubomirsky (2009), that is, a psychological intervention (i.e. training, exercise, therapy) aimed at raising positive feelings, cognitions or behaviors; 2) included adult participants (18 years or older) with clinical psychiatric or somatic disorders [according to the International Classification of Diseases and Related Health Problems; [25]; 3) used an outcome measure of social, emotional or psychological well-being; 4) used a control condition; and 5) provided an effect size or sufficient information to calculate an effect size. Studies were excluded if they: 1) were not published in an English language peer-reviewed journal; 2) examined physical exercises aimed at improving well-being; or 3) used an intervention that is primarily based on reminiscence, mindfulness and/or meditation. With regard to the third exclusion criterion, extensive meta-analyses have already been published for these types of interventions [26,27,28,29,30]. Published abstracts and/or study protocols were also excluded.

The first (FC) and second author (JTK) independently conducted the screening of titles. The interrater reliability was high (kappa = 0.84; n = 1000). Disagreements between raters during the screening of abstracts and full texts were discussed until consensus was reached. Any remaining ambiguity was resolved with the third (MSS) and fourth author (ETB).

Data extraction

Data were collected on: 1) population characteristics, including age, gender, disorder, and sample size (per condition); 2) intervention characteristics, including type of PPI, delivery mode, number of sessions, duration in weeks, retention rate, and guidance (i.e. with or without therapist); 3) methodological characteristics, including study design, type of control group, assessment points (i.e. pre, post and/or follow up), and outcome measures. Eight authors were contacted because information regarding study characteristics or to calculate effect sizes was lacking, of whom six provided additional data on request.

Quality assessment

All studies were rated on methodological quality using criteria based on the Cochrane Collaboration’s tool for assessing risk of bias [31] and the Jadad scale [32]. This rating consists of seven items that are rated as 0 (“absent”) or 1 (“present”), resulting in a maximum quality score of 7 points. Studies were identified as “good” when all seven criteria were met, “fair” when five or six criteria were met, and “poor” when four or less criteria were met [33]. The included items cover: 1) random sequence generation and allocation concealment (i.e. sufficient description of the method used to generate and conceal the allocation sequence); 2) blinding of outcome assessments (i.e. outcome assessments are either administered online or by an independent person who is not involved in the study), 3) reporting incomplete outcome data (i.e. dropout analysis is conducted or reasons for drop-out are reported), 4) using intention-to-treat analysis, 5) group similarity at baseline regarding prognostic factors (e.g. demographics) or adjustments were made to correct for baseline imbalance, 6) adequate sample size/power analysis (i.e. an adequate power analysis was conducted or the study included 50 or more persons in the analysis), and 7) reliability of the diagnostic assessment (i.e. assessment was conducted by a professional and not based on self-report or screening or there were no diagnostic assessments). The first (FC) and second author (JTK) independently conducted the quality assessment, whereby disagreements were discussed until consensus was reached.

Primary and secondary outcomes

The primary outcome was the mean well-being score at the end of the intervention, assessed with validated measures of social, emotional, and/or psychological well-being. In the absence of well-being measures, measures, constructs related to well-being such as hope, happiness, life satisfaction, personal growth, optimism or positive affect were included if available. If more than one measure for well-being was used, we used the most validated measure, to ensure each study had one primary outcome for the analysis. Secondary outcomes included depression, anxiety and stress.

Statistical analysis

For each study, means and standard deviations were extracted, where possible based on the intention-to-treat method; otherwise, the reported means and standard deviations for the patients that completed the interventions were used. Effect sizes were calculated in three steps. First, standardized pre-post effect sizes were calculated per condition (i.e. PPI or control condition) by subtracting the average pre-intervention score from the average post-intervention score and subsequently dividing this score by the pooled standard deviation. Second, the difference in effect size (Δd) between PPI condition and control condition was computed. Third, Δd was adjusted for small sample bias, indicated as Hedges’ g. Where possible, pre-to-follow-up effect sizes were calculated in a similar manner, thereby only using studies with a follow-up period between 8 and 12 weeks.

Using Comprehensive Meta-Analysis version 2.2.064, separate meta-analyses were performed for 1) well-being, 2) depression, 3) anxiety, and 4) stress in which data were pooled using the random-effects model accounting for diversity across studies (e.g. in terms of populations, types of PPIs and outcome measures). Effect sizes of 0.56 to 1.2 can be considered large, effect sizes of 0.33 to 0.55 moderate, and effect sizes of 0 to 0.32 small [34].

Heterogeneity of effect sizes was examined using Q and I2 statistics. The Q-test assesses whether the observed effect sizes are significantly more different from one another than would be expected based on chance alone. A significant Q-statistic indicates heterogeneity. The I2 statistic captures the percentage of the total variance across the included studies attributable to heterogeneity. A value of zero indicates true homogeneity, while values of 25, 50, and 75% indicate low, moderate, and high levels of heterogeneity, respectively [35].

Publication bias was assessed using funnel plots, Egger’s Test, Duval and Tweedie’s trim-and-fill procedure, and fail-safe N. First, a funnel plot was created by plotting the overall mean effect size against study size. Whereas a symmetric distribution of studies around the effect size indicates the absence of publication bias, a higher concentration of studies on one side of the effect size than on the other indicates publication bias [36]. Second, Egger’s test [37] was used to examine the symmetric distribution of studies around the effect size with a quantitative test statistic (considered significant funnel plot asymmetry if p < 0.05). Third, Duval and Tweedie’s [38] trim-and-fill procedure was applied. This procedure imputes the effect sizes of missing studies and produces an adjusted effect size accounting for these missing studies. Adjusted values were only reported for pooled effect sizes when these were statistically significant. Finally, a fail-safe N, a test of funnel plot asymmetry, was calculated for each analysis. The fail-safe N indicates the number of unpublished non-significant studies that would be required to lower the overall effect size below significance [37]. The findings were considered robust if the fail-safe N ≥ 5n + 10, where n is the number of comparisons [39].

Pre-specified exploratory subgroup analyses were performed to examine differences in effect sizes based on: 1) population type: psychiatric vs somatic disorders; 2) intervention type: individual vs. group format, with vs. without therapist guidance; and 3) duration of the intervention: short (≤ 8 weeks) vs long (> 8 weeks). Mixed effects analysis was used to tests for differences between subgroups. Additional ad hoc analyses were performed to explore differences in effect sizes based on: 1) type of PPI: PPI therapy programs (e.g. meaning-centered group approach, well-being therapy) vs single PPIs (e.g. three good things/signature strengths); and 2) control group: no intervention (i.e. did not receive any intervention at all)/waitlist (i.e. did receive the intervention after the experimental group) vs. active/treatment-as-usual. Finally, meta-regression analysis was performed to investigate if effect sizes were moderated by study quality.


Selection of studies

A total of 10,886 studies were produced in the electronic database searches. After the exclusion of duplicates (n = 1578) and the removal of studies at the title screening phase (n = 9069), 239 abstracts were reviewed (Fig. 1). Of the 101 articles identified for full text review, 30 controlled studies were included. The 30 studies comprised 33 comparisons for well-being, 26 comparisons for depression, 14 comparisons for anxiety and 6 comparisons for stress [40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69]. Fourteen studies were conducted in the United States, three each in Iran, Canada, and Spain, two each in the United Kingdom and Italy, and one each in Australia, Germany, and Taiwan. The characteristics of the included studies are presented in Table 1.

Fig. 1
figure 1

Flowchart of the study selection process

Table 1 Characteristics of studies included in the systematic review and meta-analysis

Population characteristics

The included studies comprised 1864 adult participants, 960 in the PPI conditions and 904 in the control conditions. The mean age of the participants at pre-intervention was 47.8 years (SD = 11.5, range 26.4–68.9), and more than half were women (61.5%). In 16 studies, clinical samples with somatic disorders were included, with cancer being the most prevalent disorder (8 out of 16 studies). Other somatic disorders included cardiac diseases (n = 4), HIV (n = 1), brain injuries (n = 1), diabetes (n = 1) and chronic pain (n = 1). The remaining 14 studies included samples with psychiatric disorders, with depressive disorder as the most prevalent disorder (7 out of 13 studies), followed by anxiety disorders (n = 2), severe emotion dysregulation (n = 1), psychotic disorders (n = 1), post-traumatic stress syndrome (n = 1), and various mental health problems (n = 2).

Intervention, comparison and outcome characteristics

In 20 studies, PPIs were compared to treatment as usual or an active control condition, such as supportive psychotherapy [44], cognitive behavioral therapy [47], dialectical behavior therapy [69] or mood monitoring [57]. Ten studies compared PPIs to a no intervention/waitlist condition. The names of the PPIs as provided by the authors of the studies are also displayed in Table 1. All interventions were explicitly aimed at raising positive feelings, cognitions or behaviors. The 24 studies used empirically validated PPIs (see [2, 18]) or programs that have incorporated PPIs such as positive psychotherapy [67] or well-being therapy [51]. In 24 studies, therapist guidance was part of the PPI. The intervention duration varied from 3 days to 16 weeks. The mean retention rate, based on dropouts at post-intervention, was 81.4% (available for 26 studies). For the PPI conditions, the mean retention rate was 81.0% and for the control conditions 81.8%. For the 12 studies that included follow-up measurements, the average follow-up time was 12.9 weeks after post-intervention.

Quality of studies

The quality scores of the studies are displayed in Table 2. If a criterion was not reported in the paper, it was labeled “unclear”, and the criterion was rated as not met. All studies were either of medium quality (n = 12) or of low quality (n = 18). None of the included studies met all quality criteria. The use of intention-to-treat analyses was the most poorly rated, with only 11 studies meeting this criterion.

Table 2 Methodological quality of studies included in the meta-analysis


Table 3 summarizes findings from the meta-analyses per outcome, i.e. well-being, depression, anxiety, and stress. The meta-analyses were run separately for all studies at post-intervention with the outliers included, with the outliers excluded and with the low quality omitted. The meta-analyses at follow-up were run including outliers and low quality studies. The effect sizes of the individual studies at post-intervention are plotted in Figs. 2, 3, 4 and 5.

Table 3 Between-group effects
Fig. 2
figure 2

Post-intervention effects of positive psychology interventions on well-being. The square boxes show Hedges’ g effect size in each study, and the line the 95% confidence interval. The diamond reflects the pooled effect size and the line the width of the 95% confidence interval

Fig. 3
figure 3

Post-intervention effects of positive psychology interventions on depression. The square boxes show Hedges’ g effect size in each study, and the line the 95% confidence interval. The diamond reflects the pooled effect size and the line the width of the 95% confidence interval

Fig. 4
figure 4

Post-intervention effects of positive psychology interventions on anxiety. The square boxes show Hedges’ g effect size in each study, and the line the 95% confidence interval. The diamond reflects the pooled effect size and the width of the 95% confidence interval

Fig. 5
figure 5

Post-intervention effects of positive psychology interventions on stress. The square boxes show Hedges’ g effect size in each study, and the line the 95% confidence interval. The diamond reflects the pooled effect size and the width of the 95% confidence interval

Post-intervention effects on well-being

For well-being (33 comparisons), a significant, small effect was observed (g = 0.28, 95% CI: 0.07 to 0.48, p = 0.008) at post-intervention. The level of heterogeneity was high (I2 = 78.20). Four outliers were detected [41, 46, 52, 61]. After omitting these studies from the analysis, we found a similar effect, with g = 0.24 (95% CI: 0.13 to 0.35, p < 0.001), and heterogeneity reduced substantially (I2 = 20.29). When studies scored as low quality were excluded from the analysis (including outliers), again a small significant effect size was observed (g = 0.19, 95% CI: 0.02 to 0.37, p = 0.030), with a moderate level of heterogeneity (I2 = 40.88).

Post-intervention effects on depression

Based on 26 comparisons, we found a significant, small effect of PPIs on depression, with g = 0.27 (95% CI: 0.09 to 0.45, p = 0.003) at post-intervention. The level of heterogeneity was moderate (I2 = 62.34). Five outliers were detected [45, 51, 56, 68, 69]. After removal of the outliers, a small effect size was observed (g = 0.23, 95% CI: 0.11 to 0.34, p < 0.001). The level of heterogeneity was low (I2 = 10.16). After removal of low quality studies, the effect size for depression was not significant with g = 0.07 (95% CI: -0.19 to 0.32, p = 0.598), and heterogeneity was moderate (I2 = 66.08).

Post-intervention effects on anxiety

For anxiety (14 comparisons), a significant, moderate effect was found (g = 0.47, 95% CI: 0.23 to 0.71, p < 0.001) at post-intervention. Heterogeneity was moderate (I2 = 62.34), and one outlier was detected [52]. After removal of the outlier, the effect size dropped to g = 0.36 (95% CI: 0.20 to 0.53, p < 0.001), but still remained in the moderate range, and the level of heterogeneity was low (I2 = 10.16). After removal of low quality studies from the analysis, the effect size for anxiety was small and not significant (g = 0.22, 95% CI: -0.05 to 0.49, p = 0.233), with moderate heterogeneity (I2 = 40.39).

Post-intervention effects on stress

The overall mean effect size for 5 comparisons on stress was not significant (g = 0.00; 95% CI: -0.62 to 0.62, p = 0.999) at post-intervention. After the removal of one outlier [41], the effect size increased to the small range (g = 0.27; 95% CI: -0.19 to 0.73, I2 = 43.89) but remained non-significant (p = .247). Only 1 study that included stress as an outcome had a medium quality rating (see Table 1).

Effects at follow-up

At follow-up, a significant, moderate effect was observed for well-being (g = 0.41, 95% CI: 0.08 to 0.74, p = 0.014), a significant, small effect for depression (g = 0.21, 95% CI: 0.05 to 0.37, p = 0.011), and a significant, moderate effect for anxiety (g = 0.35, 95% CI: 0.12 to 0.59, p = 0.004). There were no follow-up assessments conducted between 8 to 12 weeks with stress as outcome.

Subgroup analyses

Exploratory subgroup analyses are presented in Table 4. For well-being (Q = 6.412, df = 1, p = 0.011) a significantly higher effect size was found for PPIs with therapist guidance (g = 0.39) than for PPIs without therapist guidance (g = − 0.12). For stress, PPIs were found significantly more effective in studies using a no intervention/waitlist control condition (g = 1.12 vs g = − 0.21; Q = 8.283, df = 1, p = 0.004) than in studies using an active or treatment-as-usual control condition. Effect sizes did not significantly vary based on population type (i.e. psychiatric vs somatic disorders), intervention format (i.e. individual vs group), intervention duration (i.e. shorter vs longer than 8 weeks) and/or type of PPI (i.e. PPI therapy programs vs single PPIs). For depression and anxiety, no significant differences between subgroups were found.

Table 4 Subgroup analyses (including outliers)

Meta-regression analysis

Using meta-regression analysis, we found no evidence that effect sizes for well-being and stress were moderated by study quality. The study quality had a significant negative influence on the effect size for depression and anxiety, with lower study quality scores resulting in lower effect sizes for depression (slope: -0.17, Z = − 3.23, p = 0.001) and anxiety (slope: -0.28, Z = − 3.25, p = 0.001).

Publication bias

First, inspection of the funnel plots showed that only for stress the funnel plot was skewed in favor of studies with a positive outcome at post-intervention. Second, Egger’s test statistic showed no significant funnel plot asymmetry for all analyses (all p-values > .05). Third, after adjusting for potential publication bias with Duval and Tweedie’s trim-and-fill procedure, the effect sizes for well-being and stress remained the same. However, for depression, four studies were trimmed and the adjusted effect size was g = 0.15 (95% CI: 0.05 to 0.25). Also for anxiety, four studies were trimmed and the adjusted effect size was g = 0.27 (95% CI: 0.14 to 0.39). Finally, the fail-safe N indicated that the findings for well-being and anxiety were robust, whereas the fail-safe numbers for depression (132) and stress (0) were lower than required (140 and 35, respectively). After omitting outliers, the findings for anxiety remained robust. The fail-safe N for well-being (137), depression (66) and stress (0) were lower than required (respectively 155, 115 and 35). At follow-up, the fail-safe N for well-being (28), depression (4) and anxiety (10) were lower than required (respectively 45, 35, and 30).


To our knowledge, this is the first meta-analysis examining the effects of PPIs on well-being and distress in clinical samples with psychiatric and somatic disorders. When excluding outliers, our analyses suggest that PPIs have a small but significant effect on well-being compared to control conditions. At follow-up, a significant moderate effect size of PPIs on well-being was observed. For the secondary outcomes, a small but significant effect size was found for depression at post-intervention and follow-up and moderate significant effect sizes for anxiety at post-intervention and follow-up. Effect sizes for stress were not significant. These findings suggest that PPIs not only have the potential to improve well-being, but can also reduce distress in populations with clinical disorders.

The effect sizes at post-intervention and follow-up for well-being and distress were comparable with those found in Bolier et al's meta-analysis of controlled PPIs studies in predominantly non-clinical samples [16], but were lower than those in the earlier meta-analysis of Sin and Lyubomirsky [17]. However, in the meta-analysis conducted by Sin and Lyubomirsky [17] less stringent inclusion criteria were used and other interventions such as mindfulness and life-review were included that are commonly not regarded as PPIs [2, 16]. Nonetheless, our findings show promise for PPIs in samples with psychiatric and somatic disorders, and suggest that PPIs, wherein the focus is on eliciting positive feelings, cognitions or behaviors, may also be relevant for clinical populations.

In the field of psychology, especially clinical psychology, the focus lies primarily on examining distress-reducing treatment approaches. As PPIs explicitly aim to improve well-being, the findings of the current study are important because well-being is often impaired in individuals with clinical disorders [23] and low levels of well-being form a substantial risk for relapse or recurrence of symptoms [5, 7]. More importantly, recent studies suggest that well-being and psychological distress are two separate constructs, and that the treatment of symptoms does not necessarily result in improved well-being (e.g., [6, 14]). In the light of these findings, we encourage researchers to further establish the effectiveness of well-being enhancing approaches including PPIs.

Explorative subgroup analyses suggest that guided PPIs are more effective in improving well-being compared to unguided PPIs, such as self-help. Similar findings were found in earlier meta-analytic reviews [16, 17] regarding PPIs in predominantly nonclinical samples, where larger effect sizes were found in therapist-guided interventions (compared with unguided self-help), when the interventions were offered to people with mental health problems. This is also in line with findings regarding supported versus unsupported conventional psychological treatments, such as cognitive behavioral therapy (e.g., [70, 71]) where significant larger effect sizes are observed for supported psychological treatments. Therapist guidance may potentially improve outcomes of PPIs on well-being in samples with psychiatric and/or somatic disorders. However, based on the explorative nature of the subgroup analyses, these findings should be treated with caution and future research should examine the effect of therapist guidance compared to self-help in controlled studies.

No other significant pre-specified moderators of outcome were observed. There was no significant effect of disorder type (i.e. psychiatric vs somatic disorders) and intervention format (i.e. individual vs group). Although, the moderating effect of intervention duration was not significant, the results showed that PPIs with a shorter duration than 8 weeks did not have a significant effect on well-being whereas PPIs with a longer duration had a significant effect on well-being. This finding is in line with earlier meta-analytic reviews [4, 5] and suggests that PPIs are more effective when offered during a longer period of time (more than 8 weeks). In the additional moderator analyses, no significant differences in effect sizes were found for empirically validated PPIs vs other PPIs. For stress, a significantly higher effect size was found for PPIs that had no intervention/waitlist as control condition than for PPIs that had an active control condition or treatment-as-usual as control condition. However, the sample sizes were relatively small in the exploratory subgroup analyses, which limits the interpretation of the differences between groups, and the results should therefore be considered with caution.

The current systematic review and meta-analysis highlights the need to improve the research methodology and reporting within the field of PPIs. The quality of the included studies was low to medium. Although the quality of the studies may have been underestimated since we rated a criterion as not met if it was not reported in the paper, it seems that the methodological quality of studies in this field could be considerably improved if authors routinely report on sequence generation, allocation concealment and blinding of assessors. Furthermore, only one third of the included studies reported using the intention-to-treat principle to analyze the results and almost half of the included studies did not report using a power analysis to determine the sample size. Inadequate statistical power and not adhering to the intention-to-treat principle introduces bias into the results of individual studies, and distorts the results from meta-analyses [72]. This was reflected in the meta-regression analysis which indicated that the effects of PPIs on depression and anxiety were moderated by the methodological quality of the studies, with a lower study quality resulting in smaller effect sizes. Therefore, we recommend researchers conducting studies on PPIs in clinical samples to comply with the quality criteria when designing studies, in order to perform more high quality research to accurately determine the effectiveness of PPIs in clinical samples with psychiatric and somatic disorders. Moreover, the number of studies including post-treatment follow-up measures is relatively low (12 out of 30). We encourage researchers in the field to include follow-up measurements as to determine whether possible favorable effects of PPIs can be sustained in the long run.

Our systematic review and meta-analysis focused on controlled studies of PPIs in clinical samples. We identified a number of studies in different clinical disorders, age groups and settings. Drawing upon these findings in one place has generated the first evidence-based overview of the effectiveness of PPIs in clinical populations. However, several limitations should be noted. One important limitation is that well-being was not always the primary outcome in the included studies. Also, different definitions of well-being were used across the included studies. Incorporating validated measures of well-being, preferably ones that encompass emotional, psychological, and social dimensions of well-being [73], in future studies of PPIs is recommended. Second, the effects of the PPIs may also have been overestimated due to publication bias. Although the results of this meta-analysis point at significant but small effects of PPIs, after adjustment for publication bias, caution is needed. Third, our conclusions are based on the overall effect after the exclusion of outliers, including studies of low quality. When considering only studies of at least medium quality, the effects of PPIs are substantially lower but the sample size of the studies also decreases substantially. Since this is the first study meta-analyzing the effects of PPIs in clinical samples, we based our conclusions on the analyses (i.e. after excluding outliers) with the largest sample size to present a more comprehensive representation of the field. Fourth, we observed a broad range of PPIs in our meta-analysis that varied in delivery mode and intensity. Future research should examine which clinical populations may benefit from PPIs, in terms of type, delivery mode and intensity, and whether there are differential mediators of outcome. Still, this is one of the first meta-analyses in this field providing an overview of PPIs in clinical samples.


In conclusion, this systematic review and meta-analysis provides evidence that PPIs are effective in improving well-being as well as in alleviating common psychological symptoms, including depression and anxiety, in clinical samples with psychiatric and somatic disorders. At present, the most promising PPIs seem to be those that are guided. Given the growing interest for PPIs in clinical settings [15, 16], it is timely and important to further establish the potential of PPIs in the context of clinical populations using large-scale and methodologically sound trials.