Introduction

Schools are expected to foster not only their students’ cognitive development, but also their students’ wellbeing. Schools should implement policies and practices striving to improve students’ attitudes, values, and social support (Langford et al. 2014; World Health Organization 1995). To this end, a range of universal school-based interventions have been developed to enhance students’ intrapersonal and interpersonal domains. The intrapersonal domain refers to managing one’s own feelings, emotions, and attitudes pertained to the individual self (Barber 2005). The interpersonal domain refers to the ability to build and maintain positive relationships with others and to understand social situations, roles, and norms, and respond appropriately (Pellegrino and Hilton 2012; Shek and Leung 2016). Both domains are intertwined as the way individuals view themselves can influence how they approach social interactions and vice versa (Finkel and Vohs 2006).

Even though the two domains are related, they are also meaningfully distinct. While the intrapersonal domain represents an individual’s subjective psychological functioning, the interpersonal domain represents an individual’s social functioning (Dufner et al. 2019). This distinction is empirically supported by factor and profile analyses (e.g., Gilman and Anderman 2006; Park et al. 2017), and by relating the two domains to various developmental outcomes. For instance, competencies in the intrapersonal domain predict better academic achievement and competencies in the interpersonal domain predict better peer relations (Park et al. 2017). While students can acquire competencies in both domains by mastering relevant cognitive, affective, and social skills (Durlak et al. 2011), difficulties with these skills can set students at increased risk of developing problems in the intrapersonal domain, such as internalizing behavior, or in the interpersonal domain, such as aggression (Modecki et al. 2017; White et al. 2013). Given that competencies in both domains are markers of a healthy psychosocial development, whereas problems in these domains increase the likelihood of developing psychopathology later in life, it is important to stimulate youth’s development in these domains (Van Order et al. 2005).

Children’s intra- and interpersonal domains develop throughout their youth, but the importance of these skills becomes particularly pronounced in adolescence when they consolidate their own identity and peer relationships become increasingly important. Adolescents spend less time at home and longer hours at school which provides them increasing opportunities and requirements to interact with others, such as peers, teachers, and romantic partners (Barber 2005). This makes secondary school a potentially good target for interventions to foster youth’s intra- and interpersonal domains. In the present meta-analysis, we therefore examined the effects of universal secondary school-based interventions on students’ intrapersonal and interpersonal domains.

School-based interventions addressing adolescents’ intra- and interpersonal domains typically show small positive effects (e.g., effect sizes (Cohen’s d) ranging from 0.03 to 0.24; Dray et al. 2017; Durlak et al. 2011; Jiménez-Barbero et al. 2016). One way to increase our understanding of when interventions are most effective is by studying which components are related to intervention effects. If we can identify components associated with stronger (or weaker) intervention effects, this could help generate hypotheses about the components that drive intervention effects, and thus about how interventions could be improved. In addition, schools can make informed decisions about which intervention to implement, by selecting interventions based on the evidence base for the components. As a first step towards generating hypotheses about components that drive intervention effects, Boustani et al. (2015) listed components that are most frequently included in effective school-based interventions (e.g., problem solving, psychoeducation). Although such a frequency count provides a useful overview, it does not show whether the effectiveness of interventions relates to the presence of the components. Furthermore, due to the focus on effective interventions, the overview cannot identify components related to weaker intervention effects. The present meta-analysis statistically tested which components are related to stronger or weaker intervention effects.

In the literature, typically three types of components are distinguished: Content, instructional, and structural components. Content components are specific skills adolescents learn to promote positive outcomes, such as emotion regulation and problem solving (Boustani et al. 2015), i.e., “what they learn.” Instructional components are techniques and methods of information delivery used by the intervention facilitator, such as cognitive restructuring and modeling (Boustani et al. 2015), i.e., “how they learn it.” Structural components describe the structure of the intervention that might impact results, such as the number of sessions and whether or not parents are included in the intervention (Lee et al. 2014), i.e., “how the intervention is set up.” By examining all three types of components, we strive to improve our understanding of whether a certain type of component is particularly associated with intervention effects.

Various meta-analyses have successfully identified intervention components that predict intervention effects (e.g., De Mooij et al. 2020; De Vries et al. 2015; Kaminski et al. 2008; Van der Put et al. 2018), but few meta-analyses have focused on components of school-based interventions. Prior meta-analyses that did examine components of school-based interventions focused on substance use, sexual risk behaviors (e.g., pregnancy, STD/HIV) and/or nutrition (see for a review of reviews Peters et al. 2009). For example, Onrust et al. (2016), focusing on substance use, found that components that sought to stimulate students’ self-control and problem solving, and components that included cognitive restructuring, adjusting social norms (e.g., peer education), and parental involvement predicted stronger substance use reductions. Hennessy and Tanner-Smith (2015), focusing on alcohol use, found that components that included an individual and motivational enhancement approach predicted stronger alcohol use reduction.

In the present meta-analysis, we examined which components are related to stronger (or weaker) school-based intervention effects on students’ intra- and interpersonal domains. We focused on a broad range of outcomes for two reasons. First, many school-based interventions aim to enhance multiple aspects of students’ development (e.g., promoting self-efficacy, psychological wellbeing, and life satisfaction: Gigantesco et al. 2015; bullying, aggression, and wellbeing across various domains: Bonell et al. 2018). We wanted to align with this approach in our meta-analysis. Second, some components may be related to intervention effects on some outcomes, but not on others. Unraveling these differential associations may provide insight in the extent to which interventions need to be matched to specific problems.

We studied relations between components and intervention effects across different populations (e.g., socio-economically advantaged students, predominately ethnic minority students). Although the effectiveness of components may depend on participant characteristics, any moderator effects by participant characteristics were beyond the scope of this study—our goal was to provide a first overview of which components are related to intervention effects. We analyzed all three types of components (i.e., content, instructional, and structural) and tested whether interventions with a specific component showed larger (or smaller) effect sizes than interventions without that component, using multilevel meta-regression. This enabled us to identify not only which components were associated with stronger effects, suggesting potential effective components, but also components associated with weaker effects, suggesting potential ineffective components. Knowing what does not work is equally important as knowing what does work (e.g., Poulin et al. 2001; Werch and Owen 2002).

Concerning content components, based on the results of Onrust et al. (2016) and Boustani et al. (2015), we hypothesized that basic life skills and self-awareness would be related to stronger intervention effects on students’ intra- and interpersonal domains. Basic life skills refers to abilities for adaptive and positive behavior to deal with demands and challenges of everyday life (World Health Organization 1997). Several reviews suggest the importance of basic life skills, such as problem solving, assertiveness, and social skills, for a range of outcomes of effective school-based interventions (e.g., intra- and interpersonal domains, Boustani et al. 2015; drug use, Cuijpers 2002). Self-awareness indicates a realistic and accurate assessment of one’s strengths and norms, and is related to improvements on the interpersonal domain (e.g., Shek and Leung 2016). Raising self-awareness, such as insight building and self-efficacy, is often used in effective interventions targeting the intrapersonal and interpersonal domains (Boustani et al. 2015).

For instructional components, we hypothesized that components using a more active learning approach, in which students interact with each other and perform tasks (e.g., practicing through role-play), would be related to stronger intervention effects. Active learning approaches have consistently been related to stronger effects. For instance, Kaminski et al. (2008) found in their meta-analysis that parenting interventions in which parents practiced the learned skills were more effective than interventions that did not include practice. Similarly, Cuijpers (2002) concluded in his review of school-based drug interventions that interventions using more active methods (e.g., discussion) were more effective than interventions using more passive methods (e.g., didactic instruction).

Regarding structural components, the general assumption is that longer and extensive interventions are more effective than briefer and less extensive interventions (Yeager and Walton 2011). The evidence, however, is conflicted. Some meta-analyses showed that longer and extensive interventions are indeed more effective. For instance, interventions showed stronger effects as the time span, number of sessions, and involved persons (i.e., whole school, parents) increased (De Vries et al. 2015; Ttofi and Farrington 2011). Other meta-analyses, on the other hand, showed that briefer and less extensive interventions are more effective (i.e., “less is more”). For instance, interventions showed stronger effects when the time span was short, the number of sessions limited, and no additional services were provided (Cuijpers 2002; Kaminski et al. 2008; Van der Put et al. 2018). Longer and extensive interventions require more time and effort to implement with fidelity (Bakermans-Kranenburg et al. 2003); resources that may lack in many schools. Given that findings of previous research concerning structural components are inconclusive, examining the relations between these components and intervention effects was explorative.

In summary, identifying components related to stronger or weaker intervention effects has both theoretical and practical implications. First, it expands our knowledge concerning interventions. We begin to unravel, based on associations between components and intervention effects, what is more important to change students’ intra- and interpersonal domains: What they learn, how they learn it, or how the intervention is set up? Second, it helps schools to make informed decisions about which intervention to implement and catalyzes hypotheses generation about how interventions may be optimized. We hope this knowledge can be used as a first step towards improving the effectiveness of school-based interventions addressing students’ intra- and interpersonal domains.

Method

Inclusion and Exclusion Criteria

We sought to include evaluations of universal secondary school-based interventions addressing students’ intrapersonal and interpersonal domains. Universal secondary school-based interventions were defined as interventions delivered to students during regular school hours, targeting all students (Mychailyszyn et al. 2012; Peters et al. 2009). The intrapersonal domain was defined as managing one’s own feelings, emotions, and attitudes pertained to the individual self (Barber 2005) in which one can experience competencies (e.g., resilience, self-esteem, self-regulation, general wellbeing) and problems (e.g., internalizing behavior). The interpersonal domain was defined as the ability of an individual to build and maintain positive relationships with others and understanding social situations, roles and norms, and respond appropriately (Pellegrino and Hilton 2012; Shek and Leung 2016) in which one can experience competencies (e.g., sexual health, social competence, positive school climate) and problems (e.g., aggression, bullying). Although one’s development in these two domains has been associated with psychopathology, the domains and psychopathology are not opposite ends of the same continuum (e.g., Girard et al. 2017). In the current meta-analysis, we focused on interventions stimulating the development in the intra- and interpersonal domains rather than preventing psychopathology.

Studies were eligible for review when (1) the intervention was implemented in a regular school (i.e., not in special education), (2) the intervention was implemented during regular school hours in a group setting, (3) the intervention was aimed at improving (subdomains of) the intra- and/or interpersonal domain (i.e., interventions primarily aiming to improve students’ physical health (e.g., prevention of substance use, nutrition, pregnancy, STDs) or prevent psychopathology (e.g., depression) were excluded.), (4) the intervention was universal, so targeting all students, (5) the participants were in middle school or high school (Grades 6–12), (6) the study included a control group, (7) the study included a quantitative baseline and post intervention measurement of (subdomains of) the intrapersonal domain and/or interpersonal domain, (8) sufficient information concerning baseline and post intervention measurements was reported, or obtained after contact with the author, so that effect sizes could be calculated post intervention, corrected for baseline differences, (9) the study was written in English, and (10) the study was published as article, book, or book chapter. Research has shown that including unpublished studies does not reduce the possible impact of publication bias and is sometimes even counterproductive due to selection bias (Ferguson and Brannick 2012).

Literature Search

We searched four databases (i.e., PsycINFO, PubMed, ERIC, and CENTRAL). With these databases we searched the psychological, medical, and educational literature, and (quasi-)randomized controlled trials specifically. The search was not restricted to a time period. Search terms were used to elicit school-based interventions (e.g., school, class), interventions (e.g., prevention, intervention), adolescents (e.g., adolescent, youth), and intra- and interpersonal outcomes (e.g., self-esteem, social competence). Because these search terms led to an extremely high number of studies, we added some restrictions to the search, to avoid picking up interventions targeting other populations (e.g., preschool, clinical) or domains (e.g., substance use, lifestyle) than targeted in this study. The complete list of search terms is provided in “Appendix A”. This search (April 2019) resulted in 6102 studies in PsycINFO, 2964 studies in PubMed, 1683 studies in ERIC, and 567 studies in CENTRAL. Removal of duplicates resulted in 9,498 unique studies. In addition, reference lists of included studies and identified relevant reviews and meta-analyses were searched. This resulted in 22 additional studies.

All studies identified by the search were first screened for eligibility based on their title and abstract. Title and abstract were reviewed to assess whether the study met inclusion and/or exclusion criteria. We specifically focused on information where the intervention was conducted (e.g., school-based), at what type of school the intervention was conducted (e.g., middle/high school), and whether the intervention was implemented during regular school hours (rather than after school). Based on this screening 9,068 studies (95%) were excluded. The remaining 429 studies were studied full-text. In this second screening phase another 310 studies (72%) were excluded. See Fig. 1 for the flow diagram.

Fig. 1
figure 1

Flow diagram

In order to assess reliability of these two screening phases, a research assistant independently screened a random selection of 9% (800 studies) of all identified studies concerning the first screening phase and in the second screening phase 10% (45 studies) of the remaining studies. The research assistant judged the relevance of the studies for the current meta-analysis based on the inclusion and exclusion criteria. Reliability was substantial, with 98% agreement (Cohen’s κ = 0.71; Landis and Koch 1977) regarding the first screening phase and 89% agreement (Cohen’s κ = 0.76; Landis and Koch 1977) regarding the second screening phase. Any disagreements between the researchers concerning inclusion were solved through discussion.

Data Extraction

Studies were coded for information concerning the study (e.g., year of publication, country where study was conducted), sample (e.g., age, gender distribution), design and method (e.g., randomization, attrition analyses), intervention (e.g., intervention provider, aim of intervention), effect size data (e.g., outcome category), and intervention components (e.g., problem solving, practice, parental involvement). The intervention components were primarily based on the meta-analysis by Boustani et al. (2015) who, in turn, based their components on the PracticeWise Clinical Coding System (PracticeWise 2009). Additionally, we reviewed the components of the reviews and meta-analyses of Kaminski et al. (2008), Onrust et al. (2016), Peters et al. (2009), and Van der Put et al. (2018) to further strengthen the theoretical base of the components. Based on this broad theoretical basis, we adjusted some components of Boustani et al. (2015) due to highly overlapping content and co-occurrence (i.e., we combined communication skills and social skills; we combined cognitive coping and coping skills; emotion regulation contains anger management; practice contains role-play) and we deleted some components due to low frequency (i.e., civic responsibility, support networking). An overview of all components and their definitions is presented in “Appendix B”. Sources cited in the study and other freely available materials, such as descriptions from the developer or websites, were retrieved for coding the components (Boustani et al. 2015; Kaminski et al. 2008). In cases where insufficient data were reported for calculating the effect size, the first author was contacted. When this author had not responded after a reminder, the second or last author was contacted and, if necessary, reminded. If the required data could not be obtained after this, the study was excluded from the meta-analysis (see Fig. 1 for the flow diagram).

Of the included studies, 28% (30 studies) was coded independently by a second coder for reliability. The inter-rater reliability was moderate to excellent (Landis and Koch 1977) with an average intra-class-correlation of 0.97 (SD = 0.05), ranging from 0.88 to 1.00, for continuous variables, and an average Cohen’s kappa of 0.82 (SD = 0.11), ranging from 0.60 to 1.00, for categorical variables (see “Appendix C” for the reliability per individual code). Coding of the component ‘Insight building’ was not reliable with Cohen’s kappa of 0.52. Disagreements between the two coders were discussed and solved unanimously.

Calculation and Analyses of Effect Sizes

Effect sizes were represented as Cohen’s d, reflecting the standardized mean difference between the intervention and control condition, following the procedures of Lipsey and Wilson (2001):

$${\text{Cohen's }}d = \frac{{\bar{X}_{{G1}} - \bar{X}_{{G2}} }}{{s_{p} }}$$

Effect sizes were calculated at post intervention (i.e., within 6 months after the intervention) and corrected for baseline differences. Positive effect sizes indicated better results for the intervention compared to the control condition. All effect sizes were adjusted using the Hedges’ (1983) small sample correction prior to analyses:

$$\text{Hedges' adjusted effect size }= d *\left(1- \frac{3}{4N-9}\right)$$

Outliers were examined and, when believed to be unrepresentative, winsorized by replacing outliers with the value of the lower or upper value of two standard deviants from the mean (Lipsey and Wilson 2001).

Publication Bias

As commonly known, studies with nonsignificant or negative results are less likely to be published than studies with significant or positive results. The risk of publication bias was tested using a funnel plot. A funnel plot is a scatter plot in which the effect sizes are plotted against their precision (i.e., standard error). It is assumed that the effect sizes of the studies are symmetrically distributed around the true effect size, with more precise effect sizes (typically those from larger studies) at the top of the funnel and less precise effect sizes (typically those from smaller studies) at the base of the funnel. Asymmetry in the funnel plot can be an indication of publication bias (Light and Pillemer 1984). Whether or not a funnel plot is asymmetrical can be statistically tested with Egger’s regression test (Egger et al. 1997). When the funnel plot is asymmetrical according to Egger’s regression test the trim-and-fill analysis (Duval and Tweedie 2000a, 2000b) can be used to adjust the effect for possible publication bias. This analysis estimates how many studies fall outside the symmetric part of the funnel plot and trims this outlying part. With the remaining symmetric funnel plot the true center of the funnel is estimated. The trimmed studies and their missing counterparts are replaced in the funnel representing imputed ‘missing’ effect sizes. Based on this filled funnel plot, the corrected mean is estimated resulting in an adjusted effect size. Tests to visualize and examine publication bias assume independence of effect sizes, which is not the case in multilevel meta-analyses. We took this violation into account by using the variance of the effect sizes as a moderator in Egger’s regression test.

Analyses

We calculated an effect size for each reported measure of the intra- or interpersonal domain. To account for the clustering of effect sizes within a trial, we used multilevel meta-analytical models with three levels: Sampling variance around each effect size (level 1), variance between effect sizes within studies (level 2), and variance between studies (level 3; Assink and Wibbelink 2016; Van den Noortgate et al. 2013).

The unit of analyses were the interventions rather than the publications, since we are interested in the effectiveness of the intervention compared to the control condition. When one publication reported on two interventions, both interventions were included and analyzed separately. When multiple publications reported on the same intervention, evaluated in different studies with different samples, their effect sizes were analyzed together, clustered within the same intervention. When multiple publications reported on the same intervention, evaluated in the same study with the same sample, we coded the most comprehensive publication; the less comprehensive publication was checked for additional information and their effect sizes were analyzed together, clustered within the same intervention.

The multilevel analyses were conducted in R using the metaphor package (Viechtbauer 2010). First, the overall effects of universal school-based interventions on students’ intrapersonal and interpersonal domains were estimated in separate models. Methodological rigor was assessed to examine how well the overall effect sizes reflected the effects of the intervention rather than methodological influences or biases (Lipsey and Wilson 2001). Based on the Cochrane Risk of Bias 2.0 tool for Cluster Randomized Trials (Higgins et al. 2016) randomization (random vs. quasi-random assignment) and completeness of outcome data (percentage of drop-out) were analyzed as covariates. Additionally, the type of comparison group (passive: No intervention/waitlist vs. active: Care as usual/other intervention) was examined as covariate to examine absolute versus relative effects of the interventions. Characteristics of methodological rigor that predicted the overall effect sizes were included as covariates in further analyses.

To analyze which components were associated with stronger or weaker intervention effects, moderation analyses were conducted. Moderation analyses were conducted only if both levels of the moderator (i.e., component present or not) contained at least three effect sizes (Crocetti 2016). Note that these moderation analyses are based on correlations between interventions and effects. Besides significant effects (p < 0.05), effects with a trend towards significance (p < 0.10) were reported. These trends contribute to the hypotheses generation nature of the meta-analysis and provide an indication to what extent moderation by a certain component for a certain outcome can be generalized to other outcomes.

Results

Descriptive Characteristics

The present meta-analysis included 104 publications reporting on 99 unique interventions. In total, 529 effect sizes were extracted from the publications comparing the intervention with the control condition on the intrapersonal domain (k = 218) or the interpersonal domain (k = 311). Four effect sizes were extreme outliers, more than four standard deviations above the mean. All were derived from the same study (Haynes and Avery 1979) and believed to be unrepresentatively high. These four effect sizes were therefore winsorized.

The studies, published between 1979 and 2019 (Median publication year: 2013), were conducted in the USA (k = 36), Canada (k = 2), Europe (k = 45), Australia (k = 7), Asia (k = 13), and Africa (k = 1). Most studies randomly assigned participants to the conditions (k = 70). In 47 studies, the intervention group was compared to an active control group (i.e., Care As Usual or another intervention). The other 57 studies compared the intervention group to a passive control group (i.e., waitlist or no intervention). In total, the studies comprised 97,884 participants with an average age of 13.70 years (SD = 1.50) at the start of the intervention and mean sex distribution of 49% boys (SD = 16.43). Of the studies reporting ethnicity (59%), participants represented mostly ethnic majority in 59% of the studies, mostly ethnic minorities in 28% of the studies, and mixed ethnic majority and minorities in 13% of the studies. The mean drop-out rate of participants was 12.33% (SD = 10.65). The interventions consisted on average of 14 sessions (SD = 15.35) with a time span of 22.55 weeks (SD = 33.59). Roughly half of the interventions were provided by teachers (k = 46) and the other half (k = 58) by professionals. In addition, in 6 interventions the intervention was (also) provided by a peer. “Appendix D” provides the key characteristics of the included studies.

Overall Effect Sizes

Interventions had a small positive effect on students’ intrapersonal domain [d = 0.19, 95% CI (0.13; 0.25)]. More specifically, the positive intervention effects for self-esteem and self-regulation were somewhat stronger than for internalizing problems and wellbeing. No significant intervention effect was found for resilience (see Table 1). Interventions also had a small positive effect on students’ interpersonal domain [d = 0.15, 95% CI (0.10; 0.19)]. The magnitude of intervention effects was fairly similar for aggression, sexual health, social competence, and bullying. Interventions showed the strongest positive effects on school climate. However, this effect did not reach significance due to the small number of effect sizes for this subdomain. Definitions of the two general domains and the subdomains are provided in “Appendix E”.

Table 1 Effectiveness of interventions targeting the intra- and interpersonal domains

Publication Bias

For both the intra- and interpersonal domains, the distribution of effect sizes appeared to be symmetrical (Egger’s regression test: Intrapersonal z =  − 0.22, p = 0.826; Interpersonal z = 0.17, p = 0.862; see Fig. 2), indicating that there was low risk of publication bias.

Fig. 2
figure 2

Funnel plot of effect sizes concerning the intrapersonal (left) and interpersonal (right) domains

Intervention Components Related to Intervention Effects

Preliminary Analyses

Interventions targeting students’ intrapersonal domain (see Fig. 3) and those targeting students’ interpersonal domain (see Fig. 4) shared many commonly used components. Most commonly used content components are teaching students social skills, emotion regulation, and insight building. Most commonly used instructional components are implementing discussions, practice, and didactic instruction. The most commonly used structural component is additional individual guidance during the intervention.

Fig. 3
figure 3

Frequencies of components applied in interventions targeting the intrapersonal domain and subdomains

Fig. 4
figure 4

Frequencies of components applied in interventions targeting the interpersonal domain and subdomains

Concerning methodological rigor, whether or not participants were randomized, drop-out rate, and type of comparison group were not related to effect sizes concerning the intrapersonal domain or the subdomains. Whether or not participants were randomized was related to effect sizes concerning the interpersonal domain; randomized studies yielded stronger effects. Percentage of drop-out was related to effect sizes concerning social competence; studies with lower drop-out rates yielded stronger effects. Whether or not participants were randomized and drop-out rates were also related to effect sizes concerning bullying; randomized studies and higher drop-out rates yielded stronger effects. Therefore, randomization and drop-out were added as covariates when it were significant predictors of the effect size in the moderation analyses concerning the interpersonal domain, social competence, and bullying.

Intrapersonal Domain

Of the ten content components, none were significantly related to intervention effects on students’ intrapersonal domain in general (see Table 2). However, there was a trend that teaching emotion regulation was associated with weaker intervention effects on the intrapersonal domain overall, and specifically on self-esteem. For the subdomains, teaching assertiveness was associated with weaker effects on internalizing problems. Furthermore, there was a trend that insight building was associated with stronger effects on resilience.

Table 2 Effect sizes of interventions with and without components targeting the intrapersonal domain

Of the eight instructional components, practicing during the intervention was related to significantly stronger intervention effects on students’ intrapersonal domain overall. None of the other components were significantly related to intervention effects on students' intrapersonal domain in general.

Of the five structural components, none were related to intervention effects on students’ intrapersonal domain in general. For the subdomains, associations with stronger effects were found on internalizing problems when the whole school was involved and when the intervention had more sessions.

In sum, interventions that included insight building, where students practiced during the sessions, that involved the whole school staff, and those that had more sessions showed stronger effects for stimulating subdomains of the intrapersonal domain than interventions that did not include these components. Interventions that taught emotion regulation and assertiveness showed weaker effects for stimulating subdomains of the intrapersonal domain than interventions that did not include these components.

Interpersonal Domain

Of the ten content components, teaching problem solving was related to stronger intervention effects on students’ interpersonal domain in general, specifically for interventions targeting bullying and school climate (see Table 3). Teaching cognitive coping was related to weaker intervention effects on students’ interpersonal domain overall, and specifically for interventions targeting bullying. Furthermore, there was a trend that insight building was associated with stronger intervention effects on students’ interpersonal domain overall. The other components were not related to intervention effects on the interpersonal domain in general. For the subdomains, insight building was associated with stronger effects on social competence and bullying. Teaching emotion regulation and assertiveness were associated with weaker effects on respectively bullying and aggression.

Table 3 Effect sizes of interventions with and without components targeting the interpersonal domain

Of the eight instructional components, none of the components were related to intervention effects on the interpersonal domain in general. Regarding the subdomains, using multimedia was related to stronger effects on social competence. In addition, there were trends that using didactic instruction and relaxation were associated with stronger effects and using discussion had weaker effects on aggression.

Of the five structural components, none were related to intervention effects on students’ interpersonal domain in general. Concerning the subdomains, three components were related to stronger intervention effects. Interventions that included more sessions were associated with stronger effects on bullying. Interventions that involved the whole school were associated with stronger effects on bullying and school climate. Interventions with additional individual guidance were associated with stronger effects on school climate and showed a trend that it was related to stronger effects on aggression. Furthermore, there was a trend that interventions that involved parents were associated with stronger effects on school climate and that interventions that included more components were related to stronger effects on bullying.

In sum, interventions that taught insight building, and problem solving, used didactic instruction, relaxation, and multimedia, involved the whole school and parents, included additional individual guidance, more sessions, and more components showed stronger intervention effects for stimulating subdomains of the interpersonal domain than interventions that did not include these components. Interventions that taught emotion regulation, and assertiveness, and applied cognitive coping, and discussions showed weaker intervention effects for stimulating subdomains of the interpersonal domain than interventions that did not include these components.

Discussion

It is important to understand the intervention components that contribute to intervention effectiveness, or ineffectiveness, in order to guide intervention selection and implementation. Schools strive to improve their students’ wellbeing, but their time and resources to invest in interventions are limited. This meta-analysis aimed to identify the intervention components that contribute to the effectiveness of universal secondary school-based interventions aiming to stimulate students intra- and interpersonal domains. In line with previous meta-analyses examining universal school-based interventions, we found small positive effects on students’ intra- and interpersonal domains (e.g., Dray et al. 2017; Durlak et al. 2011; Jiménez-Barbero et al. 2016). Overall, none of the discrete components were consistently related to stronger or weaker effects on both students’ intra- and interpersonal domains across the subdomains. In other words, components that were related to stronger or weaker intervention effects typically were so for more specific domains, highlighting the importance of matching intervention to specific competencies or problems. In terms of the type of components that matters most, content components seemed more relevant for stimulating both the intrapersonal domain (e.g., internalizing behavior) and the interpersonal domain (e.g., bullying). Importantly, components related to stronger intervention effects were not necessarily frequently implemented in interventions (e.g., in 10–19% of the interventions). Similarly, components related to weaker intervention effects were generally implemented frequently (e.g., in 40–53% of the interventions).

Content components teaching students self-awareness (i.e., insight building) and problem solving were related to stronger effects, whereas components teaching emotion regulation, assertiveness, and cognitive coping were related to weaker effects. These findings are in line with previous research that indicated teaching self-awareness and problem solving as potential effective components (e.g., Boustani et al. 2015). Teaching emotion regulation, assertiveness, and cognitive coping might be more relevant in different contexts than the secondary school context in which the interventions were implemented. For instance, emotion regulation might be more relevant when implemented in psychotherapy (e.g., Weisz et al. 2012) and teaching assertiveness might be more relevant for students at elementary schools (e.g., Onrust et al. 2016). Teaching cognitive coping was related to weaker effects on interpersonal competences in general. This finding was somewhat surprising, given that cognitive coping is considered an effective component in other interventions as Cognitive Behavioral Therapy (Yovel et al. 2014) with well-trained therapists (Kobak et al. 2017). One possible explanation may be that school-based interventions, are often implemented by teachers who only received a short training (e.g., Challen et al. 2014) and have no to little experience in teaching cognitive coping. Taken together, content components might be differentially related to intervention effects in different contexts.

In general, instructional components that reflect an active learning approach were related to stronger intervention effects (e.g., relaxation, practice). This does not mean that interventions should only use active learning approaches and exclude more passive learning approaches. Interventions that used discussion as method delivery, an active learning component, were related to weaker intervention effects on aggression, whereas interventions using a didactic information delivery as method, a passive learning approach, were related to stronger intervention effects. These findings are in line with the meta-analysis of De Mooij et al. (2020) that showed that psychoeducation was related to stronger effects of Social Skills Training interventions. Using didactic instruction might fit better in the school context than using discussion. In a didactic instruction approach, the emphasis is on knowledge transfer between the teacher and the students, whereas a discussion approach is more dependent on the students and the social skills and cohesiveness of the group. Teachers might be less equipped to prevent a discussion from sidetracking than to teach psychological constructs (Horne et al. 2007).

The results concerning structural components showed that longer and more extensive interventions (e.g., involving parents and the whole school) were more effective for targeting system level outcomes such as school climate. Long-term and extensive interventions might be more effective when the intervention aims to increase students’ feelings of safety at school. By targeting multiple systems in which the students are involved (e.g., school, family) teachers and parents might become more sensitive for problems students encounter, such as bullying or problematic relations with peers (Ttofi and Farrington 2011) and a broad range of risk factors is addressed (Trip et al. 2015). For interventions targeting the individual level such as self-esteem, more extensive interventions were not related to stronger effects nor were less extensive interventions related to weaker effects. Based on these findings, less extensive interventions might be preferred to stimulate the intrapersonal domain due to the easier implementation (Bakermans-Kranenburg et al. 2003), while more extensive interventions may be better suited to stimulate the interpersonal domain.

Furthermore, our results showed that components related to stronger intervention effects were not necessarily commonly implemented. For instance, interventions that involved the whole school were related to stronger effects on internalizing problems, bullying and school climate. However, only 10% to 19% of the included interventions involved the whole school. In contrast, some components that were related to weaker effects are implemented more often. For example, teaching emotion regulation, included in 40% to 53% of the interventions, was related to weaker effect sizes on the intrapersonal domain in general, self-esteem, and bullying. Our frequency counts of components are in line with the frequency count by Boustani et al. (2015) of effective school-based interventions. These findings indicate that it is important to critically consider which components to include in an intervention and to not simply “do what previously has been done.”

Several limitations merit attention. First, we tested associations between components and intervention effectiveness. Based on these associations, we cannot state whether specifically these components are (in)effective or whether other components confounded with that specific component accounted for the association. Moreover, the analyzed components were not implemented in isolation, but in the context of an intervention program consisting of multiple components. Interactions among components can affect their effectiveness. In addition, it remains unclear how the components were implemented, how much time was allotted to certain components, and what the quality of implementation of the component was. These aspects could also influence components’ effectiveness. This meta-analysis should therefore be regarded as hypothesis generating; our results give future research indications which components are interesting to examine further. Future research should test causal individual and synergistic effects of components, and potential order effects of components. Second, the coding of components depended on the sufficiency of the intervention description in the included studies; if a component was not mentioned in the article, or other freely available information concerning the evaluated interventions, it was coded as not present. At the same time, components that are formally part of the intervention, and therefore reported and coded as such, may not necessarily be implemented. It might be that some components were thus coded as “present” while they were not actually implemented. Third, even though we included more than 500 effect sizes, some components (e.g., peer resistance, parental involvement) were less frequently implemented in interventions than other components (e.g., practice, discussion) resulting in better powered analyses for some components than for others. Last, our outcome categories, the two general domains as well as the specific competencies and problems, were relatively broadly defined. On the one hand, these broad outcome categories are a good representation of the broad range of problems that may be present in the heterogeneous student population which schools aim to address with these universal interventions. On the other hand, the outcome categories may be less sensitive to change and could mask associations between specific components and specific outcomes.

In conclusion, when designing and implementing universal school-based interventions, and especially when no rigorous evidence base for the intervention is available, it is important to consider the evidence base of its included components. Some components are often implemented in interventions without being actually related to stronger intervention effects. In fact, some commonly implemented components (e.g., emotion regulation, discussion) were related to weaker intervention effects in our meta-analysis. Vice versa, some components that were related to stronger intervention effects (e.g., involvement of the whole school or parents) were only rarely included in interventions. Thus, it is essential to examine the evidence base of components before including it in an intervention, and to not solely focus on which components have been included in previous interventions. Selecting an intervention for implementation is complex and stakeholders need to take numerous factors into account (e.g., training of providers, match with the context, required resources). We hope our findings can contribute to this process by informing stakeholders which components may be important to be included in an intervention when aiming to address certain competencies or problems; schools can look up which components are associated with stronger effects on outcomes relevant for them and take this evidence-base into consideration in their decision. This meta-analysis provides an empirical foundation for the evidence base of components related to stronger and weaker effects for universal school-based interventions addressing the intra- and interpersonal domain.