School is an important setting for children to meet and socialize with their age-mates and make friends. For many children, school is the only place where they can have peer interactions (Buchanan et al., 2022; Hoffman & Miller, 2020). By interacting with peers, children not only learn the social skills that prepare them for adulthood, such as how to negotiate and collaborate with others (Veiga et al., 2017); but also develop a sense of belonging to the school environment (Allen et al., 2021), which contributes to their psychosocial wellbeing in later life (Palikara et al., 2021; Tian et al, 2016).

However, for many autistic children, socializing in school is no easy task and could even be a major source of stress (Rieffe et al., 2021; Williams et al., 2019). They face many challenges when joining or initiating an interaction with peers (Brewster and Coleyshaw, 2011; Kasari et al., 2012), are often excluded or ignored by allistic (i.e., non-autistic) peers (De Boer & Pijl, 2016; Dean et al., 2014), and the physical environment could simply be too arousing for them to comfortably participate in group activities, such as the playground being too crowded or the hallway being too noisy (Corbett et al., 2014; Mayes et al., 2013; Rieffe et al., 2021; see Bailey and Baker (2020) for a review on the barriers). These challenges reflect the fact that the needs of autistic pupils are not well considered in the organization and design of current school environments, while the allistic preferences for social interaction are promoted. Special considerations are warranted to make schools more welcoming for autistic children. Such considerations are becoming more urgent with the global trend towards inclusive education over the past decade, which means that more and more children with special needs, such as autistic children, are attending mainstream schools.

School-based interventions for the inclusion of autistic pupils generally address one of four levels in a school environment. First, interventions targeting the autistic children (the child level) usually aim at improving the social skills of autistic children, by giving therapist-led training sessions in schools where autistic pupils learn skills to initiate and maintain an interaction based on allistic norms of social interaction (e.g., Dean et al., 2020, Feng et al., 2008, Laushey et al., 2009). Second, interventions that also involve allistic peers (the peer level) often promote autistic pupils’ interactions with their peers by training allistic peer partners or “coaches” to accompany or include autistic children during activities (e.g., Hughes et al., 2013b; Thiemann and Goldstein, 2004), or by forming peer groups with whom autistic pupils regularly meet to discuss school life issues and allistic social rules (e.g., Hart and Banda, 2018; Schaefer et al., 2018). Third, interventions that further involve the school staff (the staff level), e.g., the teachers or the paraprofessionals, usually aim to provide knowledge and training to adults who work directly with autistic children in school, so they acquire the skills to facilitate the interactions between autistic pupils and the peers (e.g., Locke et al., 2019; Kretzmann et al., 2015). Fourth, a small number of intervention programs addresses the physical environment of the classroom or school (the physical environment level), for example, by renovating the school playground with features that encourage autistic children to play together with peers (e.g., Baker et al., 1998), or by changing the seating arrangement to allow allistic peers to have opportunities to be in contact with autistic children (e.g., Chung & Douglas, 2015).

These interventions all tackle a different level of autistic children’s inclusion in schools, but a combined knowledge base is lacking. Previous review studies mostly centered around a single level (e.g., child level: Camargo et al., 2014; Dean & Chang, 2021; peer level: Ezzamel, 2016; Watkins et al., 2015). Although some studies did include multiple levels in their reviews, they primarily focused on the effects of these levels on educational and behavioral functioning, such as academic performance, social skills, and problem behaviors (Lang et al., 2011; Leifler et al., 2021; Watkins et al., 2019). To the best of our knowledge, Sutton et al. (2019) and Whalon et al. (2015) are the only review studies that focused on peer interaction. However, the outcome measures investigated in those reviews either involved only the quantity of peer interactions, i.e., the frequency and duration of social initiations and responses, but not on the quality of these interactions; or included only the social behaviors from the autistic pupils to their peers, rather than the other way around. In other words, thus far the review studies seem to focus on the extent to which interventions allow autistic pupils to “fit in” to allistic peer interactions, and meta-analyses also confirmed moderate-to-strong effect sizes of such interventions that increased autistic pupils’ skills to meet allistic standards of socialization towards their peers (Watkins et al., 2019; Whalon et al., 2015). However, these reviews may not necessarily reflect increased social inclusion of these pupils. This gap in our knowledge prevents us from drawing up a more holistic strategy to address social inclusion of autistic children in mainstream schools.

This current study, in the form of a systematic review, aimed to investigate i) what school-based interventions were available in the evidence base of journals that were designed to enhance autistic pupils’ social inclusion in primary and secondary mainstream schools, ii) at which level of the school environment they targeted at, and iii) the extent to which they were effective. Included interventions should have been designed to target at least one of the four levels of the school environment, i.e., the child-level (the autistic children), the peer-level, the staff-level, and the physical environment level, and adopted a design that allows for an indication of changes in autistic children’s quantity or quality of social participation when an intervention is applied. By synthesizing the knowledge available on this topic, the approaches at each level for socially including autistic children in schools and their effects can be summarized, allowing for a more holistic strategy to be implemented in the school settings.

Methods

Literature Search

The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) Checklist was used to guide the review process in this study (Moher et al., 2010). A search was conducted in April 2021 using four electronic databases, i.e., PsycInfo, PubMed, ERIC, and Web of Science, for all peer-reviewed published/in-press literature. Manual search of reference lists of the retrieved studies was conducted afterwards.

Search terms spanning five areas were used in combination with each other: (1) autism (autis* OR pervasive develop* OR Asperger*); (2) children (pupil* OR child* OR adolescen* OR student* OR youth* OR young* OR school age*); (3) school setting (primary school* OR secondary school* OR elementary school* OR high school* OR schoolyard* OR playground*); (4) intervention (interven* OR train* OR adapt* OR program*); (5) social participation (social* OR interaction* OR participation* OR initiation* OR engagement* OR belong* OR bully* OR friend* OR peer*).

Eligibility Criteria and Study Selection

Given the aim of this review to provide an overview of evidence-based interventions, this review included only studies that have been published in scientific, peer-reviewed journals. The studies should be published in English by the end of April 2021. Gray literature was not included because currently there has not been consensuses regarding how to systematically search for gray studies, include or exclude them in the review process, and evaluate the quality of data from such studies (Martín et al., 2005; Schmucker et al., 2017).

Moreover, a study had to meet the following criteria to be included in this review. First, it involved participants attending primary or secondary schools, and diagnosed with an autism spectrum disorder (ASD; including autism, Asperger syndrome, and pervasive developmental disorder not otherwise specified [PDD-NOS]). Studies were excluded if they were not conducted in primary or secondary schools (e.g., Boyd et al. (2018) only on preschoolers). Given the already wide age range considered in this review, studies were also excluded if they included only pupils above 18 years (although no studies were excluded for this reason). When a study included participants with other diagnoses, it was taken into the review process if results specific to autistic pupils were presented (e.g., Schaefer et al., 2018), and excluded if it reported only aggregated data (e.g., Bailey et al., 2021).

Second, the autistic pupils included in the study were in a mainstream, general education setting, which means that these students shared the school context and activities with allistic peers. Therefore, if a study involved only pupils in a self-contained special education class, it was excluded (e.g., Ackerman et al., 2021, Bambara et al., 2016).

Third, the study examined a school-based intervention (e.g., a program, a training session, or an adaptation) implemented at one or more of the four levels of the school environment: the child, the peers, the staff, and/or the physical environment. The intervention aimed to improve the social inclusion of autistic pupils, with a primary outcome measure for social participation with peers in terms of its quantity (e.g., frequency/duration of social interactions, initiations, or responses; number of friends) or quality (e.g., friendship quality, bullying, peer acceptance/rejection, or school belongingness/loneliness).

Fourth, the study should test the effect of the school-based intervention, by adopting a group design (i.e., with an experimental group of pupils who underwent the intervention, compared to a “treatment as usual” control group) or a single subject design (i.e., pupils serving as their own control, whereby their outcomes were examined and compared between baseline and intervention conditions, with at least one measurement to examine each condition).

Fifth, the methodological quality of the study had to be rated as “strong” or “adequate” (Reichow et al., 2008; see below for more details). Studies rated as “weak” were excluded.

The selection process involved two stages: first, the duplicates were excluded and the titles and abstracts were screened; second, the full texts were reviewed for eligibility. The screening and eligibility check were conducted by two individual coders (the second and third authors of this study). In both stages and throughout the review process, all studies were coded by the two coders individually, and disagreements were discussed between the two coders and a third tiebreaker (the first author) until reaching 100% agreement in biweekly project meetings. A training session took place before each stage for the discussion about the criteria and their definitions, during which five articles were coded iteratively until 100% agreement was reached. With this set-up, 96% and 99% agreement was respectively achieved in the two stages regarding which studies to exclude. The complete review process is presented with a PRISMA flow chart in Fig. 1.

Fig. 1
figure 1

PRISMA flow chart of the selection process

Methodological Quality

The methodological quality of the reviewed studies was assessed using the evaluation protocol for autism research proposed by Reichow et al. (2008). For group research, there are six primary quality indicators (i.e., participant characteristics, independent variables, dependent variables, comparison condition, link between research question and data analysis, and use of statistical tests) and eight secondary quality indicators (i.e., random assignment, interobserver agreement, blind raters, fidelity, attrition, generalization and/or maintenance, effect size, and social validity). A study was considered “strong” if it met all the primary indicators and at least four secondary indicators. When a study met at least four primary indicators and at least two secondary indicators, it was rated as “adequate.” Other studies were considered “weak.”

Single subject research can be assessed with six primary indicators (i.e., participant characteristics, independent variables, dependent variables, baseline condition, visual analysis, and experimental control), and six secondary indicators (i.e., interobserver agreement, kappa, fidelity, blind raters, generalization and/or maintenance, and social validity). When a study met all the primary indicators and at least three secondary indicators, it was deemed “strong.” An “adequate” study met at least four primary indicators and at least two secondary indicators. Other studies were rated as “weak.”

As mentioned, two coders coded all studies based on the provided protocol. After the first round of quality assessment, however, 36% of the studies were deemed “weak” by one coder and “adequate” by the other. Therefore, the third tie-breaker joined the second round of quality assessment. The protocol was reviewed again among the coders to resolve any concerns, and all the studies were discussed one-by-one until 100% agreement was achieved in the biweekly project meetings.

Data Extraction

The two independent coders each extracted data from all the eligible studies. Study characteristics were extracted through open-ended questions and/or forced choice questions. For open-ended questions, agreement was considered achieved when the coders selected the same range of information from the studies. For forced choice questions, agreement was reached when the same option was chosen. Before data extraction, a training session was organized where five articles were coded for practice and 100% agreement must be met. The coded data was discussed one-by-one in regular project meetings. Disagreements occurred in about 14% of the studies, due to doubts about the setting the autistic pupils were in, which measures could be seen as reflecting social participation, and the outcomes being compared. Also when extracting effect sizes, disagreements occurred in 25% of the studies. These disagreements were discussed with the third tie-breaker in regular project meetings until agreement was achieved among all coders.

The data extracted included: i) study characteristics; ii) participants characteristics; iii) setting; iv) measures; v) intervention characteristics; and vi) outcomes. See Table 1 for an overview of the characteristics that the two coders extracted. These characteristics were chosen primarily based on the research question of this study regarding the different levels of the school environment and the effects we aimed to examine. Moreover, they were chosen to meet the requirements of the methodological quality evaluation protocol of Reichow et al. (2008). Previous review studies (e.g., Sutton et al., 2019; Whalon et al., 2015) were also taken into account to finalize the list of characteristics for the extraction.

Table 1 Overview of characteristics extracted for each reviewed study

Results

Study Characteristics

A total of 56 studies met the criteria for the current review (see Table 2 for an overview, and Appendix Table 4 for a complete list of included studies and corresponding outcomes). Among them, 47 (84%) employed a single subject design, while 9 (16%) adopted a group design.

Table 2 Participant characteristics

Almost all studies were conducted on Western samples, in Canada (1), Ireland (1), Spain (1), United Kingdom (1), and United States (51). Only one study was on a non-Western sample, in Taiwan (0.1% of all the autistic children involved in this review; Feng et al., 2008).

Twenty (36%) studies received a “strong” methodological quality rating, while 36 (64%) were rated as “adequate” (see Appendix Table 5 and 6). Overall, the studies eligible for this review described the participants, the intervention, and the outcome measures in detail, conducted data analyses that fit the research questions, and provided information about generalization or maintenance of the targeted behaviors, thus making replications possible.

Table 3 Summary of outcome measures and corresponding outcomes

Participant Characteristics and Settings

A total of 981 autistic children were involved in this review (see Table 2). Among these, 138 (14%) were girls, yet four studies did not specify the sex of the participating autistic children. The sample size of autistic children was variable, from 1 to 197 autistic pupils.

Participants were reported as diagnosed with autism (n = 332), Asperger’s syndrome (n = 19), or ASD (n = 611). Five children did not have an autism diagnosis but met the criteria when assessed with the Autism Diagnostic Observation Scale (ADOS). Fourteen children who had an autism diagnosis when recruited were no longer on the spectrum during the intervention according to ADOS. Also, 50 children were reported to have additional diagnoses, including intellectual disability (n = 28), speech or language impairment (n = 7), attention deficit hyperactivity disorder (ADHD; n = 7), anxiety disorders (n = 2), oppositional defiant disorder (n = 1), Down syndrome (n = 1), seizure disorder (n = 1), hearing loss (n = 1), visual disability (n = 1), and specific learning disability (n = 1). In most studies (n = 37; 66%), the inclusion criteria only entailed an autism diagnosis, without specifying functionality or cognitive levels.

Thirty-five studies were conducted among primary-school students, although two of these studies also included kindergarteners (n = 1 [17% of the sample] in Vincent et al. (2018); and number unknown in Morgan et al. (2018)). Eighteen studies involved only secondary-school students, and three studies included children from both primary and secondary schools (Brock et al., 2018; Hanley-Hochdorfer et al., 2010; Kamps et al., 2002). In addition, two studies reported aggregated results of a sample that also included autistic children from self-contained special classes besides the autistic pupils from mainstream classes (n = 1 [25% of the sample] in Kamps et al. (2014); n = 117 [59% of the sample] in Morgan et al. (2018)).

Outcome Measures

As Table 3 shows, observations were the most used method (n = 54; 96%) for measuring autistic children’s social inclusion in school. Although the instruments used were wide-ranging, the dependent variables can be categorized into four types: initiations, responses, interactions, and engagement. First, a total of 35 studies (63%) examined the frequency and/or quality (valence, appropriateness, verbal or not, and prompted or not) of social initiations, from the target autistic children to their peers (n = 29), and/or from the peers to the autistic children (n = 6). Second, 22 studies (39%) examined the frequency and/or quality of the social responses, from the autistic children to their peers (n = 17), and vice versa (n = 5). Third, 26 studies (46%) looked at the back-and-forth nature of the observed interactions (e.g., counting both initiations and responses; the presence of turn-taking; the presence of reciprocal exchanges), and among them, two studies also investigated the time when the autistic children were alone. Lastly, 22 studies (39%) checked if the autistic children were engaged in an activity, whether jointly with peers (n = 21), solitarily (n = 3), or near other peers but doing a separate activity (n = 1).

Besides observations, peer nominations were conducted in nine studies (16%), where all participating students (autistic and allistic) were asked to nominate several friends or playmates, or to rate a certain aspect about their interaction with a given peer (e.g., whether they like to play with the peer). Through these nominations and ratings, autistic pupils’ sociometric status was measured. Some studies also collected self-report questionnaires from autistic children themselves (n = 1; 2%), or informant-report questionnaires from their parents (n = 2; 4%) or teachers (n = 3; 5%), to examine these children’s social participation in schools or outside the intervention.

Statistical Analyses

Among the 47 studies where a single-subject design was utilized, the majority (n = 43; 91%) conducted visual inspection/analysis and reported the descriptives. In these studies, levels, trend, and variability of data were inspected, and the immediacy of intervention effect, overlap between phases, and consistency of the patterns were examined. Twelve of these studies reported effect sizes, using Tau or Tau-U (n = 4; Kamps et al., 2014; Levy & Dunsmuir, 2020; Mason et al., 2014; Sabey et al., 2020); non-overlap of all pairs (NAP) (n = 4; Block et al., 2015; Radley et al., 2014, 2017; Rodríguez-Medina et al., 2016); percentage of non-overlapping data (PND; PNOD) (n = 2; Carter et al., 2017; Hanley-Hochdorfer et al., 2010); pairwise data overlap (PDO) (n = 1; Laushey et al., 2009); improvement rate difference (IRD) (n = 1; Brain & Mirenda, 2019). Besides visual inspection, one study also used a nonparametric Wilcoxon matched pairs test and reported r statistics for the comparison of pre-post friendship nominations (Rodríguez-Medina et al., 2016); two studies also conducted analyses of variance (ANOVA) (Frederickson et al., 2005; Kamps et al., 2002), where one of them (Frederickson et al., 2005) reported eta square values for effect sizes. Three studies with a larger sample size used (generalized) linear mixed models to model the changes over the different assessments (N of autistic pupils = 31 in Locke et al. (2019); N = 32 in Dean et al. (2020); N = 137 in Kasari et al. (2016)). Two studies among them specified the effect sizes with Cohen’s f (Dean et al., 2020; Locke et al., 2019).

Among the nine studies that adopted a group design, six studies used mainly multi-level modeling techniques (linear mixed models (LMM); hierarchical linear models (HLM)) to analyze the effects, while one study applied a generalized approach with truncated Gaussian models (Shih et al., 2019). In the remaining studies, one study used the analyses of covariance (ANCOVA) (Golzari et al., 2015), while the other used ANCOVA for comparing between groups in peer nominations and HLMs for modeling playground observation data (Kasari et al., 2012). The effect sizes were reported in all of these studies, using Cohen’s d (n = 7; Asmus et al., 2017; Brock et al., 2018; Carter et al., 2016; Kasari et al., 2012; Kretzmann et al., 2015; Lopata et al., 2019; Morgan et al., 2018); Cohen’s f (n = 1; Shih et al., 2019); eta square (n = 1; Golzari et al., 2015).

Type of Interventions and Outcomes

Twelve (21%) studies applied the intervention at child level; nine (16%) at peer level; seven (13%) at staff level; and four (7%) at physical environment level. The remaining 24 (43%) studies implemented the intervention at multiple levels of the school environment: four targeting at both the child and the peers, and one at both the child and the staff; 18 targeting at the peers, but also adding an adult facilitator who was either someone from the research team (n = 6) or a school staff who received training (n = 12); and one covering all of the four levels (Chung & Douglas, 2015).

Child Level

The studies that examined interventions at child level focused on implementing a training program, led by a therapist or the researcher separately from the allistic peers, for improving autistic children’s social skills (see Appendix Table 4). A commonly applied program was the Social Stories (Gray, 2010; Gray & Garand, 1993), which provides specific instructions for behavioral responses, such as how to appropriately join in and maintain an interaction within a defined context, via visual supports and text (Delano & Snell, 2006; Golzari et al., 2015; Hanley-Hochdorfer et al., 2010; Sansosti and Powell-Smith, 2006; all the four studies explicitly mentioned that the guidelines of Carol Gray for the Social Stories were followed). One study further provided teaching scripts to special education teachers in the school for teaching social skills through stories (Bock, 2007). Similarly, programs like the Concept Mastery Routine focus on defining a social concept, e.g., appropriate social initiations, with autistic pupils using visual diagrams (Laushey et al., 2009).

Some studies also included a small group of allistic peers in social skill training. These peers (trained or untrained) jointly acted as a collaborative reading partner after the training sessions (Reutebuch et al., 2015), or as models or role-playing partners when autistic pupils were taught or practiced social skills like joint attention, turn-taking, responding to questions, and maintaining conversations (e.g., the Superhero program, such as in Radley et al. (2014) and Block et al. (2015)). One study involved peers indirectly: autistic children had to nominate a peer to play with and was prompted to do so during recess (Kasari et al., 2016).

As Table 3 shows, following such interventions targeting at social skills, autistic children spent more time interacting with peers, engaged more often in joint activities with peers, and made initiations and responses to peers in a manner that more often met the researchers’ definition of “appropriateness.” Teachers also reported a higher level of social participation in autistic pupils after the intervention. However, these interventions did not seem to improve peers’ responses to the target autistic children. Whether with a group or single-subject design, no changes were observed in peers’ responses to the target (whether the responses were positive, negative, or absent). Similarly, no changes were noted in peer rejection or friendship nominations, as reported from peers.

Peer Level

The interventions at peer level usually involved trainings to teach allistic peers how to interact with an autistic classmate, and these peers were assigned as partners or life coaches to accompany autistic pupils during recess or in a certain school activity (e.g., Brain & Mirenda, 2019; Carter et al., 2016; Thiemann and Goldstein, 2004; see Appendix Table 4). Another common peer-mediated intervention is forming peer networks, where a group of allistic peers were trained beforehand, and teamed up with autistic pupils outside the regular school hours to have discussions on different issues related to social situations and interactions or on shared interests (e.g., Haring and Breen, 1992; Hochman et al., 2015; Kasari et al., 2016). These peer-network interventions were often in combination with an adult facilitator, either the researcher or a teacher in the school. For studies that included a school staff member as facilitator (e.g., Asmus et al., 2017; Carter et al., 2016; Rodríguez-Medina et al., 2016), trainings were given to staff beforehand to demonstrate strategies for facilitating discussions and interactions between autistic children and the allistic peers. In one study, all the allistic peers in a class participated in an adult-facilitated session, in which they discussed friendship and the focal autistic child’s strengths and difficulties (without the focal child’s presence), and afterwards the allistic peers who volunteered were included in the peer network (Frederickson et al., 2005).

As presented in Table 3, in interventions with trained allistic peers as partners who accompanied the autistic children in school activities or during recess, it was found that the reciprocity in peer interactions increased: there were more responses from peers to the target autistic child; more verbal exchanges and turn-taking between the autistic child and the peers; and the interactions were maintained for a longer time. The results of peer nominations also showed that autistic children received more friendship nominations from the peers and were in a more central position in a social group. Nevertheless, there were no clear effects on autistic children’s initiations to peers, and no changes in the friendship nominations given by autistic children. There were also no differences in peer acceptance when trained and untrained peers were both included in the investigation.

When peer-network meetings were held, the studies that included the researcher as the facilitator and those that included a schoolteacher as the facilitator produced similar results. They showed that, outside the peer network sessions, autistic children spent more time in peer interactions and engaging in joint activities with peers; there were more initiations and responses to and from the peers; and the peer acceptance increased while peer rejection and victimization decreased. Yet, although the autistic children engaged more often with peers, they did not get more friendship nominations from the peers, nor nominated more friends themselves. Furthermore, while teacher reports showed increased social contacts and gaining new friends, parents reported no changes in social contacts and friends.

Staff Level

The interventions at staff level all involved training programs for school teachers or paraprofessionals, aimed at helping school staff identify needs of autistic children and promote child-led activities (see Appendix Table 4). Among these, three studies made use of the Remaking Recess program that trained teachers or paraprofessionals to facilitate peer interactions specifically during school recess on the playgrounds, while the other programs were applied to general school settings (Kretzman et al., 2015; Locke et al., 2019; Shih et al., 2019). Also, three studies taught paraprofessionals to include the Pivotal Response Treatment procedures, a naturalistic behavioral methodology, in supporting autistic children’s peer interactions in school activities (Feldman & Matos, 2013; Pierce & Schreibman, 1997; Robinson, 2011).

Interventions that provided staff training had effects on autistic children’s engagement in joint activities with peers, their initiations to peers, and reciprocity in their interactions (see Table 3). Autistic children were also reported to nominate more friends and to be in a more central position in social groups according to peer reports. However, no differences were observed in the friendship nominations they received from peers, regardless of the study design.

Physical Environment Level

Only four studies targeted specifically at physical environment (see Table 3 and Appendix Table 4). Among them, two studies by the same research group launched social clubs in schools based on the autistic pupils’ preferred interest (e.g., a movie or a frisbee club; Koegel et al., 2012, 2013). Similarly, one study turned the special interests of autistic children into playground games (e.g., interest in geography was incorporated into a tag game on a giant map outlined on the playground; Baker et al., 1998). The remaining one study provided structured activities that motivate peer interactions (e.g., relay races, board games, and scavenger hunts) on the playgrounds (Vincent et al., 2018). These activities were open to all students in the schools or on the playgrounds, although in two studies, children’s social interactions were facilitated by an adult (Baker et al., 1998; Vincent et al., 2018). After the interventions, it was observed that autistic pupils spent longer time engaging in joint activities with peers and made more initiations to their peers. However, no studies examined peers’ responses, and only one study followed up to see if the effect maintained over time.

There was also one study that applied a combined strategy (Chung & Douglas, 2015), including all four levels of the school environment: offering speech generating devices, inviting peer partners, training paraprofessional facilitators, and rearranging the class seating to allow the target autistic students to sit with their allistic peer partners. The study however only examined the aggregated effects, and found improvements in the reciprocity during peer interactions, with more initiations by both the target autistic child and the peers.

Discussion

School is the place where many children spend most of their waking hours, acquiring new academic and physical skills, learning social conventions, hanging out with peers, and making friends. Providing a socially inclusive school environment for all children is therefore a necessity. However, this goal appears quite difficult to attain for many schools, partly due to the gap in our knowledge regarding which solutions are available and which ones work for whom.

This systematic review showed that interventions at different levels of the school environment can effectively increase autistic pupils’ interactions with their peers, yet it remains unclear whether these school-based interventions led to better social inclusion for these children. Below we discuss the reviewed outcomes and their implications for practice and future research.

Levels of Social Inclusion

A recent editorial article by Weaver and colleagues (Weaver et al., 2021) proposed the “community engagement continuum” for defining the extent to which an autistic individual is socially included in a community. This framework includes four layers: (1) tolerance, where an individual prepares tools themselves to be physically present in an environment and tolerates the unmodified environment; (2) accessibility, where an individual is given supports or accommodations to physically access an environment, but no changes are made for supporting social participation; (3) integration, where an individual is given support and opportunities for both physical engagement and meaningful social participation; and (4) inclusion, where an environment (e.g., social, cultural, or physical environment) is modified to allow all individuals to belong and contribute meaningfully.

Following this line of thought, interventions at the child level seem to relate to accessibility, as these programs do not create opportunities for social participation; instead, autistic children are taught social skills, with which they use to create those opportunities themselves. Echoing this definition of accessibility, the current review showed that autistic children did have an increased presence in joint activities with peers, and made more attempts to initiate an interaction or respond to peers, after receiving the child-specific social skill training in school. However, peers’ responses to the autistic children did not change. In fact, even the proportion of “no response” from peers stayed unchanged (Sabey et al., 2020), showing that autistic children were still ignored by their peers – a form of implicit aggression – regardless of their improved social skills.

As to the interventions at the levels of peers and staff, integration was achieved to a certain degree. Through training, (some of) the allistic peers and staff improved their interaction/facilitation skills and the understanding of the difficulties surrounding autism, which led to more reciprocity in the interactions between the autistic children and their peers. However, having more peer interactions does not necessarily mean making more friends. Whilst an adult facilitator could steer more social initiations and responses among children, the friendship nominations received by the autistic children appeared unaffected after the intervention, showing that these children were still not seen as a friend by their peers (e.g., Kasari et al., 2016). Moreover, although having the company of peer partners led to more peer responses and more friendship nominations from the peers, the autistic children seem to adopt a passive position in such peer-mediated interactions, which resulted in producing no effect on their initiations to peers nor on the friendship nominations made by themselves (e.g., Carter et al., 2016; Kasari et al., 2012). Also, the positive effects derived from these programs may not extend to the untrained peers and staff.

In theory, interventions applied at the level of physical environment could be an approach for achieving inclusion. The modifications made to the physical environment for fitting individual needs bring the message that individual differences are valued and respected, and that is the starting point for an autistic child to feel belonged in school. In the current review, positive effects were shown in three school-based interventions of this sort, including providing playground games and materials that encourage social interactions (Vincent et al., 2018), designing new playground activities based on autistic children’s preferred interests (Baker et al., 1998), and forming school social clubs based on autistic children’s preferred interests (Koegel et al., 2013). However, given the small body of literature and the fact that peers’ responses were not recorded in these studies, it is hard to confirm from the current review the extent to which autistic children may benefit from such approaches. This presents an urgency to understand the changes in peers’ behaviors towards autistic children after adaptations are made to the physical environment in schools.

Practical Implications, Limitations, and Future Research

As one of the first systematic review that addresses social inclusion of autistic children at different levels of mainstream school environment, this study provides an overview of school-based, methodologically sound interventions currently available to educators and researchers. It carries several implications for practice and future research, while some limitations should also be considered.

In this review, child-specific and peer-mediated interventions were the most common approaches, yet it is worth noting that such interventions mostly took place outside the regular school routine. For example, child-specific sessions and peer network meetings were held in a separate room from children’s usual classroom, and peer partners received the training outside the curriculum. Such an outcome shows that social inclusion remains an extra layer of school education and may reflect the harsh reality that faces mainstream school educators with a dearth of means for increasing social inclusion among children within the school routine.

However, when an intervention for social inclusion is considered an addition to school routine and focusing specifically on a child’s social skills, stigmatization is likely to occur (Turnock et al., 2022); and worse, when unsuccessful the child might feel he or she failed, most likely further damaging their self-esteem, their position in the group, and their sense of belonging (Rieffe et al., 2018). The prevalence of such a child-specific point of view among the reviewed studies should therefore be taken with caution. The fact that most of the reviewed studies measured only autistic children’s interactions towards peers, and provided limited information in the other way around, may also reflect an underlying child-specific perspective in the choice of measures in many studies. There was also a lack of differentiations in regard to autistic pupils’ motivation towards socialization and these children’s perception of “meaningful” participation. It should be noted that having a higher quantity of social interactions, such as making more initiations and responses, or having more “appropriate” interactions, does not indicate that these interactions are also of higher quality. Individual differences in what makes an interaction enjoyable, and the effects of the surrounding social and physical environment, must be accounted for when evaluating an intervention.

Therefore, future researchers as well as schools and teachers are encouraged to switch focus from “fixing” autistic pupils, to addressing the school environment that surrounds autistic pupils, including the awareness of diversity and equity among peers and staff, and the design of the physical environment such as providing places to seek quiet and more green spaces (e.g., Rieffe et al., 2021; also see Martin (2016) for an overview of recommendations for physical environment design for autistic individuals). To this end, two aspects require special attention.

First, special attention should be paid to the measures for evaluating the interventions across the different levels of school environment, including the experiences of the autistic pupils. Currently, data is primarily from observations, presumably by allistic observers. Future studies should consider including other methods, such as questionnaires by autistic pupils and other relevant informants (e.g., peers, caregivers, or teachers) to better account for autistic pupils’ perspectives and experiences, and to confirm that the effects are maintained outside the observed contexts. Moreover, recent research showed that sensing technologies may be used to assess social dynamics at both group and individual levels, and the interaction between pupils and the built structures (Eichengreen et al., 2024; Nasri et al., 2022).

Second, tools should be developed to support schools and teachers to create a welcoming setting where diverse needs and desires are respected and valued. Notably, the current review focused on primary and secondary schools that provide mainstream education, where autistic children shared (most of) their school time with allistic peers. Practices in special education schools were thus not considered in this review. However, these practices might be insightful to the design of interventions that fit the needs and experiences of autistic children also in other school settings. For example, Yuill and colleagues (Yuill et al., 2007) examined the effect of a new playground that was designed for autistic pupils, in a special-education primary school. This new playground provided a clear circuit between activities (e.g., the slide curved to the direction where the next activity starts) to encourage playful movements and interactions naturally and structurally. It also had observation points where autistic children could observe others’ play without needing to interact, or recover from sensory overarousal triggered by the surroundings, which is often experienced by autistic individuals.

Notably, this review included both group design and single-subject design, in the attempt to cover all published studies that evaluated the effect of an intervention. Yet, it should be taken into account that the majority (84%) of the included studies were of a single-subject design and thus mostly with a small sample size. Also, single-subject designs are prone to internal validity risks if the conventions are not adhered to and stability within conditions cannot be established (Petursdottir & Carr, 2018). In this review, we mitigated the effects of these risks by evaluating the studies’ methodological quality and synthesizing only the results from those with an adequate-to-strong quality rating. However, their potential effects should still be taken with caution.

Furthermore, this review did not include gray (unpublished) literature, due to the lack of guidelines for systematically searching, selecting, and evaluating such studies (Martín et al., 2005; Schmucker et al., 2017). Although this decision was based on our aim to provide an overview of evidence-based interventions, we could not rule out the possibility that publication biases might be present in our synthesis as a result (Tincani & Travers, 2019). Thus, it should be noted that some potentially effective interventions might be omitted in this review because they were not published in peer-reviewed journals nor in English.

Conclusion

The current synthesis of results shows that the school-based interventions currently available in the literature can improve the accessibility of school activities, helping autistic pupils gain skills to approach the peers; and can enhance the integration of autistic pupils in schools, through the support of trained peers and/or school staff. Yet, a more holistic strategy that interconnects the different levels of the school environment, moving the focus from individual children’s social skills to the modification of the larger context, is required to ensure the inclusion of autistic children in schools, where they can meaningfully contribute. It is thus recommended that future studies attend the social, cultural, and physical environments that surround autistic children, and the expectations and experiences of these children concerning social participation. The paucity of information regarding these aspects in the current literature prevents us from drawing conclusions about autistic children’s social inclusion in schools. To fill this gap, new methodologies for measuring children’s interactions with the environments (e.g., a multidisciplinary approach combined with sensor technology; e.g., Andersen et al., 2019; Veiga et al., 2017), and the use of measures that can reflect children’s own views (e.g., with self-reports and focus group interviews), are needed. Such considerations will improve social inclusion not only for a specific population, but for all children, who have different capacities and wishes.