Introduction

Autism Spectrum Disorder (ASD) is a neurodevelopmental disorder characterized by persistent impairments in reciprocal social communication and social interactions along with the presence of restricted, repetitive patterns of behaviors, interests, or activities [1]. Two recent studies conducted in Italy reported a prevalence of ASD in children (age range 7–9 years old) of 1.14% and 1.3% [2, 3], consistent with its prevalence in the world, which is reported between 1 and 2% [4]. The male: female ratio is about 4: 1 [5], with 48% of children having intellectual disability [5, 6].

No pharmacological treatment has currently shown to be effective for the treatment of the core symptoms of ASD. In general, pharmacological treatments, combined with psychological interventions, are directed at the treatment of associated symptoms (such as irritability) or coexisting psychiatric conditions (e.g. attention deficit disorder, oppositional disorder, schizophrenia spectrum disorders), which are frequent in patients with ASD [7].

Antipsychotics are used to treat associated comorbidities, such as schizophrenia spectrum disorders (SSD) and behavior disorders [8, 9]. A recent meta-analysis estimated a prevalence rate of SSD symptoms in ASD of 9.6% [10], and vice-versa individuals with SSD have significantly more autistic symptoms than healthy controls [11]. An observational study reported a high incidence of multiple treatment failure when children and adolescents with comorbid ASD and SSD are treated with antipsychotics [12].

A systematic review by Pillay et al. [13] suggested that antipsychotics may be useful in improving core symptoms, particularly stereotyped behaviors and narrow interests. The latest, larger clinical studies have been based on second generation antipsychotics (SGA) such as risperidone, aripiprazole and lurasidone, while older studies studied first-generation antipsychotics (FGA). Few studies compared the effect of two or more antipsychotics [13].

This systematic review was performed within the context of the development of evidence-based guidelines for the diagnosis and management of ASD in children and adolescents for the Italian National Institute of Health (in Italian: Istituto Superiore di Sanità—ISS). This systematic review has been specifically conducted by the Evidence Review Team, based on the manual developed and published by the ISS, to support the ISS autism guidelines panel in formulating recommendations [14, 15]. Following a public application process, the ISS established a multidisciplinary panel, including people with ASD and/or their caregivers, that formulated the following question for its first recommendation:

  • Should antipsychotics be used for the treatment of children and adolescents with ASD?

Methods

The key elements of the review protocol including participants, intervention, comparator, outcomes, study design (PICOS) were developed by the guideline panel. Panel members and evidence review team members declared conflict of interests and all the process was recorded and it is available for consultation upon request to the study authors.

Population

Children and adolescents aged 0–18 years with a primary diagnosis of ASD. A concurrent secondary diagnosis of another medical condition was not considered as an exclusion criterion.

Intervention

The following list of FGAs and SGAs was selected by panel members to be investigated in the systematic review, with no limitations of dose and administration route: aripiprazole, clozapine, haloperidol, levosulpiride, lurasidone, olanzapine, risperidone, trifluoperazine. We included also studies in which antipsychotics were used as adjunctive treatment (e.g. in addition to behavioral or other pharmacological interventions).

Comparisons

Placebo or no intervention.

Outcomes

To measure the desirable and undesirable effects of the treatment, we assessed the following outcomes:

  • Restricted and repetitive interests and behavior,

  • Hyperactivity, inattention, oppositional, disruptive behavior disorders,

  • Self-harm,

  • Insomnia,

  • Social communication, social interaction,

  • Serious adverse events,

  • Emotional dysregulation/irritability,

  • Anxiety,

  • Adverse events,

  • Global functioning, global improvement,

  • Quality of life,

  • Obsessions, compulsions,

  • Dropout due to any cause,

  • Dropout due to adverse events.

These outcomes were deemed to be highly relevant to children and adolescents with ASD by the guideline panel.

Subgroup analyses

We performed subgroup analyses dividing the studies between those that reported Aberrant Behavior Checklist (ABC)—Irritability subscale scores ≥ 18 as the participants’ selection criterion, and those that reported different inclusion criteria or no particular criteria in addition to the diagnosis of ASD. The ABC is a scale empirically developed to assess the effectiveness of psychotropic medications by measuring psychiatric and behavioral disturbances exhibited by individuals with intellectual and developmental disabilities through 5 subscales corresponding to as many domains (irritability [range 0–45], lethargy/social withdrawal [range 0–48], stereotypic behavior [range 0–21], hyperactivity/noncompliance [range 0–48], inappropriate speech [range 0–12], with higher scores indicating worse condition) [13, 16]. We assessed credibility of subgroup effects using the criteria proposed by Sun et al. [17], and we considered it in evaluating the certainty in the evidence of effects [18].

Types of studies included

Randomized controlled trials comparing antipsychotics with placebo or no treatment in the management of ASD were included. Both parallel, crossover and withdrawal design were included. Quasi-randomized trials, such as those allocating by using alternate days of the week, and open label trials were excluded. For trials that had a crossover design only results from the first randomization period were considered, since carry-over effect could not be excluded [19].

Literature search

We performed a comprehensive computer literature search of the CENTRAL, PubMed/Medline, Embase, PsycINFO, Web Of Science databases and of trial registers to find relevant peer reviewed articles on the effect of antipsychotics for children and adolescents with ASD from the date of database inception until January 2019. The full search strategies used are available in Additional file 1 and Additional file 2. No date limit and no language restrictions were used. Finally, we hand-searched references from relevant systematic reviews and included studies to identify any RCT missed by the search strategy.

Study selection and data extraction

Two reviewers (FDC, GD) independently evaluated the retrieved studies for inclusion and assessed the methodological quality of included studies. Information extracted included study characteristics (lead author, publication year, journal), participant characteristics (age range, setting, diagnosis), intervention details (dose ranges, mean doses of study drugs) and outcome measures of interest.

Data analysis

Data were entered and analyzed using STATA 16.1 software. Since different scales were used in the studies, we analyzed data as continuous outcomes using standardized mean difference (SMD) with 95% confidence intervals, utilizing the random effects model, because a certain degree of heterogeneity was expected among trials [20]. In interpreting SMD values, we considered SMD “small” if < 0.40, “moderate” from 0.40 to 0.70, and “large” if > 0.7. We analyzed dichotomous outcomes by calculating the risk ratio (RR) for each trial with the uncertainty in each result being expressed with 95% confidence interval (CI). Heterogeneity between studies has been investigated by the Q-test, by the I-squared statistic (I-squared equal to or more than 50% was considered indicative of heterogeneity), and by visual inspection of the forest plots.

Dealing with missing data

We managed missing data according to Higgins et al. [19]. If dichotomous outcome data were still missing, they were managed according to the intention-to-treat (ITT) principle, and we assumed that patients who dropped out after randomization had a negative outcome. Missing continuous outcome data were either analyzed using the last observation carried forward to the final assessment (LOCF) or on an endpoint basis, including only participants with a final assessment. When p values, t-values, 95% CIs or standard errors were reported in articles, we calculated SDs from their values [20]. If p values, t-values, 95% CIs or standard errors were not reported at the endpoint, SDs were imputed from their baseline values, or, if baseline values were not reported, from the mean value of SDs of individuals randomized to that drug (or to placebo) in the other included studies [21].

Risk of bias and overall certainty of evidence assessment

Two authors independently (GLD, FDC) assessed the risk of bias in the included studies using the Cochrane risk of bias assessment tool [19].

The main results of the review were presented in’Summary of findings’ (SoF) tables, as recommended by Cochrane [19, 22], using the methodology developed from the Grading of Recommendations Assessment, Development and Evaluation (GRADE) Working Group [23,24,25]. The confidence in the effect estimates was evaluated in four levels: high, moderate, low, very low. The results were summarized and presented to the panel through the GRADE evidence to decision (EtD) framework [26, 27]. Here we present the results for the following criteria: desirable effects, undesirable effects, and certainty of evidence. The results for the other EtD criteria will be published elsewere [28, 29].

Results

Selected studies

We retrieved from database searching 1987 citations of which 779 were removed, being duplicates. We excluded 1127 records on the basis of titles and abstracts and retrieved 82 documents in full text: 47 studies have been excluded and 35 full-text articles included since satisfied the inclusion criteria. Reasons for the exclusion of the 46 papers: 27 studies whose comparator did not meet inclusion criteria, being augmentation trials without placebo arm [30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51] or trials comparing two pharmacological interventions without placebo arm [52,53,54,55,56]; seven studies were pooled analyses or post-hoc analyses of randomized controlled trials [57,58,59,60,61,62,63]; nine studies whose design did not meet the inclusion criteria [64,65,66,67,68,69,70,71,72]; two studies for which we were not able to retrieve the full-text [73, 74]; one study included only adults [75]. We retrieved 52 records from trial registers, 22 of which evaluated in full text: five trials were excluded since they did not show any result (clinicaltrials.gov identifiers: NCT00147394 [76], NCT00198107 [77], NCT00468130 [78], NCT01171937 [79], NCT00057408 [80]), four trials considered comparators that did not meet inclusion criteria (clinicaltrials.gov identifiers: NCT00080145 [81], NCT00205699 [82], NCT01333072 [83], NCT01844700 [84]), three trial whose design did not meet inclusion criteria (clinicaltrials.gov identifiers: NCT00166595 [85], NCT00691080 [86], NCT00619190 [87]), and two ongoing trials (clinicaltrials.gov identifiers: NCT02574741 [88], NCT03487770 [89]).

Finally, 46 documents, corresponding to 21 RCTs (1309 participants) were included: nine RCTs comparing risperidone with placebo [90,91,92,93,94,95,96,97,98], five RCTs comparing aripiprazole with placebo [99,100,101,102,103], five RCTs comparing haloperidol with placebo [104,105,106,107,108], one RCT comparing lurasidone with placebo [109], and one RCT comparing olanzapine with placebo [110]. Articles’ selection process is shown in Fig. 1, while full references for included and excluded trials are reported in Additional file 3.

Fig. 1
figure 1

PRISMA Flow chart

Included studies characteristics

Six studies (28.6%) included pre-schoolers and school age children, while 15 studies (71.4%) included school-age children and adolescents. The majority were male (83.3%), with a mean age of 8.8 years. The studies included patients diagnosed with autism (71.4%), ASD (23.8%) or PDD-NOS (4.8%), and diagnosis was performed principally through DSM-IV (81.0%) or DSM-III criteria (14.3%). Overall, 318 patients were randomly assigned to aripiprazole, 248 to risperidone, 100 to lurasidone, 67 to haloperidol, 6 to olanzapine, and 545 to placebo. Mean sample size was of 62 (range 11–218). One study recruited patients from Europe, 17 from North America, and three from Asia. Study median duration was eight weeks (Interquartile range: 8–22). Full study characteristics are reported in Table1.

Table 1 Characteristics of included Randomized Controlled Trials

To see the results of risk of bias assessment of included studies, see Additional file 4. We assessed publication bias through funnel plots presented in the Additional file 7.

Results and overall certainty of evidence

Forest plots for the main analyses are shown in Additional file 5, while the GRADE evidence profile is shown in Additional file 8.

Antipsychotics probably reduce “hyperactivity, attention deficit, opposition, and disruptive behaviors”, and probably slightly reduce both “restricted and repetitive interests and behaviors” and “obsessions, compulsions”.

Antipsychotics seem to slightly reduce “emotional dysregulation/irritability” and seem to positively influence “global functioning, global improvement”.

There are uncertainties about the effect of antipsychotics on “anxiety” and “self-harm”.

With regard to the safety profile, antipsychotics are likely to increase the risk of “adverse events”, and may induce a slight increase in the incidence of “serious adverse events”.

We found no extractable data regarding “insomnia” and “quality of life” outcomes.

Details about effect estimates and certainty of evidence are reported in Tables 2 and 3: Summary of Findings tables.

Table 2 Summary of Findings (SoF) for the comparison antipsychotics versus placebo – continuous outcomes
Table 3 Summary of Findings (SoF) for the comparison antipsychotics versus placebo –dichotomous outcomes

Subgroup effect considerations

Antipsychotics in the subgroup with ABC-Irritability ≥ 18 showed a trend towards a higher efficacy for emotional dysregulation and global functioning, and a trend towards lower efficacy for social communication, social interaction and hyperactivity outcomes. In the same subgroup antipsychotics apparently showed a better safety and tolerability profile (see Additional File 6). However, when considering whether a clear subgroup effect was present or not, we noted that: summary effects for the subgroup versus all were overlapping; nearly all 95% CIs of individual studies were overlapping for all outcomes; there was not a clear suggestion of consistently greater benefits or harms in the subgroup; the p-values for subgroup effects indicated that differences were likely due to chance.

We considered that there were no subgroup effects that are substantially concerning or credible, and certainty of evidence was not downgraded for subgroup effects.

Discussion

We found antipsychotics for children and adolescents with ASD more efficacious, more acceptable, but less safe than placebo.

Antipsychotics are often used as off-label pharmacological treatments for children and adolescents, even though their use in this population is an evidence-based choice for certain conditions [111]. According to a meta-analysis conducted in 2017 by Pillay et al. [13], and similarly to what we found in our review, SGAs were probably effective in reducing irritability, and in producing a slight decrease in social withdrawal, stereotypy and inappropriate speech, as measured by ABC. SGAs also probably increased response rate and decreased global impression of severity. Recent meta-analyses comparing only one drug (i.e. risperidone, aripiprazole, and lurasidone) versus placebo showed all similar results, even if more often did not report significant estimates, probably due to wider confidence intervals, particularly for lurasidone [112,113,114]. Pillay et al. [13] described narratively head-to-head trials, reporting (1) no differences between aripiprazole and risperidone in main outcomes, (2) mixed results when comparing haloperidol to risperidone or olanzapine. Since the publication of Pillay et al. (2017) systematic review, results from 4 placebo controlled [77, 95, 100, 102] and 2 head-to-head [53, 83] RCTs have been published.

In our study, antipsychotics in short- and medium-term showed good tolerability, as they reduced the risk of dropouts by 39% (moderate certainty), and demonstrated to be relatively safe options, with an increase of 19% in adverse events (moderate certainty) and 9% in serious/severe adverse events (low certainty). Evaluating the long-term safety profile of the antipsychotics was not among the objectives of this systematic review, as we did not perform a systematic review of not-randomized studies. However, it can be stated that, in children and adolescents with ASD: long-term use of risperidone, although generally well-tolerated, has been associated with an increase in plasma glucose, insulin, prolactin and leptin proportional to the dosage; the major problems often resulted from continuous weight gain over time and judged excessive [115,116,117]; the continued use of aripiprazole has been associated with a decrease in prolactinemia, with a similar risk profile for weight gain [118, 119]. The transition of treatment from risperidone to aripiprazole seems to reduce adverse events such as drowsiness, hyper-prolactinemia and amenorrhoea [120]. The number needed to treat (NNT) evaluated for effectiveness on irritability was lower using risperidone than aripiprazole, but on the other hand the number needed to harm (NNH) for the onset of extrapyramidal symptoms was also lower with risperidone [8]. The use of antipsychotics in general has also been related to hyperuricemia and hyperprolactinemia [121,122,123].

Study limitations

We have not prospectively registered the protocol for our systematic review; however, the clinical question was formulated by a multidisciplinary panel of experts, and we followed the methodology reported in the manual developed and published by the ISS [14].

After the systematic review was performed and the recommendation formulated by the panel was submitted to public consultation (https://www.osservatorionazionaleautismo.it/attivita-istituzionali/linee-guida/consultazioni-pubbliche), the results of a previously ongoing trial (NCT00198107) [77] were published on clinicaltrial.gov: 40 individuals were randomized to aripiprazole and 41 to placebo for 8 weeks. Data were available for “restricted and repetitive interests and behavior”, “Hyperactivity, inattention, oppositional, disruptive behavior disorders”, “Social communication, social interaction”, “Emotional dysregulation/irritability” (ABC subscales), for “obsessions, compulsions” (CY-BOCS), for “global functioning, global improvement” (ADOS), and for safety and tolerability outcomes (adverse events, dropouts and serious adverse events). Outcome data were consistent with what we found in our meta-analyses; for all these outcomes, except for emotional dysregulation/irritability and global functioning, the certainty of evidence was already rated as moderate. The Evidence Review Team, together with the content expert and the panel chair, considered that the new findings did not change the body of evidence and that there was no need to carry out new statistical analyses nor to reformulate the recommendation for this intervention.

The main limitation in the subgroup analysis was that the “all ASDs” studies included a mixed population, while ideally, we would have had groups with and without problem behaviors.

Finally, the use of the EtD framework requires the panel to be familiar with the tool [124]. To overcome this potential limitation, about 2 months before the presentation of the body of evidence on antipsychotics, an EtD framework on a pilot question was presented to the panel [125, 126] for the formulation of a recommendation on the use of polyunsaturated fatty acids. This recommendation will not take part to the Italian guidelines on the diagnosis and treatment of children and adolescents with ASD. In other experiences, panel members have reported that, when familiarity with the EtD framework is achieved, the tool helped them in structuring discussion, saving time, ensuring systematicity in the process of recommendation formulation [127].

Conclusions

We found antipsychotics in children and adolescents with ASD to be significantly more efficacious than placebo in reducing stereotypies, hyperactivity, irritability and obsessions, compulsions, and in increasing social communication and global functioning. Antipsychotics were also found to be significantly more acceptable in terms of dropouts due to any cause, but significantly less safe in terms of patients experiencing adverse events. We found no evidence regarding the impact of antipsychotics on “insomnia” and “quality of life” outcomes in this population. Available evidence on efficacy and safety of antipsychotics in children and adolescents with ASD needs to be evaluated together with evidence on equity, acceptability, feasibility [28], resources required and cost-effectiveness [29] in formulating a recommendation.

Preferred reporting items for systematic reviews and meta-analyses (PRISMA) checklist is reported on Additional file 9.