Autism spectrum disorder (ASD) is a lifelong neurodevelopmental disorder characterised by impairments in social interaction and communication, as well as the presence of repetitive and restricted behaviours/interests/activities (American Psychiatric Association, 2013). Anxiety disorders commonly co-occur with ASD (Hossain et al., 2020), and there are ASD-specific challenges in their assessment and treatment, so the interface between anxiety and ASD is a research priority – a view shared by autisticFootnote 1 individuals and their parents (Wallace et al., 2013). A fundamental challenge is that anxiety symptoms in ASD can be atypical in how they are triggered, their presentation, and their conceptual overlap with ASD symptomatology (e.g. Lau et al., 2020; Magiati et al., 2017). Atypical anxiety triggers associated with ASD include idiosyncratic specific fears, disruptions to routine and change, social confusion, sensory overload, and being reoriented away from restricted or repetitive patterns of behaviour, interests, or activities. Atypical presentations of anxiety associated with ASD include increased sensory, repetitive, ritualistic, or socially inappropriate behaviours. Accompanying challenges that complicate the assessment and treatment of anxiety include ASD-associated impairments in social communication, emotional literacy, interoceptive abilities, and recollecting and narrating past experiences (Bordignon et al., 2015; Kinnaird et al., 2019; Marini et al., 2018; Palser et al., 2018), alongside the heterogeneity that characterises the autistic population. To address these ASD-specific challenges, assessment and treatments for anxiety need adapting. Digital technology may aid adaptations as it can be used to provide novel approaches that can ease personalisation. These approaches involve the use of electronic devices and software, such as apps, computer- and internet-based approaches, wearable devices, and virtual reality (VR), in adjunct or independent to mental health services (Hollis et al., 2018; see definitions in Table 1).

Table 1 Types and definitions of digital technologies

Digital technology may specifically aid the assessment of anxiety in ASD since it can facilitate the use of methods that are multi-modal (e.g. clinical interviews, rating scales, behavioural observations, and physiological measurements) and from multiple informants (e.g. parents, therapist, and self-report), which is recommended to reduce the reliance on self-report measures that can lack validity in ASD due to the reasons aforementioned (Macneil et al., 2009). For example, VR can simulate an immersive visual environment with a range of anxiety-evoking stimuli which can induce psychophysiological and behavioural symptoms that can be measured in real-time (Diemer et al., 2014). Similarly, other technology-based approaches can be used to capture real-time anxiety responses: wearables to measure blood volume pulse and galvanic skin response, and ecological momentary assessment (EMA) to measure subjective experiences during everyday life (see Table 1; Hektner et al., 2007). Capturing real-time responses to a range of potentially anxiety-evoking stimuli may help to capture the heterogeneity in ASD, to reduce the possible influence of pre-conceived notions of anxiety, and to detect atypical anxiety triggers and manifestations (Hare et al., 2015).

The most common use of digital technology for the treatment of anxiety is VR exposure therapy as part of cognitive behavioural therapy (CBT), which may have ASD-specific advantages. This is because, rather than asking the patient to think of situations that cause them anxiety, the patient can experience typical and atypical anxiety triggers in VR. VR may facilitate the building of exposure hierarchies by reducing the reliance on the patient’s recollection and narration of past experiences, generativity, and imagination—abilities typically impaired in ASD (e.g. Bordignon et al., 2015; Lai et al., 2017; Marini et al., 2018). Further, VR may facilitate the formulation of exposure hierarchies by providing a controlled, repeatable, and diversifiable environment for patients to practice strategies to manage anxiety during a live anxiety-evoking situation in front of the therapist (Newbutt et al., 2016; Parsons et al., 2004). On a practical level, VR can provide a safe setting for exposure therapy, such that any socially inappropriate behaviour exhibited as an atypical anxiety response would not have the same consequences as real-life exposure.

Considering both assessment and treatment, VR and computer-based approaches alike are inherently visual, engaging, and structured and thus in-fitting with ASD-specific recommendations for adapting CBT (NICE, 2013) and for aiding communication during anxiety assessment for autistic individuals (Attwood, 2006). Furthermore, autistic individuals tend to show an affinity towards electronic screen media (ESM), which includes computer applications and VR (Mineo et al., 2009; Shane & Albert, 2008). Additionally, computer-based and telemedicine approaches allow assessments/treatments to be delivered at-home which may be preferred by autistic individuals (Hepburn et al., 2016) due to not requiring being in an unfamiliar environment, travelling nor transitioning between contexts, and potentially involving fewer interactions with new people, all of which can be challenging or anxiety-evoking for autistic individuals (e.g. Evans et al., 2005; Goodall, 2018; Rezae et al., 2019; Rydzewska, 2016). Because of this, and since technology can facilitate the delivery of self-help tools, technology-based approaches can improve access to mental health care (Hollis et al., 2018), which has been inadequate in the autistic community (Lake et al., 2014; Vogan et al., 2017). Despite the potential benefits of technology-based assessment and treatment approaches for targeting anxiety in ASD, no reviews to date have examined this. Yet, numerous reviews have shown technology-based approaches hold promise for targeting anxiety generally (e.g. Cieślik et al., 2020; Grist et al., 2019; Walz et al., 2014) and for targeting functioning in ASD such as social communication skills (e.g. Knutsen et al., 2016; Koumpouros & Kafazis, 2019; Sandgreen et al., 2020; Valentine et al., 2020).

Aims and Objectives

The present review therefore aims to fill this research gap by narratively synthesising research on technology-aided assessments and treatments of anxiety in autistic individuals. This consists of four objectives: (1) identify the digital technology used to aid assessments and treatments used for anxiety in autistic individuals, as well as the assessments and treatments themselves; (2) identify the types of anxiety disorders and symptoms targeted in these studies; (3) synthesise and critique the quality of evidence regarding effectiveness, feasibility, acceptability, safety, and utility of technology-aided assessments and treatments in these studies; and (4) identify possible research avenues for future studies in this area. An ancillary objective will be to determine how well characterised samples are with respect to socio-demographic variables, IQ, anxiety diagnosis, anxiety severity, adaptive functioning, social communication skills, and mental health comorbidities (to inform objectives 2, 3, and 4, including gauging the generalisability of synthesised findings).

Methods

Study Design

A systematic review was conducted. Reporting guidelines from the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA; Moher et al., 2009) and the Synthesis Without Meta-analysis (SWiM; Campbell et al., 2020) were followed. The PRISMA and SWiM checklists are included in Online Resources 1 and 2. For a justification of the chosen methodology, see Online Resource 3.

Protocol and Registration

Details of the protocol for this systematic review that was registered prospectively on PROSPERO can be accessed at https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42019142910. Before registering the protocol, existing PROSPERO records and the Cochrane Database of Systematic Reviews were searched to check existing coverage on the topic. Changes to the protocol are listed and justified in Online Resource 4.

Eligibility Criteria

This review only included original articles written in English and published/in-press within peer-reviewed journals. Studies were only included if they used empirical methods and included at least 5 human participants with ASD diagnoses. It was further specified that only studies aiming to assess or treat anxiety using digital technology (see Table 1) in autistic participants would be included. While studies were excluded if they solely used computerised/telephone versions of measures due to being commonplace in research, studies for inclusion were required to measure anxiety at least once using standardised quantitative measures, digital task/assessments, or physiological measurements. Lastly, papers on genomics, digital phenotyping, and neurotechnology were excluded.

Information Sources

Searches were conducted in five electronic databases: Embase, MEDLINE, PsycInfo, Cochrane Library, and Web of Science. Databases were first searched on 16 August 2019 and updated on 27 November 2020.

Search Strategy

For each key concept identified from the research question (i.e. autism, digital technology, and anxiety), relevant search terms were identified (e.g. autis*, digital* and anxi*) and supplemented after examining keyword lists in seminal papers under each key concept and each concept combination therein, as well as controlled vocabulary across the databases. For each database, any terms that were descriptors (i.e. subject headings) were searched for in the [mesh], title, and abstract fields. The remaining terms were searched for in all text fields excluding the affiliation field and author field. However, since the terms “digital*” and “web*” yielded too many false results, they were searched for in the title and abstract fields only. Only subject headings with subheadings relevant to the research question were exploded. As controlled vocabulary varies database-to-database, search strategies were database-specific. Since Web of Science does not have controlled vocabulary, and due to fewer search fields being available, all terms were searched for in the title, abstract, and author keywords fields (see Table 2). No search filters or limiting commands were used on any databases to promote sensitivity. For the full electronic search strategy, see Online Resource 5.

Table 2 Search strategy for Web of Science

Study Selection

During study selection, records were categorised as “eligible”, “query”, and “not eligible” according to the eligibility criteria. Following de-duplication, records were screened at title and abstract level. A randomly selected 10% of records were also blindly screened at title and abstract by an independent researcher. Inter-rater reliability for this initial screening was strong (Kappa = 0.80; percentage agreement = 99.49%). Following this, records categorised as eligible or query were assessed for eligibility using full-text review. Full-text screening was completed independently by the first and second authors with moderate inter-rater reliability (Kappa = 0.64; percentage agreement = 82.14%). Reliability checks were conducted before the updated search. All discrepancies were resolved through discussion with the third author.

Data Collection Process

A standardised data extraction form (see Online Resource 6) was developed following guidance from the Centre for Reviews and Dissemination (2008). The form was piloted on 3 studies and supplemented to capture variation in methods. The first author independently completed the data extraction using the form and the second author checked the accuracy of the completed forms, except for the study identified in the updated search which was checked by an independent researcher. Key characteristics of the studies were then tabulated.

Data Items

Sample

Data on age, gender, diagnoses including ASD, and country were extracted. Other pre-defined sample characteristics, including those that may be potential confounds such as socio-demographic variables, IQ, anxiety diagnosis, anxiety severity, adaptive functioning, social communication skills, and mental health comorbidities were also extracted to inform the quality assessment and satisfy the ancillary objective of determining how well characterised samples are. Further to this, eligibility criteria, any sample size justifications, sampling methods, and recruitment procedures were extracted to inform the quality assessment.

Phenomenon of Interest

The type of digital technology used, assessment characteristics (e.g. tasks and assessment tools used), and intervention characteristics (e.g. anxiety targeted, treatment delivered and duration) were extracted.

Design

The research design employed, and any conditions, was extracted.

Outcomes of Interest

Anxiety measures were outcomes of interest in terms of treatment effects and whether multi-modal methods were used. Because feasibility and acceptability of the assessments and treatments were of interest, any information on rates of identification, enrolment, recruitment, retention/drop-out, response/data completion, and on usability (e.g. technical problems) was extracted. For treatment studies, session attendance, treatment completion rates, and therapist fidelity was also extracted.

Research Types

Since both qualitative and quantitative records could be included, the research type was determined.

Risk of Bias in Individual Studies

All authors reviewed the quality assessment tool and agreed adaptations (see Online Resource 7). During quality assessment, each study was assessed using an adapted version of the Effective Public Health Practice Project (EPHPP) tool for quantitative studies. This tool requires rating the risk of sampling bias, selection bias, experimenter/researcher bias, participant bias, attrition bias, or confounding, as well as the quality of study design, data collection tools, the intervention/assessment delivered, and any analysis conducted. Sections covering these aspects (see Fig. 2) are rated as “strong”, “moderate”, or “weak” in quality per study. Based on these ratings, following the adapted EPHPP instructions, a global rating was computed that was then judged in terms of applicability. The global rating was deemed applicable in all cases. Author 1 and an independent researcher conducted the quality assessments independently on all included papers. Disagreements due to clear oversight were identified and corrected through discussion. At this point, weighted kappa (κw) with quadratic weights (Fleiss & Cohen, 1973) was calculated to determine inter-rater reliability. It showed statistically significant agreement between the two raters for global and section ratings, κw = 0.762, 95% CI [0.634, 0.891], p < 0.01. Percentage agreement was 82.9%. The strength of the agreement was classified as good according to Altman’s (1991) classifications and excellent according to Fleiss et al. (2003) classifications. The additional paper yielded in the updated search was independently assessed by author 1 and another party with corrections made. For all papers, disagreements due to differing interpretation were resolved by consensus after consulting authors 2 and 3. One paper was co-authored by author 3, and so disagreement on this paper was discussed with author 2 only.

Synthesis of Results

With reference to the quality appraisals, papers’ methodologies and results were critically reviewed and combined textually by a formal narrative synthesis. This was informed by guidance from the Economic and Social Research Council (ESRC; Popay et al., 2006). Studies deemed directly relevant to objective 3 were prioritised for drawing conclusions from the synthesis. To address the present review’s aim, studies were grouped according to whether they focused on the assessment or treatment of anxiety under each research objective and were further grouped by the main methodology used.

Results

Study Selection

An overview of the study selection process is shown in Fig. 1. The database search yielded 3529 records. Following the removal of duplicates (n = 1,101), the titles and abstracts of records were screened according to the eligibility criteria, leading to the removal of 2,391 records. After a backward and forward reference search of eligible records, 2 additional records were screened at full-text and accepted. No additional records were identified after searching the reference lists of prior related reviews. The remaining records (n = 36) were assessed for eligibility using full-text review and 15 papers that consisted of 16 studies were deemed eligible for inclusion in the narrative synthesis.

Fig. 1
figure 1

PRISMA 2009 Flow diagram

Synthesis of Study Characteristics and Results

For a summary of the studies’ key characteristics and results, see Tables 3, 4, 5, and 6.

Table 3 Studies investigating the use of technology-aided assessments of anxiety in autistic individuals (N = 5)
Table 4 Studies investigating the use of ESM for targeting anxiety during medical visits in autistic individuals (N = 3)
Table 5 Studies investigating the use of ESM alongside CBT to treat anxiety in autistic individuals (N = 4)
Table 6 Studies investigating the use of telemedicine interventions in autistic individuals (N = 4)

Sample Characteristics

Apart from one study that recruited adults only (i.e. Hare et al., 2015), all assessment studies recruited adolescents with one recruiting adults as well (i.e. Chen et al., 2016). The overall age range for assessment studies was 10 to 61 years. Treatment studies recruited children and adolescents (age range: 3 to 19 years), except for two that recruited adults only (i.e. Gaigg et al., 2020; Maskey, et al., 2019c; age range: 18 to 66 years). Approximately 81% of participants across all samples were male, though some studies only reported demographics of their final sample and Suresh and George (2019) did not report the proportion of males. None of the assessment studies required participants to show clinically significant anxiety levels to participate, but for most samples a substantial proportion did and for one study this was unreported (i.e. Liu et al., 2008). Samples in most of the treatment studies had clinically significant anxiety (i.e. Conaughton et al., 2017; Hare et al., 2016; Hepburn et al., 2016; Maskey et al., 2014, 2019a, b), even for the three studies for which this was not an inclusion criterion (i.e. Gaigg et al., 2020; Hare et al., 2016; Maskey et al., 2014). In the remaining treatment studies, this was not reported. Most studies did not report comorbidities or excluded on this basis (e.g. severe mental health disorders), and those that did typically reported multiple anxiety diagnoses and low mood. Studies were mostly based in the UK, except for 1 in India (i.e. Suresh & George, 2019), 4 in the USA (i.e. Hepburn et al., 2016; Isong et al., 2014; Johnson et al., 2014; Liu et al., 2008), 2 in Australia (i.e. Chen et al., 2016; Conaughton et al., 2017), and some of Chen et al.’s (2016) participants were based in Taiwan.

Risk of Bias within Studies

As indicated in Figs. 2 and 3, no studies were rated strong with respect to sampling technique or blinding. Around half of all studies were moderate in quality with respect to sampling technique as their samples were considered at least somewhat likely to be representative, but since the rest of studies showed a high risk of selection bias, they were rated as weak in this respect. With respect to blinding, two factors were considered: (1) the risk of reporting bias and (2) the risk of researcher bias for assessment studies and detection bias for treatment studies. The risk of reporting bias was high in all studies as it could not be determined if participants were aware of the research question in any. This was coupled with a high risk of researcher or detection bias for most studies, hence why they were rated as weak in terms of blinding. Assessment studies were considered to exhibit high risk of researcher bias if they did not report whether researchers/evaluators present during assessment task were blind to the aims of the study. Treatment studies were considered to be at high risk of detection bias if outcome assessors were not blinded or this was not reported. The remaining studies were moderate with respect to blinding as detection/researcher bias was deemed unlikely.

Fig. 2
figure 2

Quality assessment ratings for assessment studies per category from the Adapted EPHPP Tool. Note. Stronger quality ratings indicate reduced risk of bias

Fig. 3
figure 3

Quality assessment ratings for treatment studies per category from the Adapted EPHPP Tool. Note. Stronger quality ratings indicate reduced risk of bias

Most designs were strong or moderate due to being deemed suitable for testing the research questions/hypotheses. However, because most treatment studies aimed to test feasibility and only 5 studies conducted randomised controlled trials (RCTs; potentially underpowered), they only provide a preliminary indication of efficacy. Additionally, the data collection methods belonging to around half of all studies were weak either because they were not valid, or both their validity and reliability could not be determined. While around half of studies showed high study completion rates of 80% or more and so were rated strong in quality for lack of retention, the remaining half showed lower completion rates, or the number of withdrawals and drop-outs was unreported. For most studies with potential confounding variables, these were often controlled or accounted for and so strong in this respect.

Ratings of the categories in Figs. 2 and 3, alongside consideration of the quality of intervention/assessment integrity and analyses using the adapted EPHPP tool, were used to inform the global rating for each study. As shown in Tables 3, 4, 5, and 6, most (N = 12) studies were rated weak, some (N = 3) studies were rated moderate, and 1 was rated strong.

Assessment Studies: Synthesis of Results and Strength of Evidence

Technology-based assessments of anxiety in ASD show promise for assessing state anxiety in everyday life and lab-based settings, but recurring issues amongst the assessment studies include the use of self-report scales that lack validity in ASD alongside lack of controlling for inter-individual differences that could act as potential confounds. Further, the generalisability of findings to the broad autistic population is questionable due to involving small convenience samples (n ≤ 30) of mainly adolescents, with at least average verbal intelligence, absence of reporting on participant enrolment and recruitment rates, and the variability in retention rates. Overall, these limitations notwithstanding, results collectively indicate that anxiety in ASD may have a distinct psychophysiological signature and be evoked by idiosyncratic triggers.

Everyday State Anxiety

Three of the assessment studies used EMA delivered via a mobile platform to assess everyday anxiety in autistic individuals. Participants received daily pre-programmed prompts to self-report their level of state anxiety, via a rating on a Likert scale (see Table 3), as well as the activity they were engaging in. Chen et al. (2016) used purely quantitative methods, whereas the others used mixed methods that included a qualitative exploration of the phenomenology of reported thoughts/anxiety. Studies produced conflicting findings regarding the type of everyday activities associated with heightened state anxiety. Chen et al. (2016) found elevated state anxiety to be associated with social activities, Hare et al. (2016) with being alone, and Hare et al. (2015) with no particular activity. As can be seen in Table 3, the cause of these conflicting findings may be attributable to differing methodologies since the way activities were coded, the range of the Likert scales used for self-reporting state anxiety, the age of participants, and the number of sampling days, varied between the studies. Crucially, while Hare et al., (2015, 2016) reported that autistic participants showed clinically significant levels of anxiety, neither study reported participants’ levels of social anxiety which Chen et al. (2016) found to have a moderating effect. Chen et al.’s (2016) findings are perhaps the most certain as the study was moderate in quality. Chen et al.’s (2016) findings must however be interpreted with caution considering the use of multilevel modelling with a small sample size (n = 30) and that, in contrast to the other EMA studies, inter-rater reliability for the coding of activities and thoughts was not calculated.

Findings did however consistently suggest that everyday state anxiety appears to be distinct in ASD in terms of its variability and idiosyncratic phenomenology (Hare et al., 2016), its increased presence (Chen et al., 2016; Hare et al., 2015), and its associated cognition (Hare et al., 2015). However, only Hare et al. (2015) demonstrated the distinctness of anxiety in ASD via group comparisons between autistic and neurotypical individuals (i.e. using a mixed design), with the other studies solely recruiting autistic participants (i.e. using a within-subjects design). Moreover, the group differences found by Hare et al. (2015) could instead be attributed to trait anxiety, rather than ASD, as the autistic group showed significantly higher anxiety levels. In terms of the validity of assessing anxiety using EMA in ASD, this remains unclear predominately because these studies relied solely on self-report ratings of state anxiety that can lack validity in ASD.

These studies indicate that EMA may be feasible and acceptable for assessing everyday anxiety in ASD, but recruitment and technical issues need addressing. While most participants in Hare et al.’s (2015) and Hare et al.'s (2016) studies found the technology acceptable, this was not reported on in Chen et al.’s (2016) study though the response and retention rates were high with only one participant dropping out due to technical issues. Lastly, Hare et al.’s (2016) participants suggested that participant-initiated EMA, as opposed to the pre-programmed prompts, may be more acceptable due to being less disruptive. These suggestions may however have been impacted by another component of Hare et al.’s (2016) study that increased participant burden. That is, in the second stage of the study, participants were asked to employ real-time stress management (RTSM) techniques if they rated their state anxiety as high (see Treatment section for more details).

Experimentally Induced State Anxiety

The two remaining assessment studies used quantitative methods, experimental designs, and measured physiological responses in autistic individuals during laboratory-based tasks designed to induce stress. The digital technology used in these two studies included a wearable device to measure physiology (Hollocks et al., 2014) and computer-based cognitive tasks (delivered via a computer monitor) for stress induction (Liu et al., 2008). Physiological parameters included heart rate and heart-rate variability. Both studies found physiological responses to be related to anxiety levels but in different respects due to the methodology employed. Liu et al. (2008) generated a physiology-based affective model that reliably predicted therapist-reported state anxiety, whereas Hollocks et al. (2014) did not report therapists’ ratings of state anxiety and showed a more nuanced pattern of findings. Compared to Liu et al. (2008) who did not report whether participants showed clinically significant anxiety, Hollocks et al. (2014) conducted group comparisons according to whether participants met criteria for an anxiety disorder and/or for ASD. For the group with anxiety and ASD diagnoses, it was uniquely found that physiological responses were inversely associated with anxiety symptoms, but these differences were not mirrored by parental- or child-reported levels of state anxiety during the task. Liu et al.’s (2008) results should be interpreted with caution as only therapists’ reports of state anxiety were used in the analysis, and neither blinding, participants’ levels of trait anxiety, nor presence of any anxiety diagnoses were reported. Most fundamentally, the suitability of the methodology and the pattern of results were difficult to discern due to unclear reporting hence why it was rated as weak. Since only Hollocks et al.’s (2014) findings provide sufficient certainty, it can only be concluded that autistic individuals with anxiety diagnoses likely show a blunted physiological stress response, but the shared limitations of the assessments studies did apply here.

Because neither study aimed to gauge feasibility and acceptability, which is not unusual for an assessment study, only some of these indicators were reported. All participants in both studies completed the assessment, although missing data in Hollocks et al.’s (2014) study and recruitment difficulties in Liu et al.’s (2008) study were reported. For now, it is challenging to generalise findings to the broader ASD population due to the limitations of both studies, but they do indicate that physiological parameters for anxiety assessments may be useful in potentially bypassing the aforementioned problematic validity of self-report scales in ASD, and Liu et al.’s (2008) study provides some indication that computerised tasks may be suitable for inducing state anxiety in ASD.

Treatment Studies: Synthesis of Results and Strength of Evidence

Two main types of digital technology were used for targeting anxiety in ASD: ESM and telemedicine (see Tables 4, 5, and 6). Contrary to the assessment studies, most treatment studies provided key feasibility indicators, and all were quantitative. As per Tables 4, 5, and 6, there were three research designs employed in the treatment studies: RCTs (N = 5), non-equivalent pre-test post-test design (N = 1), and case series (N = 5). While it is difficult to summate results due to the clinical and methodological heterogeneity, especially with respect to the nature of the interventions, these studies provide preliminary evidence for technology-based interventions being efficacious in reducing anxiety in ASD and for some maintenance of treatment effects. Only interventions involving peer modelling and social scripts did not show promise. Furthermore, the representativeness of the samples is likely constrained by the use of convenience samples of young people typically with verbal IQs of ≥ 70, and because most studies employed either potentially underpowered- or non-RCTs and/or focused on feasibility, treatment results can only provide a preliminary indication of the efficacy. Alongside this, in two of the strongest studies and the only two to test technology- and therapist-assisted CBT in an RCT, improvements did not translate into loss of the targeted anxiety diagnosis for around 62% of participants. Overall, aside from technical issues and some recruitment and retention issues reported, interventions were shown to be feasible and acceptable.

Electronic Screen Media

ESM was used to target anxiety by displaying a demonstration of a situation likely to be anxiety-evoking or as a relaxation/distraction task. Isong et al. (2014) and Johnson et al. (2014) used visual media (i.e. images and videos) to aid learning of medical visit procedures, via peer modelling and social scripts respectively, and were the only studies to involve interventions found not to reduce anxiety levels. However, for the group of Isong et al.’s (2014) participants that viewed the peer modelling via video goggles, rather than a DVD (i.e. television screen) as used by the other group or an iPad as used by Johnson et al.’s (2014) participants, the intervention was found to reduce anxiety levels. A further group of Isong et al.’s (2014) participants, for whom the assigned intervention reduced anxiety levels, wore video goggles to watch a favourite video as a distraction task during a medical visit. Likewise, Suresh and George (2019) successfully employed a distraction task for the same purpose but simulated in VR. In the remaining studies, VR was used to reduce a specific fear/phobia, as opposed to merely anxiety during medical visits like the others, via relaxation and exposure tasks constituting CBT (i.e. Maskey et al., 2014, 2019a, b, c). The majority of participants across the virtual reality-assisted cognitive behavioural therapy (VRCBT) studies were classed as treatment responders due to showing reduced specific fear/phobic behaviours post-treatment. Typically, treatment effects showed maintenance (see Table 5).

Commonly, interventions showing promise for reducing anxiety in ASD were those that used ESM for exposure, relaxation, or distraction. However, in the only VRCBT study that was an RCT instead of a case series and that was rated strong rather than moderate (Maskey, et al., 2019b), the majority of participants were not classed as treatment responders, and self- and parental-reported anxiety scores did not differ significantly between groups nor over time. Still, there were statistically significant improvements on the main anxiety outcome (Target Behaviour Ratings) for the treatment group. Nonetheless, all other studies using ESM (i.e. non-VRCBT studies addressing medical visit anxiety) were weak in quality. Isong et al. (2014) and Suresh and George (2019) included additional flaws that collectively diminish the certainty of efficacy findings. That is, psychopharmacological medication usage was unknown, which is a potential confounding variable and the analyses were unclearly reported.

The generalisability of feasibility and acceptability results is constrained by the variation in whether participants met the diagnostic criteria for a specific phobia. Furthermore, most ESM studies did not report recruitment rates, none reported participant satisfaction nor whether technical issues were experienced, and data completion rates were missing or variable. However, VRCBT was shown to be feasible and acceptable in that there was a 100% treatment completion rate and only 1 dropout at follow-up. This was not so apparent for studies targeting anxiety during medical visits as dropout and treatment completion rates tended to be less favourable and recruitment difficulties were reported by Isong et al. (2014). Additionally, while the VRCBT findings are promising, their generalisability is compromised by the eligibility requirement for participants to be interested in the intervention and to be experiencing a fear/phobia deemed fit for simulating in the VR environment, but due to lack of reporting it is difficult to determine which fears/phobias may be inappropriate and the likely degree of interest within this patient group.

Telemedicine

All four telemedicine interventions appeared to reduce anxiety levels in autistic individuals with clinically significant anxiety. Interventions involved Internet-based cognitive behavioural therapy (iCBT; Conaughton et al., 2017), CBT delivered via videoconferencing (Hepburn et al., 2016), online self-help CBT and mindfulness-based therapy (MBT) programmes (Gaigg et al., 2020), and RTSM delivered via a mobile platform (Hare et al., 2016). In the latter study by Hare et al. (2016), participants reported how anxious they felt at random intervals in the day when prompted to do so by their mobile device (i.e. using EMA) and if they rated their anxiety levels as high, the platform presented instructions for a range of common RTSM techniques could select from. Results must be interpreted with caution; however, as no telemedicine studies were rated as strong, none included active comparators, and Hare et al. (2016) had no control group. Additionally, only Conaughton et al. (2017) was rated as moderate and used multi-modal and multi-informant measures of anxiety that are more likely to produce valid results in autistic samples. Still, while Conaughton et al. (2017) found no group differences in their main anxiety outcome measure, remission of primary anxiety diagnosis, the iCBT group did show greater reductions in the clinical severity of the primary anxiety diagnosis, the number of anxiety symptoms and disorders, as well as improvements in functioning across diverse indicators. Furthermore, Conaughton et al.’s (2017) and Gaigg et al.’s (2020) follow-up and Hepburn et al.’s (2016) post-treatment measurements at 3 months indicated maintenance of treatment effects. However, only 50% of participants in Gaigg et al.’s (2020) 6-month follow-up showed maintenance and the waiting-list control group likewise showed reductions in anxiety over time such that at this timepoint active treatment showed no advantage.

In terms of feasibility and acceptability, only Hepburn et al. (2016) reported their recruitment rate, although it was deemed feasible and all studies reported low dropout rates during the treatment phase and most reported at least moderate treatment satisfaction. Conaughton et al. (2017) and Hepburn et al. (2016) found high session attendance, but a substantial proportion of participants in Gaigg et al.’s (2020) did not complete the self-help programme nor return requested diary records pertaining to the use of strategies, and in Hare et al.’s (2016) study did not engage in any RTSM techniques on at least 1 day. However, it is unclear whether this latter finding was a result of participants not rating their state anxiety as high which would prevent the RTSM techniques from being presented (i.e. triggered) on the device at all. Lastly, technical issues caused dropout in the baseline phase of Hare et al.’s (2016) study and impacted some sessions in Hepburn et al.’s (2016) study, but this was not reported on by Gaigg et al. (2020) or Conaughton et al. (2017) which might otherwise explain the low treatment completion rate in the latter study.

Discussion

This review aimed to narratively synthesise research on technology-aided assessments and treatments of anxiety in autistic individuals for the first time. Despite the long-established link between anxiety and ASD, and the potential of digital technology to facilitate ASD-specific adaptations for its assessment and treatment, searches yielded only 16 studies. This area has thus been neglected relative to the preponderance of research outside of ASD that supports the use of digital technology for aiding the assessment and treatment of anxiety (see reviews by Cieślik et al., 2020; Gujjar et al., 2019; Walz et al., 2014).

Summary and Critique of Evidence: Assessment Studies

While there were only five assessment studies, collectively they indicate the nature of anxiety to be distinct in ASD, as demonstrated by non-technology-based research (e.g. Lau et al., 2020; Magiati et al., 2017). Three of the assessment studies used EMA to assess everyday state anxiety (i.e. Chen et al., 2016; Hare et al., 2015, 2016) indicating it to be distinctly characterised, pronounced, variable, and idiosyncratic in ASD. ASD-specific characteristics included rumination, worries regarding the need for rules and coping with change, confusing and self-focused thoughts. These results are congruent with the wider ASD literature (e.g. Bearss et al., 2016; Lau et al., 2020; Ozsivadjian et al., 2012; Robertson et al., 2018). Conversely, review findings are incongruent with this literature base in that anxiety was not commonly shown to be triggered by social situations. This may be explained by the small samples or the fact that only one study (i.e. Chen et al., 2016) controlled for social anxiety levels which had a moderating effect. Equally, these mixed findings may be attributable to the extensive methodological variation. Similarly, the inference drawn from these EMA studies, that anxiety is distinct in ASD, is potentially spurious because only Hare et al. (2015) included an autistic and neurotypical group for comparison, and the autistic group showed significantly higher and clinically significant levels of anxiety that could have confounded results. Moreover, the studies relied solely on self-report measures of anxiety, which are known to lack validity in ASD (Macneil et al., 2009). These potential validity issues notwithstanding, as concluded in the non-ASD literature (see Walz et al., 2014), assessing anxiety using EMA can be insightful but its burdensomeness should be considered. Future studies should follow Santangelo et al. (2013) comprehensive guidance to address such issues. EMA may offer particular value to this field due to it being well-suited to capture inter-individual and situational variation in anxiety triggers and symptoms (Walz et al., 2014), and in everyday thoughts, feelings, and behaviours of this characteristically heterogenous population, at least in individuals with high cognitive functioning (Hare & Chen, 2019).

As a means of future studies reducing their reliance on subjective measures of anxiety, physiological responses could be captured passively during everyday activities using wearables. This may prove synergistic in enhancing our understanding of anxiety in ASD, as has been apparent beyond ASD (Walz et al., 2014). Only one reviewed study (i.e. Hollocks et al., 2014) used a wearable to measure physiology-based anxiety, and this was during a laboratory-based stress induction task. It was found that autistic participants with anxiety diagnoses showed a blunted stress response relative to controls, indicative of chronic autonomic hyperarousal which has since been demonstrated in autistic individuals experiencing emotional difficulties (Patriquin et al., 2019). However, autonomic responsivity in individuals with anxiety diagnoses has been shown to vary as a function of the test and stimuli used, with ecologically valid assessment tools more likely to evoke hyperactivity (Hu et al., 2016). This ought to be investigated in ASD, especially considering conclusions from a recent review that, due to mixed findings, the physiological markers of anxiety in ASD remain uncertain but their identification may permit a nuanced assessment that can thus accommodate for the heterogeneity in ASD (McVey, 2019). With respect to the acceptability of wearables in autistic participants, while Hollocks et al. (2014) did not aim to investigate this and participants were only required to use the wearable for a limited period in a laboratory, assessment completion rates were high. A more recent review directly investigated this across a range of wearables and found that obtrusive devices including the chest strap device used by Hollocks et al. (2014) were least suitable for autistic individuals, and that less obtrusive wrist-worn devices were preferred by participants and comparable in terms of clinical validity (Taj-Eldin et al., 2018), but it must be considered that less obtrusive devices tend to provide less accurate data.

In summary, although more research is required to reach a definitive conclusion and to overcome the limitations common to all reviewed studies that are discussed later, the assessment studies collectively indicate that autistic individuals’ psychophysiological stress response may have a distinct signature that varies according to the presence of anxiety symptoms and can be feasibly assessed in lab-based and everyday settings. A complementary area requiring research attention is the use of digital technology to experimentally induce and thus assess state anxiety under controlled conditions. Only Liu et al. (2008) tested this, using a computerised cognitive task, and limitations in the reporting of the methodology rendered it difficult to draw firm conclusions from this study. Relatedly, no studies were identified that tested the use of VR for assessing anxiety except in the context of VR exposure therapy—an area lacking research outside of ASD despite its well-recognised potential (Firth et al., 2018; Freeman et al., 2017). Generally, while computerised anxiety-induction tasks for use in laboratory-based studies are increasingly being adopted, there is a relative paucity of studies investigating the use of digital technology to assess anxiety (Firth et al., 2018; Walz et al., 2014).

Summary and Critique of Evidence: Treatment Studies

Technology-based interventions appeared to broadly reduce anxiety levels in autistic individuals. Findings are analogous with the general literature in terms of the types of technologies and interventions used, the anxiety targeted, and outcomes achieved (e.g. Cieślik et al., 2020; Grist et al., 2019; Gujjar et al., 2019). Findings similarly correspond with the ASD-specific literature in that technology-aided interventions, involving overlapping technologies, have been shown to be feasible and acceptable for targeting ASD-related functioning (e.g. Sandgreen et al., 2020; Valentine et al., 2020).

Two main types of digital technology were used in the reviewed treatment studies: telemedicine and ESM. Telemedicine interventions provided preliminary evidence of efficacy, feasibility, and acceptability for targeting transdiagnostic anxiety in ASD. These results must be interpreted with caution though as the usability of technology was impactful or not recorded, participant retention was variable, and only Conaughton et al. (2017) were not rated weak and used multi-modal and multi-informant measures of anxiety that are more likely to produce valid results in autistic samples. The only technology-based interventions that produced null results were ESM studies that targeted anxiety via a learning demonstration, specifically using visual media (i.e. photos and videos) to display peer modelling and social scripts of medical visits (i.e. Isong et al., 2014; Johnson et al., 2014). However, this pattern in findings ought to be interpreted with caution, chiefly because one of the two studies was minimally informative due to fundamental methodological limitations and found peer modelling to reduce anxiety when displayed via video goggles as opposed to via a DVD (i.e. Isong et al., 2014), but also in light of the limitations common to all studies discussed subsequently. Additionally, although extensive evidence shows that the situational factors inherent to medical visits do evoke anxiety that the peer modelling and social scripts attempted to address, such as novel social situations, unfamiliar procedures, and unclear rules/expectations (e.g. Bearss et al., 2016; Robertson et al., 2018), only two small-scale studies have provided evidence for the use of social scripts as a means of addressing this and they also only focused on medical visits (Drake et al., 2012; Johnson & Bree, 2014). Further, there is a lack of convincing evidence for the use of such methods for improving social skills and behaviour in ASD (Kokina & Kern, 2010; Leaf et al., 2020). Therefore, it is undeterminable whether these null results are attributable to the technology used, the interventions themselves, the presence of ASD, or the methodologies employed.

All 3 studies that used ESM for targeting anxiety during medical visits were weak in quality which thus diminishes the certainty of efficacy findings. On the other hand, the reported feasibility indicators were promising, and interventions across methodologies that involved ESM consistently reduced anxiety in most participants. Studies that used VRCBT for treating specific fears/phobias (i.e. Maskey et al., 2014, 2019a, 2019b, 2019c), specifically using VR for exposure and relaxation tasks, demonstrated the highest feasibility and acceptability, and all were moderate-to-strong in quality. In the non-ASD literature, the effectiveness of using VR for treating anxiety disorders in this way is the most established finding (Freeman et al., 2017). However, while all ESM studies used multimodal (e.g. self-report and behavioural) and multi-informant (e.g. child and parental) measures of anxiety, none of the VRCBT studies used physiological measures. Future studies may consider measuring physiological responses because, unlike the subjective measures used, their validity cannot be compromised by the ASD-specific challenges in atypical behavioural presentations of anxiety (Magiati et al., 2017) nor impairments in emotional literacy and interoceptive abilities (Kinnaird et al., 2019; Nicholson et al., 2019; Palser et al., 2018). This may explain why Maskey, Rodgers, Grahame et al. 2019b found mixed findings across the subjective measures used. Furthermore, a recent meta-analysis indicates that physiological process measures better predict exposure therapy outcomes than those that are non-physiological (Rupp et al., 2017). However, as signposted by McVey (2019), the physiological markers of anxiety in ASD first need explicating to aid interpretability.

Future treatment studies should improve on blinding methods, use adequately powered RCTs with active comparators, include longer follow-ups (i.e. > 3 months), and monitor concurrent psychopharmacological treatment regimes. Lastly, it must be considered that in the only two treatment studies that tested technology and therapist-assisted CBT using an RCT design (i.e. Conaughton et al., 2017 and Maskey, et al., 2019b), improvements did not translate into loss of the targeted anxiety diagnosis for around 62% of participants. On the other hand, at least in neurotypical adults, remission rates for CBT vary and are moderated by the anxiety disorder diagnosis, and the number of sessions in both studies was lower than the 11 to 19 sessions recently deemed necessary for at least 50% of individuals to show clinically significant improvement (Levy et al., 2020). Furthermore, in autistic children (i.e. aged under 18 years) who were the common demographic across studies, long-term CBT interventions for anxiety are especially crucial and variability in outcomes may be attributable to the presence of atypical anxiety symptoms (Perihan et al., 2020). Future studies ought to take these factors into account where possible.

Overcoming Common Limitations of Reviewed Studies

To bypass the common limitations of the reviewed assessment and treatment studies, future research ought to consider the influence of gender, levels of anxiety, ASD symptomatology, ASD and anxiety diagnoses, and comorbidities, as well as the use of behavioural measures of anxiety that capture atypical presentations. Furthermore, the generalisability of findings to the ASD population is questionable due to samples being self-selected and almost invariably consisting of young people with at least average verbal intelligence, no reported learning disabilities nor severe or complex mental health conditions.

Strengths and Limitations of Review Methodology

A strength of the search strategy was that it was informed by guidance from Salvador-Oliván et al. (2019) and McGowan et al. (2016), although due to resource constraints the eligibility criteria was restrictive (see Online Resources 3 and 4) and potentially afflicted by publication bias so it may not have fully captured relevant studies. There was also a risk of bias at data extraction since authors were not contacted to provide missing or additional data and since only one author undertook data extraction—though the completed data extraction form was checked by another author. Additionally, the common rubric used for the feasibility and efficacy indicators during synthesis must be interpreted with caution due to the variation in the indicators and definitions used across studies. Bias may have further been introduced when studies were grouped for synthesis as this was decided post-screening due to the wide range potential of assessments/treatments, technologies and anxiety disorders, and pairings therein. Nonetheless, another strength of the review was the approach to understanding discrepancies and uncertainties in the results which entailed systematically revisiting all elements of the final data extraction form so as to notice any patterns or differences in methodologies employed that may have explanatory power and to identify any research gaps that may provide a fuller understanding. Additionally, inter-rater reliability was desirable at study selection and quality assessment. Moreover, this review is the first of its kind and fulfilled its objectives of synthesising current evidence, alongside identifying the technologies used, anxiety targeted, and future research avenues.

Conclusion

Due to the lack of available studies, methodological heterogeneity, low quality, and small sample sizes, strong conclusions cannot be drawn. Nevertheless, this is to be expected of a research field in its infancy, and the narrative synthesis has broadly indicated that technology-aided assessments and treatments may be feasible and effective at targeting anxiety in ASD. The field would benefit from studies with universal feasibility indicators and outcome variables, adequate power, improved blinding procedures, and more representative samples. Future research should employ multi-modal and multi-informant measures of anxiety and consider the influence of gender, levels of anxiety, ASD symptomatology, and comorbidities. A notable research avenue is the direct testing of VR for inducing psychophysiological symptoms of anxiety, which could aid our conceptualisation of anxiety that findings indicate is distinct in ASD and elucidate the treatment mechanisms of VR exposure therapy.