Background

It is widely recognized that pediatric populations are vulnerable to sudden and pervasive disruptions in their daily life, such as those brought by the COVID-19 pandemic [1,2,3,4]. Those with neurodevelopmental disorders (NDD), broadly defined as childhood-onset chronic conditions characterized by atypical brain development [5,6,7], have been identified by parents, educators, clinicians, and policy makers alike, as requiring specific attention given their preexisting behavioral, emotional, and learning difficulties. The gamut of such difficulties is often observed in individuals with autism spectrum disorder (ASD). A neurodevelopmental disorder, ASD, is defined by impairments in social communication, and restricted and repetitive behaviors/interests that are often accompanied by comorbid neurological and psychiatric conditions, as well as varying degrees of speech-language and intellectual abilities [8,9,10,11,12,13,14]. Accordingly, a number of narrative articles and reviews have highlighted the burden on mental health that the disruptions in educational and treatment services, routine changes, and social isolation have posed for youth with ASD and related NDD [4, 15,16,17,18,19,20,21,22]. Here, we report on an international empirical effort aimed at assessing the heterogeneity of the impact of the COVID-19 pandemic in a large international sample of previously well-characterized youth with ASD and/or other NDD.

Our decision to focus on the heterogeneity of the pandemic outcomes was originally based on prior disaster research in the general population, showing that the degree of severity of prior mental illness, disaster exposure, as well as perceived risk, are predictors of negative outcomes [23,24,25]. These findings have been echoed, to some extent, in initial studies of the COVID-19 pandemic in ASD/NDD youth; these have documented care service disruptions [26,27,28,29] and negative mental health impact [4, 27, 29,30,31,32,33,34,35,36,37,38,39,40]. Most of them emphasized main group effects. Yet, findings suggest significant heterogeneity in outcomes. For example, 70% to 92% of those diagnosed with ASD/NDD have lost at least one special education or therapeutic service [26,27,28, 30], but the number of services lost during the pandemic has varied across individuals. Similarly, transitions to online service provision have occurred, but have only involved relatively smaller groups [26, 27]. In one study, approximately two-thirds of parents indicated that service disruptions negatively impacted their children’s functioning to varying degrees [27]. In parallel, studies documenting mental health difficulties varied in the symptom domain(s) examined, ranging from behavioral and/or emotional problems [30, 31, 36, 38, 39], ASD symptoms [28, 30, 39], living skills [32, 35], difficulties, and/or sleep disruptions [30, 32, 36, 40]. For example, the first study of the pandemic in children with ASD [32] reported more intense and frequent behavioral problems (in 36% and 42% of the sample, respectively). An additional study reported the onset of disruptive behaviors, anxiety, sleep problems, and irritability in varying proportions of children, ranging from 11 to 28% of the sample [34]. Overall, the scientific literature suggests that, for ASD/NDD youth, different profiles of changes in mental and service access from pre- to pandemic time occur. However, the concomitant pattern of variability in these two domains remains unclarified.

Understanding outcome heterogeneity in both service and clinical symptom changes is a necessary step to identify which individuals have greater needs. This may also facilitate the identification of protective and risk factors. Toward these goals, we adapted the Coronavirus Health and Impact Survey Initiative (CRISIS) [25] for Autism and Related Neurodevelopmental Conditions (AFAR). CRISIS was originally designed to capture the multifaceted nature of risk in the general population during the COVID-19 pandemic by quantitatively assessing life changes and perceived risk about COVID-19, as well as mental health before and during the pandemic. Previous work in the general population established the psychometrics of CRISIS as well as its feasibility in delineating distinct life stress profiles and their predictive role in mental health outcomes [25, 41]. The CRISIS adaptation for autism and related neurodevelopmental conditions aimed to quantify both changes in symptom domains affected by or known to impact daily life, along with changes in therapeutic services. Preserving the original structure of CRISIS also allowed a comprehensive assessment of a range of disaster- and clinically related potential predictors of pandemic-related outcomes—a topic that is emerging in the literature, likely becoming a critical focus in years to come [28, 32, 34].

To enhance the scope of AFAR assessments, we formed an international, collaborative network of investigators to collect AFAR data in clinically well-characterized youth with ASD/NDD diagnoses. Unlike most prior ASD/NDD pandemic-related work which samples relatively homogeneous geographical regions, our aggregating data from multiple, international sites offered a naturalistic observational framework to quantify the degree of disaster exposure (i.e., new COVID-19 case rates and local containment measures) as a potential predictor of outcomes. This complemented the AFAR assessment of pandemic-related experiences, pre-pandemic mental health and child characteristics, thus yielding a comprehensive investigation of putative predictors of the pandemic’s outcomes.

Additionally, the aggregation of data across the AFAR network allowed us to rapidly generate a sample of individuals with ASD/NDD richly characterized by clinicians, at a scale not readily achievable otherwise. Given the multidimensional nature of the outcomes and predictors examined, we leveraged multivariate data-driven approaches in a large sample (n = 1275) of cross-sectional AFAR data collected over the initial phase of the COVID-19 pandemic. Against this background, the present study aimed to identify potential subgroups of outcome and their predictors to inform recovery efforts and prepare for future crises.

Methods

Survey development and structure

A workgroup of ASD/NDD experts (ADM, LG, SG, PP, AT, BV) led the adaptation from the CRISIS Parent/Caregiver Baseline Form. Adapting the caregiver survey was prioritized in order to immediately capture the acute phase of the pandemic across the widest possible range of ASD/NDD youth using the same method. The adaptation aimed to include assessments of clinical domains relevant to ASD/NDD and services, while maintaining the existing structure of CRISIS [25, 42]. At the time, empirical evidence on the impact of disasters on ASD/NDD was limited to one study reporting worsening in adaptive functioning [43]. Therefore, along with adaptive skills, we prioritized the assessment of symptoms known to be affected by or to impact adjustment needs. These encompassed restricted and repetitive behaviors/interests (RRB) [44,45,46], externalizing, internalizing symptoms, and sleep problems that often co-occur in ASD/NDD [8, 12]. Parent/caregiver questions were developed to target observable behaviors rather than attempting to seek reports on internal states. In keeping with this goal and to contain the survey’s length, new questions on a range of co-occurring psychiatric symptoms replaced the Mood State domain. The Substance Use domain was also removed given that it is more accurately measured by self-report, which is beyond the scope of the present study [47, 48]. Like CRISIS, symptoms were rated on a Likert-scale based on (1) the three months prior to the start of COVID-19 pandemic in the respondent’s geographical area and (2) the two weeks prior to completion (Prior and Current time points, respectively).

To evaluate changes in service access, we derived items from a survey developed during the pandemic and piloted with people with syndromic intellectual disabilities and their caregivers [26]. Questions queried changes in therapeutic services typically received both within and outside school settings in the respondents geographical area, following the start of the pandemic. The remainder of the original CRISIS was unchanged, except for some rewordings or additional response options (e.g., sleep problems) as summarized in Table 1 and detailed in Additional file 1: Methods.

Table 1 CRISIS AFAR Parent Baseline Survey v0.5.1 (3–21 years): Domains and Items

Like CRISIS, questions of the parent/caregiver survey were developed for individuals aged five to 21 years; a later review identified a subset of questions developmentally applicable for children as young as three years. The initial adaptation was developed in English and then translated into five other languages and updated with rewordings in consultation with the larger AFAR network. The final version 0.5.1 of AFAR Parent/Caregiver Baseline Form (3–21) included 96 independent items, with 34 questions asked twice for Prior and Current time points. Clinical contacts with caregivers of youth with ASD/NDD during the acute phase of the pandemic informed the adaptation process; at its completion, several sites reached out to caregivers of ASD/NDD youth to review the survey; they favored its distribution for data collection (see Additional file 1: Methods for more details). AFAR is freely available for use by other investigators [42].

Following the AFAR data collection, we used exploratory and confirmatory factor analyses to identify the underlying structure of each of the domains developed a priori and to yield summary scores for quantitative investigations (see Additional file 1: Methods for more details). As detailed in Additional file 1: Results and in Tables S5 and S6, factor analyses yielded a stable and replicable factor structure, including a single factor for Adaptive Living Skills, two factors RRB largely reflecting lower- and higher-order RRB [46, 49], and four for Co-occurring Problem Behavior (i.e., Anxiety/Affect, Oppositional Behavior, Sleep Problems, Activity/Attention), among the AFAR-specific domains. Further, consistent with the general population [25], results yielded a single COVID-19 Worries factor, and multiple factors for the Life Changes and Behavior/Media domains (two and five, respectively).

Data collection and selection

AFAR Parent/Caregiver Baseline surveys were collected in 15 samples at 14 research and/or clinical institutions in Europe and North America. Data were collected cross-sectionally over the first six months of the pandemic (April–October 2020; Fig. 1, Additional file 1: Methods and Table S1). Parents of children aged three to 21 years with previously established clinician-based DSM-IV/5 [6, 50] or ICD-10 [5] diagnosis of ASD and/or other NDD were invited to complete AFAR. As discussed below, only data from individuals aged five years and older are included in the present analyses.

Fig. 1
figure 1

Data collection times across contributing samples. Data collection time periods for each contributing sample are color coded by country. Specific geographical regions for each sample are also indicated as state, or region. See Supplementary material in Additional file 1: Methods and Table S1 for details on data collection protocols. NY, New York; MO Missouri; CA California; ON Ontario; WHO World Health Organization

Along with diagnostics, when feasible, previously collected information on intellectual functioning and symptoms was shared. IRB approval for collection and sharing of de-identified AFAR and related data was obtained at each institution. For the present analyses, only data from AFAR surveys with available AFAR responses for the variables used in clustering analyses and completed within the time interval of 90% of a given sample were included. This served to minimize outliers regarding to COVID-19 infection rates and related responses.

Analyses

Overview

As shown in Table 2, to assess the heterogeneity of outcomes in symptom and service changes between prior and pandemic times, primary analyses leveraged data-driven hierarchical agglomerative clustering [51, 52] Clustering of the pandemic symptom changes was complemented by analyses of symptom and service changes across the whole aggregate sample (i.e., the sample derived from aggregating all eligible data) intended to further aid interpretation of results and comparisons with the literature, as well as to underscore the relevance of assessing heterogeneity. These whole-group-level analyses included repeated measures MANCOVA for symptom severity, measures of central tendency were used for access to services (Additional File 1; Methods). Following the identification of distinct outcome subgroups, we assessed and ranked a range of variables indexing pre-pandemic characteristics, as well as pandemic-related experiences and environments as their predictors. Analyses were corrected for multiple comparisons using false discovery rate (FDR) at q < 0.05, as applicable. The codes used in analyses are available in GitHub [53].

Table 2 Overview of analytical strategy and key results

Hierarchical clustering

Outcome heterogeneity was assessed across 11 features indexing symptom and service changes pre-pandemic to pandemic time using agglomerative hierarchical clustering [51, 52]. Among these features, seven features reflected changes in clinically relevant symptoms for ASD/NDD. They indexed differences between Current and Prior scores on the symptom domains that were empirically identified with factor analyses (see Additional file 1: Methods and Tables S5–S6). The remaining four features included the total number of services that were either lost or continued within and outside school. All scores were converted to standard z scores prior to clustering; thus, clusters (i.e., subgroups) were characterized by their profile of deviation from the aggregate sample average. The optimal cluster solution was determined using NbClust [54], according to a majority rule among multiple goodness-of-fit measures.

Random forest

To assess the relative contribution of multiple variables (i.e., features) as predictors of the outcome subgroups identified in hierarchical clustering, we used random forest classification. As depicted in Fig. 2, we assessed 20 features including family and child variables measuring pre- and pandemic-related experiences derived from AFAR, prior clinical characterization at each contributing sample (e.g., ASD vs. non-ASD NDD, number of psychiatric comorbidities), as well as COVID-19 pandemic environmental markers, empirically derived, for each individual, from open-data sources providing information by geographical area over time [55,56,57]. The variables derived from parent responses in AFAR included: socio-demographics, the child’s pandemic experiences, pre-pandemic services received, and a summary measure of clinical severity Prior (baseline) to the pandemic. This global severity summary score, computed across the seven AFAR symptom domains, was significantly correlated with previously collected standardized symptom measures (see Additional file 1: Fig. S4). The COVID-19 pandemic environmental markers included government responses, and new COVID-19 infection rates for each child in their geographical area at the time of the data collection. Briefly, to quantify new infection rates in a given child’s geographical area at the time of the AFAR data collection, we used the publicly available from Our World in Data’s (OWID) COVID-19 tracker [56] and COVID-19 European regional tracker [57]. To quantify government responses to the COVID-19 pandemic, for each child at the time of the AFAR data collection in their geographical area, we used the government stringency (GS) index computed by the Oxford Coronavirus Government Response Tracker [55]. The GS index combines metrics of infection containment and public information campaigns. It ranges from 0 to 100, higher scores reflect stricter government policies, and it is provided by day, in each territory. See Additional file 1: Methods for more details.

Fig. 2
figure 2

Features selected for predicting COVID-19 Impact Subgroups. The Venn diagram shows the 20 features examined as potential predictors of the four COVID-19 impact subgroups with random forest classification. Each feature is organized across three partially overlapping domains: child characteristics before the pandemic (red); COVID-19 pandemic experiences and environment (yellow); and family/household characteristics (blue)

We quantified the relative predictive role of the 20 features examined following a previously established analytical framework [58,59,60]. Briefly, using a permutation importance method [60], we indexed importance as the average out-of-bag-error (OOBE), computed across 4000 bootstrapped samples (2/3 training, 1/3 testing). The larger the OOBE value, the greater the importance for a given feature. These features were ranked in decreasing order of importance according to the obtained OOBE values (see Additional file 1: Methods for details).

Results

Aggregate sample characteristics

Data from 1275 youth, 5- to 21-year-old, aggregated across the 15 contributing samples met inclusion criteria for subsequent analyses. As shown in Figs. 1 and 3, contributing samples were from Europe (49%) and North America (51%). They included seven samples collected across research and clinical institutions in Italy, one each from Greece and Ireland, four samples collected in the USA (two in New York, and one each in California, and Missouri), and one sample collected across five centers in Ontario, Canada, as part of a COVID-19 multi-network collaboration in pediatrics, including the Province of Ontario Neurodevelopmental Disorders COVID Mental Health collaboration (POND-CMH) Network [61,62,63]. Key demographics (age and sex a birth), diagnosis, and intellectual functioning are summarized in Fig. 3 for the aggregate and each contributing sample. Most individuals (79%, n = 1004) met diagnostic criteria for ASD; 17% (n = 214) had ADHD without ASD, and 4% (n = 57) had other NDDs, without ASD. Reflecting the known male bias of ASD/NDD [64, 65], the sample predominantly included males (88%). Among 938 (80%) youth with available intelligence estimates, 62% (n = 624) were in the Average/Above Average range, 14% (n = 141) in the Borderline range, and 24% (n = 173) had mild to profound intellectual disability. Further, over half (64%, n = 811) of the caregivers had at least a college degree; the remaining had either a high school degree (30%), or elementary education (6%). Fifty-six percent of the aggregate dataset was of European/British ancestry (see Additional file 1: Table S4). Information on prior (i.e., baseline) AFAR symptom factor severity and number of services received within and outside school are in Fig. 5; complementary characteristics are in Additional file 1: Results, Figures S1, S2 and Tables S2–S4.

Fig. 3
figure 3

Characteristics of aggregate and each contributing sample. Age distribution (box plots), proportion of males and females, primary diagnoses, and intellectual functioning (stacked bar plots) are depicted for each of the n = 15 contributing samples, as well as for the aggregate sample (i.e., the dataset resulting from combining all contributing samples). CMI-AC Child Mind Institute-Autism Center; CMI-HBN CMI Healthy Brain Network; TC Thompson Center; UCSF University of California San Francisco; CADB Center for Autism and Developing Brain, Weill Cornell Medical College/New York Presbyterian Hospital; UAth University of Athens, National & Kapodistrian University of Athens, School of Medicine, First Department of Pediatrics, Unit of Developmental and Behavioral Pediatrics. “Aghia Sophia” Children’s Hospital; POND-CMH Province of Ontario Neurodevelopmental Network, COVID Mental Health collaboration; TCD Trinity College Dublin; UCA University of Cagliari, Child & Adolescent Neuropsychiatry Unit, A.Cao Paediatric Hospital; UBA University Bari, Child Neuropsychiatry Unit, Policlinic of Bari; UFII University of Naples Federico II, Child and Adolescent Neuropsychiatry Unit; SMF Stella Maris Foundation, University of Pisa; UTV University Tor Vergata; USS University of Sassari, Child Neuropsychiatry Unit, Azienda Ospedaliero-Universitaria; UONPI-LO Unita' Operativa di Neuropsichiatria dell’ Infanzia e dell' adolescenza, Lodi; ASD Autism Spectrum Disorder, ADHD Attention-Deficit/Hyperactivity Disorder; and ID Intellectual Disability. See Table S1 and Methods in Additional file 1 for details on data collection protocols

Pandemic outcome subgroups

Based on the NbClust approach, the four-cluster solution resulting from hierarchical clustering was optimal (see Figs. 4 and 5 and Additional file 1: Table S7). In other words, clustering analysis yielded four subgroups of ASD/NDD youth capturing distinct outcome profiles of symptom and service changes (Fig. 4). Based on their profile of deviations from the aggregate’s average of symptoms and/or service changes, the four outcome subgroups were defined as: broad symptom worsening only (20%), primarily modified services (23%), primarily lost services (6%), and average symptom/service changes (53%). The subgroup defined as broad symptom worsening only was characterized by worsening across all symptom domains as indexed by z scores ranging between 1 and 3 (i.e., 1 to 3 standard deviations above the aggregate’s mean) but by marginal service changes relative to the aggregate’s average (i.e., within 0.5 standard deviation from the aggregate’s average). The three remaining subgroups, totaling n = 1024 (80% of the aggregate), showed symptom changes within the aggregate sample’s average (z scores < 0.5) but differed in service access. As indicated by their name, one subgroup had the most services modified, another had the most services lost, and the third subgroup had the number of services lost and continued like those of the aggregate. These differential profiles of changes in symptom and services were confirmed by one-way ANOVAs and Tukey pairwise group mean comparisons (FDR q < 0.05; Additional file 1: Table S7). Since, as expected by our naturalistic study design, differences in subgroup distributions existed between contributing samples (Additional file 1: Fig. S3), we conducted follow-up analyses including “contributing sample” as covariate. Statistical group differences were unchanged. Similarly, secondary cluster analyses on data subsets distinct by survey completion rates yielded similar results (Additional file 1: Results).

Fig. 4
figure 4

Clustering results and COVID-19 Impact Subgroup Patterns. a The dendrogram shows the optimal 4-cluster solution of COVID-19 Impact including: broad symptom worsening only (n = 251; 20%, yellow), primarily modified services (n = 293; 23%, blue), primarily lost services (n = 78; 6%, green), and average symptom/service changes (n = 653; 53%, red). b Groups means and standard error bars of the z scored symptom factor changes (difference between Current and Prior scores; or Δ) and number of services lost or modified in and outside (Out.) of school are shown for each cluster (i.e., outcome subgroup). The dotted gray horizontal line at a z score 0 represents the average of the aggregate sample (N = 1275) across each variable examined. Abbreviations: Adaptive Liv., Adaptive Living skills; RRB-LO, Restricted and Repetitive Behaviors—Lower Order; RRB-HO, Restricted and Repetitive Behaviors—Higher Order; Activ-Inatt, Activity Inattention; Sleep Prob., Sleep Problems; Out., Outside; and Mod., Modified

Fig. 5
figure 5

Symptom and service access change pattern. a For each of the four outcome subgroups (color coded in the legend) and the aggregate sample (in gray), plots depict groups means and standard error bars of prior (T1) and current (T2) symptom raw scores across the seven factors examined. High scores indicate greater severity/impairment. The scores from the adaptive living skills domain were multiplied by minus 1 in order to follow the same direction as the other factors. b The bar height represents the mean total number of services received prior to the pandemic for each subgroup (color coded in the legend), as well as for the aggregate dataset (gray) at school (left plot), or outside school (right plot). The dotted pattern within each stacked bar illustrates the group mean number of services lost, the striped pattern depicts the group mean number of services modified

Finally, Additional file 1: Table S8 provides descriptive statistics and subgroup comparisons regarding key demographics (age and sex at birth) and intellectual functioning. As detailed in Supplementary Results in Additional file 1, the four subgroups did not significantly differ in sex distribution; there was a main effect of subgroup for age with the subgroup average service and symptom change being significantly older (11.6 ± 3.5 years) than the three other subgroups which included children in mid-childhood (mean and SD range across subgroups: 9.7–10.8 ± 3.1–3.8 years). Most youth fell in the average/above average intelligence category across subgroups except for the primarily lost services subgroup, which was characterized by significantly greater intellectual impairment.

Whole-sample average impact

In the aggregate sample, one-way repeated measures MANCOVA yielded a significant main effect of time across the seven AFAR symptom factor scores (F(7,2548) = 77.40, FDR corrected p < 0.001). Follow-up comparisons for each symptom factor revealed statistical significance worsening of sleep problems (prior = 4.9 ± 2.1, current = 5.2 ± 2.3; F(1,2548) = 8.158, FDR corrected p = 0.03; see Fig. 5). Over the initial phase of the pandemic, on average the aggregate sample lost 1.2 ± 1.7; 1.3 ± 2 services, within and outside school. One service, on average, continued either within or outside school (1.1 ± 1.4; 0.8 ± 1.3, respectively), mostly continuing via telehealth/email (88% in school and 69% outside of school). More details on service changes are in Additional file 1: Table S7 and Supplementary results.

Predictors of outcome subgroup

The RF classification model predicted subgroup membership with 81% accuracy (precision/sensitivity = 82%, recall/specificity = 75%). The top-ranked predictor (OOBE: 12%) was the number of services received at school before the pandemic (Fig. 6a). Six predictors followed, with OOBE ranging from 6 to 1%. In rank order, they included the number of services received in and outside of school, GS index, Lifestyle Stress, COVID-19 Worries, new COVID-19 infections, and child’s age. The remaining features had OOBE < 1% and, thus, were considered negligible predictors.

Fig. 6
figure 6

Random forest feature importance ranking and top-ranked features by subgroup. a Feature (predictor) ranking by importance indexed by mean out-of-bag errors (OOBEs) is shown in descendent order. b The radial plot shows the z-scored group means across the eight top-ranked predictors color coded by outcome subgroup (Yellow = broad symptom worsening only; Red = average symptom/service changes; Blue = primarily modified services; Green = primarily lost services)

Across the top seven predictors, each outcome subgroup was characterized by unique combinations of increases or decreases relative to the other subgroups. For example, the primarily modified services subgroup had a greater number of services before the pandemic relative to the broad symptom worsening only and to the average symptom/service changes subgroups. This latter subgroup, however, had lower ratings of COVID-19 Worries and Lifestyle Stress and included older youth in comparison with the other three subgroups. The primarily lost services subgroup had a similarly higher number of pre-pandemic services received but had the lowest GS index and COVID-19 new infection rates, as well as lower COVID-19 Worries and Lifestyle Stress relative to the other subgroups. Notably, the broad symptom worsening only subgroup had the highest COVID-19 Worries and Lifestyle Stress ratings, even with lower infection rates and GS index relative to the primarily modified services subgroup. These subgroups’ differences across predictors were confirmed by one-way ANOVAs and Tukey pairwise group mean comparisons (FDR q < 0.05; see Table 3). Since differences in subgroup distributions existed between contributing samples (Additional file 1: Fig. S3), we conducted follow-up analyses covarying for “contributing sample” in comparisons for the total baseline services received, Lifestyle Stress, COVID-19 Worries, and child age, but not for GS index and COVID-19 rates, as they reflect known differences across contributing samples per study design; statistical subgroup differences remained unchanged.

Table 3 Group means of top eight predictors in the aggregate sample and by subgroups

Notably, the predictor ranked eighth—i.e., pre-pandemic global severity—negligibly contributed to the classification model. Nevertheless, for illustrative and interpretation purposes, it is plotted in Fig. 6b and its descriptive statistics for each of the subgroups are summarized in Table 3. Similar to intellectual functioning, the four outcome subgroups did not differ with respect to pre-pandemic global severity except for the primarily lost services subgroup which was significantly more severe than the other three subgroups. Given that the primarily lost services subgroup was characterized by significantly greater intellectual impairment than the other three, we repeated supplementary RF analyses including intellectual categories in the subset of youth with intellectual functioning category available (n = 926). As detailed in Supplementary Results in Additional file 1, results were virtually unchanged relative to those from primary analyses; the feature indexing intellectual functioning had negligible predictive value for subgroup membership.

Discussion

Prior disaster research in the general population, including studies of the COVID-19 pandemic [25, 66, 67], has consistently highlighted heterogeneity of mental health outcomes. The present study extends this insight into a multinational large sample of ASD/NDD youth by concomitantly assessing variability of changes in symptoms and therapeutic service access over the early stage of the pandemic. Across contributing samples, data-driven analyses identified four ASD/NDD subgroups: broad symptom worsening only (20%), primarily modified services (23%), primarily lost services (6%), and average symptom/service changes (53%). Their profiles revealed that symptom and service changes have distinct patterns of covariation among youth with ASD/NDD. The subgroup with notable clinical worsening neither had the greatest number of services lost nor the greatest number of continued services, suggesting that other contributors to clinical worsening also exist. Conversely, youth with relatively stable symptoms were parsed into three subgroups that differed in service access changes going from pre-pandemic to pandemic times. Recognizing these subgroups elucidated unique effects of a set of predictors and highlighted different pathways to either stable or worsening clinical presentations. Indeed, the varying pandemic impact on symptoms in ASD/NDD youth was predicted by unique combinations of universal and ASD/NDD-related pre- and pandemic contexts in which service changes occur, rather than any one characteristic of the child, their family, or their environment alone.

Our results underscore that solely focusing on group-level effects leads to an incomplete picture of the COVID-19 pandemic impact on ASD/NDD youth. At the whole-sample level, multivariate analyses revealed that only sleep problems significantly worsened going from pre-pandemic to pandemic times. While results for this group-level approach confirm earlier pandemic reports of increased sleep problems in ASD/NDD [30, 36, 68], they failed to recognize a more vulnerable subgroup (i.e., broad symptom worsening only outcome subgroup). Indeed, cluster analysis revealed that 20% of the children worsened even more than their ASD/NDD peers. Significant worsening affected a broad range of symptoms, including sleep problems, and beyond—i.e., externalizing and internalizing symptoms, RRBs, and daily living skills. For the remaining participants, symptom changes from pre-pandemic to pandemic times were within the aggregate average picture—i.e., statistically significant increases in sleep problems with other symptom severity being relatively stable. Consistent with prior literature [26,27,28], most youth in our ASD/NDD sample experienced a variety of service disruptions across settings, resulting in loss or modification of services (e.g., telehealth). However, youth in our aggregate sample experienced notable variability in service access. The present study found that the association between service and symptom changes is complex. In fact, clustering individuals across both services and symptom changes allowed us to further parse the relatively clinically stable youth into three homogeneous subgroups differing by service access changes. This enabled a fine-grained identification of predictors of differing pandemic outcomes—i.e., distinct profiles of symptom and service access changes across the four subgroups.

In the present study, the most relevant predictors of outcome subgroup membership included factors shared with the general population [23, 69], as well as others. Specifically, elevated perception of COVID-related risk (indexed by the COVID-19 Worries factor) and pandemic-related lifestyle stressors (i.e., restrictions on leaving home, cancellations of important events) predicted broad and more severe symptom changes in youth with NDD/ASD (i.e., the broad symptom worsening only outcome subgroup). Beyond these universal stressors, the present work identified age and a set of pandemic-related and NDD/ASD-specific experiences and contexts that must be considered collectively to provide a meaningful picture of pandemic impact in NDD/ASD youth (i.e., number of services received prior to the pandemic, COVID-19 containment measures, and COVID-19 new rates). These findings indicate that, in middle childhood, a greater number of baseline services may foster resilience and be protective to broad symptom worsening during a disaster, at least over its initial phase. Additionally, for those in middle childhood and living in areas with greater restrictions at times of high COVID-19 rates, continuing services, even if in modified format, may lead to a relatively more stable clinical profile.

Of note, pre-pandemic global clinical severity and other features related to the youth clinical presentation, including intellectual skills, negligibly contributed to prediction. Most children across subgroups had equivalent baseline symptom severity and average intelligence, except for the primarily lost services subgroup which was characterized by greater pre-pandemic impairment. These findings are in contrast with earlier ASD studies [28, 32] suggesting that preexisting challenges or symptom severity are associated with greater impairment during the pandemic. In considering possible explanations, we note that unlike the present efforts, prior NDD/ASD work focused on samples from relatively narrow geographical areas with largely similar COVID-19 rates and institutional containment responses. Additionally, inter-study differences in sample severity may exist; future studies focusing on a wider range of severity ratings are needed to build upon the present findings. Finally, neither specific NDD diagnosis nor psychiatric comorbidity’s burden robustly contributed to prediction of outcome subgroups. Thus, although our sample included predominantly autistic youth, considering the breadth of clinical impairment typically observed in ASD, our results suggest that the insights gained from the present effort can inform other NDD more broadly.

Limitations

Although this is the most comprehensive and systematic study of the covariation of symptom and service changes in youth with NDD/ASD during the pandemic, results should be interpreted in light of several limitations. First, considering time constraints on questionnaire completion, albeit comprehensive, AFAR could not assess all possible domains of impact and/or prediction. For example, symptoms least expected to change over a short period of time, were given lower priority, most notably, social communication impairments [70]. Given the protracted nature of the pandemic, future studies should include assessment of longer-term changes in social communication skills. Similarly, although family demographics, parent education, and parent being an essential worker were assessed and included in our predictive model, parent’s mental health, recently reported to relate to children’s outcomes during the pandemic [71,72,73,74], was not assessed. Second, because of the urgency of the pandemic, we did not systematically involve other stakeholders in the survey adaptation process. However, the input of caregivers of children with ASD was accounted for in a number of informal ways during and at completion of the survey. Third, although the aggregate sample included youths with clinician-based diagnoses, previously collected measures of severity varied by contributing sample, and assessments of the role of prior severity were based on parent responses in the AFAR survey. Nevertheless, we found that the AFAR baseline global severity scores correlated with available standardized measures. Fourth, although relatively large, our aggregate convenience sample could not address all demographics. For example, females were underrepresented, and preschoolers were not included. Although consistent with a systematic review of pandemic studies in youth [4] sex at birth was not a robust predictor of impact, given recent reports of a male to female ratio of 3:1 in ASD [64], future studies should oversample females. Fifth, the present study focused on impact over the first six months of the pandemic using a cross-sectional design. Longitudinal coordinated study designs and infrastructures (e.g., common assessment measures) are needed to capture long-term outcomes and define stability of the subgroups over time. Finally, this first study relied exclusively on parent reports in order to capture the acute phase of the pandemic across a widest possible range of ASD/NDD youth and abilities using the same approach. Future studies should include self-report and/or direct observation to complement the parent’s perspective.

Conclusions

As in the general population, the COVID-19 pandemic impact varies across ASD/NDD youth. Risk and resilience are rooted in the pre- and pandemic contexts in which service disruptions occur. Provision of mental healthcare in preparation for and during disasters are critical for ASD/NDD youth, further motivating efforts assessing effectiveness for telehealth and/or hybrid treatment programs. As heightened perception of risk was among the predictors for broad symptom worsening, during disasters, special attention should be paid to how much youth are concerned about a current crisis, and guided access to clear and appropriately dosed information is needed. Finally, this study demonstrates the value of international data sharing and collaborations. It also underscores the need for increased global coordination to include common assessment protocols and data structures to facilitate data sharing and analysis aimed to more readily assess and address the needs of those most vulnerable.