Screening for Attention Deficit Hyperactivity Disorder in Young Autistic Adults: The Diagnostic Accuracy of Three Commonly Used Questionnaires

Palmer, Melanie; Fang, Zhaonan; Hollocks, Matthew J; Charman, Tony; Pickles, Andrew; Baird, Gillian; Simonoff, Emily

doi:10.1007/s10803-023-06146-9

Screening for Attention Deficit Hyperactivity Disorder in Young Autistic Adults: The Diagnostic Accuracy of Three Commonly Used Questionnaires

Original Paper
Open access
Published: 28 October 2023

(2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Autism and Developmental Disorders Aims and scope Submit manuscript

Screening for Attention Deficit Hyperactivity Disorder in Young Autistic Adults: The Diagnostic Accuracy of Three Commonly Used Questionnaires

Download PDF

1437 Accesses
1 Citation
3 Altmetric
Explore all metrics

Abstract

Objective

Attention Deficit Hyperactivity Disorder (ADHD) is a common co-occurring condition in autistic individuals. ADHD is sometimes first recognised in young adulthood because ADHD symptoms may be misattributed to autism due to superficial overlap in presentation and diagnostic overshadowing. It should be investigated whether ADHD questionnaires are accurate in screening symptoms in young adults with autism. The current study examined this.

Methods

Participants were autistic young adults (N = 119) who took part in the Special Needs and Autism Project (SNAP), a population-based cohort. ADHD research diagnoses were obtained through the parent-informed Young Adult Psychiatric Assessment. Parents and young adults (self-report sample N = 71) completed ADHD questionnaires (Aberrant Behavior Checklist hyperactivity/non-compliance subscale, Conners Adult ADHD Rating Scales ADHD Index, and Strengths and Difficulties Questionnaire ADHD subscale). Receiver operating characteristic analyses were conducted to explore if the questionnaires discriminated ADHD cases from non-cases. To assess whether results varied by intellectual functioning, subgroup analyses were completed for those with an IQ ≥ 70 vs. <70.

Results

Weighted ADHD rates were high. Overall although the measures were performing at or close to adequate levels (area under the curve was 0.66 to 0.79 for parent-report and 0.70 to 0.65 for self-report), no single measure met adequate thresholds for sensitivity and specificity simultaneously. Tool performance was not different for those with an IQ ≥ 70 vs. <70.

Conclusion

No single measure reported adequate performance for distinguishing ADHD from non-ADHD cases in this sample of young autistic adults. Use of current thresholds may lead to under-diagnosis.

The predictive validity of the Strengths and Difficulties Questionnaire for child attention-deficit/hyperactivity disorder

Article 15 September 2018

Predictive validity of attention-deficit/hyperactivity disorder from ages 3 to 5 Years

Article Open access 07 March 2021

ADD-H-Comprehensive Teacher’s Rating Scale (ACTeRS): A Measure for Attention Deficit Hyperactivity Disorder Among Children with Intellectual Disability in India

Article 30 September 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Attention Deficit Hyperactivity Disorder (ADHD) is a neurodevelopmental condition characterised by impairing problems with inattention and/or hyperactivity/impulsivity that are beyond what is expected for an individual’s developmental level (American Psychiatric Association, 2013). ADHD is usually recognised in early childhood with symptoms remaining into adulthood (particularly inattention and impulsivity) with some reduction in level (Willcutt et al., 2012; Wootton et al., 2022). Although typically conceived as a childhood-recognised condition, it is now agreed that ADHD can first be recognised in adolescence or young adulthood (Sonuga-Barke et al., 2022).

There are several possible reasons why many individuals with ADHD are not identified until later in life. One possibility is that there may be truly adult-onset cases, although this view is still controversial (Sonuga-Barke et al., 2022). Another possibility is that onset was during childhood, but ADHD symptoms are not recognised until later due to a range of other factors. In support of this there is some evidence that late-recognised ADHD appears to differ slightly from early-recognised ADHD in that symptoms seem less severe, and individuals may have higher intellectual ability (e.g., Asherson & Agnew-Blais, 2019). Given the acknowledgement of late-recognised ADHD, it is important that instruments used for measuring ADHD, mainly developed for use with children, are accurate in identifying symptoms in these age groups. In addition, young adults may be capable of reliably self-reporting on their own symptoms, adding another source of information for clinical consideration.

Autism Spectrum Disorder (hereafter referred to as autism as it is the term preferred by the autistic community, Kenny et al., 2016) commonly co-occurs with ADHD. Autism is characterised by atypicality in social interaction and communication and repetitive, restricted, stereotyped patterns of behaviours and interests (American Psychiatric Association, 2013). ADHD occurs at rates of approximately ~30% in autistic young adults (Lever & Geurts, 2016; Rong et al., 2021). There is evidence of high stability of ADHD symptoms in autistic young people (Carter Leno et al., 2022).

Some autistic individuals with ADHD also do not have their co-occurring condition recognised until late adolescence or young adulthood. One reason is due to diagnostic overshadowing, which may occur when ADHD symptoms are misattributed to other developmental delays (Mason & Scior, 2004), such as autism, because of superficial overlap in symptoms. For example, it can be difficult to differentiate autistic mannerisms from fidgeting, a symptom of ADHD; or lack of understanding of social rules characteristic of autism from displays of impulsivity, such as interrupting conversations. Between childhood and adolescence, Hollocks et al. (2022) found that 16% of autistic young people received an ADHD diagnosis, which may in part be due to diagnostic overshadowing. Another possibility for late recognition may be the increased accuracy in diagnosing individuals with the inattentive type, because they are more capable of describing their own internal states as their age increases. It is therefore important that ADHD screening instruments are accurately picking up ADHD in autistic young adults. Furthermore, as many autistic individuals also have a co-occurring Intellectual Disability (ID) (e.g., Charman et al., 2011; Christensen & Zubler, 2020), understanding how well measures work for autistic individuals with and without ID is important. The overlap in ADHD symptoms and behaviours associated with an ID should also be considered here.

There is some research that has examined how ADHD measures perform as screening instruments in young adults without autism. A systematic review of 35 studies evaluating 14 different ADHD measures with adult populations reported variability in the psychometric properties for different screening instruments (Taylor et al., 2011). They found that self-report and informant-report versions of the Conners Adult ADHD Rating Scales (CAARS) ADHD Index looked adequate with internal consistency of 0.74–0.92, test-retest of 0.80–0.91, and sensitivity of 82% and specificity of 87%. A lower cut point of 4 rather than 6 symptoms was suggested as symptoms weren’t always endorsed to a high level, but impairment existed. However, most studies in the review were of poor quality and insufficiently reported, had small sample sizes, and many did not use a gold standard interview to ascertain ADHD caseness. Taylor et al. (2011) concluded that more research was needed to confirm their findings. Other research exploring the use of a widely used screening instrument for child psychopathology, the Strengths and Difficulties Questionnaire (SDQ), found that the SDQ ADHD subscale had high discriminant validity in distinguishing ADHD cases from non-ADHD cases (AUC = 0.90) in both males and females aged 25 years in a general population cohort, but that a lower cut point (≥ 5) was also needed (Riglin et al., 2021). Therefore, there is emerging evidence that such instruments are effective in picking up ADHD cases in young adults but cut point modification may be required.

Few studies have investigated the accuracy of ADHD measures when it co-occurs with other neurodevelopmental conditions. In one example, Yerys et al. (2017) reported unacceptable model fit for one- (ADHD), two- (hyperactivity/impulsivity vs. inattention), and three-factor (with hyperactivity and impulsivity also separated) solutions for parent and teacher reports of autistic children on the ADHD Rating Scale-IV (ADHD-RS-IV). The least problematic solution was the two-factor solution; however, this included items that were designed to tap into inattention cross-loading onto the hyperactivity/impulsivity factor or vice versa. They suggested that minor changes to item wording was needed when using such instruments with autistic individuals to reduce the influence of autism traits on item endorsement, and that follow-up diagnostic clinical interviews should explore separating inattention from other ADHD symptoms in greater detail. Other research conducted by La Malfa et al. (2008) explored the validity of the Observer: Screener version of the CAARS for assessing ADHD in adults with ID. They found the CAARS had good internal consistency. Scores on the hyperactivity subscale and ADHD index were significantly different across ID severity groups (e.g., mild, moderate, severe, profound), but sample sizes within each group were small limiting conclusions that can be drawn from their findings. The inattentive subscale did not appear to be influenced by ID severity. Taken together, these findings show some promise of ADHD screening instruments across young people with autism and ID, but further evidence of the diagnostic validity of ADHD rating scales is required and to our knowledge, no research has been conducted that has examined the diagnostic validity of ADHD questionnaires in young autistic adults.

The objective of the current study was to examine the discriminant validity of three widely used instruments for distinguishing ADHD from non-ADHD cases in young autistic adults. The measures examined included measures developed for assessing ADHD in the general population (CAARS ADHD Index, SDQ ADHD subscale), in addition to a questionnaire developed for use with individuals with developmental disabilities (the Aberrant Behavior Checklist [ABC] hyperactivity/non-compliance subscale [hereafter referred to as the hyperactivity subscale]). We explored the properties of the different parent- and self-reported versions, and whether the accuracy rates varied by intellectual ability of the young adult.

Method

Participants

Participants were autistic young adults and their families who took part in the Special Needs and Autism Project (SNAP). SNAP is a longitudinal study that followed a sample of young people drawn from a population-based cohort of 56,946 in South-East England (see Baird et al., 2006 for further details). The children invited into SNAP were born between July 1990 and December 1991, and either had a clinical diagnosis of autism during the first wave of data collection or were considered to be ‘at risk’ of having autism due to having a statement of Special Educational Needs. A stratified sample of children (N = 255) were then assessed for autism using gold standard measures of autism and language, and intellectual and adaptive functioning. Participants in the cohort who received an autism research diagnosis were followed-up when they were young adults at age 23 (Simonoff et al., 2008). Attempts were made to contact all autistic participants. 119 young autistic adults for whom we had ADHD diagnostic data at age 23 and at least one parent-report ADHD measure formed the sample for the current study. Sample characteristics are presented in Table 1 and the participant flow through the study is in the Supplementary Materials.

Written informed consent was obtained from all participating parents and the autistic young adults who had capacity to consent. Where it was suspected that the young adult did not have capacity to consent, a consultee was appointed to determine willingness to participate. Ethical approval for data collection at age 23 was granted by the Camberwell and St. Giles NRES Committee (reference 12/LO/1770, IRAS project number 112286).

Measures

Demographic information about the young adults and their parents was collected using a questionnaire developed for the study. A measure of neighbourhood deprivation, the Carstairs Index (Carstairs & Morris, 1990), was calculated from full post codes when the autistic individuals were originally assessed at age 12. This index combines overcrowding, male unemployment, population representation in Registrar General social class 4 and 5, and car ownership. Each component is standardised and summed to produce an index which may have positive or negative values. Positive scores indicate greater deprivation. The indices are ordered and grouped into five population quintiles, with quintile 1 representing the least deprived and quintile 5 representing the most deprived in the population.

ADHD diagnoses were obtained through the Young Adult Psychiatric Assessment (YAPA, Angold & Costello, 2000) designed for use with young adults. It is a semi-structured interview administered by a trained researcher/clinician to ascertain detailed descriptions and examples of emotions and behaviours associated with a range of mental health disorders. The modules used in SNAP cover separation anxiety disorder, generalized anxiety disorder, panic disorder, social anxiety, simple phobia, obsessive-compulsive disorder, major depressive disorder, dysthymic disorder, ADHD, oppositional defiant disorder, conduct disorder, Tourette syndrome, chronic tic disorder, trichotillomania, enuresis, and encopresis. The descriptions focus on the intensity, frequency, duration and impairment of symptoms and enable symptoms of different conditions to be disentangled. Use of behavioural descriptions of symptoms decreases the likelihood of diagnostic overshadowing, or incorrectly double coding symptoms, for example, repetitive language associated with autism being incorrectly coded as anxiety and reassurance seeking. In addition, ongoing supervision is used to help interviewers make these distinctions. The YAPA also probes about areas of functioning relevant to this age group, such as living situation and relationships. Standardised algorithms are then applied to generate DSM-V diagnoses and a variety of symptoms and impairment scores. In these analyses we use the parent-reported YAPA for ADHD diagnoses. This was because young adults with lower IQs could not complete the YAPA themselves due to intellectual challenges and we wanted to include the wider group of young autistic adults in the sample to enhance generalisability.

The Aberrant Behavior Checklist (ABC, Aman et al., 1985) was developed specifically to measure a range of problematic behaviours in individuals with developmental disabilities. It is used widely in both clinical and research settings, for example, to assess the effects of psychotropic medication for conditions such as ADHD (e.g., Capone et al., 2016). There is substantial literature examining its validity and reliability with a range of child (e.g., Research Units on Pediatric Psychopharmacology Autism Network, 2005) and adult populations (e.g., Newton & Sturmey, 1988). In the current study, the hyperactivity subscale, which taps into both hyperactive and non-compliant behaviours, was used. The ABC hyperactivity subscale was designed to be completed by someone close to the individual and consists of 16 items (see Supplementary Materials for example items). Items are rated on a 4-point scale ranging from ‘not at all a problem’ to ‘the problem is severe in degree’. Total hyperactivity subscale scores range from 0 to 48.

The Conners Adult ADHD Rating Scales (Conners et al., 1999) is a widely used normed measure of current ADHD symptoms. The Observer: Short version was used in the current study and completed by the young autistic adults and their parents. This version of the CAARS includes items that tap into inattention/memory problems, hyperactivity/restlessness and impulsivity/emotional lability, and self-concept problems. In the current study, responses to the 12 item ADHD Index were used (see Supplementary Materials for example items). Items are rated on a 4-point scale ranging from ‘not at all, never’ to ‘very much, very frequently’. Raw scores are converted to a T score with scores ≥ 60 indicating elevated ADHD problems requiring further assessment.

The Strengths and Difficulties Questionnaire (SDQ, Goodman, 1997) is a questionnaire measure for emotional and behavioural problems in children and young people aged 3–17 years. The SDQ has been used widely as a community screening instrument for child mental health problems (e.g., Goodman et al., 2000), along with screening amongst more vulnerable populations (e.g., Goodman et al., 2004). Although it has not formally been validated in young adults, there is some use with adults (e.g., Riglin et al., 2021). It was used in SNAP as the SDQ had been collected at previous data collection waves when the young adults were children. Five items make up the ADHD subscale which was used in the current study (see Supplementary Materials for example items) and completed by the young autistic adults themselves and their parents. Each item is rated on a 3-point rating scale from ‘not true’ to ‘certainly true’, with a possible range of 0–10.

Intellectual functioning was assessed through the Wechsler Abbreviated Scale of Intelligence (WASI-II, Wechsler, 2011), an abbreviated measure designed to test intellectual ability in individuals aged 6–90 years. The two-subset WASI-II (measuring vocabulary and matrix reasoning) was administered to the first 10 participants, after which participants received the four-subset version (measuring vocabulary, similarities, block design and matrix reasoning) to ensure a comprehensive accurate measure of intellectual functioning. When standard IQ scores showed a floor effect (IQ < 40), a ratio variable was generated by dividing the sum of subscale raw scores by age in months. A regression equation then predicted IQ from the ratio score, the total IQ raw score and age in months for the entire sample, where those with IQ < 40 were set to missing. Fitted residuals from this regression equation then provided IQ estimates for those with IQ < 40. When standard IQ scores were missing, WASI-II scores were imputed using parent-reported Adaptive Behavior Assessment System-II General Adaptive Composite scores (Harrison & Oakland, 2003) as the auxiliary variable (see Supplementary Materials for further information about this procedure). Two IQ subgroups were used in the current study: those with an IQ ≥ 70 and those with an IQ < 70.

Statistical Analysis

Analyses were completed in Stata version 17.0 (StataCorp., 2021). The participants who were included in the samples for which we had YAPA assessments and parent-/self-report ADHD measures were not mutually exclusive. Attrition at 23 was explored by comparing the parent-/self-report samples that were available for the current analysis to the original autism sample (N = 158) for drop out by child sex, parental education, and social deprivation (see Supplementary Materials for participant flow).

The accuracy of each measure was assessed using receiver operating characteristic (ROC) analyses for discriminating cases from non-cases. The resulting area under the curve (AUC) values range from 0 to 1 with an AUC of 1 indicating perfect discrimination and AUC of 0.5 meaning that the scale is not able to discriminate better than chance (Hanley & McNeil, 1982; Mandrekar, 2010). Sensitivity and specificity were calculated for established cut points where applicable (e.g., SDQ ADHD subscale) and for optimally identified cut points using the Youden’s Index (J) method (Youden, 1950) commonly used for assessing instrument detection properties which assumes equal weighting for both sensitivity and specificity. Sensitivity values of between 0.7 and 0.8 and specificity of 0.8 are typically regarded as acceptable when screening child psychiatric conditions (Glascoe, 2005). The ROC AUC for the different parent-report and self-report measures were compared. Where sample size allowed (e.g., for parent, but not self-report as the number of young adults in the IQ < 70 [n = 10] completed the SDQ ADHD subscale and CAARS ADHD Index vs. IQ ≥ 70 [n = 64]), the analyses were also conducted by IQ subgroups. AUC, sensitivity and specificity statistics were bootstrapped to obtain 95% confidence intervals (CI) using 1,000 repetitions. All analyses were weighted using frequency weights which consider the original sample stratification and characteristics, along with subsequent attrition in young adulthood. This means that estimates are applicable to the original population from which the sample is drawn. For the ROC analyses, this was done using frequency weights that in theory could generate occasional replicates with out-of-range AUC values (e.g., values of greater than 1) which have not been corrected. Prevalence of ADHD rates were weighted using population weights implemented by the svy procedure. All other descriptive analyses were unweighted and therefore represent the characteristics of the sample in the study at young adulthood.

Results

Attrition at Young Adulthood

The samples for which we had YAPA assessments and parent-/self-report ADHD measures did not differ from the original autism sample (N = 158) by child sex (parent-report sample: χ²[1, N = 158] = 0.00, p = .975, self-report sample: χ²[1, N = 158] = 0.18, p = .668), but for those who remained in the study, the parents were significantly more educated (N.B. for self-report sample only; parent-report sample: χ²[1, N = 158] = 1.93, p = .165, self-report sample: χ²[1, N = 158] = 4.66, p = .031), and the families less socially deprived (parent-report sample: t[156] = 2.38, p = .018, self-report sample: t[156] = 2.11, p = .036).

Prevalence of ADHD Cases

In young adulthood, 40 participants meet YAPA ADHD diagnosis for the parent-report sample and 20 amongst those for which we had self-report measures. This equated to weighted population estimates of 22.5% and 23.7% respectively, with an average of 4 symptoms being endorsed (see Table 2). Weighted population estimates for those in the parent-report sample with an IQ < 70 was 41.3% and 15.4% for those with an IQ of ≥ 70. The numbers of young adults who had combined, predominantly inattentive, and predominantly hyperactive ADHD diagnoses respectively for the parent-report sample were: 13 (32.5% of the 40 with ADHD), 12 (30.0% of those with ADHD) and 15 (37.5% of those with ADHD). This equated to weighted population estimates of 9.1%, 6.4%, and 7.1% for the combined, predominantly inattentive, and predominantly hyperactive ADHD diagnoses respectively. Amongst the 71 in the self-report sample 8, 8, and 4 had combined, predominantly inattentive, and predominantly hyperactive ADHD diagnoses respectively (40.0%, 40.0%, and 20.0% of those in the self-report sample with ADHD diagnoses).

Accuracy of Parent-Report ADHD Measures

Table 3 displays the AUC, sensitivity, specificity, and correctly classified values and their 95% CIs for the ADHD measures. Results are broken down into IQ subgroups and presented for optimal and pre-existing cut points where applicable. ROC curves are in the Supplementary Materials, along with the unbootstrapped reports of sensitivity and specificity for all cut points in the data. The overall AUCs for the parent-report measures were: ABC hyperactivity subscale 0.66 (95% CI 0.47–0.86), CAARS ADHD Index 0.78 (95% CI 0.61–0.94), and SDQ ADHD subscale 0.79 (95% CI 0.66–0.92). There were no statistically significant differences in the AUC values between the three parent-report measures, z-scores ranged from − 0.13 to 0.31, p’s ranged from 0.753 to 0.947. Nor were there any statistically significant differences in instrument performance for those with an IQ < 70 vs. IQ ≥ 70 (ABC hyperactivity subscale: AUC = 0.77 vs. 0.66, z = 1.14, p = .254; CAARS ADHD Index: AUC = 0.71 vs. 0.76, z=-0.10, p = .920; SDQ ADHD subscale: AUC = 0.74 vs. 0.75, z = 0.03, p = .978).

For the ABC hyperactivity subscale and CAARS ADHD Index, sensitivity was high (above 90%), but specificity was inadequate at the J optimally identified cut-point for those with an IQ < 70 and IQ ≥ 70 (ABC hyperactivity subscale: cut points of ≥ 4 and ≥ 3 respectively, and CAARS ADHD Index: cut points of ≥ 60 and ≥ 53). Using the pre-existing cut point of T score > 60 for the CAARS ADHD Index improved specificity for the whole sample (from 0.57 to 0.78), but reduced sensitivity (from 0.94 to 0.43). This pattern was consistent across IQ < 70 and IQ ≥ 70 subgroups, but more pronounced in the IQ ≥ 70 subgroup. In contrast, the parent-reported SDQ ADHD subscale demonstrated high specificity (93%), but sensitivity was low (60%) overall. This held across the IQ subgroups and for both the optimal (≥ 7) and pre-existing (≥ 8) cut points. The correct classification rates for the optimally identified cut points were 55% on the ABC hyperactivity subscale, 66% on the CAARS ADHD Index, and 86% on the SDQ ADHD subscale. To achieve 90% specificity required cut points of ≥ 9 on the ABC hyperactivity scale, ≥ 67 T score on the CAARS ADHD Index, and ≥ 7 on the SDQ ADHD subscale (see Supplementary Materials).

Accuracy of Self-Report ADHD Measures

The AUC, sensitivity, specificity, and correctly classified values and their 95% CIs for the self-report ADHD measures are also in Table 3. The ROC graphs are in the Supplementary Materials with the unbootstrapped reports of sensitivity and specificity for all cut points in the data. The AUC for the CAARS ADHD Index and the SDQ ADHD subscale was 0.70 (95% CI 0.51–0.90) and 0.65 (95% CI 0.44–0.87) respectively. For both the CAARS ADHD Index and the SDQ ADHD subscale, sensitivity was inadequate, although specificity was acceptable for both the optimal cut points (CAARS ADHD Index ≥ 56: 57% and 81% respectively, SDQ ADHD subscale ≥ 9: 28% and 100%) and pre-existing cut points (CAARS ADHD Index > 60: 36% and 90% respectively, SDQ ADHD subscale ≥ 7: 31% and 88%). The correct classification rates at the optimally identified cut point was 74% on the CAARS ADHD Index and 84% the SDQ ADHD subscale. To achieve 90% specificity, a cut point of ≥ 62 on the CAARS ADHD Index ≥ 8 on the SDQ ADHD subscale would be required.

Discussion

The current study examined the accuracy of parent- and self-reports of ADHD symptoms on three widely used measures for identifying ADHD cases from non-ADHD cases in young autistic adults. The accuracy of the measures was also compared across IQ subgroups to assess whether performance varies by this individual characteristic. Given the acknowledgement of late-recognised ADHD, it is important that instruments used for measuring ADHD are accurate in identifying symptoms in these age groups. Furthermore, due to diagnostic overshadowing, it may be more likely for co-occurring diagnoses to go undetected early in childhood amongst autistic individuals, with a co-occurring condition being identified later.

The AUC statistics indicate that overall, the measures were performing at or close to adequate levels. This is a key finding given that none of these measures were developed for screening of ADHD symptoms with autistic young adults. Parent-report measures appeared to perform similarly across both young autistic adults with IQs above and below 70, suggesting that the measures have similar accuracy for those with and without a co-occurring ID and are appropriate for use across the intellectual ability spectrum. These findings reflect parallel analyses completed by the authors using a sample of autistic children with and without ADHD and ID (in preparation). Although YAPA ADHD diagnoses were made based on parent-reports, self-reported ADHD symptoms from the young autistic adults themselves resulted in similar accuracy statistics to parent-report (N.B. this was a more restricted sample of those who could complete self-report measures). However, although overall the measures were performing adequately, no single measure met adequate thresholds for sensitivity and specificity simultaneously, and there was vast variation in sensitivity and specificity as sensitivity decreased. The current results indicate that there is utility in starting with a broad, short measure, such as the SDQ, which appears to be sufficient for ruling out non-ADHD cases in autistic young adults. In clinical practice, a balance between reducing stress associated with false positive screens, and the costs of false negatives should be considered. The high prevalence of ADHD in autistic individuals should also be taken into account to ensure that cases are not missed and below threshold results on a screening instrument should not be used to exclude consideration of the diagnosis. For research, a more stringent threshold for screening could be taken to ensure that samples include true ADHD cases.

It is important to note that the measures used in the current study were developed for different reasons with a range of populations in mind. The ABC was originally developed to track treatment outcomes, including ADHD symptoms, amongst adults with developmental disabilities (Aman et al., 1985). The CAARS ADHD Index was developed as a tool to assess the presence and severity of ADHD symptoms for use with adults in the general population, whereas the SDQ ADHD subscale was developed as a short questionnaire to be used as a screening instrument for the general child and adolescent population, in which identifying ADHD symptoms is viewed as particularly important. However, all are widely used in the United Kingdom and Europe, and it is important for clinicians to understand their accuracy in autistic populations.

As this is, to our knowledge, the first study exploring the accuracy of ADHD measures in young autistic adults, more research with different samples is required before recommendations about how to use such measures can be made. For example, the modification of cut points may be required for autistic young adults as suggested by research using samples of non-autistic adults (e.g., Riglin et al., 2021; Taylor et al., 2011), with lower cut points needed. However, the balance between a screener’s specificity and sensitivity should be considered (Trevethan, 2017). For example, the optimally identified cut point of 3–4 on the ABC hyperactivity subscale in this study may not be appropriate as this could lead to high false negative rates. This cut point would be equivalent to one symptom being endorsed to the highest degree, or four symptoms being endorsed as a mild level, which may not adequately identify the young adults with impairing ADHD symptoms. This low cut point identified in the current study could be related to the original purpose of the ABC hyperactivity subscale being a treatment monitoring measure, that it is not mapped to DSM ADHD diagnostic criteria, and that it also includes items about non-compliant behaviour. As can be seen in Supplementary Tables 3, which presents the unbootstrapped data for the parent-reported ABC hyperactivity subscale for all cut points observed in the data, the cut point of ≥ 19 would correctly classify the highest proportion of the sample at 77.9%. This cut point is similar to mean ABC hyperactivity subscale scores reported elsewhere in a sample of autistic young people (Mage = 10.6 years; 14.6 and 23.6 for those with high and low self-injury respectively) (Brinkley et al., 2007). However, other research has reported lower mean scores of 4.8 on the ABC hyperactivity subscale amongst 18–25-year-olds with Fragile X (Wheeler et al., 2014). Alternatively, utilisation of cut points that classify with 90% sensitivity could be adopted to ensure that cases are not missed. Another area of potential modification that should be tested in future research is item adaptation.

Limitations

The YAPA assessment used to derive ADHD diagnoses for the young adults has not been validated in autistic populations, so it is unclear how autism may influence reports on the YAPA. However, to our knowledge, no other psychiatric research interviews have been validated in this population. Furthermore, as ADHD diagnoses in the current study are based on a research interview, they do not equate to clinical diagnosis which may not account for other contextual factors, such as substance misuse. However, interviewers administering the YAPA are trained to obtain descriptions of emotions and behaviours so that symptoms of different conditions can be disentangled. Youden’s Index (J) method was used to identify optimal cut points, which places equal weighting on both sensitivity and specificity. This is just one method for balancing the properties of a measure’s accuracy to detect cases and in screening whole populations, one might place more weight on sensitivity to reduce false negatives. In addition, due to small cell sizes, we were unable to examine the performance of the self-reported ADHD screening instruments by IQ subgroups so we cannot generalise the accuracy reported for the self-report measures to autistic young adults with impairments in intellectual functioning. Further research should explore the performance of such measures for this group, as it is likely adaptation is needed for individuals with IQs < 70 to access such assessments. For example, this could involve using pictorial ratings scales to simplify response options.

Strengths and Conclusions

The study examined the accuracy of three widely used ADHD screening instruments amongst a well-characterised sample of young autistic adults with and without ADHD and ID. It included reports from parents as well as the young adults themselves and evaluates accuracy across those with a range of intellectual functioning making the findings more generalisable to the autism population. Although the measures were performing adequately overall for both those with IQs of < 70 and IQ ≥ 70, no single measure met satisfactory thresholds for sensitivity and specificity simultaneously. The potential remains to amend measures for better performance in autistic populations; for example, cut point reduction or changes to the wording of individual items may improve accuracy. Clinical use of ADHD screening instruments alone should not be used to determine who receives a diagnostic assessment for young autistic adults and such measures should be supplemented by clinical observation. Clinical judgement should be used to consider adjustments applicable to this population, for example, superficial overlap with autism symptoms should be taken into account and possibly less weight should be placed on these items.

Table 1 Sample characteristics

Full size table

Table 2 Scores on the ADHD measures

Full size table

Table 3 Bootstrapped area under the curve, sensitivity, specificity, and correctly classified for parent- and self-reported ADHD measures

Full size table

References

Aman, M. G., Singh, N. N., Stewart, A. W., & Field, C. J. (1985). Psychometric characteristics of the aberrant behavior checklist. American Journal of Mental Deficiency, 89(5), 492–502.
American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders: DSM-5 (5th ed.). American Psychiatric Association.
Angold, A., & Costello, E. J. (2000). The child and adolescent Psychiatric Assessment (CAPA). Journal of the American Academy of Child & Adolescent Psychiatry, 39(1), 39–48.
Article Google Scholar
Asherson, P., & Agnew-Blais, J. (2019). Annual research review: Does late-onset attention-deficit/hyperactivity disorder exist? Journal of Child Psychology and Psychiatry, 60(4), 333–352.
Article PubMed Google Scholar
Baird, G., Simonoff, E., Pickles, A., Chandler, S., Loucas, T., Meldrum, D., & Charman, T. (2006). Prevalence of disorders of the autism spectrum in a population cohort of children in south thames: The Special needs and Autism Project (SNAP). The Lancet, 368(9531), 210–215.
Article Google Scholar
Brinkley, J., Nations, L., Abramson, R. K., Hall, A., Wright, H. H., Gabriels, R., & Cuccaro, M. L. (2007). Factor analysis of the aberrant behavior checklist in individuals with autism spectrum disorders. Journal of Autism and Developmental Disorders, 37(10), 1949–1959.
Article PubMed Google Scholar
Capone, G. T., Brecher, L., & Bay, M. (2016). Guanfacine use in children with down syndrome and comorbid attention-deficit hyperactivity disorder (ADHD) with disruptive behaviors. Journal of Child Neurology, 31(8), 957–964.
Article PubMed Google Scholar
Carstairs, V., & Morris, R. (1990). Deprivation and health in scotland. Health Bulletin, 48(4), 162–175.
PubMed Google Scholar
Carter Leno, V., Hollocks, M. J., Chandler, S., White, P., Yorke, I., Charman, T., & Simonoff, E. (2022). Homotypic and heterotypic continuity in psychiatric symptoms from childhood to adolescence in autistic youth. Journal of the American Academy of Child & Adolescent Psychiatry, 61(12), 1445–1454.
Article Google Scholar
Charman, T., Pickles, A., Simonoff, E., Chandler, S., Loucas, T., & Baird, G. (2011). IQ in children with autism spectrum disorders: Data from the Special needs and Autism Project (SNAP). Psychological Medicine, 41(3), 619–627.
Article PubMed Google Scholar
Christensen, D., & Zubler, J. (2020). CE: From the CDC: Understanding autism spectrum disorder. The American Journal of Nursing, 120(10), 30–37.
Article PubMed PubMed Central Google Scholar
Conners, C. K., Erhardt, D., & Sparrow, E. (1999). CAARS. Conners’ Adult ADHD Rating Scales. Technical manual Toronto, Ontario, Canada.
Glascoe, F. P. (2005). Screening for developmental and behavioral problems. Mental Retardation and Developmental Disabilities Research Reviews, 11(3), 173–179.
Article PubMed Google Scholar
Goodman, R. (1997). The Strengths and Difficulties Questionnaire: A research note. Journal of Child Psychology and Psychiatry and Allied Disciplines, 38(5), 581–586.
Article PubMed Google Scholar
Goodman, R., Ford, T., Corbin, T., & Meltzer, H. (2004). Using the Strengths and Difficulties Questionnaire (SDQ) multi-informant algorithm to screen looked-after children for psychiatric disorders. European Child & Adolescent Psychiatry, 13(Suppl 2), II25–31.
Google Scholar
Goodman, R., Ford, T., Simmons, H., Gatward, R., & Meltzer, H. (2000). Using the strengths and difficulties questionnaire (SDQ) to screen for child psychiatric disorders in a community sample. The British Journal of Psychiatry, 177, 534–539.
Article PubMed Google Scholar
Hanley, J. A., & McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143(1), 29–36.
Article PubMed Google Scholar
Harrison, P., & Oakland, T. (2003). Adaptive behavior assessment system (ABAS-2) - second edition. The Psychological Corporation.
Hollocks, M. J., Leno, V. C., Chandler, S., White, P., Yorke, I., Charman, T., & Simonoff, E. (2022). Psychiatric conditions in autistic adolescents: Longitudinal stability from childhood and associated risk factors. European Child & Adolescent Psychiatry. https://doi.org/10.1007/s00787-022-02065-9.
Article Google Scholar
Kenny, L., Hattersley, C., Molins, B., Buckley, C., Povey, C., & Pellicano, E. (2016). Which terms should be used to describe autism? Perspectives from the UK autism community. Autism, 20(4), 442–462.
Article PubMed Google Scholar
La Malfa, G., Lassi, S., Bertelli, M., Pallanti, S., & Albertini, G. (2008). Detecting attention-deficit/hyperactivity disorder (ADHD) in adults with intellectual disability the use of conners’ adult ADHD rating scales (CAARS). Research in Developmental Disabilities, 29(2), 158–164.
Article PubMed Google Scholar
Lever, A. G., & Geurts, H. M. (2016). Psychiatric co-occurring symptoms and disorders in young, middle-aged, and older adults with autism spectrum disorder. Journal of Autism and Developmental Disorders, 46(6), 1916–1930.
Article PubMed PubMed Central Google Scholar
Mandrekar, J. N. (2010). Receiver operating characteristic curve in diagnostic test assessment. Journal of Thoracic Oncology, 5(9), 1315–1316.
Article PubMed Google Scholar
Mason, J., & Scior, K. (2004). Diagnostic overshadowing’ amongst clinicians working with people with intellectual disabilities in the UK. Journal of Applied Research in Intellectual Disabilities, 17(2), 85–90.
Article Google Scholar
Newton, J. T., & Sturmey, P. (1988). The aberrant Behaviour Checklist: A british replication and extension of its psychometric properties. Journal of Intellectual Disability Research, 32(2), 87–92.
Article Google Scholar
Research Units on Pediatric Psychopharmacology Autism Network. (2005). Randomized, controlled, crossover trial of methylphenidate in pervasive developmental disorders with hyperactivity. Archives of General Psychiatry, 62(11), 1266–1274.
Article Google Scholar
Riglin, L., Agha, S. S., Eyre, O., Bevan Jones, R., Wootton, R. E., Thapar, A. K., & Thapar, A. (2021). Investigating the validity of the Strengths and Difficulties Questionnaire to assess ADHD in young adulthood. Psychiatry Research, 301, 113984.
Article PubMed PubMed Central Google Scholar
Rong, Y., Yang, C., Jin, Y., & Wang, Y. (2021). Prevalence of attention-deficit/hyperactivity disorder in individuals with autism spectrum disorder: A meta-analysis. Research in Autism Spectrum Disorders, 83, 101759.
Article Google Scholar
Simonoff, E., Pickles, A., Charman, T., Chandler, S., Loucas, T., & Baird, G. (2008). Psychiatric disorders in children with autism spectrum disorders: Prevalence, comorbidity, and associated factors in a population-derived sample. Journal of the American Academy of Child and Adolescent Psychiatry, 47(8), 921–929.
Article PubMed Google Scholar
Sonuga-Barke, E., Becker, S. P., Bölte, S., Castellanos, F. X., Franke, B., Newcorn, J. H., & Simonoff, E. (2022). Annual research review: Perspectives on progress in ADHD science – from characterization to cause. Journal of Child Psychology and Psychiatry. https://doi.org/10.1111/jcpp.13696.
Article PubMed Google Scholar
StataCorp. (2021). Stata statistical software: Release 17. StataCorp LLC.
Taylor, A., Deb, S., & Unwin, G. (2011). Scales for the identification of adults with attention deficit hyperactivity disorder (ADHD): A systematic review. Research in Developmental Disabilities, 32(3), 924–938.
Article PubMed Google Scholar
Trevethan, R. (2017). Sensitivity, specificity, and predictive values: Foundations, pliabilities, and pitfalls in research and practice. Frontiers in Public Health, 20(5), 307.
Article Google Scholar
Wechsler, D. (2011). Wechsler Abbreviated Scale of Intelligence (2nd ed.). NCS Pearson.
Wheeler, A., Raspa, M., Bann, C., Bishop, E., Hessl, D., Sacco, P., & Bailey, D. B. J. (2014). Anxiety, attention problems, hyperactivity, and the aberrant behavior Checklist in Fragile X syndrome. American Journal of Medical Genetics Part A, 164A(1), 141–155.
Article PubMed Google Scholar
Willcutt, E. G., Nigg, J. T., Pennington, B. F., Solanto, M. V., Rohde, L. A., Tannock, R., & Lahey, B. B. (2012). Validity of DSM-IV attention deficit/hyperactivity disorder symptom dimensions and subtypes. Journal of Abnormal Psychology, 121(4), 991–1010.
Article PubMed PubMed Central Google Scholar
Wootton, R. E., Riglin, L., Blakey, R., Agnew-Blais, J., Caye, A., Cadman, T., & Tilling, K. (2022). Decline in attention-deficit hyperactivity disorder traits over the life course in the general population: Trajectories across five population birth cohorts spanning ages 3 to 45 years. International Journal of Epidemiology, 51(3), 919–930.
Article PubMed PubMed Central Google Scholar
Yerys, B. E., Nissley-Tsiopinis, J., de Marchena, A., Watkins, M. W., Antezana, L., Power, T. J., & Schultz, R. T. (2017). Evaluation of the ADHD rating scale in youth with autism. Journal of Autism and Developmental Disorders, 47(1), 90–100.
Article PubMed PubMed Central Google Scholar
Youden, W. J. (1950). Index for rating diagnostic tests. Cancer, 3(1), 32–35.
Article PubMed Google Scholar

Download references

Acknowledgements

We thank the adult and parent participants for giving generously of their time.

Funding

The present wave of data collection was funded by Autism Speaks grant number 7729. Previous data collection was funded by the Wellcome Trust and UK Department of Health (wave 1) and the Medical Research Council (wave 2). MP is supported by a research grant from the Baily Thomas Charitable Fund (TRUST/VC/AC/SG/5841–8993). AP and ES received support from the NIHR through a Senior Investigator Award (AP: NF-SI-0617-10120, ES: NF-SI-0514-10073) and from the NIHR Biomedical Research Centre at South London and Maudsley NHS Foundation Trust (IS-BRC-1215-20018).

Author information

Authors and Affiliations

Department of Child and Adolescent Psychiatry, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK
Melanie Palmer, Zhaonan Fang, Matthew J Hollocks, Tony Charman & Emily Simonoff
Service for Complex Autism & Associated Neurodevelopmental Disorders, South London and Maudsley NHS Foundation Trust, London, UK
Matthew J Hollocks, Tony Charman & Emily Simonoff
Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK
Andrew Pickles
Newcomen Neurodevelopmental Centre, Evelina Children’s Hospital, Guy’s and St Thomas NHS Foundation Trust, London, UK
Gillian Baird

Authors

Melanie Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Zhaonan Fang
View author publications
You can also search for this author in PubMed Google Scholar
Matthew J Hollocks
View author publications
You can also search for this author in PubMed Google Scholar
Tony Charman
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Pickles
View author publications
You can also search for this author in PubMed Google Scholar
Gillian Baird
View author publications
You can also search for this author in PubMed Google Scholar
Emily Simonoff
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ES, GB, TC and AP obtained the funding for the study. MP and ZF completed the analysis. The manuscript was drafted by MP, ZF and MH. All authors read, made revisions and approved the final version.

Corresponding author

Correspondence to Melanie Palmer.

Ethics declarations

Conflict of Interest

TC has received consultancy fees from F. Hoffmann-La Roche Ltd. and Servier and royalties from Sage Publications and Guilford Publications. AP has received royalties from Western Psychological Services. All other authors declare no conflicts of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Palmer, M., Fang, Z., Hollocks, M.J. et al. Screening for Attention Deficit Hyperactivity Disorder in Young Autistic Adults: The Diagnostic Accuracy of Three Commonly Used Questionnaires. J Autism Dev Disord (2023). https://doi.org/10.1007/s10803-023-06146-9

Download citation

Accepted: 25 September 2023
Published: 28 October 2023
DOI: https://doi.org/10.1007/s10803-023-06146-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Screening for Attention Deficit Hyperactivity Disorder in Young Autistic Adults: The Diagnostic Accuracy of Three Commonly Used Questionnaires