Factor Structure, Reliability and Criterion Validity of the Autism-Spectrum Quotient (AQ): A Study in Dutch Population and Patient Groups

Hoekstra, Rosa A.; Bartels, Meike; Cath, Danielle C.; Boomsma, Dorret I.

doi:10.1007/s10803-008-0538-x

Factor Structure, Reliability and Criterion Validity of the Autism-Spectrum Quotient (AQ): A Study in Dutch Population and Patient Groups

Original Paper
Open access
Published: 27 February 2008

Volume 38, pages 1555–1566, (2008)
Cite this article

Download PDF

You have full access to this open access article

Journal of Autism and Developmental Disorders Aims and scope Submit manuscript

Factor Structure, Reliability and Criterion Validity of the Autism-Spectrum Quotient (AQ): A Study in Dutch Population and Patient Groups

Download PDF

Rosa A. Hoekstra^1,2,
Meike Bartels¹,
Danielle C. Cath³ &
…
Dorret I. Boomsma¹

33k Accesses
446 Citations
9 Altmetric
1 Mention
Explore all metrics

Abstract

The factor structure of the Dutch translation of the Autism-Spectrum Quotient (AQ; a continuous, quantitative measure of autistic traits) was evaluated with confirmatory factor analyses in a large general population and student sample. The criterion validity of the AQ was examined in three matched patient groups (autism spectrum conditions (ASC), social anxiety disorder, and obsessive–compulsive disorder). A two factor model, consisting of a “Social interaction” factor and “Attention to detail” factor could be identified. The internal consistency and test–retest reliability of the AQ were satisfactory. High total AQ and factor scores were specific to ASC patients. Men scored higher than women and science students higher than non-science students. The Dutch translation of the AQ is a reliable instrument to assess autism spectrum conditions.

Is the Autism-Spectrum Quotient a Valid Measure of Traits Associated with the Autism Spectrum? A Rasch Validation in Adults with and Without Autism Spectrum Disorders

Article Open access 19 April 2017

Psychometric Assessments of Three Self-Report Autism Scales (AQ, RBQ-2A, and SQ) for General Adult Populations

Article 21 January 2019

The Autism-Spectrum Quotient (AQ) Adolescent’s Version in Turkey: Factor Structure, Reliability and Validity

Article 30 August 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Pervasive developmental disorders, of which the most common are autistic disorder, Asperger Syndrome (AS) and pervasive developmental disorder not otherwise specified (PDD-NOS), are characterized by a triad of impairment: difficulties in reciprocal social interaction, communication, and the presence of stereotyped behavior, interests, and activities (American Psychiatric Association 2000). Together, these conditions are referred to as autism spectrum conditions (ASCs). This term reflects the assumption that (high functioning) autism and AS lie on a continuum reflecting severity of social communication disability, from classical autism at the most severe end of the spectrum, decreasing via high functioning autism (HFA), AS, and PPD-NOS into normal behavior. Twin and family studies have shown that genetic factors play a major role in the risk for ASC (Rutter 2000). A conservative estimate of the concordance rates of autism in monozygotic twins is 60%. Concordance in dizygotic twins, however, is only 0–5% (Folstein and Rutter 1977; Bailey et al. 1995). Moreover, if one MZ twin has autism, not only is the likelihood of an autism diagnosis in the co-twin increased, but also the risk of other neurodevelopmental difficulties affecting language and social interaction (Le Couteur et al. 1996; Bailey et al. 1995). This notion led to the idea that the same genetic variants affecting the risk for autism may lead to a broader phenotype of autistic traits, as reflected in an increased rate of social deficits, impairments in communication and language, a preference for routines and difficulty with change in non-autistic relatives of autistic individuals (Bolton et al. 1994; Bailey et al. 1998). Rather than a distinct disorder, it is now thought that the autism spectrum conditions as defined in the DSM-IV represent the upper extreme of one or more quantitative traits, and these traits may be continuously distributed in the population (Constantino and Todd 2003; Piven et al. 1997; Spiker et al. 2002). Studies using quantitative measurements of autistic traits found elevated scores in relatives of autistic and pervasive developmental disorder patients (Constantino et al. 2006; Bishop et al. 2004) and high scores in children whose parents showed high (but sub-diagnosis) endorsement on autistic traits (Constantino and Todd 2005).

The quantitative approach to autistic traits has led to the development of the Autism-Spectrum Quotient (AQ; Baron-Cohen et al. 2001). This self-administered questionnaire was developed to quantify autistic traits in individuals with normal intelligence. The AQ consists of 50 items, assessing personal preferences and habits. Subjects rate to what extent they agree or disagree with the statements on a 4-point Likert scale, with answer categories “definitely agree”; “slightly agree”; “slightly disagree” and “definitely disagree”. For approximately half the items an “agree” response is in line with autistic traits (e.g. item 23: “I notice patterns in things all the time”); for the other half a “disagree” response is indicative of an autistic trait (e.g. item 11: “I find social situations easy”). All the item scores are summed; a high AQ score indicates a high autistic load, close to the autistic end of the autism spectrum. In the original version of the AQ (Baron-Cohen et al. 2001), the 50 items were divided into five theoretically derived subscales of 10 items each: Social skill; Communication; Imagination; Attention to detail; and Attention switching.

Both the British (Baron-Cohen et al. 2001) and the Japanese version (Kurita et al. 2005) of the AQ reported good test-retest reliability and moderate internal consistency (Cronbach’s α varying from .63 to .78). However, some aspects of the AQ merit further study. Firstly, the five domains of the AQ have been derived on a theoretical basis and have undergone little empirical testing. Austin (2005) conducted an exploratory factor analysis of the AQ in a group of 201 undergraduate students and found evidence for three, rather than five factors, with a focus on Social skill, Details/patterns, and Communication/mind reading. In a principal component analysis using data on the child-version of the AQ, Auyeung et al. (2008) found support for 4 rather than 5 factors. Confirmatory factor analyses can further extend our knowledge on the psychometric qualities of the AQ subscales and empirically test the goodness of fit of the 5-domain model. The first aim of the current study was to examine the model fit of the 5 domain structure proposed by Baron-Cohen et al. (2001) in a large student sample and a general population sample and compare the fit with alternative models. Some of the 5 domains assess traits that may show considerable overlap (e.g. satisfactory social skills are needed for good communication). Therefore we expected some of the domains to be substantially correlated and predicted that a model in which the highly correlated factors are clustered may be preferred.

Secondly, more research needs to be done on the criterion validity of the AQ. Studies in England (Baron-Cohen et al. 2001), and Japan (Wakabayashi et al. 2006) found significantly higher AQ scores in subjects with an AS or HFA diagnosis, compared to scores in a student sample and a general population sample. A preliminary study (Woodbury-Smith et al. 2005) reported satisfying ability of the AQ to distinguish between subjects with and without an AS/HFA diagnosis, in a group of 100 referrals to a diagnostic clinic for adults suspected of having AS or HFA. However, a Dutch study comparing AQ scores in mild ASC patients with scores in patients referred to a general outpatient clinic found little differences between the two groups (Ketelaars et al. 2007). The subjects in the latter study were not matched on sex, education and age. It thus remains unclear whether high AQ scores are specific to ASCs, or may be common to psychiatric disorders in general. Apart from a general population and a student sample, the current paper includes a small sample of three different patient groups: one group with AS/HFA and PPD-NOS patients, a group of patients diagnosed with obsessive-compulsive disorder (OCD), and a group with social anxiety disorder (SAD). Because SAD patients demonstrate problems with social interaction, and OCD patients show symptoms of repetitive behavior, we predicted that the AQ scores in these patient groups will be higher than the general population mean. Extremely high AQ scores were expected to be specific to ASC patients.

Lastly, this paper is the first to examine the psychometric properties of the AQ in a Dutch population. The original version of the AQ was translated to Dutch using the backward translation procedure. The characteristics of the Dutch AQ, including test–retest reliability and internal consistency, were studied in a large sample of students and subjects from the general population. In line with the empathizing–systemizing theory of autism (Baron-Cohen 2002; Baron-Cohen et al. 2005) and findings from previous studies (Baron-Cohen et al. 2001; Hoekstra et al. 2007; Wakabayashi et al. 2006) we expected: (a) AQ scores to be continuously distributed in the general population; (b) significantly higher mean AQ scores in men compared to women; and (c) higher AQ scores in science students than in students in the field of humanities and social sciences.

Methods

Participants

This study encompassed four different samples. The first group consisted of 961 students from the VU university in Amsterdam (n = 809) and the University of Twente in Enschede (n = 152). During the break of one of their classes, the students were asked to complete the AQ. Students were recruited from the fields of humanities (history and law; n = 128), the social sciences (psychology, education, and communication science; n = 594) and natural and technical sciences (including mathematics, physics and information sciences; n = 239). Participation rates varied from 65 to 100%. The mean age of the students was 21.19 years (SD 3.69).

The general population sample (n = 302) consisted of parents of twins who visited an information day for parents of multiples. They were asked to either fill out the AQ immediately or to return the questionnaire to our research group by mail. The participation rate for this data collection was 62%; mean age of the participants was 35.68 years (SD = 6.33). The student and general population groups were not matched on age and IQ. However, the two groups were included for separate research purposes. The parent group served as a normative sample, whereas the student sample was included to address differences in AQ scores in different fields of study.

To obtain data on test–retest reliability, a group of 18-year old twins and their brothers and sisters filled out the AQ. These twin families participated in an ongoing study on cognitive development in late puberty and completing the AQ was part of the test protocol (see Hoekstra et al. 2007 for more information on this study). The first 117 participants of the study were re-contacted 1–6 months later (the mean time interval between the first and second assessment was 3.9 months), and were asked to fill out the AQ for a second time. Seventy-five participants returned the questionnaire for the second time (64%). AQ scores of the responders in the retest did not significantly differ from the subjects who did not respond in the retest (F(1, 115) = .066, p = .797).

The last group consisted of three subgroups of psychiatric patients, who were all adult outpatients recruited from the anxiety outpatient services of GGZ Buitenamstel in Amsterdam (see also Cath et al. 2008). All subjects were administered the Structured Clinical Interview on DSM-IV diagnoses (SCID-I; First 1996) to establish in-, and exclusion criteria. Subjects suffering from co-morbid depression, psychosis, substance abuse, mental deficiency or inability to read or speak Dutch were excluded. To exclude any risk of cognitive deficit and/ or below average intelligence, only patients who had successfully completed an educational degree were included in the study, and patient groups were matched on age (range 19–57 years), sex (10 males; 2 females in all groups) and educational level. The SCID-I does not contain a section on autism disorders, and at the time of data collection no validated Dutch version of either the Autism Diagnostic Observation Schedule-Generic (ADOS-G; Lord et al. 2000) or the ADI-R (Lord et al. 1994) was available for adult subjects. Therefore all subjects were assessed on presence of ASCs according to DSM-IV criteria by two independent experienced clinicians and with the aid of a structured retrospective interview taken from one of the parents of the patients on early infant development in all domains of the spectrum of autistic conditions. The structured interview encompassed the following topics: age at onset of problem behavior, contact and communication skills, stereotyped behavior, development of speech and language, motor and sensory development, particular interests and skills, ability to display imagination, resistance against change and unexpected events, and impulse control. Only subjects who had independently been diagnosed with an ASC by the two clinicians were included in the study; diagnoses were made independent of the AQ responses. Subjects meeting the inclusion criteria completed the AQ at home, after they had given written informed consent, and returned the questionnaire during their next visit to the outpatient service. The patient groups encompassed (a) 12 patients with an autism spectrum condition (n = 2 HFA; n = 4 AS; n = 6 PDD-NOS); (b) 12 patients with a “pure” obsessive-compulsive disorder, 3) 12 patients with a “pure” generalized social anxiety disorder.

The Dutch Autism-Spectrum Quotient

The AQ was translated after permission from prof. Simon Baron-Cohen (SBC). The translation into Dutch was conducted by an official translator. Subsequently, a second translator translated the Dutch version back into English. After comparing the outcome of the retranslated version to the original text, and discussing discrepancies in the retranslation with SBC, a final version was established (the Dutch version of the questionnaire is obtainable from the first author upon request). Total AQ and domain scores were based on the original 4-point Likert scale scores (1 = “definitely agree” up to 4 = “definitely disagree”). For the items in which an “agree” response is characteristic for autism, the scoring was reversed (“definitely agree” scored 4 points; “slightly agree” 3 points, etc.; This was the case in item 2, 4, 5, 6, 7, 9, 12, 13, 16, 18, 19, 20, 21, 22, 23, 26, 33, 35, 39, 41, 42, 43, 45, 46). All item scores were summed, resulting in a minimum total AQ score of 50 (no autistic traits) and a maximum score of 200 (full endorsement on all autistic items). Note that in most reports using the British version of the AQ (Baron-Cohen et al. 2001) the answer categories have been dichotomized into “agree”/disagree” scores. In these studies, all item responses in line with the autism phenotype scored one point, resulting in a maximum total AQ score of 50.

Missing Answers and Outliers

If more than five items were left blank (10% of the total number of items), the AQ was considered incomplete and the data were discarded in analyses (n = 2 in the student group; n = 7 in the general population group; none in the twin family or patient groups). Two social sciences students and one subject from the general population group obtained a score >160. Since these subjects completed the AQ anonymously, it could not be verified whether this reflected a true score. These outliers (deviation >4 SD’s of the mean) were therefore left out of the analyses.

Statistical Analyses

To examine the factor structure of the AQ, models were fitted on the student group data using confirmatory factor analyses in LISREL. The items of the AQ were measured on an ordinal scale, therefore the diagonally weighted least square procedure was used. First, a five-factor model, based on the 5 domains put forward by Baron-Cohen et al. (2001) was fitted. The factors were allowed to correlate since it is reasonable to assume that different aspects of autistic traits will be related. The fit of this model was compared with the fit of alternative models. Secondly, the model fitting procedure was repeated on the data from the general population sample. Based on the model fitting results in these two samples, the most parsimonious best fitting model was chosen.

To evaluate model fit, several model fit statistics were inspected. The χ² test statistic and the Standardized Root Mean Square Residual (SRMR) were included as a measure of overall goodness of fit. Low χ² values compared with the number of degrees of freedom indicate that the model fits the data well. The SRMR is a standardized overall badness-of-fit measure and is based on the fitted residuals. An SRMR value of zero indicates no residuals and thus a perfect fit. As a rule of thumb, an SRMR of less than .05 indicates a good fit, whereas values smaller than .10 can be interpreted as acceptable fit (Schermelleh-Engel et al. 2003). To enable a comparison between different models, the Goodness of Fit Index (GFI), Parsimony Goodness of Fit Index (PGFI); and the Expected Cross Validation Index (ECVI) were included as well. The GFI measures the relative amount of the variances and covariances in the observed matrix that is predicted by the model. The GFI ranges between zero and one with higher values indicating a better fit. The PGFI is a modification of the GFI and takes model complexity into account. High PGFI values indicate good fit in a relatively parsimonious (and thus less complex) model. If choosing between alternative models, the highest PGFI indicates the superior model. The ECVI evaluates how well the model fitted in the sample under study would generalize to other samples and is thus a measure of cross validation. The model with the smallest ECVI indicates the model with the best fit.

Group differences in total AQ score and factor scores, and the validity and internal consistency of the scale were analyzed using SPSS. Group differences were tested using mixed model multivariate analysis of variance (MANOVA). The mixed linear model is an expansion of the general linear model and permits non-constant variability in the data. This way, it was possible to test for group differences in variables with unequal variances and unequal group sizes. In the student sample, overall sex differences and differences between students from different fields were explored. Effects of sex and age were studied in the general population sample. Furthermore, AQ score differences between patient groups and the general population sample were tested. Internal consistency was assessed in the student and general population using Cronbach’s alpha.

The test–retest reliability of AQ scores was assessed using AQ-scores collected in 18-year-old twins and their brothers and sisters (n = 75). These participants are genetically related. In order to use all available data and to satisfy the independent observations assumption for statistical testing, the test–retest analysis was performed using structural equation modeling in the computer program Mx (Neale et al. 2006).

Results

Factor Analysis

A correlated 5-factor structure, based on the domains suggested by Baron-Cohen et al. (2001) was fitted to the data of the student group using confirmatory factor analysis. The results of this analysis showed that the domains “Social skill”; “Communication”; “Attention switching”; and “Imagination” were highly correlated (correlations varying from r = .53 to r = .84). These correlations indicate a considerable overlap between the domains, and suggest that a model allowing these domains to cluster together may fit the data better. To examine this, a hierarchical model, encompassing 1 higher order factor, existing of 4 lower order domains (“Social skill”; “Communication”; “Attention switching”; and “Imagination”) and 1 separate factor “Attention to detail” was fitted to the data. Lastly, a 1-factor model incorporating all 50 items was fitted, to evaluate whether the AQ really is multifactorial or rather measures 1 construct. The fit statistics of the three models are given in the top of Table 1. The SRMR was <.10 in all three models indicating adequate fit. The χ² value of the 1 factor model was considerably larger than the value of the two other models, suggesting that the 1 factor model showed relatively poor fit. The GFI and PGFI suggested the same; both values were low for the 1 factor solution. Based on these fit indices, the 1 factor solution was rejected. When comparing the fit indices for the two remaining models, the hierarchical factor model showed the lowest value for χ² and ECVI and the highest PGFI, suggesting that this model fitted the data best. The GFI was marginally higher in the 5-domain model, but the difference with the hierarchical model was very small. All in all, the fit indices suggest that the hierarchical model showed the best fit.

Table 1 Summary of the fit statistics of the several factor model structures

Full size table

Subsequently, the fit of the three models was tested in the general population sample (see bottom half of Table 1). Similar to the analyses in the student sample, the 1 factor model showed the poorest fit and was rejected. The SRMR values for the two remaining models were the same and indicated near-acceptable fit. The ECVI value indicated a slightly better fit for the hierarchical model, while the GFI and the PGFI were the highest in the 5-domain model. Altogether the fit indices for the 5-domain model and the hierarchical model were very similar in the general population. Since the hierarchical model is more parsimonious, this model was chosen as the best fitting model.

The Hierarchical Factor Model

According to the analyses in two independent samples, the model that best described the data of the AQ consisted of 1 higher order factor, encompassing the 4 lower order domains Social skill; Communication; Attention switching; and Imagination, and 1 separate factor Attention to detail (see also Fig. 1). The items loading on the first higher order factor mainly focus on social situations, difficulties in communication with others, and empathic abilities. This factor was called “Social interaction”. The second factor is the domain “Attention to detail” and mainly consists of items assessing interests in patterns and details. For both factors, similar to the total AQ score, a high factor sum score implies a high autistic load. Since the same best fitting model was identified in both samples, the best fitting model was subsequently fitted on the combined student and general population sample in order to obtain 1 estimate for the domain and factor loadings (see Table 2 and Fig. 1) and to obtain the correlation between the factors. Apart from item 18 and 30, all domain loadings were positive. The correlation between the 2 factors was estimated at r = .19 (p < .001).

Table 2 Item content and loadings on the 5 domains in the best fitting structure, ordered per higher order factors and lower order domains

Full size table

Internal Consistency and Test–Retest Reliability

The internal consistency was assessed in the student and general population sample. The internal consistency of the total AQ score was satisfactory in both samples (Cronbach’s α = .81 (student sample) and α = .71 (general population sample)). The internal consistency of the higher order factor Social interaction (40 items) was estimated at α = .84 and α = .77. The internal consistencies for the lower order domains and the Attention to detail factor were somewhat lower and were estimated at: α = .63/.68 (Attention to detail); α = .76/.68 (Social skill); α = .63/.62 (Attention switching); α = .52/.49 (Communication); and α = .63/.52 (Imagination). The test–retest reliability as assessed in 75 young adults recruited in the twin family study was r = .78 for the total AQ score and r = .79 for the Social interaction factor. The test–retest reliabilities for the lower order domains and the Attention to detail factor were respectively: r = .71(Attention to detail), r = .69 (Social skill), r = .68 (Attention switching), r = .60 (Communication), and r = .81 (Imagination).

Students: Effects of Sex and Field of Study

Table 3 shows the mean total AQ and factor scores in the student sample, separated by field of study. Effects of sex and field of study on the factor scores Social interaction and Attention to detail were tested using a mixed model MANOVA. A significant main effect of field of study (F = 41.41, p < .001) and a significant sex × field of study interaction effect (F = 5.92, p = .003) was found. Within the different fields of study, the sex effect was not significant (F = .40, p = .53). Science students scored significantly higher than students engaged in a humanities or social sciences degree (t = 8.64, p < .001), whereas the social sciences students scored significantly lower than the humanities and science students (t = 4.48, p < .001). The effect of field of study was significant both in the Social interaction factor (F = 28.22, p < .001) and in the Attention to detail factor (F = 5.34, p = .005).

Table 3 Mean total AQ and factor scores per group, sex, and field of study

Full size table

General Population: Effect of Sex and Age

Mean total AQ and factor scores in the general population sample are shown in Table 3, separated by sex. A MANOVA of Social interaction and Attention to detail by sex and age showed a significant sex effect (F = 5.55, p = .02), with men scoring significantly higher than women. The effect of age was not significant (F = .46, p = .50). Contrast tests showed that the sex effect was significant in both the total AQ score (t = 2.01, p = .05) and in the Social interaction factor (t = 2.46, p = .01), but not in the Attention to detail factor (t = −.52, p = .61).

Patient Groups vs. General Population; Differences Between Patient Groups

Mean total AQ and factor scores in the different patient groups are shown in Table 3. A mixed model MANOVA of Social interaction and Attention to detail by diagnosis revealed significant differences between the patient and general population samples (F = 19.17; p < .001). Contrast tests showed that the subjects diagnosed with an ASC (t = 6.89, p < .001), the subjects diagnosed with OCD (t = 2.53, p = .02), and the subjects diagnosed with SAD (t = 2.24, p = .04) all scored significantly higher on the total AQ score than the general population sample. OCD and SAD patients did not differ in total AQ score (t = −.11, p = .91). The ASC subjects however obtained significantly higher total AQ scores than the OCD (t = 3.99, p < .001) and SAD patients (t = 3.99, p < .001). Moreover, the ASC subjects scored the highest on the Social interaction factor. Their scores on the Social interaction factor were significantly higher than the scores of the general population (t = 6.57, p < .001), the OCD sample (t = 3.56, p = .002), and the SAD group (t = 2.88, p = .009). Lastly, the ASC group scored significantly higher on the Attention to detail factor compared to the general population (t = 2.10, p = .04), the OCD patients (t = 1.99, p = .05), and the SAD sample (t = 4.28, p < .001).

The Spectrum of Autistic Traits

To obtain a better insight in the differences in AQ scores within each group, a frequency distribution is tabulated of the total AQ score in the general population and the three patient groups (Table 4). A high total AQ score (>145) was found to be specific to ASC subjects. Neither subjects from other patient groups, nor subjects from the general population obtained scores in this segment, while all subjects with an AS or HFA diagnosis scored in this range. Subjects with a PDD-NOS diagnosis obtained lower scores than the AS/HFA patients (total AQ scores between 114 and 134), but scored higher than the majority of the other psychiatric patients and the general population. Figure 2 illustrates that AQ scores followed a continuous distribution in the general population. The AQ scores of ASC patients all fell in the right end of this distribution.

Table 4 Frequency distribution of the total AQ score in the general population sample and three patient groups

Full size table

Discussion

The results of our study indicate that the Autism-Spectrum Quotient is a valid and reliable instrument to assess individual differences in autistic traits. The Dutch AQ was found to have satisfactory internal consistency and test–retest reliability. Moreover, high AQ scores were specific to ASC patients. Previously, an exploratory factor analysis performed in a British student population suggested three underlying factors within the AQ, encompassing Social skill, Details/patterns, and Communication/mind reading (Austin 2005). A recent study using the child version of the AQ found evidence for 4 of the 5 domains (Auyeung et al. 2008). Our confirmatory factor analyses in both a general population and a student sample indicated that 4 of the 5 domains of the AQ (Social skill, Communication, Attention switching, and Imagination) were highly correlated. Rather than four separate domains, we propose a hierarchical model allowing these domains to cluster together. This way, one broad band “Social interaction” factor (incorporating the 4 highly correlated domains) could be identified, together with a small second factor, consisting of items focusing on a preference for details and patterns (the domain “Attention to detail”).

It may seem surprising that the broad band scale also includes the domain Attention switching. As put forward by Courchesne and colleagues (1994), difficulties in Attention switching make it harder to keep track of social information. Social interaction usually involves frequent and rapid changes in the source of information (visually or auditory information, change in objects or actions, etc.), and requires the ability to follow the flow of social cues (words, gestures, postures, background context, etc). The marked deficit in attention switching in ASC subjects may directly harm their social and communication abilities, which explains why this domain is included within the Social interaction factor.

The internal consistency and test-retest reliability of the total AQ score (α = .81/.71 and r = .78) and the broad band social interaction factor (α = .84/.77 and r = .79) were satisfactory. The internal consistency of the Attention to detail factor was moderate (α = .63/68) and its test–retest reliability was acceptable (r = .71). The internal consistencies of the four domains encompassing the broad band social interaction factor varied but were generally moderate (α between .39 and .76), the test–retest reliability varied between .60 and .81. Our results indicate that future studies using the AQ to assess autistic traits may benefit from a focus on the total AQ score and the 2 factor scores, rather than examining differences and similarities in the five original domains. The power to detect differences will be higher using the 2 factors; this could be of importance especially in linkage or association studies into autistic traits.

A small but significant correlation (r = .19) was found between the two Social interaction factor and the Attention to detail factor. This result is in agreement with findings from a British twin project (Ronald et al. 2005) which assessed both social and non-social behaviors characteristic for autism using parent and teacher ratings. Social and non-social behaviors were weakly correlated with each other, both in the teacher (r = .15) and the parent data (r = .29).

Group differences in AQ scores were in line with previous studies using the AQ (Austin 2005; Baron-Cohen et al. 2001; Hoekstra et al. 2007; Wakabayashi et al. 2006). Males scored higher than females on the total AQ score and the Social interaction factor, although no sex difference on the Attention to detail factor was found. Similar to the findings reported by Baron-Cohen et al. (2001) and Austin (2005), science students obtained significantly higher scores than humanities and social science students. In our study, students enrolled in a social science degree in turn scored significantly lower than humanities students. Rather than a remarkable discrepancy with the British results (no differences between humanities and social sciences students), these findings are probably due to differences in the student sample. The majority of the social science students included in our study was enrolled in a psychology or education degree. Both these studies have a strong focus on human interaction on an individual level and are likely to attract students who enjoy social interaction. If social sciences focusing on a broader level (such as economics and political science) had been included, the difference in AQ score might not have been significant.

Subjects diagnosed with an ASC scored significantly higher on the total AQ than the general population and the other patient groups. This is a satisfying result, considering the potential symptom overlap between these patient groups and ASC. The overlap between ASC subjects and SAD patients entails problems with social interaction in both groups; the overlap between ASC and OCD subjects encompasses repetitive behaviors that occur in both groups. Relatives of autistic individuals are reported to have up to 10-fold higher rates of social phobia compared to control families (Smalley et al. 1995; Piven and Palmer 1999). Similarly, an increased incidence of OCD is found in autism relatives (Bolton et al. 1998), and the occurrence of obsessive–compulsive traits in parents of an autistic child is significantly more likely if the child displays strong repetitive behavior (Hollander et al. 2003). Recent research suggests that a common genetic pathway, the serotonin transporter gene, could explain a small part of the association between OCD and rigid compulsive behaviors in autism (Ozaki et al. 2003; Sutcliffe et al. 2005). Our study however shows that high AQ scores are specific to the ASC patients. ASC patients scored significantly higher than the other patient groups on both the Social interaction factor and the Attention to detail factor.

Moreover, all subjects with an AS/HFA diagnosis could be distinguished from the other samples, as no subjects without an ASC diagnosis obtained a score >145. The difference is less clear-cut for subjects with PDD-NOS, who obtained intermediate AQ scores. These findings could explain the discrepancy in previous studies using the AQ. Woodbury-Smith et al. (2005) reported satisfying ability of the AQ to distinguish patients with an AS/HFA diagnosis from non-AS/HFA patients in a clinic for adults suspected of having AS or HFA. On the other hand, Ketelaars et al. (2007) found little differences in AQ scores between mild ASC patients and patients referred to the general outpatient clinic. The latter study mainly included PDD-NOS patients and few patients with AS or HFA. PDD-NOS constitutes a broad diagnostic category with criteria less stringent than for autistic disorder (American Psychiatric Association 2000) and is often seen as a milder form of autism. A lower AQ score in PDD-NOS patients compared to AS/HFA patients, yet higher than in most subjects diagnosed with another psychiatric disorder or subjects from the general population, is compatible with the notion of a continuum of autistic traits.

This study had some limitations. Firstly, the data collection in the student and general population sample was based on anonymous reports. We could therefore not verify whether any subjects in these groups had an ASC diagnosis or should warrant a diagnosis. Furthermore, the sample sizes of the patient groups were small and findings from these groups should be interpreted with care. Future studies in clinical samples should explore the criterion validity of the AQ more extensively, and should especially focus on the differences between severely impaired ASC patients and mildly impaired PDD-NOS subjects.

In conclusion, this study shows that the AQ is a reliable instrument for examining variation in autistic traits. The AQ can be divided into two reliable sub factors, focusing on difficulties in social interaction and on marked interests and attention to details and patterns. Total AQ scores follow a continuous distribution in the general population, and patients with an ASC diagnosis fall in the upper end of this distribution. OCD and SAD patient obtain scores between the general population mean and scores typical for ASC patients. These findings indicate that the AQ is a valuable instrument to assess where an individual lies on the autism spectrum.

References

American Psychiatric Association (2000). Diagnostic and Statistical Manual for Mental Disorders, 4th Ed., Text Revision (DSM-IV-TR). Washington: American Psychiatric Press.
Austin, E. J. (2005). Personality correlates of the broader autism phenotype as assessed by the Autism Spectrum Quotient (AQ). Personality and Individual Differences, 38, 451–460.
Article Google Scholar
Auyeung, B., Baron-Cohen, S., Wheelwright, S., & Allison, C. (2008). The Autism Spectrum Quotient – Children’s Version (AQ-Child). Journal of Autism and Developmental Disorders, In press.
Bailey, A., Le Couteur, A., Gottesman, I., Bolton, P., Simonoff, E., Yuzda, E., et al. (1995). Autism as a strongly genetic disorder: Evidence from a British twin study. Psychological Medicine, 25, 63–77.
Article PubMed Google Scholar
Bailey, A., Palferman, S., Heavey, L., & Le Couteur, A. (1998). Autism: The phenotype in relatives. Journal of Autism and Developmental Disorders 28, 369–392.
Article PubMed Google Scholar
Baron-Cohen, S. (2002). The extreme male brain theory of autism. Trends in Cognitive Sciences, 6, 248–254.
Article PubMed Google Scholar
Baron-Cohen, S., Knickmeyer, R. C., & Belmonte, M. K. (2005). Sex differences in the brain: Implications for explaining autism. Science, 310, 819–823.
Article PubMed Google Scholar
Baron-Cohen, S., Wheelwright, S., Skinner, R., & Martin, C. E. (2001). The Autism Spectrum Quotient (AQ): Evidence from Asperger syndrome/high functioning autism, males and females, scientists and mathematicians. Journal of Autism and Developmental Disorders, 31, 5–17.
Article PubMed Google Scholar
Bishop, D. V. M., Maybery, M., Maley, A., Wong, D., Hill, W., & Hallmayer, J. (2004). Using self-report to identify the broad phenotype in parents of children with autistic spectrum disorders: A study using the Autism-Spectrum Quotient. Journal of Child Psychology and Psychiatry, 45, 1431–1436.
Article PubMed Google Scholar
Bolton, P., Macdonald, H., Pickles, A., Rios, P., Goode, S., Crowson, M., et al. (1994). A case-control family history Study of Autism. Journal of Child Psychology and Psychiatry and Allied Disciplines, 35, 877–900.
Article Google Scholar
Bolton, P. F., Pickles, A., Murphy, M., & Rutter, M. (1998). Autism, affective and other psychiatric disorders: Patterns of familial aggregation. Psycholical Medicine, 28, 385–395.
Article Google Scholar
Cath, D. C., Ran, N., Smit, J. H., van Balkom, A. J. L. M., & Comijs, H. C. (2008). The relationship between pervasive developmental disorders, OCD and social anxiety disorder in adults: A preliminary case controlled study. Psychopathology, 41, 101–110.
Article PubMed Google Scholar
Constantino, J. N., & Todd, R. D. (2003). Autistic traits in the general population: A twin study. Archives of General Psychiatry, 60, 524–530.
Article PubMed Google Scholar
Constantino, J. N., & Todd, R. D. (2005). Intergenerational transmission of subthreshold autistic traits in the general population. Biological Psychiatry, 57, 655–660.
Article PubMed Google Scholar
Constantino, J. N., Lajonchere, C., Lutz, M., Gray, T., Abbacchi, A., McKenna, K., et al. (2006). Autistic social impairment in the siblings of children with pervasive developmental disorders. American Journal of Psychiatry, 163, 294–296.
Article PubMed Google Scholar
Courchesne, E., Townsend, J., Akshoomoff, N. A., Saitoh, O., Yeung-Courchesne, R., Lincoln, A. J., et al. (1994). Impairment in shifting attention in autistic and cerebellar patients. Behavioral Neuroscience, 108, 848–865.
Article PubMed Google Scholar
First, M. B. (1996). Structured clinical interview for DSM-IV axis I disorders. New York.
Folstein, S., & Rutter, M. (1977). Genetic influences and infantile autism. Nature, 265, 726–728.
Article PubMed Google Scholar
Hoekstra, R. A., Bartels, M., Verweij, C. J. H., & Boomsma, D. I. (2007). Heritability of autistic traits in the general population. Archives of Pediatric and Adolescent Medicine, 161, 372–377.
Article Google Scholar
Hollander, E., King, A., Delaney, K., Smith, C. J., & Silverman, J. M. (2003). Obsessive-compulsive behaviors in parents of multiplex autism families. Psychiatry Research, 117, 11–16.
Article PubMed Google Scholar
Ketelaars, C., Horwitz, E., Sytema, S., Bos, J., Wiersma, D., Minderaa, R., & Hartman, C. A., (2007). Brief report: Adults with mild Autism Spectrum Disorders (ASD): Scores on the Autism Spectrum Quotient (AQ) and comorbid psychopathology. Journal of Autism and Developmental Disorders, In press.
Kurita, H., Koyama, T., & Osada, H. (2005). Autism-Spectrum Quotient-Japanese version and its short forms for screening normally intelligent persons with pervasive developmental disorders. Psychiatry Clinical Neurosciences, 59, 490–496.
Article Google Scholar
Le Couteur, A., Bailey, A., Goode, S., Pickles, A., Robertson, S., Gottesman, I., et al. (1996). A broader phenotype of autism: the clinical spectrum in twins. Journal of Child Psychology and Psychiatry, 37, 785–801.
Article PubMed Google Scholar
Lord, C., Risi, S., Lambrecht, L., Cook, E. H., Jr., Leventhal, B. L., DiLavore, P. C., et al. (2000). The autism diagnostic observation schedule-generic: A standard measure of social and communication deficits associated with the spectrum of autism. Journal of Autism and Developmental Disorders, 30, 205–223.
Article PubMed Google Scholar
Lord, C., Rutter, M., & Le Couteur, A. (1994). Autism diagnostic interview-revised: A revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. Journal of Autism and Developmental Disorders, 24, 659–685.
Article PubMed Google Scholar
Neale, M. C., Boker, S. M., Xie, G., & Maes, H. H. (2006). Mx: Statistical modeling. 7th Ed. Richmond: VCU.
Google Scholar
Ozaki, N., Goldman, D., Kaye, W. H., Plotnicov, K., Greenberg, B. D., Lappalainen, J., et al. (2003). Serotonin transporter missense mutation associated with a complex neuropsychiatric phenotype. Molecular Psychiatry, 8(895), 933–936.
Article PubMed Google Scholar
Piven, J., & Palmer, P. (1999). Psychiatric disorder and the broad autism phenotype: Evidence from a family study of multiple-incidence autism families. American Journal of Psychiatry, 156, 557–563.
PubMed Google Scholar
Piven, J., Palmer, P., Jacobi, D., Childress, D., & Arndt, S. (1997). Broader autism phenotype: Evidence from a family history study of multiple-incidence autism families. American Journal of Psychiatry, 154, 185–190.
PubMed Google Scholar
Ronald, A., Happe, F., & Plomin, R. (2005). The genetic relationship between individual differences in social and nonsocial behaviours characteristic of autism. Developmental Scence, 8, 444–458.
Article Google Scholar
Rutter, M. (2000). Genetic studies of autism: From the 1970s into the millennium. Journal of Abnormal Child Psychology, 28, 3–14.
Article PubMed Google Scholar
Schermelleh-Engel, K., Moosbrugger, H., & Muller, H. (2003). Evaluating the fit of structural equation models: Tests of significance and descriptive goodness-of-fit measures. Methods of Psychological Research, 8, 23–74.
Google Scholar
Smalley, S. L., McCracken, J., & Tanguay, P. (1995). Autism, affective disorders, and social phobia. American Journal of Medical Genetics, 60, 19–26.
Article PubMed Google Scholar
Spiker, D., Lotspeich, L. J., Dimiceli, S., Myers, R. M., & Risch, N. (2002). Behavioral phenotypic variation in autism multiplex families: evidence for a continuous severity gradient. American Journal of Medical Genetics, 114, 129–136.
Article PubMed Google Scholar
Sutcliffe, J. S., Delahanty, R. J., Prasad, H. C., McCauley, J. L., Han, Q., Jiang, L., et al. (2005). Allelic heterogeneity at the serotonin transporter locus (SLC6A4) confers susceptibility to autism and rigid-compulsive behaviors. American Journal of Human Genetics, 77, 265–279.
Article PubMed Google Scholar
Wakabayashi, A., Baron-Cohen, S., Wheelwright, S., & Tojo, Y. (2006). The Autism-Spectrum Quotient (AQ) in Japan: A cross-cultural Comparison. Journal of Autism and Developmental Disorders, 36, 263–270.
Article PubMed Google Scholar
Woodbury-Smith, M. R., Robinson, J., Wheelwright, S., & Baron-Cohen, S. (2005). Screening adults for Asperger Syndrome using the AQ: A preliminary study of its diagnostic validity in clinical practice. Journal of Autism and Developmental Disorders, 35, 331–335.
Article PubMed Google Scholar

Download references

Acknowledgment

Financial support was given by The Netherlands Organization for Scientific Research (NWO, grant number 575-25-006) & (NWO/SPI 56-464-14192). RAH is financially supported by NWO (Rubicon), MB is supported by NWO (VENI: 451-04-034). The authors would like to express their gratitude to Jacintha Scheerder and Danielle van Versendaal for their help in the data collection. We are indebted to Irene Rebollo for her assistance in the LISREL analyses and to Leo Beem for his help in the mixed model procedures. Furthermore we would like to thank Professor Simon Baron-Cohen for his valuable feedback.

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Author information

Authors and Affiliations

Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
Rosa A. Hoekstra, Meike Bartels & Dorret I. Boomsma
Autism Research Centre, Department of Developmental Psychiatry, University of Cambridge, Douglas House, 18b Trumpington Road, Cambridge, CB2 8AH, England, UK
Rosa A. Hoekstra
Department of Psychiatry, GGZ Buitenamstel, Amsterdam, The Netherlands
Danielle C. Cath

Authors

Rosa A. Hoekstra
View author publications
You can also search for this author in PubMed Google Scholar
Meike Bartels
View author publications
You can also search for this author in PubMed Google Scholar
Danielle C. Cath
View author publications
You can also search for this author in PubMed Google Scholar
Dorret I. Boomsma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rosa A. Hoekstra.

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Hoekstra, R.A., Bartels, M., Cath, D.C. et al. Factor Structure, Reliability and Criterion Validity of the Autism-Spectrum Quotient (AQ): A Study in Dutch Population and Patient Groups. J Autism Dev Disord 38, 1555–1566 (2008). https://doi.org/10.1007/s10803-008-0538-x

Download citation

Received: 08 November 2006
Accepted: 02 January 2008
Published: 27 February 2008
Issue Date: September 2008
DOI: https://doi.org/10.1007/s10803-008-0538-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Factor Structure, Reliability and Criterion Validity of the Autism-Spectrum Quotient (AQ): A Study in Dutch Population and Patient Groups

Abstract

Similar content being viewed by others

Is the Autism-Spectrum Quotient a Valid Measure of Traits Associated with the Autism Spectrum? A Rasch Validation in Adults with and Without Autism Spectrum Disorders

Psychometric Assessments of Three Self-Report Autism Scales (AQ, RBQ-2A, and SQ) for General Adult Populations

The Autism-Spectrum Quotient (AQ) Adolescent’s Version in Turkey: Factor Structure, Reliability and Validity

Introduction