Introduction

The Autism Spectrum Quotient (AQ-Adult) was developed to quantitatively measure traits associated with the autistic spectrum in adults of normal intelligence. The AQ-Adult is a brief, self-administered, 50-item questionnaire. Results from the AQ-Adult demonstrate that those with an autism spectrum condition (ASC) diagnosis score significantly higher than a sample from the general population. 80% of individuals with a diagnosis of Asperger Syndrome (AS) or high functioning autism (HFA) scoring at or above a cut-off of 32 (out of 50), whilst only 2% of controls score at or above this cut-off (Baron-Cohen et al. 2001).

The AQ-Adult shows strong heritability as demonstrated in a general population twin study (Hoekstra et al. 2007), and in family genetic studies of parents of children with ASC, who score higher than unrelated individuals (Bishop et al. 2004; Wheelwright and Baron-Cohen, submitted). As such it not only identifies the narrow autism phenotype (i.e., individuals with a diagnosis of childhood autism) but also the broader autism phenotypeFootnote 1 (Bolton et al. 1994; Landa et al. 1992; Le Couteur et al. 1996; Sears et al. 1999). The AQ-Adult has also been found to predict a diagnosis of AS in a clinic sample (Woodbury-Smith et al. 2005). It shows high inter-rater and test–retest reliability (Baron-Cohen et al. 2001). The norms reported in the 2001 sample have been replicated very closely in independent samples in the UK (Austin et al. 2002; Wheelwright et al. 2006), in a Dutch sample (Hoekstra et al., submitted) and several Japanese samples (Kurita et al. 2005; Wakabayashi et al. 2004, 2006). In the Dutch sample there was also a test of the clinical specificity of the AQ-Adult, where it was found that adults with AS or HFA scored significantly higher than general adult psychiatric clinical controls (Hoekstra et al. 2007).

An adolescent version of the AQ has also been developed (AQ-Adol) (Baron-Cohen et al. 2006). The AQ-Adol is completed by a parent about their child, but in almost all respects items on the AQ-Adol are the same as the AQ-Adult. Results showed that adolescents scored very similarly to adults, with individuals with ASC scoring significantly higher than matched controls (Baron-Cohen et al. 2006). For the AQ-Adol, 90% of adolescents with a clinical diagnosis of an ASC scored at or above the cut-off of 30, compared to 0% of controls (maximum score = 50). Sex differences were found on both the adult and adolescent versions of the AQ, with typical males scoring higher than females. Specifically, the mean score for males on the AQ-Adult was 17.8 (SD = 6.8) and for females was 15.4 (SD = 5.7), which is comparable to the mean score for males (M = 20.2, SD = 4.8) and females (M = 15.3, SD = 5.7) on the AQ-Adol.

Given the similar results on the AQ-Adult and the AQ-Adol, it is of interest to design a child version of the AQ (AQ-Child). A questionnaire of similar structure would allow for comparison with adolescent and adult data for individual items or sub-domains. Over a longer period it may also be possible to measure any changes in the profile of autistic traits with age. This might help to determine how stable autistic traits are across the lifespan. In addition, if the AQ-Child is found to predict the presence of a clinical diagnosis of an ASC in clinical samples, this brief questionnaire may be a useful screening measure that could identify children at risk for ASC in a clinical setting as well as being invaluable for epidemiological research that requires large samples.

The AQ-Adult and AQ-Adol are divided into five different domains or categories of autistic traits. These are: social skills, attention to detail, attention switching, communication and imagination. Each domain is assessed by ten questions. AQ items consist of statements that are answered in a Likert scale (Definitely Agree, Slightly Agree, Slightly Disagree and Definitely Disagree). The minimum score on the AQ-Adult and AQ-Adol is 0 and the maximum score is 50. These questionnaires have demonstrated good internal consistency indicated by high α coefficients for items measuring each of the five sub-domains.

The present study aimed to develop a version of the Autism Spectrum Quotient suitable for children (AQ-Child). The questionnaire format is maintained and questions are modified in order to identify autistic traits through parent report. The psychometric properties of this measure are investigated by administering the AQ-Child to n = 1,225 control and n = 540 children with an ASC diagnosis. Data from these samples are analyzed to identify an appropriate cut-off that may predict the presence of ASC as well as to analyze the factor structure for comparison with the AQ-Adult and AQ-Adol. It was expected that similar scoring patterns would be found in children as has been found previously in older populations: children with ASC scoring the highest, followed by control males then by control females (Baron-Cohen et al. 2001, 2006).

Method

The AQ-Child

The AQ-Child is a 50-item parent-report questionnaire developed to detect autistic traits in children at 4–11 years of age. The AQ-Child was designed to be a parent-report questionnaire, since self-report by children might be restricted by reading and comprehension difficulties. It was adapted from the adult and adolescent versions of the AQ, and items that were not age-appropriate in the adult questionnaires were revised accordingly. Items in the AQ-Child were kept as close to the AQ-Adult and AQ-Adol as possible, with most questions aimed at the same behaviors (see Appendix 1). Items were worded to produce an approximately equal agree/disagree response in order to avoid a response bias.

The AQ-Child consists of a series of descriptive statements designed to assess five areas associated with autism and the broader phenotype: social skills (items 1, 11, 13, 15, 22, 36, 44, 45, 47, 48), attention switching (items 2, 4, 10, 16, 25, 32, 34, 37, 43, 46), attention to detail (items 5, 6, 9, 12, 19, 23, 28, 29, 30, 49), communication (items 7, 17, 18, 26, 27, 31, 33, 35, 38, 39) and imagination (items 3, 8, 14, 20, 21, 24, 40, 41, 42, 50), each represented by ten items. Higher scores correspond to more ‘autistic-like’ behavior.

In the scoring system of Baron-Cohen et al. (2001) items are scored as 1 for a response in the ‘autistic’ direction and 0 for a ‘non-autistic’ response. The response scale in the present study adopted the scoring scheme used in recent studies of the AQ-Adult (Austin 2005; Hoekstra et al. 2007) where the response scale is treated as a 4-point Likert scale. Parents rate to what extent they agree or disagree with the statements about their child, with the following answer categories: 0 representing definitely agree; 1 slightly agree; 2 slightly disagree; and 3 definitely disagree. Items were reverse scored as necessary. This method was used because it was felt that the degree of endorsement of each item contained additional information and was therefore retained. Total AQ scores were represented by the sum each item score. The minimum AQ score (0) indicates no autistic traits; the maximum score (150) suggests full endorsement on all autistic items.

Participants

Group 1 (n = 1,225, 618 girls, 607 boys) included children who were participating in a large epidemiological study of social and communication skills in children aged 4–9 years old (Scott et al. 2002; Williams et al. 2005). Children were excluded if the child had any of the following conditions: ASC (n = 34), language delay (n = 58), dyspraxia (n = 13), epilepsy (n = 3), or ADHD (n = 28). Twin births (n = 32) or siblings (n = 62) of children with an ASC were also excluded. In the case where more than one child in a family was participating, the child whose age was closest to the mean age of the control group (M = 9.82, SD = 1.27) was retained, and the other siblings were excluded (n = 48). Siblings and twin births were excluded to ensure independence of data. Initially, 2,777 questionnaires were sent out by post, resulting in a response rate of approximately 50%. This sample was drawn from primary schools in Cambridgeshire, UK. Results from a smaller sample of 89 control children (42 girls, 47 boys) suggest that IQ [measured by the Wechsler Abbreviated Scale of Intelligence (WASI; Wechsler 1999)] was not correlated to the AQ-Child, therefore IQ data were not collected (r = −0.10, p > 0.5).

Group 2 comprised children diagnosed with an ASC by psychiatrists using established DSM-IV criteria (APA 1994). Children with a diagnosis of autism [n = 192, mean age = 7.58 (SD = 2.43)] or AS/HFA (n = 348) were included in the study, mean age = 9.31 (SD = 2.10). Children with a diagnosis of PDD-NOS (n = 26) or atypical autism (n = 4) were excluded from the study due to the small sample sizes. Children with a diagnosis of autism were grouped separately from the children with an AS/HFA diagnosis. Mothers of these children were recruited via the University of Cambridge Autism Research Centre website and completed the questionnaires online (http://www.autismsresearchcentre.com).

Questionnaires with more than five blank items were considered incomplete and these data were discarded in subsequent analyses (n = 57). If five or fewer answers were missing, the AQ-Child score was corrected for missing items by making the following calculation: total AQ-Child score + (mean item score × number of missing items) (Hoekstra et al. 2007).

Results

Item Analysis

An item analysis was conducted to examine scoring patterns on each item (see Table 1). Inspection of these scores showed that there were three items where controls scored higher than Group 2 (items 29, 30, 49). All three of these items focus on ‘attention to detail’. Closer inspection suggests that these items may be difficult to examine for young children and so these were eliminated in subsequent analyses.

Table 1 Item analysis—mean scores for each item by group each group

Internal Consistency

Cronbach’s α coefficients were calculated and for the measure as a whole, the α coefficient was high (α = 0.97). The internal consistency of the five AQ-Child subscales were also satisfactory (social skills = 0.93; attention to detail = 0.83; attention switching = 0.89; communication = 0.92; and imagination = 0.88).

Test–Retest Reliability

A random selection of 500 parents were asked to complete a second copy of the AQ-Child to examine test–retest reliability, resulting in 272 test–retest pairs (141 girls, 131 boys). The mean time interval between the first and second test was 12.3 weeks (SD = 2.01). For the AQ-Child, the correlation between the two tests was good (r = 0.85, p < 0.001).

Factor Analysis of the AQ-Child

The retained 47 items of the AQ-Child were subjected to principal components analysis (PCA). Prior to performing PCA, the suitability of the control AQ-Child data for factor analysis was assessed. Inspection of the correlation matrix revealed the presence of many coefficients of 0.3 and above. Diagnostic checks suggested that the data were suitable for analysis: the Kaiser–Meyer–Oklin value was 0.93, and the Bartlett’s test of sphericity reached statistical significance (χ 2 = 19841.29, df = 1081, p < 0.001).

An oblique rotation was used, since it is reasonable to assume that the autism sub-components would be related. Five components arose explaining 21.8, 8.9, 5.1, 4.2 and 3.2% of the variance. An inspection of the scree plot revealed a clear break after the first, second and fourth components. It was decided to retain four components for further investigation. Before rotation, the four-component solution explained a total of 40% of the variance. A correlation (r = 0.33) was found between the first and third factors, confirming that the factors are not independent. To aid in the interpretation of these components, oblimin rotation was performed. All items with factor pattern matrix elements greater than 0.3 are included. The factors were named mind-reading, attention to detail, social skills and imagination. These factors were found to be respectively highly correlated with the original AQ sub-scales of communication (r = 0.97, p < 0.001), attention to detail (r = 0.95, p < 0.001) social skills (r = 0.97, p < 0.001), and imagination (r = 0.97, p < 0.001). Table 2 presents the items of the four scales in order of loadings on the components (highest first). The eigen values of the rotated factors and the percentages explained by each of the factors are also shown. The internal reliabilities of the new factors were 0.96, 0.85, 0.94 and 0.90 respectively.

Table 2 Factor structure of the AQ-Child

Group Differences

Group differences were examined using both the original AQ-Child subscales and on scores for the factors discussed above. Mean AQ scores (total) for each group, broken down by sex and by subdomain, are shown in Table 3.

Table 3 Mean scores for subscales and total by group

Examination of the original subscales using an ANOVA of Total AQ score by group (control, AS/HFA and autism) and sex showed a significant effect of group [F(2, 1759) = 1277.66, p < 0.001]. Post hoc Dunnett T3 tests revealed that the two clinical groups scored significantly higher than the typically developing group (p < 0.0001), but that the two clinical groups did not differ from each other. The main effect of sex was also significant [F(1, 1759) = 6.33, p = 0.01]. The interaction between group and sex was also significant [F(2, 1759) = 3.56, p < 0.05].

T-tests confirmed that there was a significant sex difference [t(1154.11) = 7.02, p < 0.001, equal variances not assumed] in the control group (males scoring higher than females), confirming the same effect reported with the AQ-Adult and AQ-Adol. There were no significant sex differences in the clinical groups [Autism group: t(190) = 0.26, p > 0.05; AS/HFA group: t(346) = 0.15, p > 0.05]. See Fig. 1 for a visual representation of AQ distribution scores for control girls and boys and for the AS/HFA and Autism groups. The clinical groups differed from the control group on all subdomain scores (for t-tests results see Table 3). No association between age and AQ total was found (r = 0.03, p > 0.05) suggesting that AQ scores are independent of age in this sample.

Fig. 1
figure 1

Scoring patterns on the AQ-Child by group

Cut-off Scores

Figure 2 shows the receiver-operating-characteristic (ROC) curve. The area under the ROC curve was 0.99 (95% C.I.: 0.98–0.99), indicating that total score is a good indicator of ASC diagnosis.

Fig. 2
figure 2

ROC curve of the sensitivity and specificity of AQ-Child Score. Area under the curve = 0.99

Table 4 also shows sensitivity and specificity values for a range of potential cut-offs as well as the percentage of each group (by sex) scoring at or above each cut-off (maximum obtainable score = 141 when only using retained items). Adult (Baron-Cohen et al. 2001) and adolescent (Baron-Cohen et al. 2006) data are also shown. The ROC analysis showed that a score of 76 showed both high sensitivity (95%) as well as high specificity (95%). With a cut-off of 76, less than 2% of control girls and 7% of control boys scored at or above the ASC cut-off, whereas 95% of children with AS/HFA and 95% of children with Autism scored at or above this cut-off.

Table 4 Comparison of cut-off points

For the AQ-Adult, a score of 32 or above was chosen as the cut-off since 79.3% of the clinical group scored at or above this score, whereas only 2% of the control adults did so. The AQ-Adol used a cut-off of 30.

Discussion

This study examined the psychometric properties of the Autism Spectrum Quotient-Children’s version (AQ-Child) The AQ-Child showed excellent test–retest reliability and high α coefficients for the questionnaire as a whole as well as for each of the original five sub-scales (communication, attention to detail, social skills, imagination and attention switching). Factor analysis provided support for four factors which were highly correlated with four of the original subscales: communication (renamed mind-reading), attention to detail, social skills and imagination. All of these four factors also showed high reliability coefficients.

A cut-off of 76 was adopted and showed high sensitivity (0.95) and specificity (0.95). In addition, 95% of the AS/HFA and 95% of the autism group scored at or above this cut-off, whereas only 4% of controls did so. These results demonstrate that the AQ has good construct validity as those with a diagnosis score significantly higher than those without a diagnosis. Figure 1 shows the clear differentiation of total score between those with and without a clinical diagnosis. However, caution should be exercised when using any cut-off to indicate diagnosis because diagnosis is not dependent on an absolute score but on whether the traits cause impairments in everyday functioning (APA 1994; ICD-10 1994). A further limitation is that we have no confirmation that individuals in Group 1 with no clinical diagnosis do not have an ASC, and future population-based studies could assess those who score above the cut-off to see whether those without a diagnosis would warrant one. Such studies would inform the psychometrics of the AQ.

Comparisons between the AS/HFA and autism group showed no significant difference in total AQ-Child score. This may be because as a whole, this instrument is not biased toward language skills. However, several items (e.g., 10, 27, 33) concern behaviors that may require some conversational competence, and we recommend that the AQ would be most useful for individuals with some speech, and with an intelligence in the borderline average range (70) or above. It would be interesting for future researchers to examine the relationship between AQ-Child score and severity of symptoms, as this was outside the scope of this study.

Sex differences in the control group were found, with typically developing boys scoring higher than girls. Similar results are reported in many measures of ASCs. No sex differences were found in the AS/HFA or autism group. This may be because in the clinical groups, many more boys were in the study than girls. This is a common issue since the high male to female ratio of ASC limits the number of females available to participate in research studies. It would be beneficial for future research to obtain larger samples that can more closely examine scoring patterns between males and females with ASC.

Age was not associated with AQ-Child score. These results are consistent with those obtained from the Adolescent version of the AQ (Baron-Cohen et al. 2006). The items in the AQ-Child were designed to be appropriate for children between the ages of 4 and 11, which is the broadest range that the AQ has measured. Our results suggest that the AQ-Child measures traits that are consistent throughout the age groups. It would be useful in future studies to test the correlation between the AQ-Child and related measures (such as the Childhood Autism Spectrum TestFootnote 2) (Scott et al. 2002; Williams et al. 2005) which has been used in primary school age children, and to test if one instrument is more sensitive than another.

The use of a Likert format for responses in the questionnaire allowed parents to express the severity of a particular trait in their child. This was believed to be more appropriate for conditions like ASC where individuals lie on a spectrum of conditions. Eaves et al. (2006) comment on how a categorical ‘yes/no’ response can lead to missing data where the parent skips or misses questions or responds with a written answer in similarly designed questionnaires.

In summary, the AQ-Child is a strong measure that is able to discriminate the presence of ASC in a clinical sample and will be an important research tool for investigating autistic traits. The AQ-Child could also be a useful tool for researchers to follow the developmental trajectory in clinical samples, or as a measure of the broader autism phenotype in epidemiological samples. Future work needs to consider whether scores on the AQ-Child or AQ-Adol predict AQ-Adult score, and the potential value of the AQ as a developmentally sensitive screening instrument used to detect ASC in the general population.

Conclusions

This study reports the development of the AQ-Child, a new instrument that aims to quantify autistic traits in children aged 4–11 years. This 50-item parent-report questionnaire records a child’s behavior on a range of autistic traits. Factor analysis identified four subscales (named mind-reading, attention to detail, social skills and imagination). These were highly correlated with four of the original five designed subscales for the AQ-Child (communication, attention to detail, social skills and imagination, respectively). The high test–retest and reliability coefficients further support the structure of the questionnaire. The measure was administered to a group with an ASC diagnosis (n = 540) and a control group with no history of ASC in their immediate family (n = 1,225). Scoring patterns show a very clear differentiation between those with ASC and a control group. The questionnaire may be useful in screening for ASC rather than as a diagnostic tool, and is not a substitute for a clinical diagnosis. The development of the AQ-Child also presents the possibility of following autistic traits throughout an individual’s lifetime in a longitudinal study. Understanding the nature of the AQ continuum, in terms of its components and how it can be derived and predicted from these (Wheelwright et al. 2006), is an important goal for future research.