Background

Vision is the primary sensory modality in primates, reflected by the visual cortex being the largest of all the sensory cortices. Our eyes perform quick orienting movements ('saccades') towards interesting features of stimuli in the external world [1]. In general, we tend to look longer at more rewarding stimuli [2]. This rationale lies behind the 'preferential looking' technique in infancy research, where gaze duration and direction are assumed to reflect visual preference [26]. Gaze not only informs us about normative variation in the visual processing of stimuli but also is relevant to the understanding of complex neurodevelopmental conditions such as autism spectrum conditions (ASC), which are characterised by atypical gaze fixation patterns [7, 8]. This has led to the suggestion that gaze fixation patterns could constitute potential endophenotypes for such conditions. Gaze patterns show high test-retest reliability as well as a moderate to high heritability when tested in twins [911], suggesting a significant genetic contribution. This raises the possibility that variation in candidate genes underlie normative variation in gaze patterns.

The measure of particular interest to us is the duration of gaze fixation, given the evidence that people with ASC show reduced gaze fixation towards social stimuli [8, 1215]. Research in primates suggests that the striatal region plays a major role in directing gaze [16]. The striatum is thought to encode a 'value map' of the visual stimuli. Both ventral striatal neurons as well as a subpopulation of caudate neurons encode reward magnitude of the stimuli [17, 18]. This 'value map', in addition to further frontal cortical inputs, is then passed to the lateral intraparietal area (LIP), where a fine-tuned map of 'relative expected subjective value' is created. The LIP projects to the frontal eye fields, which send excitatory projections to the caudate nucleus. A subset of neurons from the caudate nucleus inhibit the substantia nigra and consequently disinhibit the superior colliculus, which in turn controls the gaze control nuclei in the brainstem, leading to a gaze shift [19].

One of the key molecular systems involved in the functioning of the striatal circuit is the endocannabinoid system. It is a neuropeptidergic circuit involved in reward processing and works in tandem with the mesolimbic dopaminergic system [20]. Expressed selectively in the brain, the cannabinoid receptor 1 (CNR1) is the best-studied molecule of this system. Immunolocalisation studies in rats and humans have indicated high CNR1 expression levels in the striatum, a region known for its central role in reward processing [2024]. CNR1 is believed to modulate striatal dopamine release through a trans-synaptic mechanism involving both GABAergic and glutamatergic synapses and is expressed strongly in the caudate, putamen, globus pallidus internal and substantia nigra, as well as in the shell of the nucleus accumbens [25]. Phasic release of striatal dopamine is the primary mechanism encoding for reward [26].

Recent studies have suggested abnormalities in ASC in striatal volume [27, 28], connectivity [29] and activity in response to social stimuli [30]. In addition, a gene expression study of postmortem brain tissue of people with ASC found reduced expression of CNR1[31]. In view of the atypical gaze behaviour of people with ASC, together with the observed striatal atypicalities, it is reasonable to examine the phenotype of gaze patterns as a function of variation in genes expressed in the striatum.

As gaze fixation is linked to striatal activity [16, 17, 19], we might expect that molecular variation in the genes involved in striatal function would be associated with differences in gaze towards socially rewarding stimuli. Using functional magnetic resonance imaging (fMRI), we previously found genetic variation in CNR1 modulated activity in the striatal region while watching happy (but not disgust) faces [32]. This result has been independently replicated in larger samples [33]. In the current study, we aimed to conduct an identical experiment using gaze-tracking in a new sample of volunteers. Specifically, we tested whether CNR1 genetic variation influences gaze duration towards happy faces. To ensure that this was closely matched to the original fMRI experiment, we also analysed gaze fixation duration for disgust expressions as a function of CNR1 genetic variation. Disgust faces are potential signals of 'nonreward', in contrast to rewarding happy faces, and hence provide a high-level control condition (that is, matched for face-specific qualities, such as configural features, as well as more general visual qualities of the stimuli, such as colour, shape and luminosity) for our experiment. We predicted that variation (single-nucleotide polymorphisms (SNPs)) in the CNR1 gene would be significantly associated with individual variability in gaze duration towards happy but not disgust faces.

Methods

Participants

A total of 30 student volunteers (13 males and 17 females; 29 right-handed and 1 left-handed; mean age ± SD, 24.1 ± 3.41 years old) were recruited by advertisement from the local universities. Participants were included only if all four grandparents were of Caucasian European ancestry to avoid genetic heterogeneity between different populations. Participants were also excluded if they reported any history of psychiatric diagnoses or regular drug abuse. They were equated for educational background in that all had completed high school and were studying towards a college degree. All had normal (or corrected to normal) vision. The study was approved by the Psychology Research Ethics Committee of the University of Cambridge.

Buccal swabs were collected from all participants, and DNA was extracted. The four SNPs of choice were identical to those selected in our earlier fMRI study (rs1049353, rs806377, rs806380 and rs6454674), chosen to ensure a minor allele frequency > 0.2 in a Caucasian population and to cover as much of the gene as possible (see Figure 1) [32]. The DNA was genotyped by Geneservice, Inc. (Cambridge, UK) using standard TaqMan™ assays (Applied Biosystems, Inc., California, USA). Genotyping for these SNPs failed for two of these participants, resulting in a sample size of 28 participants for the final analysis.

Figure 1
figure 1

Schematic structure of the cannabinoid receptor CNR1 gene with all four genotyped single-nucleotide polymorphisms indicated. Top: White boxes indicate untranslated regions, black boxes indicate translated regions and intervening straight line indicates an intronic region. Bottom: The linkage disequilibrium structure of the gene in the Caucasian (CEU) population is shown (using the publicly available HapMap version 3, release R2, database available at http://hapmap.ncbi.nlm.nih.gov/.

Procedure

The stimuli were taken from the Mindreading™ set developed by Baron-Cohen et al. [34], since dynamic facial expressions of emotion are assumed to be more ecologically valid than static photographs. The Mindreading set consists of video, audio and textual examples for 412 different emotions arranged into 24 emotion groups and organised according to six different developmental levels (based on emotions recognised in childhood through adulthood). These stimuli have been validated in typical populations and in people with ASC [3538]. These stimuli were chosen over other existing available stimuli because Mindreading stimuli comprise dynamic emotional expressions whilst alternatives (such as the Ekman and Friesen set [39], the Karolinska Directed Emotional Faces set [40] and the NimStim set [41]) comprise static expressions. The Mindreading stimuli have excellent interrater reliability and external validity [36, 38] (stimuli are available at http://www.jkp.com/mindreading/).

Participants were seated comfortably at a fixed distance of 60 cm from the screen and were instructed to keep movement to a minimum. Participants watched 80 videoclips (three seconds each and sixteen clips for each of the five emotions) presented in a pseudorandom order using GazeTracker™ software (DynaVox Inc., Virginia, USA) with an interstimulus interval of six seconds. Participants were shown a fixation cross during the interstimulus interval. All stimuli were centred on a 19-inch monitor and occupied 70% of the screen area. To ensure that participants were attending to the stimulus, they were asked to say aloud what emotion they thought was being displayed (choosing one of five emotion words: 'happy', 'sad', 'angry', 'disgust' or 'fear'). Their responses were recorded by the experimenter.

The Eye Response Interface Computer Aid camera (ERICA; http://www.eyeresponse.com/) was used to measure fixation time at each point at 60 Hz. ERICA uses reflected low-frequency infrared rays (λ = 880 nm) to map macrosaccades and fixation times at each point. The data were preprocessed using GazeTracker™ software. To ensure that the measured gaze duration was specific to the socially informative regions of the emotion expressions [35], 'look zones' were manually drawn around the eyes (the eyebrows and lower eyelids) and mouth region (the region from the bottom of the nose to the bottom of the lower lip) of all stimuli (see Figure 2). All look zones were 'dynamic'; that is, they tracked the eyes and the mouth regions while allowing for head movement of the actors.

Figure 2
figure 2

Example gaze trail from a single participant and group mean gaze durations for happy faces classified by genotype. (a) Example gaze trail from a single participant on a still frame from a video stimulus showing a happy expression. The black circles represent fixation points and the amount of time (in seconds) spent in each. The dotted lines demarcate each look zone (eyes region and mouth region). (b) Gaze duration for happy faces grouped by genotype for rs806380 (top) and rs806377 (bottom), respectively. Unfilled circles indicate mean gaze duration, and error bars represent ± 1 SEM. From the Mindreading™ set developed by Baron-Cohen et al. [34]

The sum of duration of all fixations was recorded for each look zone. A fixation was defined as a continuous gaze for 100 ms within a 40-pixel diameter (corresponding to a 1.3° visual angle), which was in line with parameters used in similar studies [15]. Gaze duration for each expression was calculated by summing the mean fixation time for eyes and mouth regions. Different regions of the face (that is, eyes and mouth) are relevant for processing different basic emotions [12, 4244]. Thus it is not ideal to compare the fixation time to the eyes region for happy and disgust faces, since disgust faces are associated with greater gaze duration to the mouth region. Hence, total fixation time across eyes and mouth regions was used as the dependent variable.

Results

Both happy and disgust expressions were recognised with > 80% accuracy. Genetic association was measured using the UNPHASED programme (http://www.mrc-bsu.cam.ac.uk/personal/frank/software/unphased), which computes the retrospective likelihood, that is, the probability of observing different genotypes given an observed distribution of a quantitative trait [45]. The two dependent variables (gaze duration for all happy and all disgust faces) and the genotypes for all four SNPs were included in a single analysis. This analysis revealed a significant association of the gaze duration for happy faces with rs806377 (χ2 = 8.88, df = 2, P = 0.011) and rs806380 (χ2 = 8.46, df = 2, P = 0.014). No significant associations (at P ≤ 0.05) were noted for gaze duration for disgust faces (nominal prs806377 = 0.104 and nominal prs806380 = 0.086). To test whether the observed lack of significant association with disgust faces was due to one video clip that was misclassified by a majority of the participants, the data were reanalysed after removing all fixation data associated with this one video clip. This revealed an identical pattern of results, with a nominal prs806377 = 0.111 and a nominal prs806380 = 0.105.

However, when multiple SNPs are in linkage disequilibrium (LD), hypothesis tests in single-locus analyses are not independent. To take this into account, Li and Ji [46] proposed a method for estimating the true number of independent tests (Meff), which takes into consideration the LD between SNPs. This method was implemented using the SNPSpD software programme [47], which revealed that Meff was 3 in the current sample. The Bonferroni correction for three independent tests gave a corrected P = 0.033 for rs806377 and a corrected P = of 0.042 for rs806380 for association with gaze duration for happy faces.

To further analyse genotypic differences for each SNP that were significantly associated with gaze duration for happy faces, post hoc t-tests were conducted. In rs806377, the CC genotype was associated with longer gaze duration than the CT genotype (t = 2.92, P < 0.025 with the Bonferroni correction). In rs806380, the GG genotype was associated with longer gaze duration than the AA genotype (t = 2.78, P < 0.05 with the Bonferroni correction) (see Figure 2).

The main effects of all possible haplotypes were tested with various possible window sizes (two, three and four marker combinations) using UNPHASED software. None of these haplotypic association tests were significant at P < 0.05. While the small sample size did not allow for a robust test of sex differences in this genetic association, we report the nominal P values for these tests for the sake of completeness. rs806377 was significantly associated with the gaze duration for happy faces in both females (P = 0.021) and males (P = 0.004). Additionally, in males, rs806380 (P = 0.019) and rs1049353 (P = 0.004) were found to be associated with gaze duration for happy faces.

Discussion

In this experiment, we predicted that CNR1 genetic variations would be associated with differences in gaze fixation duration towards happy faces. This prediction was confirmed: two SNPs in this gene (rs806377 and rs806380) were associated with differences in gaze duration for happy (but not disgust) faces. This finding fits well with the established role of the CNR1 gene in reward processes [20] and is consistent with the results of fMRI studies [32, 33] in showing that this gene is a component of the molecular architecture of social reward processing. Social reward processing has been suggested to be impaired in people with ASC [4850], particularly as reflected in atypical gaze patterns towards social stimuli. Hence the current results could be relevant to understanding the genetic underpinning of the social behavioural symptoms in people with ASC.

A comparison of these results with those from our earlier fMRI study reveals that for the SNP rs806377, the allelic group (CC) associated with the highest striatal response is also associated with the longest gaze duration for happy faces. For rs806380, the allelic group associated with the highest striatal response (GG) is also associated with the longest gaze duration for happy faces. rs806377 is located in an untranslated region (UTR) of the gene (Figure 1), and rs806380 was found to be in significant LD with a 5'-UTR SNP (rs78074274) using CandiSNPer [51]. The observed effects can thus be mediated by either or both of these UTR SNPs by potentially altering gene transcription and/or mRNA stability. Since the fMRI and gaze duration data come from largely independent samples (only three of thirty participants were common to both studies), it is likely that the observed genetic differences reflect real effects.

We interpret the genetically linked biasing of visual perception in terms of individual differences in the reward circuitry. The two processes of increased visual preference (indicated by longer gaze duration) and increased striatal response for happy faces are linked in a positive feedback loop [5]. We tend to look longer at preferred stimuli, which in turn increases our preference/'reward value' for these stimuli. Consequently, we interpret the observed effect in biasing visual perception of social stimuli in terms of differences in the individual reward circuitry. Whether such intrinsic differences in reward circuitry change the formation and nature of 'saliency/value maps' formed during gaze fixation is a question for future research [19, 52].

A second broader question for future research is whether the observed CNR1 genotypic differences in fixation duration for happy faces is specific to social rewards or whether this holds true for all classes of rewards. Variation in CNR1 has been linked to polysubstance abuse and associated with increased activity in reward-processing areas of the brain in response to drug cues for both marijuana and alcohol addicts [53, 54]. Hence it is possible that the observed genotypic differences in the general population may extend to other classes of rewards. Crucially, however, a reduced experience of rewards in response to social stimuli such as happy faces (as has been suggested by Dawson et al. [48] to apply to ASC) has more far-reaching consequences, since if an infant is looking less at happy faces and is finding them less rewarding, this will make social interactions less reinforcing, which in turn can exacerbate the social difficulties observed in people with ASC.

It is possible that a number of genes, each of small to medium effect size, determine the striatal response to social stimuli such as happy faces. Other potential candidate genes might include those involved in the oxytocin-vasopressin system (OXTR, AVPR1A and AVPR1B) as well as those coding for key proteins involved in neutrotransmission (for example, MAOA and GABRB3) [55]. We speculate that these genes have an additive effect and might potentially underlie complex traits related to social functioning [56]. In a larger population-based genetic association study of empathy, we found a nominally significant association of CNR1 genetic variation with the Empathy Quotient [55, 57]. Additionally, reduced expression of CNR1 was found in postmortem brain tissue of individuals with ASC [31]. Together, these findings further support the implication that variation in the CNR1 gene modulates the response to social stimuli such as happy faces.

However, the current findings should be interpreted with caution, since, in the absence of any expression data, any functional role for the SNPs can only be speculative; that is, the observed SNP effects may be caused by being in LD with other functional polymorphisms and/or through mechanisms that affect mRNA stability or splicing as mentioned earlier. However, the observation of genetic differences in two separate (albeit small) samples using an identical paradigm with two different techniques points towards a putative role played by CNR1 in the response to happy faces.

Conclusions

In this study, we tested whether common variants in the CNR1 gene modulate gaze duration towards happy faces. We found that two SNPs in this gene were significantly associated with gaze duration for happy (but not disgust) faces. This result is consistent with that of our previous fMRI study [32]. Specifically, the allelic groups that were found to be associated with the strongest striatal response in our fMRI study were associated with the longest gaze duration for happy faces in the current sample. This finding suggests a role for CNR1 in social reward processing and could have significance for clinical conditions such ASC, which are marked by a deficit in social reward processing as well as atypical responses to facial expressions of emotion [35, 36, 49].