Training pet dogs for eye-tracking and awake fMRI

Karl, Sabrina; Boch, Magdalena; Virányi, Zsófia; Lamm, Claus; Huber, Ludwig

doi:10.3758/s13428-019-01281-7

Training pet dogs for eye-tracking and awake fMRI

Open access
Published: 16 July 2019

Volume 52, pages 838–856, (2020)
Cite this article

Download PDF

You have full access to this open access article

Behavior Research Methods Aims and scope Submit manuscript

Training pet dogs for eye-tracking and awake fMRI

Download PDF

Sabrina Karl¹,
Magdalena Boch^2,3,
Zsófia Virányi¹,
Claus Lamm² &
…
Ludwig Huber¹

7830 Accesses
23 Citations
28 Altmetric
Explore all metrics

Abstract

In recent years, two well-developed methods of studying mental processes in humans have been successively applied to dogs. First, eye-tracking has been used to study visual cognition without distraction in unrestrained dogs. Second, noninvasive functional magnetic resonance imaging (fMRI) has been used for assessing the brain functions of dogs in vivo. Both methods, however, require dogs to sit, stand, or lie motionless while yet remaining attentive for several minutes, during which time their brain activity and eye movements are measured. Whereas eye-tracking in dogs is performed in a quiet and, apart from the experimental stimuli, nonstimulating and highly controlled environment, MRI scanning can only be performed in a very noisy and spatially restraining MRI scanner, in which dogs need to feel relaxed and stay motionless in order to study their brain and cognition with high precision. Here we describe in detail a training regime that is perfectly suited to train dogs in the required skills, with a high success probability and while keeping to the highest ethical standards of animal welfare—that is, without using aversive training methods or any other compromises to the dog’s well-being for both methods. By reporting data from 41 dogs that successfully participated in eye-tracking training and 24 dogs IN fMRI training, we provide robust qualitative and quantitative evidence for the quality and efficiency of our training methods. By documenting and validating our training approach here, we aim to inspire others to use our methods to apply eye-tracking or fMRI for their investigations of canine behavior and cognition.

Head-mounted mobile eye-tracking in the domestic dog: A new method

Article 05 July 2022

Functional mapping of the somatosensory cortex using noninvasive fMRI and touch in awake dogs

Article Open access 20 April 2024

Neurobehavioral evidence for individual differences in canine cognitive control: an awake fMRI study

Article 09 April 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

After primates (see, e.g., Tomasello & Call, 1997), rodents (Pineno, 2010), corvids and parrots (ten Cate & Healy, 2017), and cetaceans (Mann, 2018), canines have become the main model system for the investigation of cognitive behavior in nonhuman animals (see, e.g., Katz & Huber, 2018; Miklósi, 2014). From a scientific point of view, dogs have become particularly attractive because of the interesting but yet not well understood interplay of long-term (canine evolution) and short-term (domestication) phylogenetic as well as ontogenetic (lifetime experiences) influences on cognition and behavior. After several decades of assuming both a special sensitivity and also cognitive ability of understanding humans (see, e.g., Huber, 2016) due to domestication and an increased dependence on humans (e.g., Hare & Tomasello, 2005), a current trend in the attempts to explain dog cognition and behavior is to emphasize socio-ecological factors (changed feeding ecology and social organization) and remaining traits from their wild progenitor, the wolf (Marshall-Pescini, Cafazzo, Virányi, & Range, 2017; Range & Virányi, 2015). In addition, the enormous amount of experience during their life with humans, which is often characterized by close, intimate relationships, must not be underestimated (Udell & Wynne, 2008). From an applied point of view, dog research is producing a great impact on society, spanning a range from how to handle man’s best friend as pets, or even as therapists to “bad dogs” (dog biting), let alone the practical importance of a better understanding of dogs for the growing number of industries (e.g., scent detection) that utilize the behavior of domestic dogs.

So far, the methodology for the investigation of dog cognition has been based predominantly on behavioral experimentation or observational studies. The great majority of studies have relied on analysis of the performance of dogs when they are confronted with challenging tasks in either the physical or the social domain (Bensky, Gosling, & Sinn, 2013; Lea & Osthaus, 2018). Only a few studies have applied advanced psychophysical techniques to examining perceptual and cognitive abilities with the aid of highly controlled, experimentally manipulated stimulation. An example of such a sophisticated stimulus device is the touchscreen (Steurer, Aust, & Huber, 2012; Wallis et al., 2017), which has been used to test discrimination, categorization, concept formation, and even inferential reasoning (Aust, Range, Steurer, & Huber, 2008; Range, Aust, Steurer, & Huber, 2008; Wallis et al., 2016). However, the use of more naturalistic stimuli, such as humans showing specific behavior, facial expressions, or gestures (e.g., pointing live, presentation), requires measuring the dog’s looking behavior via its head or eye movements (e.g., Adachi, Siebrits, Peirce, & Desroches, 2007; Barnard et al., 2016; Faragó et al., 2010; Huber, Racca, Scaf, Virányi, & Range, 2013; Mongillo, Scandurra, Kramer, & Marinelli, 2017; Racca et al., 2010; Schmidjell, Range, Huber, & Virányi, 2012; Wallis et al., 2015). These orientation movements that signal looking preferences, attention patterns, or gazing have usually been examined with video analysis—that is, by recording the dog’s head movements with video camera(s) and subsequently “manually” coding the video files using behavioral event recording software such as The Observer XT (Noldus Information Technology, The Netherlands) or Solomon Coder (developed by András Péter, www.solomoncoder.com).

These traditional methods are, however, not fine-grained enough to reveal the subelements of the looking behavior that are necessary to uncover the underlying mental mechanisms of the performance (Quinn, Doran, Reiss, & Hoffman, 2009). When only head movements are used, it is impossible to define to which parts of human faces the dogs’ attention is drawn (Somppi, Törnqvist, Hänninen, Krause, & Vainio, 2012). However, the quality of the technology or methods strongly depends on the research question, so that the sophisticated, mostly much more expensive methods only make sense if the rough methods are unable to answer the question. For instance, a video camera placed in front of the dog’s head can measure whether the dog is looking left or right, which is sufficient if the stimuli are distantly positioned in the left and right viewing fields of the dog. Examples are studies about size constancy (Müller, Mayer, Dörrenberg, Huber, & Range, 2011), looking preference for novel objects, face discrimination and inversion responses (Racca et al., 2010), and looking preferences for emotional human stimuli (Racca, Guo, Meints, & Mills, 2012). In conclusion, the choice of the methods depends on the necessary accuracy for measuring of the dog’s looking behavior. Only with the measurement of the dog’s eye movement with high spatial and temporal resolution one can determine how the dog scans the human face, for example, including fixations and quick shifts.

In human psychology, eye movement tracking has been developed as a technique for directly, objectively, and accurately assessing human gazing behavior (for an overview, see Holmqvist et al., 2011). For example, researchers aimed at determining the patterns of human face scanning by measuring frequencies, durations, and probabilities of fixations. The resulting spatial and temporal characteristics of fixation sequences could be used to examine human face perception (Walker-Smith, Gale, & Findlay, 1977) or the cognitive development of joint attention (Carpenter & Tomasello, 1995).

Three decades later, eye-tracking has also found its way into research on nonhuman animals. It was first used in veterinary medicine (neuro-ophthalmology) to diagnose ocular motor abnormalities such as nystagmus. For this purpose, the heads of untrained dogs were stabilized and held rigidly by locking arms or velcro head harnesses (Dell’Osso, Williams, Jacobs, & Erchul, 1998; Jacobs, Dell’Osso, Wang, Acland, & Bennett, 2009). Primatologists had been the first to recognize the advantage of eye tracking in monkeys (Guo, Robertson, Mahmoodi, Tadmor, & Young, 2003) and apes (Kano & Tomonaga, 2009, 2010) that are neither rewarded (as in conditioning paradigms) nor restrained (as in some preferential-looking paradigms), and therefore could show more natural behavior. The first attempt to apply eye-tracking in dog research utilized a head-mounted, portable, video-based eye-tracking camera. Williams, Mills, and Guo (2011) modified a VisionTrak head-mounted eye tracker (ISCAN ETL 500; Polhemus, Vermont, USA) to be used with one dog that was trained to wear the apparatus. Nowadays, lightweight goggles with in-built infrared cameras, such as the Tobii Pro Glasses 2 (Tobii AB, Stockholm, Sweden), are available for humans, but so far they do not work with dogs.

Contact-free eye-tracking with dogs has utilized standard, table-mounted eye-tracker systems with a remotely placed infrared camera. Such systems measure the dogs’ eye movements using infrared corneal reflection techniques. In most studies, the camera was integrated below a computer monitor placed at some distance from the dogs’ eyes, such as an iView X RED (SensoMotoric Instruments GmbH, Germany; Somppi et al., 2012, Somppi, Törnqvist, Hänninen, Krause, & Vainio, 2014; Somppi et al., 2016; Törnqvist et al., 2015), Tobii X50 (Tobii AB, Stockholm, Sweden; Téglás, Gergely, Kupán, Miklósi, & Topál, 2012), or EyeLink 1000 (Barber, Randi, Müller, & Huber, 2016). In these first studies using eye-tracking for dogs, the researchers investigated how dogs perceive human gestures in different ostensive contexts (Téglás et al., 2012), how dogs look at actual objects within pictures that differ in terms of novelty and categorical information (human and dog faces, toys, alphabetic characters; Somppi et al., 2012), whether dogs show a human-like facial inversion effect (Somppi et al., 2014), and how dogs with different social experience (pet dogs vs. kenneled dogs) look at pictures showing interactions between humans and dogs (Törnqvist et al., 2015). In five studies, the focus was on the dog’s response to emotional expressions. Researchers asked whether dogs show an attentional bias toward threatening social stimuli and whether their gaze fixation patterns are influenced by the different facial areas of human and dog faces (Somppi et al., 2016), how dogs with different experience with humans (pet vs. laboratory dogs) scan human emotional faces (Barber, Müller, Randi, Müller, & Huber, 2017; Barber et al., 2016; ), whether oxytocin has an impact on the processing of human facial emotions (Kis, Hernádi, Miklósi, Kanizsár, & Topál, 2017; Somppi et al., 2017), and whether the latter effect correlates with cardiac responses (Barber et al., 2017). Most recently, researchers have investigated con- and heterospecific auditory–visual matching in dogs when seeing a woman’s face and hearing her voice or seeing a dog’s face and hearing its barking (Gergely, Petró, Oláh, & Topál, 2019).

A crucial feature of investigating response patterns to such sophisticated stimuli by means of eye-tracking is that (1) the animals need to be in a relaxed enough condition to pay attention to and process the stimuli presented, and (2) at the same time, they need to stay motionless so that their head does not move throughout calibration and validation, as well as during the whole subsequent sequence of stimulus presentations. Calibration is used to collect fixation samples from known target points in order to map the raw eye data to gaze positions. Targets like white disc patterns on the black screen or even animated images are presented serially on a screen. The dog fixates each while samples are collected, and feedback graphics are presented on the host PC display. The calibration is checked automatically when it is finished, and diagnostics are provided. The subsequent validation provides the experimenter with information about calibration accuracy. This is measured in terms of the difference between the computed fixation position and the fixation position for the target obtained during calibration. This error reflects the gaze accuracy of the calibration.

It is obvious that the gazing patterns of subjects that are stressed by being restrained or forced to perform the task will not provide reliable data on how animals process the pictures, videos, and other visual stimuli they are presented with (Niehorster, Cornelissen, Holmqvist, Hooge, & Hessels, 2018). Moreover, physical fixation, such as being harnessed (Jacobs et al., 2009) or kept still by a human (i.e., the experimenter or the caregiver restrains the dog’s body or head manually), likely compromises both the natural looking behavior of dogs and their welfare. Habituating animals to such treatments requires intensive training that, we argue, is better to invest in getting reliably motionless and attentive subjects without any use of physical restraint. Not only may the well-being of the dogs favor this solution but, as long as pet dogs are being tested, the availability of subjects also may increase with this approach, which dog caregivers are likely to prefer.

An even more radical step forward in assessing cognitive processes as well as their neural correlates noninvasively in awake dogs is the use of functional magnetic resonance imaging (fMRI). Although dogs have been tested in behavioral studies of how they solve challenging problems or interact with humans, when perceiving human gestures, expressions, or even voice, we are limited in our conclusions about the underlying cognitive processes. It is not enough to infer from behavior what dogs think, how they feel, and what they understand. And we do not know whether similar behaviors across species result from the same proximate mechanisms. Neuroimaging provides a first look into the working brain during perception and the subsequent mental processes.

The advantages of this neuroimaging technique of estimating brain activity by changes in hemodynamic responses are at least threefold: It can localize neural activity in the brain with high precision, it allows network-level analyses, and, if applied properly, causes no harm nor requires invasive procedures in the tested subject. However, this comes at a cost. For instance, MRI data are highly susceptible to corruption from subject motion. The precise spatial localization of neural activity in the relatively small dog brain therefore requires that the dog lie motionless in the noisy, vibrating, and spatially restrained MRI scanner bore. Because anesthesia or sedation would negatively affect both brain function and cognition, by impeding attentiveness, altering the state of consciousness, and reducing rates of blood flow and respiration (Thompkins, Deshpande, Waggoner, & Katz, 2016), alternative ways are needed to achieve stillness. For the same reasons we described for eye-tracking, testing animals that, due to their training, stay in the scanner on a voluntary basis is highly preferred over physical restriction. Yet not only from an experimental perspective, but also from an ethical one, dogs must not be restrained but be free to leave the scanner whenever they want.

A breakthrough in training animals to remain still, wakeful, and attentive during scanning was achieved only a decade ago (Berns, Brooks, & Spivak, 2012; Tóth, Gácsi, Miklósi, Bogner, & Repa, 2009), and soon it was envisioned as a proper, noninvasive research technique to understand the neural mechanisms of canine cognitive function.

So far, five independent research groups—two in the USA (Atlanta, GA, and Auburn, AL), one in Mexico (Querétaro), and two in Europe (Budapest, Hungary, and Vienna, Austria)—have captured brain images of nonsedated and largely unrestrained dogs, and their work and publications indicate the interest in and the importance of this new frontier in functional neuroimaging (see Andics & Miklósi, 2018; Berns & Cook, 2016; Bunford, Andics, Kis, Miklósi, & Gácsi, 2017; Cook, Brooks, Spivak, & Berns, 2016; Huber & Lamm, 2017; and Thompkins et al., 2016, for reviews). Starting with studies on reward processing (Berns et al., 2012; Berns, Brooks, & Spivak, 2013; Berns, Brooks, Spivak, & Levy, 2017; Cook, Spivak, & Berns, 2014), subsequent studies investigated the default mode network (Kyathanahally et al., 2015), olfactory processing (Berns, Brooks, & Spivak, 2015; Jia et al., 2014; Jia et al., 2015), face processing (Cuaya, Hernández-Pérez, & Concha, 2016; Dilks et al., 2015; Thompkins et al., 2018), response inhibition (Cook, Spivak, & Berns, 2016), auditory processing (human and dog vocalizations: Andics, Gácsi, Faragó, Kis, & Miklósi, 2014; human words: Andics et al., 2016; Prichard, Cook, Spivak, Chhibber, & Berns, 2018), and emotion processing (“jealousy”; Cook, Prichard, Spivak, & Berns, 2018; human emotional faces: Hernández-Pérez, Concha, & Cuaya, 2018). These studies have not only provided a “proof of concept,” but also demonstrated the great potential of this neuroimaging approach to canine cognition. Still, a number of technological and methodological challenges need to be overcome to fully tap this potential (Huber & Lamm, 2017). Among them are appropriate training programs, which are both efficient and ethically responsible—that is, promoting rapid acclimation to the scanner environment with minimal stress and discomfort to the dogs. The challenge is to train animals to remain attentive and cognitively responsive without moving for long enough to make the necessary recordings. In eye-tracking, this means keeping the head still for at least 1 min, a time period that is needed to conduct the calibration and validation procedure and to record the dogs’ gazing patterns in response to the test stimuli presented. In the case of fMRI, a rule of thumb is that dogs need to stay still for at least 4 min, since this usually corresponds to the time period required to collect a sufficient number of fMRI images (with the actual duration depending on the imaging sequence and the experimental design).

So far, researchers who have successfully published studies about fMRI in awake dogs have used slightly different training methods that included chaining (e.g., Berns et al., 2013), target stick (e.g., Jia et al., 2014), and model–rival (e.g., Andics et al., 2014) training. Despite some differences that we will discuss later, all of them include techniques based on the principles of classical and operant conditioning (Dickinson, 1980). These denote learning processes in which new behaviors are acquired and modified through their association with consequences. All of these training methods, by strictly avoiding aversive methods, rely on reinforcing desired behaviors in order to increase the likelihood that the behaviors will occur again, as well as using negative punishment to decrease the probability of undesired behaviors. The training methods used so far have not been systematically compared; thus, we are far from describing a gold standard. Due to various unpredictable circumstances or to events unrelated to dog training—such as dogs that become sick, caregivers stopping participation for personal reasons, and so forth—we cannot compare the different variations quantitatively in terms of training success (e.g., the ratio of dogs that have been tested successfully of all dogs with which the researchers started training). Still, the overarching goal of any training methodology is to reduce training time while maintaining success in the desired behavior (Thompkins et al., 2016).

Here we aim to provide a comprehensive training program that has proved highly successful and, thus, can serve as a reference approach for future research. In short, this program is based on (a) systematic desensitization and habituation to the potentially stressful environment and (b) the shaping and ultimate chaining of several requisite behaviors by using primary and secondary reinforcers. In the case of fMRI, the dogs need to be habituated to the very loud MRI scanner noise (sound pressure levels of up to 96 dB), the operating vibrations caused by the magnet, the tight scanner enclosure (a constricted tube that may provoke apprehension in animals with enclosure anxiety), and the scanner ramp to get onto the elevated and narrow “patient table.” In case of eye-tracking, habituation is less of an issue, but shaping the necessary behaviors, such as putting the head on the chin rest and sitting or standing still, represents a similar training challenge.

Method

Ethics

All experimental procedures described here were discussed and approved by the institutional ethics and animal welfare committee in accordance with Good Scientific Practice (GPS) guidelines and national legislation at the University of Veterinary Medicine Vienna (approval number: 09/08/97/2012). In the case of the fMRI training, the decision was made on the basis of a pilot study at the University of Vienna.

Dog training for accurate eye-tracking

Subjects

Recruiting

All subjects were privately owned pet dogs recruited from human caregivers in Vienna via our Clever Dog Lab (CDL) website and database. The pet dogs were of various breeds, of both sexes (16 males, 25 females), and their ages ranged from eight months to nine years when they started the training (see Table S1). Most dogs were participating in dog activities such as agility, dog dance, therapy dog training, man trailing, search and rescue dog training, dummy training, training in obedience classes, and so forth, at least one or two times a week and were experiencing individual dog training on a daily basis by their caregivers. Their caregivers gave written consent for them to participate in the study.

Suitability check

Before starting the training, we checked whether the dog was suitable for the eye-tracker task. Limiting factors for choosing the subjects were, for example, age (maximum 10 years old) and eyesight, the eye shape, general state of health, length of hair around the eyes, excitement level of the dog, and whether the eye tracker could track the dog’s eyes. We needed to be sure that the dogs were able to see the visual stimuli presented on the screen. Therefore, we made a rapid eye check with a flashlight to exclude cataracts. The shape of the dogs’ eyes was also important. If the eyes were too droopy or, as in some dog breeds, tended to have visible third eyelids, it could happen that the eye tracker would have problems detecting the pupil because of the additional reflection (of wet areas). Since the dogs should sit or stand calmly while doing the eye-tracking, they should be in good health condition so they could repeat the procedure for several minutes. If the hair or the eye lashes around the dog’s eye were too long, this might also have distracted the eye-tracker system from detecting the pupil. The color of the iris could influence the pupil detection, as well. If the color of the dog’s eye was very bright—for instance, light blue—or the edge of the pupil was not really clear, the eye-tracking system could hardly distinguish between the iris and the pupil itself. The last suitability criterion was that the dog should be able to behave calmly and conduct a task during which it needed to be almost motionless for a certain amount of time (maximum of up to 3 min per trial). If all crucial criteria were fulfilled by the dogs, the training started and took place in the Clever Dog Lab, Vienna, at least once a week.

Study sample

The sample of subjects used for training and finally for the eye-tracking studies in Vienna consisted of 41 pet dogs (Table S1). All of them have been trained with a big screen, and out of these ones, 30 dogs learned to perform eye-tracking tests on a small monitor, as well (see below and Table S2).

Experimental setup

The dog training took place in the eye-tracking room of the Clever Dog Lab, Messerli Research Institute, at the University of Veterinary Medicine Vienna. The eye-tracking room is a large (588 × 356 cm), quiet, windowless room equipped with the chin rest device and the eye tracker (see Fig. S1). Light conditions in the room were kept constantly at 75 lux using LED light bulbs (9.5 W, 2700 K Philips GmbH Market DACH, Germany). We used the EyeLink 1000 eye-tracking system (SR Research, Ontario, Canada) because it allows a maximum amount of flexibility with regard to both data analysis and stimulus presentation. Because the camera was sitting just below the tracked area the subject was viewing, it could be used with life-sized stimuli being back-projected onto a large projection screen, or a computer screen, or even with live presentation. For details about the system, see Barber et al. (2016). Of course, other systems, such as the Tobii system or the iView X RED, can be used as well.

The maximum head movement the EyeLink 1000 could track without accuracy reduction was 25 mm horizontal and vertical, and 10 mm back and forth. The setting included a chin rest for stabilizing the participant’s head, either a big or small screen encompassing stimuli display area in the middle, and an eye movement recording camera connected with an infrared illuminator to its right.

We used a customized chin rest device for head stabilization (Fig. 1). A pillow with a v-shaped depression was mounted on a frame, to allow vertical adjustment of the chin rest to the height of the individual dog. The frame consisted of aluminum profiles (©MayTec, Dachau, Germany) that allowed the easily adjustable but stable fixation of additional equipment (e.g., cameras). The chin rest was positioned at a distance of 200 cm from a big projection screen (200 × 200 cm) and 50 cm from a small computer monitor (display PC monitor, 27-in., Asus PB 278), the camera and infrared illuminator. The precise height of the camera and chin rest and the angle of the camera were adjusted to each participant.

We built a wooden box (170 × 120 × 84.5 cm) around the eye-tracker stand as a means to reduce the dogs’ distraction (Fig. 2). This box (which we refer to as the “dog cinema”) had several doors (44 × 44 cm) in the side walls (75 × 40 cm), to be able to check the dog’s behavior inside, to give treats, and to adjust the eye tracker before each training or test session.

Procedure

The participating dogs were trained at least once a week and each training session lasted approximately 30–45 min—including breaks, depending on the dog’s condition.

During the entire dog training we used a clicker (small device that produces a metallic click sound) as a secondary reinforcer and worked with positive reinforcement of the correct behavior. The clicker marked the correct behavior of the dog and the reward for the dog followed immediately. This enabled us to announce the following reward for the dog even over a distance or when we were outside of the eye-tracker room. The reward used was dry food, and pieces of sausage (or another higher-quality food like cheese, depending on subject preferences or allergies) were used as “jackpot treats”—for instance, for outstanding performance or fast improvement, or to increase the dog’s motivation.

First, all potential eye-tracking dogs were trained to be able to perform the calibration and validation procedure and to perform eye-tracker tests with a big screen, following a slightly different training protocol. The dog training process for using the eye tracker and a small computer monitor in front of them consisted of three basic phases: (1) chin rest training, (2) white disc pattern presentation on the monitor training, and (3) calibration and validation training with the eye-tracker system.

Phase 1: Chin rest training

The first phase of training began with teaching the dog to remain calm at the chin rest. This process started with so-called free shaping to form the correct behaviors in the dogs. The dog to be trained was free to move around the eye-tracker apparatus and familiarize himself with the room and the equipment first. If necessary, the dog could be lured—for instance, with food or pointing gestures—into the apparatus at the beginning. At this stage it was important to observe the dog’s reactions. If it showed any kind of avoidance or fear-related signals, training could be modified in order to comfort the dog and reward him for approaching and interacting with the apparatus. Dogs were then guided—for example, with hand signals—toward the chin rest, and reinforced once any interest in the chin rest was expressed. If needed, dogs were initially lured over the chin rest, but then they often offered to lay their head on it on their own. Free shaping was then used to gradually increase the time while resting. To train the dog to reliably lay its head on the chin rest, we established a hand signal first. When the dog entered the apparatus after the cue, the clicker marked the correct behavior and the treat was given immediately thereafter. In addition to the hand signal, when the behavioral response was provided reliably, a vocal signal was introduced and was used to verbally send the dog to the chin rest (“Rest”). To train the dog to remain in the apparatus and stay still for longer periods, the dog was reinforced by gradually increasing the rest time. To generalize this behavior, we began to move around the dog and slowly increased the distance until we were able to completely leave the room. At the end of the chin rest training phase, the dog was required to stay calm on the chin rest for up to 1 min without moving even when distracted—as, for instance, by outside noise.

To make the situation most comfortable for the dog, the subjects were allowed to either sit or stand at the chin rest (Fig. 1). Age, health, physical condition, and overall well-being were considered when making this determination. For example, older dogs would likely prefer to sit. Therefore, during training this behavior was reinforced right from the beginning, to help them be more comfortable. When the dog chose the sitting position, it was necessary to regularly control for the dog’s head and body location. For example, while sitting it could happen that the nose was tilted upward, especially when the dog sat too far away from the chin rest, which could result in inaccurate detections or results in the eye-tracking. In comparison, when the dog stood at the chin rest, the head was more straight and to the front. Adaptations such as lowering the chin rest or training the dog to sit closer to the chin rest were especially helpful. Note that the prone position of the dog in the eye-tracker apparatus is not recommended, since it would likely encourage the dog to fall asleep.

Phase 2: White disc pattern presentation training on the display PC monitor

To accustom the dogs to the new equipment, the display PC monitor was first placed 100 cm away from the dog’s head. We used a variety of animal videos to facilitate the dogs’ attention to the monitor (Fig. 2). To prepare the dogs for the calibration and validation procedure, we created a special screen presentation with Microsoft PowerPoint 2010. The presentation consisted of several black slides with a big (diameter: 7.5 cm) white-disc pattern, placed at different positions on the screen. The dogs learned a certain vocal signal (“Guck”) to look at the monitor and were immediately rewarded when they seemed to gaze at the white disc pattern. The trainer was outside the eye-tracking room and presented the stimulus on the monitor while she controlled for the dog’s eye movements on the camera setup screen next to her. As soon as the dogs were used to the monitor at that distance, we placed it right behind the eye-tracker camera (distance, chin rest to monitor: 50 cm). Then we decreased the size of the white disc pattern (from 7.5 cm to 2 cm in diameter) in a step-wise fashion and presented it in dynamic mode along a triangle-shaped trajectory (similar to the EyeLink three-point calibration mode). From our experience, a moving stimulus increased the dogs’ attention and motivation to gaze-follow it, as compared to a static one. Next, we presented the stimulus according to the EyeLink calibration/validation mode: namely a white disc pattern (diameter: 2 cm) with a black hole (diameter: 1 cm). This stimulus only appeared on the three positions of the triangle without any movement. During this training step, the dogs had to learn to stay focused and to gaze at the position of the white disc stimulus for at least 3 to 4 s at five to six different locations in a row. We estimated the accuracy of the gazing behavior with the camera setup screen of the eye-tracker system and motivated and rewarded the dog for accurate gazing with verbal praise during the training trial. At the end of each training trial, the dogs were rewarded with food. The next black slide, with the stimulus at a different position, was only shown after correct gazing behavior. At the end of Phase 2, we showed four to eight consecutively appearing white disc patterns and combined them with videos, to get the dogs used to the future experimental trials. We slightly increased the duration of the presented videos over sessions, and the dogs were rewarded at the end of each training trial.

Phase 3: Calibration and validation training with the eye-tracker system

In Phase 3, we started to practice the real calibration and validation procedure with the eye-tracker system and the display PC monitor. We repeated this training procedure until the system confirmed a successful calibration and the deviation from the validation points was minimal (less than 0.5 deg). To optimize the training progress and practicing of the calibration procedure, it would be possible to use animated targets—for instance, flickering or looming dots, designs, or pictures—instead of static dots. This might help to get the dogs’ attention faster and keep it longer. Therefore, this could eventually shorten the entire calibration/validation training process and the “refresh time” after a break between different studies. With the eye-tracking system, we took snapshots of the dogs gazing at the calibration points and checked whether these represented the shown key points of an isosceles triangle from the calibration mode. Then we again added animal videos in order to imitate a real test trial. We slowly increased the time of the videos shown to 45 s, to get the dogs used to staying longer in the chin rest. The dogs were rewarded after we had already stopped the videos, to avoid training effects on their watching patterns for future eye-tracker tests. It turned out that it was necessary to randomize the different training episodes (calibration, validation, calibration followed by validation and watching videos) to prevent the dogs from estimating the duration of the training trials in order to stop looking or to change their position. If we repeated the same training episode too often, some dogs started to assess the length of the trial and stopped looking in order to get the treat earlier.

Finally, we introduced the wooden box (“dog cinema”) around the eye-tracker apparatus to reduce the dogs’ distraction. We slowly habituated the dogs to it by adding and then closing all parts of the box one by one, to avoid any fear reactions in the dogs caused by the sudden darkness (Fig. 2). Afterward, we practiced the whole procedure of calibration, validation, and watching videos on the monitor in the closed box. The rear side of the box, behind the dog, always remained open.

Statistics

To investigate whether there was an effect of age or sex on the number of training sessions in the “big-screen” (N = 41) and the “small-monitor” (N = 30) dog samples, we used generalized linear mixed models (GLMM; Baayen, 2008). We included the breed of the dogs as a random effect. The predictor variables with fixed effects were age (covariate) and sex (factor), and as a response variable we included the number of training sessions. We included no random slopes in the model. To test for the influences of age and sex, we initially compared the fit of the full model (i.e., a model with age and sex included) with that of a respective null model (i.e., a model including only the intercept, with age and sex excluded), on the basis of a likelihood ratio test. All models were fitted in R (version 3.4.4; R Core Team, 2015) using the function glmer provided in the R package lme4 (version 1.1-13; Bates, Mächler, Bolker, & Walker, 2015). Overdispersion was no issue (dispersion parameters: big screen, 1.05; small screen, 0.53). We determined confidence intervals using the function bootMer in the lme4 package, and model stability by dropping the levels of the random effects one at a time and comparing the estimates obtained with those obtained for the full data set, which revealed no influential levels of the random effect.