Genome scanning of behavioral selection in a canine olfactory detection breeding cohort

Eyre, Alexander W.; Zapata, Isain; Hare, Elizabeth; Lee, Katharine M. N.; Bellis, Claire; Essler, Jennifer L.; Otto, Cynthia M.; Serpell, James A.; Alvarez, Carlos E.

doi:10.1038/s41598-022-18698-4

Genome scanning of behavioral selection in a canine olfactory detection breeding cohort

Article
Open access
Published: 02 September 2022

Volume 12, article number 14984, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Genome scanning of behavioral selection in a canine olfactory detection breeding cohort

Download PDF

Alexander W. Eyre¹,
Isain Zapata²,
Elizabeth Hare³,
Katharine M. N. Lee⁴^nAff11,
Claire Bellis ORCID: orcid.org/0000-0001-5073-9610^5,6,
Jennifer L. Essler⁷^nAff12,
Cynthia M. Otto⁸,
James A. Serpell⁹ &
…
Carlos E. Alvarez^1,10

2745 Accesses
5 Citations
5 Altmetric
Explore all metrics

Abstract

Research on working dogs is growing rapidly due to increasing global demand. Here we report genome scanning of the risk of puppies being eliminated for behavioral reasons prior to entering the training phase of the US Transportation Security Administration’s (TSA) canine olfactory detection breeding and training program through 2013. Elimination of dogs for behavioral rather than medical reasons was based on evaluations at three, six, nine and twelve months after birth. Throughout that period, the fostered dogs underwent standardized behavioral tests at TSA facilities, and, for a subset of tests, dogs were tested in four different environments. Using methods developed for family studies, we performed a case-control genome wide association study (GWAS) of elimination due to behavioral observation and testing results in a cohort of 528 Labrador Retrievers (2002–2013). We accounted for relatedness by including the pedigree as a covariate and maximized power by including individuals with phenotype, but not genotype, data (approximately half of this cohort). We determined genome wide significance based on Bonferroni adjustment of two quasi-likelihood score tests optimized for either small or nearly-fully penetrant effect sizes. Six loci were significant and five suggestive, with approximately equal numbers of loci for the two tests and frequencies of loci with single versus multiple mapped markers. Several loci implicate a single gene, including CHD2, NRG3 and PDE1A which have strong relevance to behavior in humans and other species. We briefly discuss how expanded studies of canine breeding programs could advance understanding of learning and performance in the mammalian life course. Although human interactions and other environmental conditions will remain critical, our findings suggest genomic breeding selection could help improve working dog populations.

Genetic testing of dogs predicts problem behaviors in clinical and nonclinical samples

Article Open access 07 February 2022

Transposons in the Williams–Beuren Syndrome Critical Region are Associated with Social Behavior in Assistance Dogs

Article 13 December 2023

Comparative neurogenetics of dog behavior complements efforts towards human neuropsychiatric genetics

Article 14 August 2023

Introduction

In addition to the long term and large body of research on dog behavior¹, there is a rapidly growing literature focused on behavior and cognition in working dogs². This is in part due to interest in the evolutionary biology of selected traits such as herding, but also to increasing needs for various types of assistance functions, such as guide dogs for blind people; and for police, military, and security working dogs for purposes including olfactory detection and deterrence. The average cost of a trained service dog is $15,000–$30,000 (Ref. ³), and working dogs at US federal agencies can cost approximately $50,000 or more. As demand for such dogs grows, there is increasing interest in improving their training success rate and performance⁴. Historically, most research on working dog training has studied behavior and temperament, but cognitive analyses are becoming more common and functional magnetic resonance imaging in awake dogs has been applied to this question⁵ (reviewed by²).

Neuroscience has recently experienced an explosion of progress in several areas, including genetics and brain imaging. However, there are widespread concerns that too much research lacks a behavioral context or is dissociated from biology and psychiatry^6,7. The emergence of neuroscience apart from neurology in the early 1900s was largely based on experimentation on a great array of diverse species⁸. However, since the 1970s there has been increasing reliance on mouse models. Mice were used in almost half of studies funded by the US National Institutes of Health in 2015, whereas the next three species combined (fruit fly, zebrafish, and the worm C. elegans) were funded at a ten-fold lower level. Unlike mice and most other animal models, dogs present many human-like aspects such as membership in human families, living to advanced age, epidemiology, advanced health care and even different types of long-term work experiences⁹. Moreover, the evolutionary history of dogs makes them a uniquely powerful model for the study of complex genetics¹⁰. Canine model advantages over human investigation include reduced heterogeneity, larger effect sizes of variations resulting from strong positive selection and relaxed negative selection, lack of socioeconomic confounding, and a shorter generation time. Dogs have hundreds of isolated populations or breeds with diverse personality and behavior traits. Recent genome scans in dogs have mapped personality, normal and problem behaviors, and cognitive traits both in single breeds and across diverse breeds^{11,12,13,14,15}. In parallel, brain imaging studies have revealed structure–function correlations that suggest neurodevelopmental and physiological potential^16,17.

For the reasons mentioned above, dogs are an ideal model for identifying genetic variation broadly associated with learning, defined as a change in an individual’s behavior or abilities resulting from experience; and work performance, defined for our purposes here as an individual’s effectiveness at doing a job well. It is widely agreed that working dog success reflects the maximal matching of breed physical and behavioral conformation with the necessary performance criteria (incl. the lowest rate of excluding faults). The basis for this is that dogs work because they find the activity inherently rewarding¹⁸. Retrievers have the desired size and agility, and variation of the predatory motor pattern sequence: accentuating searching orientation; directly going to accentuated grabbing-biting without first proceeding through eyeing, stalking, and chasing; and never continuing to kill-biting. However, selection of optimal breed conformations for different work is not sufficient. Success also requires proper development during the early developmental period and both general and specific environmental exposures. All working dogs need socialization with humans, sled dogs require socialization with other sled dogs, and hunting dogs require exposure to guns firing in their first year of life. Genetic and environmental factors can influence these types of success in humans and other animals. Those influences can involve behavior, temperament, cognition, and their interactions. For instance, the prevalence of attention deficit hyperactivity disorder (ADHD) in humans is ~ 4% and its most common comorbidities are learning disabilities (45%), anxiety (38%) and other behavioral (31%) disorders¹⁹. Natural hyperactivity, impulsivity, and inattention as in human ADHD are common in dogs, vary in frequency and severity across breeds, and are highly comorbid with fearfulness, aggressiveness, and compulsive behaviors²⁰. Longitudinal data from working dog breeding programs present an opportunity to advance knowledge of effects that influence success in early behavioral development, preselection for training (the subject of this work), training, and work performance and longevity.

The following three examples show recent progress in predicting success of working dog training. The first study measured cognitive skills and temperament from birth to adulthood in Labrador and Golden Retrievers, and German Shepherd Dogs from The Seeing Eye breeding and training program²¹. The traits most predictive of successful training were faster solving of a multistep task and lower levels of a type of anxiety (lower maternal behavior predicted success but is not consistent across all working dogs). A second study used a battery of 25 cognitive tests on independent samples of retrievers trained for assistance or olfactory detection²². The main finding was that the different work types of the two populations were reflected in different cognitive tests being the most predictive of success. The third study used retrievers trained for assistance work²³. It determined predictive performance by modeling behavioral assessment scores from instruments designed to identify problematic behaviors in pet and assistance dogs. Unsurprisingly, that third study assessing problem behaviors was far better at predicting dogs that failed whereas the first two studies focused on cognition were far better at predicting success.

The present work addresses what may be referred to as preselection of dogs for subsequent training as working dogs. Generally, such preselection takes place in the context of requests for and in-person brief evaluation of dogs for purchase based on breed, medical, and behavioral criteria. However, this work is a study of a Transportation Security Administration (TSA) detector-dog breeding and training program in which dogs were fostered for 15 months and observed and tested for performance-related behavior at 3, 6, 9 and 12 months (in a period ending in 2013). Several of those tests are dependent on odor detection, including finding objects by olfaction. Others measure other traits such as interest in possessing a toy or playing tug of war. Specifically, we used a multigenerational TSA cohort of 528 Labrador Retrievers to perform a genome scan of behavioral risk of elimination prior to entering training. Studies of the validity of the behavioral testing in this TSA program^24,25 and, by the same investigators, in the Australian Border Force Detector dog program²⁶ that overlapped our time-period have been published; but this is the first genetic study. For the few loci in which only one gene is primarily implicated, the biological relevance is consistent with behavior. We discuss the potential of such studies to improve the working dog population⁴ and advance the understanding of complex behavioral systems.

Results

Cohort and population structure analysis

We used a multigenerational population from the TSA detector dog breeding and training program to map elimination due to behavioral reasons before commencing training (Table 1). In that pretraining period, dogs were tested for behavior at three, six, nine and twelve months, both at TSA facilities and in four different environments²⁵. We accessed behavioral testing data and biosamples collected from four generations of related Labrador Retrievers from 2002–2013. That data also included whether puppies were selected for training and, if not, whether they were eliminated for medical or behavioral reasons. All breeding Labrador Retrievers were sourced from US breeders of dogs with hunting titles (US) or the Australian Customs Detector Dog Program (Aus. Customs Service, ACS). Breeding Labrador Retrievers were otherwise undocumented to us except that they did not include dogs with brown coat color (associated with show vs. working dog status²⁷) and that ACS dogs presumably were part of a breeding and selection program initiated in 1993²⁸. The ACS detector dog program followed a breeding and selection program for guide dogs that was initiated by the Royal Guide Dogs Associations of Australia (RGDAA) in 1964 and shortly after was joined by the University of Melbourne. The RGDAA program and Kadnook Kennels (Aus.), a key source of their dogs, provided the base population of the detector dog program. That guide dog population also contained contributions from UK and US dogs referred to as “outside” stock (extent unknown)²⁸. At least in its early phase, the detector program received dogs determined to be unsuited for the guiding program. The selection goals for the detector program were referred to as “to provide a steady supply of dogs suitable for training; dogs with a stable temperament, free from genetic disorders and with a long and healthy working life”²⁸. The founders and dogs produced in our TSA cohort collected through 2013 were 74.3% US, 5.6% ACS, 18.0% US x ACS, and 2.1% of unknown source. The pedigree of the full cohort shows a greater risk of elimination for behavioral reasons among dogs most closely related to the founder population (Suppl. Fig. S1). The TSA pretraining program for each dog spanned approximately 12 months after which the dogs were either accepted into training (58.9%) or eliminated for medical (17.6%) or behavioral (23.9%) reasons. Behavioral testing data were available for 528 dogs, of which 296 had biosamples used for genome wide SNP genotyping (~ 173 k SNPs, Illumina CanineHD). After quality filtering, the final genome wide SNP set contained ~ 112 k markers.

Table 1 Case/control information for dogs in the 2013 TSA cohort.

Full size table

We performed principal component analysis (PCA) of the genotyped Labrador Retrievers to identify population structure (Fig. 1). Both PC1 and PC2 showed a slight separation between the US and ACS Labs, and the expected intermediate location of the US x ACS crosses. Visualizing additional PCs did not reveal any further separation between the groups. STRUCTURE model-based clustering analysis²⁹ of the same genotypes failed to detect more than one population when run using both admixture and linkage models over a range of K values and burnings/MCMC reps.

Genome scan for risk of elimination due to behavior

We used ROADTRIPS2 for genome wide association because it controls for both population structure and relatedness, and increases power by including dogs with phenotype but no genotype³⁰. It makes use of kinship information calculated from the pedigree and an imputation procedure that accounts for the relatedness of individuals. The program calculates three statistics. As the original ROADTRIPS work did, we designed our study to use two of those and to correct for multiple testing for all tests. RM is an extension of the M_QLS test that uses pedigree-based weights to improve power and is optimal for two-allele disease models with small effect sizes. RW is an extension of the W_QLS test, which accounts for correlation among related individuals by incorporating optimal weights based on pedigree information and is optimal for rare allele disease models that are close to fully penetrant. Genome wide significance was based on Bonferroni adjustment (P ≤ 2.2 × 10⁻⁷) and suggestive significance was arbitrarily set at P ≤ 1 × 10⁻⁵. We performed GWA for elimination due to behavioral reasons and identified six significant and five suggestive loci (Fig. 2; Table 2). The λ inflation factors for the two tests were slightly above the 1.11 generally considered benign. The Q-Q plots showed population structure and relatedness were well controlled. That is consistent with the robust control of type 1 error due to population structure and family relatedness demonstrated in studies of ROADTRIPS³⁰.

Table 2 Genome scan for behavioral elimination in 2013 TSA cohort.

Full size table

By the RM test, two intervals were strongly associated with behavioral elimination, chr13:55,534,649–59,902,870 (4.37 MB) and chr1:22,989,459–25,289,424 (2.30 MB); and single markers at chr7:66,358,701 and chr19:21,040,815 were also significant (CanFam3.1 coordinates). By the RW test, the same regions on chromosomes 13 and 1 were strongly associated, in addition to single markers chr6:76,632,282, chr36:25,252,101, and chr15:40,757,218. As expected, there was little overlap between the loci detected by RM and RW. The two tests had similar yields and numbers of loci with single vs multiple SNPs. None of the significant or suggestive loci mapped for behavioral elimination here overlapped C-BARQ behavioral GWAS loci or evolutionary selection regions reported for UK Labrador Retrievers^12,27. Comparison to behavioral and cognitive markers previously mapped across dog breeds also showed no overlap with the present findings.

We created comprehensive models to simultaneously determine the effect sizes of candidate loci (Table 4). For RM loci, the effect of the chr7:66,358,701 AA allele was so large that it overwhelmed all others. For RW loci, the chr3:47,134,935 AA allele also had a great effect, but its estimation is uncertain due to quasi-complete separation (due to low A allele frequency). Quasi-complete separation occurs when levels in a dependent variable separate an independent variable perfectly. When this happens, the estimate can be assumed to be very large, but its numeric estimator is unreliable. The RW loci chr13:57,789,399 and chr36:25,252,101 had large effects for the heterozygous state (odds ratio, OR = 14.42 and 5.05, respectively), but the homozygous risk state was absent in the cohort. In a combined RM and RW assessment, only chr3:47,134,935 and chr36:25,252,101 remained significant with large effect sizes (quasi-complete separation for chr3:47,134,935 and OR = 4.63 for chr36: 25,252,101). The effects detected for chr7 in RM in and for chr13 in RW became non-significant.

Candidate gene annotation for theory building

Several loci implicate one or few genes positionally (Table 2). Some single-SNP loci lie within or near one gene, and one multi-SNP locus overlaps a single gene. Such candidate genes known to have major behavioral effects in other species include CHD2, NRG3 and PDE1A. Exclusion of puppies for behavioral reasons prior to entering training is a complex trait likely to involve many brain functions. That and the small number of candidate genes suggests geneset enrichment analysis is unlikely to be useful here. While brain expression of candidate genes is consistent with this behavioral trait, at least 80% of mammalian genes are expressed in the brain. We thus avoid using the known biology of candidate genes to support the mapping and allow interpretation of our GWA candidate genes. However, for purposes of prioritization and theory building, we performed a survey of brain relevance and genomic demographics (Table 3). Four genes each were associated with human educational attainment and intellectual disability, of which CHD2, NRG3 and LRRC7 were associated with both. NRG3 and LRRC7 were also associated with accelerated divergence in humans. Very few candidate genes were implicated in human neurodevelopmental disorders (N = 2 for high confidence geneset), autism (N = 1 for known and suspected) or epilepsy (N = 2). One gene, CHD2, was associated with all intelligence and neurodevelopmentally related traits mentioned above. CHD2 was also the only tier 1 candidate known to be a haploinsufficient disease gene and to be intolerant to loss of function mutation. DRAM1 is notable because it is among the genes most strongly associated with structure of many brain regions across seven studies (GWAS Catalog). For example, DRAM1 was mapped with a P = 5 × 10⁻⁵² in a GWAS of sub-cortical volume³¹.

Table 3 Brain trait and genomics demographics of behavioral elimination GWA candidate genes.

Full size table

For two loci that have more than one candidate gene, this analysis together with the behavioral relevance noted in Table 2 implicate one gene at each locus: LRRC7 on chr6 and HS6ST1 on chr19. HS6ST1 was one of only two candidates known to be a curated neurogenesis gene and is related to HS6ST2, which we previously mapped for increased social behavior. Although ABHD3 was classified as a positional single-gene locus, the gene harbors expression quantitative trait loci (eQTLs) for ROCK1 and ESCO1; and all three genes are good behavioral candidates (Tables 1 and 2). Two loci contain 66 other candidate genes that were not analyzed, but which contain genes with established (e.g., the neurodevelopmental genes SMAD4 and EPHA5; and MC2R, the adrenocorticotropic hormone receptor expressed in the adrenal gland which receives the last signal within the hypothalamic, pituitary, adrenal axis) or more enigmatic (e.g., CCN2/CTGF³²) behavioral relevance.

Discussion

Experimental approach and robustness

Our genome scan of risk of elimination due to undesirable behavior in Labrador Retrievers in the TSA detection dog breeding and raising program yielded six genome-wide significant and five suggestive loci. As far as we know, none of the mapped haplotypes has been reported in previous genetic mapping studies or genome scans for footprints of selection under domestication. Our mapping approach was to use the ROADTRIPS method developed for case–control association testing in related individuals sampled from structured populations. The method is not constrained by how subjects are related and allows for inclusion of individuals with pedigree and phenotype data but lacking genotypes. The PCA analysis distinguished the slight difference that resulted from the geographical origins of breeding Labrador Retrievers from the US and Australia. However, the two Labrador Retriever populations are closely related and US dogs contributed to the Australian program²⁸. That was supported by the STRUCTURE analysis, which failed to detect population structure in our cohort. The Q-Q plots showed good control of the false positive rate due to population structure and family relatedness (consistent with studies of ROADTRIPS’ control of such type 1 error in case–control family-based studies³⁰). A limitation of the cohort and the indicated mapping approach is the inability to calculate heritability. We were also not able to perform a type of validation using the same cohort because it could not be split into learning and testing sets (as the pedigree was included in the association analysis).

There are several challenges to the replication of this study related to the definitions of behavioral traits. Studies which overlapped the time of this work assessed the screening methods of the TSA^24,25 and Australian Border Force Detector dog²⁶ breeding and training programs. Based on survey data from 34 TSA dog handlers, 13 of 15 traits measured in TSA puppy testing showed content validity (i.e., the TSA standardized tests given to dogs in their first year matched well with handlers’ understanding of the most important operational traits)²⁵. Unmeasured traits that were also predictive of success included “play” and off-duty “calmness”. However, for our cohort, the pretraining elimination of dogs for behavioral reasons was subjective and we do not have data describing the bases for those final decisions. Any behavior that is perceived as incompatible with successful training and deployment can result in elimination in olfactory detection programs²⁸. Examples include, different types of fearfulness and anxiety, aggression, hyper- and hypo-activity, lack of innate drive to search and possess training toys, and insufficient human socialization. We don’t know how this information was used, but for some dogs eliminated due to behavior, our data noted traits such as distractibility, lack of motivation, and submissive urination. A recent study of 17 experienced explosive detection canine practitioners in the law enforcement, military, federal, and private sectors highlighted the following traits as being associated with success: “hunt drive” (motivation, a high level of energy, focus, and the ability to ignore or recover quickly from distractions), stamina to continue working, and the ability to generalize odors (e.g. identify explosives that are similar but not identical to those used in training)³³. At the same time, behavioral tests, descriptive vocabulary, and other aspects of explosives dog selection are inconsistent between working dog organizations³⁴. Such variable criteria for selecting dogs, as well as differences in work specialization and environments, may make it challenging to find genetic associations unless they have large effects and are correlated across multiple traits. Strengths of the present model include standardized TSA puppy program evaluation, pedigree information, and presence of vastly reduced genetic, and thus also phenotypic, heterogeneity within single breeds. In ongoing studies, we are analyzing the longitudinal TSA puppy testing data for these same dogs—separately and in combined analyses with the program elimination data reported on here.

Implications for the working dog and behavioral genetics fields

Because Labrador Retrievers are very popular working dogs, an important finding is that behavioral haplotypes previously mapped or shown to be under selection in this breed^12,27 were not associated with pretraining elimination due to behavior. That was also true for large effect behavioral variation mapped in interbreed GWASs^13,14,35,36. For instance, a haplotype with a coding variant of IGSF1 that is strongly associated with anxiety and fear traits across breeds and has an allele frequency of 0.18 in pet Labradors¹⁴ was homozygous non-risk in the present cohort. These findings are consistent with effective negative selection of such variations in hunting line Labradors or in the breeding programs involved here. Our identification of new loci associated with pretraining elimination due to behavioral reasons suggests that selective breeding might be used to improve the success rates in working dog breeding populations. The replication of this study in other working dog populations will be critical to decisions about whether and how to incorporate these markers into breeding strategies. If genome wide polygenic risk could be measured in well-powered cohorts, genomic estimated breeding values built on that would likely surpass the efficiency of methods based on pedigree or phenotype⁴.

Joint modeling of mapped loci showed at least two that have moderate-to-large effect sizes. The chr36:25 Mb locus had an OR of 4.63 and chr3:47 Mb can be considered incalculable due to quasi-complete separation. Importantly, the chr3:47 Mb estimate was from comparing homozygotes states whereas there were no dogs homozygous for the elimination risk allele at the chr36:25 Mb locus. Further studies of all mapped haplotypes are necessary to understand the evolutionary, genetic, and physiological mechanisms, and relevance to other breeds. The rapid genetic divergence and population bottlenecks in the development of dog breeds over the last few hundred years have resulted in phenotypes with simple inheritance patterns that have been associated with different genetic variants between breeds³⁷. However, complex behavioral traits are influenced by many genetic and environmental factors and thus are challenging to understand. Historically in animal breeding, quantitative traits were described by the “infinitesimal model” in which phenotypes result from large numbers of genetic factors, each with an infinitesimal additive effect. More recently, the “omnigenic model” describes two types of genetic effects³⁸. Core variants have larger effect sizes and occur in biochemical pathways related to the phenotype, while peripheral variants have smaller effects. If additional investigation showed our candidate loci are characteristic of the omnigenic model, that would suggest the large effect variations may have diagnostic and interventional utility.

The present working dog model is part of an untapped birth to death cohort with standardized health, behavior, cognitive, training, performance, and environmental data. If the necessary resources were in place, molecular epidemiological and computational psychiatric approaches would present a powerful framework for studying learning and working performance as described in the Introduction. Although ethology requires deconstruction to describe and measure isolated effects, it is clear the complex behaviors studied here are integrated across physical and behavioral breed conformations, cognition, temperament, critical period development, and gene-environment effects. As an example of the problem, consider a study of over 11,000 cadets assessed throughout their time in the US Military Academy³⁹. At entry, cognitive ability was negatively correlated with both physical ability and grit (i.e., zeal, hard work, and perseverance). However, whereas cognitive ability was the best predictor of academic and military grades, both completion of initiation training and 4-year graduation were predicted better by non-cognitive traits. We are not equating TSA dog training with West Point cadet training, but rather drawing attention to the limitation of studying subjects sampled at elite institutions. Compared to the total population, elite cohorts tend to have very low variances across desired traits (i.e., everyone is preselected from the tail of a distribution). Our goal here was to genetically identify the likelihood of success in entering TSA training. If we had instead mapped failure to graduate training (i.e., using only dogs who succeeded in entering training), it is unlikely we would have identified the loci we did. Moreover, a complete working dog model would allow life course studies across a 12-year mean lifespan.

Gene annotation for gene prioritization, theory building and future validation

Because the mapped trait is complex and the GWA cannot be confirmed yet, the gene annotation provides little weight as evidence that the mapping is true. Without knowing that and whether variation affecting a candidate gene contributes to the phenotype, any biological interpretation of that gene is tentative. However, both the effect sizes of loci and the biology of candidates can be used to prioritize follow-up studies. Some investigators may want to pursue the largest effect loci for breeding purposes. Others may prioritize validating and dissecting the molecular mechanism of a candidate gene at a single-gene locus based on, say, known protein function, brain expression patterns, or human or mouse phenotypes (see brain relevance of candidate genes in Table 2). Candidate genes of interest include CHD2, NRG3, DRAM1 and PDE1A among the tier 1 loci that positionally implicate one gene. The CHD2 and PDE1A loci are also of interest for having large effect sizes in the combined RM/RW comprehensive model. The tier 1 locus implicating ABHD3 is supported by the comprehensive RM model. Notably, human ABHD3 contains variations associated with expression levels of nearby ROCK1 and ESCO1, which are strong behavioral candidate genes. At two tier-2 loci implicating more than one gene, the biological relevance favors one gene at each locus: LRRC7 on chr6 and HS6ST1 on chr19. LRRC7 is particularly interesting here because it is associated with human educational attainment; intellectual disability; neuroticism, depression, and subjective wellbeing; and tobacco and alcohol use. It has also been shown to be under positive selection in humans and to be differentially expressed in three pairs of mammals comparing wild vs. domesticated species⁴⁰. HS6ST1 is interesting because we previously mapped canine social behavior to the locus of its paralog HS6ST2, which is also a neuroticism GWA gene in humans^14,41.

The extensive mapped intervals on chr1 and chr13 are suggestive of recent positive selection. Of those, the hits on chr13 are also supported by the comprehensive RW model (Table 4). Both loci contain at least one prominent neurodevelopmental gene (SMAD4 and EPHA5, respectively) and one neuropeptide receptor central to the Hypothalamic–Pituitary–Adrenal/-Gonadal axes (melanocortin/adrenocorticotropic hormone receptor MC2R, adrenal/HPA; and gonadotropin-releasing hormone receptor GNRHR, pituitary/HPG). A coding variant of MC2R common in ancient and herding dog breeds (~ 25% and ~ 8% allele frequencies, respectively), but absent or rare in other breeds including Labrador Retrievers, was shown to be associated with reduced gazing at experimenters in the “unsolvable test”⁴². Of our three mapped SNPs in the interval containing MC2R, the nearest to that variant position is 538,389 bp away (the others ~ 1 Mb or more). That coding variant and our risk allele at the nearest SNP are both present in the canFam4 German Shepherd genome assembly (alleles A and G, respectively; compared to alleles G and A in the Boxer assembly canFam3.1). Further studies are necessary to determine if the haplotype we mapped contains this MC2R coding variant.

Table 4 Comprehensive modeling for simultaneous effect size determination of GWAS hits.

Full size table

It will be interesting to see if closely related breeds, such as Golden and Flat Coated Retrievers, or more distantly related working dog breeds like German Shorthaired Pointers carry the risk alleles we mapped on chr1 and chr13 and can thus be used for fine mapping⁴³. Alternatively, brain eQTL data for Labrador Retrievers could reveal if any genes in the intervals have differential expression associated with that haplotype⁴⁴. Whereas that requires postmortem samples, it is also possible to validate mapped loci behaviorally, especially where both risk and non-risk alleles are common. For some candidate genes, there are testable clues of associations with one type of brain trait, such as NCK1 with neuroticism, and DRAM1 and ESCO1 with brain structure. For instance, the NCK1 locus can be explored for association with related traits in genotyped dogs with C-BARQ dog owner behavioral questionnaire data³⁶. DRAM1 and ESCO1 can be tested for genetic associations with MRI-based brain structure differences^16,35. Lastly, the several dog loci that in humans are associated with cognitive traits can be tested for epistasis with each other and for association with the TSA training test data for this population that is currently being analyzed (and with emerging canine cognitive tests¹¹).

Conclusions

Our genome scan of pretraining elimination due to undesired behavioral traits in the TSA detector dog breeding and training program identified six genome wide significant loci. We used family-based association methods that controlled for relatedness by inclusion of the pedigree as a covariate and increased power by inclusion of dogs with phenotype but no genotype data. The top limitation of the study is the current lack of a validation cohort, which we are addressing in ongoing studies. One suggestion that the mapping is likely to be true is the strong behavioral relevance of multiple candidate loci which implicate a single gene. These findings are consistent with the possibility of improving the efficiency and quality of working dog programs through genomic estimated breeding values.

Materials and methods

Data acquisition

Data used in this study were generated from an olfactory detection dog breeding and training program the TSA ran from 2002 to 2013. Samples from litters born between 2002–2012 were genotyped in 2012–2013. The behavioral data were stored in a RedCap⁴⁵ database that was designed specifically for genomic and behavioral research. Phenotype data for 528 Labrador Retrievers were provided. In that original study, 296 of the same dogs were randomly selected for genotyping (Illumina Infinium Assay and CanineHD Beadchip (Illumina Part No. 11322460)) and the resulting information was included in the data we received. That genotyping data yielded 173,662 SNPs spanning the dog genome.

Ethics statement

The biological samples and genotype data, and the phenotype data were collected within the US Transportation Security Administration’s (TSA) canine olfactory detection breeding and training program between 2002–2013 following all necessary guidelines and regulations. The present study is based on access to those data.

Data processing and statistical analysis

Genotyping data were first converted from the CanFam2.1 to CanFam3.1 dog genome build using the UCSC Lift Genome Annotations tool (http://www.genome.ucsc.edu/cgi-bin/hgLiftOver). Quality control was performed in PLINK 1.90⁴⁶ using a minor allele frequency cutoff of 2.5%, Hardy–Weinberg equilibrium filter of 1E-5, and maximum SNP missingness rate of 10%. This resulted in a cleaned dataset of 112,284 SNPs.

To investigate underlying population structure, a Principal Components Analysis (PCA) was performed on the genotyped dogs after imputing missing genotypes and clumping SNPs utilizing the bigsnpr package in R⁴⁷. Variance explained for each included SNP was exported for the top 25 PC’s to assess their loading values. A pedigree for the family of dogs was developed using the kinship2 package in R⁴⁸.

To identify genomic regions associated with behavioral elimination, a Genome Wide Association Study (GWAS) was performed using ROADTRIPS2³⁰. Prior to running, kinship coefficients⁴⁹ were calculated using the KinInbcoef v1.1 software⁵⁰ and missing genotypes generated for non-genotyped, related dogs with known elimination status. ROADTRIPS2 was run with both genotyped and non-genotyped dogs that shared the same pedigree and had elimination status. Figures were generated for the output utilizing the qqman package in R⁵¹. Genome wide significance was based on Bonferroni adjustment (α = 0.05/112,246 SNPs/2 GWAS tests = 2.2 × 10⁻⁷) and suggestive significance was arbitrarily set at P ≤ 1 × 10⁻⁵.

The comprehensive models for simultaneous effect size determination were performed on genotyped dogs alone using only the significant hits generated by the GWAS. This approach was carried on as multiple logistic regressions where the elimination status was defined as the dependent variable and the specific alleles for the selected hits defined as categorical dependent variables. This approach was done independently for hits obtained through RM and RW statistics in addition the combined RM and RW list. All regression analysis was performed in SAS/STAT v.9.4.

Genome annotation

Genome annotation was performed on the UCSC Genome Browser. All canine genome coordinates reported here correspond to the canFam3 assembly. Gene annotation was performed using the Broad Improved Canine Annotation v1⁵² and checked for different or missing gene content and accepted gene nomenclature⁵³ by analyzing the syntenic intervals in the human genome (In Other Genomes (Convert) function; hg19 and hg38 assemblies).

Data availability

Phenotype data are included as Supplementary Information in this work. Genotyping data are available in a public repository: https://data.mendeley.com/, https://doi.org/10.17632/hrtpmfyypm.1.

References

Serpell, J. (ed.) The Domestic Dog (Cambridge University Press, 2017).
Google Scholar
Bray, E. E. et al. Enhancing the selection and performance of working dogs. Front. Vet. Sci. 8, 644431 (2021).
Article PubMed PubMed Central Google Scholar
How Much Does a Service Dog Cost: A Buyer's Guide for Your Service Dog [https://www.nsarco.com/blog/service-dog-buyers-guide.html]
Chen, F. L. et al. Advancing genetic selection and behavioral genomics of working dogs through collaborative science. Front. Vet. Sci. 8, 662429 (2021).
Article PubMed PubMed Central Google Scholar
Berns, G. S., Brooks, A. M., Spivak, M. & Levy, K. Functional MRI in awake dogs predicts suitability for assistance work. Sci. Rep. 7, 43704 (2017).
Article ADS PubMed PubMed Central Google Scholar
Marshall, M. The hidden links between mental disorders. Nature 581(7806), 19–21 (2020).
Article ADS CAS PubMed Google Scholar
Nivm Y. The primacy of behavioral research for understanding the brain. Behav. Neurosci. 2021.
Laurent, G. On the value of model diversity in neuroscience. Nat. Rev. Neurosci. 21(8), 395–396 (2020).
Article CAS PubMed Google Scholar
Fenger, J. M. et al. Dog models of naturally occurring cancer. In Animal Models for Human Cancer Vol. 69 (eds Martic-Kehl, M. I. et al.) 153–221 (Wiley-VCH Verlag GmbH & Co. KGaA, 2016).
Chapter Google Scholar
Rowell, J. L., McCarthy, D. O. & Alvarez, C. E. Dog models of naturally occurring cancer. Trends Mol Med 17(7), 380–388 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gnanadesikan, G. E. et al. Breed differences in dog cognition associated with brain-expressed genes and neurological functions. Integr. Comp. Biol. 60, 976–990 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ilska, J. et al. Genetic characterization of dog personality traits. Genetics 206(2), 1101–1111 (2017).
Article PubMed PubMed Central Google Scholar
MacLean, E. L., Snyder-Mackler, N., vonHoldt, B. M. & Serpell, J. A. Highly heritable and functionally relevant breed differences in dog behaviour. Proc. R. Soc. Biol. Sci. 2019(286), 20190716 (1912).
Google Scholar
Zapata, I., Serpell, J. A. & Alvarez, C. E. Genetic mapping of canine fear and aggression. BMC Genomics 17, 572 (2016).
Article PubMed PubMed Central Google Scholar
Tang, R. et al. Candidate genes and functional noncoding variants identified in a canine model of obsessive-compulsive disorder. Genome Biol. 15(3), R25 (2014).
Article PubMed PubMed Central CAS Google Scholar
Hecht, E. E. et al. Significant neuroanatomical variation among domestic dog breeds. J. Neurosci. 39(39), 7748–7758 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hecht, E. E. et al. Neurodevelopmental scaling is a major driver of brain-behavior differences in temperament across dog breeds. Brain Struct. Funct. 226(8), 2725–2739 (2021).
Article CAS PubMed Google Scholar
Coppinger, R. & Coppinger, L. Dogs: A New Understanding of Canine Origin, Behavior, and Evolution (University of Chicago Press, 2001).
Google Scholar
DuPaul, G. J., Gormley, M. J. & Laracy, S. D. Comorbidity of LD and ADHD: implications of DSM-5 for assessment and treatment. J. Learn. Disabil. 46(1), 43–51 (2013).
Article PubMed Google Scholar
Sulkama, S. et al. Canine hyperactivity, impulsivity, and inattention share similar demographic risk factors and behavioural comorbidities with human ADHD. Transl. Psychiatry 11(1), 501 (2021).
Article PubMed PubMed Central Google Scholar
Bray, E. E., Sammel, M. D., Cheney, D. L., Serpell, J. A. & Seyfarth, R. M. Effects of maternal investment, temperament, and cognition on guide dog success. Proc. Natl. Acad. Sci. U. S. A. 114(34), 9128–9133 (2017).
Article CAS PubMed PubMed Central Google Scholar
MacLean, E. L. & Hare, B. Enhanced selection of assistance and explosive detection dogs using cognitive measures. Front. Vet. Sci. 5, 236 (2018).
Article PubMed PubMed Central Google Scholar
Bray, E. E. et al. Predictive models of assistance dog training outcomes using the canine behavioral assessment and research questionnaire and a standardized temperament evaluation. Front. Vet. Sci. 6, 49 (2019).
Article PubMed PubMed Central Google Scholar
Fratkin, J. L. et al. Do you see what I see? Can non-experts with minimal training reproduce expert ratings in behavioral assessments of working dogs?. Behav. Process. 110, 105–116 (2015).
Article Google Scholar
Rocznik, D., Sinn, D. L., Thomas, S. & Gosling, S. D. Criterion analysis and content validity for standardized behavioral tests in a detector-dog breeding program. J. Forensic Sci. 60(Suppl 1), S213-221 (2015).
Article PubMed Google Scholar
Munch, K. L., Wapstra, E., Thomas, S., Fisher, M. & Sinn, D. L. What are we measuring? Novices agree amongst themselves (but not always with experts) in their assessment of dog behaviour. Ethology 125(4), 203–211 (2019).
Article Google Scholar
Friedrich, J. et al. Unravelling selection signatures in a single dog breed suggests recent selection for morphological and behavioural traits. Genet. Genomics Next 2020, e10024 (2020).
Google Scholar
Champness, K. A. Development of a Breeding Program for Drug Detector Dogs: Based on Studies of a Breeding Population of Guide Dogs (The University of Melbourne, 1996).
Google Scholar
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155(2), 945–959 (2000).
Article CAS PubMed PubMed Central Google Scholar
Thornton, T. & McPeek, M. S. ROADTRIPS: case-control association testing with partially or completely unknown population and pedigree structure. Am. J. Hum. Genet. 86(2), 172–184 (2010).
Article CAS PubMed PubMed Central Google Scholar
van der Meer, D. et al. Understanding the genetic determinants of the brain with MOSTest. Nat. Commun. 11(1), 3512 (2020).
Article ADS PubMed PubMed Central CAS Google Scholar
Chae, S. et al. Molecular laterality encodes stress susceptibility in the medial prefrontal cortex. Mol. Brain 14(1), 92 (2021).
Article CAS PubMed PubMed Central Google Scholar
Farr, B. D., Otto, C. M. & Szymczak, J. E. Expert perspectives on the performance of explosive detection canines: Performance degrading factors. Animals 11(7), 1978 (2021).
Article PubMed PubMed Central Google Scholar
Lazarowski, L. et al. Methodological considerations in canine olfactory detection research. Front. Vet. Sci. 7, 408 (2020).
Article PubMed PubMed Central Google Scholar
Zapata, I., Hecht, E. E., Serpell, J. A. & Alvarez, C. E. Genome scans of dog behavior implicate a gene network underlying psychopathology in mammals, including humans. bioRxiv https://doi.org/10.1101/2020.07.19.211078 (2020) (non-reviewed preprint).
Article PubMed PubMed Central Google Scholar
Zapata, I., Lilly, M. L., Herron, M. E., Serpell, J. A. & Alvarez, C. E. Genetic testing of dogs predicts problem behaviors in clinical and nonclinical samples. bioRxiv https://doi.org/10.1101/20200813249805 (2020) (non-reviewed preprint).
Article PubMed PubMed Central Google Scholar
Mooney, J. A., Yohannes, A. & Lohmueller, K. E. The impact of identity by descent on fitness and disease in dogs. Proc. Natl. Acad. Sci. U.S.A. 118(16), 6118 (2021).
Article CAS Google Scholar
Boyle, E. A., Li, Y. I. & Pritchard, J. K. An expanded view of complex traits: From polygenic to omnigenic. Cell 169(7), 1177–1186 (2017).
Article CAS PubMed PubMed Central Google Scholar
Duckworth, A. L. et al. Cognitive and noncognitive predictors of success. Proc. Natl. Acad. Sci. U.S.A. 116(47), 23499–23504 (2019).
Article CAS PubMed PubMed Central Google Scholar
Albert, F. W. et al. A comparison of brain gene expression levels in domesticated and wild animals. PLoS Genet. 8(9), e1002962 (2012).
Article CAS PubMed PubMed Central Google Scholar
Luciano, M. et al. The influence of X chromosome variants on trait neuroticism. Mol. Psychiatry 26(2), 483–491 (2021).
Article CAS PubMed Google Scholar
Tonoike, A. et al. Identification of genes associated with human-canine communication in canine evolution. Sci. Rep. 12(1), 6950 (2022).
Article CAS PubMed PubMed Central Google Scholar
Stephan, W., Song, Y. S. & Langley, C. H. The hitchhiking effect on linkage disequilibrium between linked neutral loci. Genetics 172(4), 2647–2663 (2006).
Article CAS PubMed PubMed Central Google Scholar
Sandor, S., Czeibert, K., Salamon, A. & Kubinyi, E. Man’s best friend in life and death: scientific perspectives and challenges of dog brain banking. Geroscience 43, 1653–1668 (2021).
Article PubMed PubMed Central Google Scholar
Harris, P. A. et al. Research electronic data capture (REDCap)—A metadata-driven methodology and workflow process for providing translational research informatics support. J. Biomed. Inform. 42(2), 377–381 (2009).
Article PubMed Google Scholar
Chang, C. C. et al. Second-generation PLINK: Rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central CAS Google Scholar
Prive, F., Aschard, H., Ziyatdinov, A. & Blum, M. G. B. Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr. Bioinformatics 34(16), 2781–2787 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sinnwell, J. P., Therneau, T. M. & Schaid, D. J. The kinship2 R package for pedigree data. Hum. Hered. 78(2), 91–93 (2014).
Article PubMed Google Scholar
Karigl, G. A recursive algorithm for the calculation of identity coefficients. Ann. Hum. Genet, 45(3), 299–305 (1981).
Article MathSciNet CAS MATH Google Scholar
Bourgain, C., Zhang, Q. KinInbcoef: Calculation of Kinship and Inbreeding Coefficients Based on Pedigree Information. Release 1.1 edn; 2009.
Turner, S. D. qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots. bioRxiv 2014, 005165 (2014).
Google Scholar
Hoeppner, M. P. et al. An improved canine genome and a comprehensive catalogue of coding genes and non-coding transcripts. PLoS ONE 9(3), e91172 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Tweedie, S. et al. Genenames.org: The HGNC and VGNC resources in 2021. Nucleic Acids Res. 49(D1), D939–D946 (2021).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Mr. Scott Thomas for important discussions and information on detection dog training and assessment. We thank the leadership and staff at the TSA National Explosives Detection Canine Program, Lackland AFB, for access to their program and for valuable demonstrations and information. We thank the leadership at the Department of Defense Military Working Dog Veterinary Service (Army), Lackland AFB, for discussions and information. The data were made available through a cooperative research and development agreement between the Department of Homeland Security and the University of Pennsylvania (Docket No. DHS-2013-0064).

Funding

This work was supported by Department of Homeland Security Science and Technology (S&T) Directorate, Contract No.70RSAT19CB0000014 with Battelle Memorial Institute (C.E.A., C.M.O., J.A.S.). C.E.A. was also supported by grants from the American Kennel Club CHF (01660) and the Scottish Deerhound Club of America.

Author information

Katharine M. N. Lee
Present address: Department of Anthropology, Tulane University, New Orleans, USA
Jennifer L. Essler
Present address: Department of Animal Science, State University of New York College of Agriculture and Technology at Cobleskill, New York, USA

Authors and Affiliations

Center for Clinical and Translational Research, The Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH, 43205, USA
Alexander W. Eyre & Carlos E. Alvarez
Department of Biomedical Sciences, Rocky Vista University College of Osteopathic Medicine, Parker, CO, 80134, USA
Isain Zapata
Dog Genetics LLC, Astoria, NY, 11102, USA
Elizabeth Hare
Division of Public Health Sciences, Department of Surgery, Washington University in St. Louis School of Medicine, St. Louis, USA
Katharine M. N. Lee
Human Genomics, Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore, 138672, Singapore
Claire Bellis
Centre for Genomics and Personalised Health, Genomics Research Centre, School of Biomedical Sciences, Queensland University of Technology (QUT), Kelvin Grove, QLD, Australia
Claire Bellis
Penn Vet Working Dog Center, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, PA, 19146, USA
Jennifer L. Essler
Penn Vet Working Dog Center, Department of Clinical Sciences and Advanced Medicine, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, PA, 19146, USA
Cynthia M. Otto
Department of Clinical Sciences and Advanced Medicine, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
James A. Serpell
Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, 43210, USA
Carlos E. Alvarez

Authors

Alexander W. Eyre
View author publications
You can also search for this author in PubMed Google Scholar
Isain Zapata
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Hare
View author publications
You can also search for this author in PubMed Google Scholar
Katharine M. N. Lee
View author publications
You can also search for this author in PubMed Google Scholar
Claire Bellis
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer L. Essler
View author publications
You can also search for this author in PubMed Google Scholar
Cynthia M. Otto
View author publications
You can also search for this author in PubMed Google Scholar
James A. Serpell
View author publications
You can also search for this author in PubMed Google Scholar
Carlos E. Alvarez
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.W.E., I.Z. and C.E.A. designed the work. A.W.E. performed the genetic analyses, I.Z. the modeling studies and C.E.A. the bioinformatics analyses. E.H, K.M.N.L. and C.B generated and processed the phenotyping and genotyping datasets. J.L.E., C.M.O., J.A.S. and E.H. provided canine behavior and working dog expertise used for study context and interpretation. C.E.A. wrote the manuscript with help from A.W.E. and I.Z., and contributions and editing by J.L.E., C.M.O., J.A.S. and E.H.

Corresponding author

Correspondence to Carlos E. Alvarez.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figure S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Eyre, A.W., Zapata, I., Hare, E. et al. Genome scanning of behavioral selection in a canine olfactory detection breeding cohort. Sci Rep 12, 14984 (2022). https://doi.org/10.1038/s41598-022-18698-4

Download citation

Received: 05 May 2022
Accepted: 17 August 2022
Published: 02 September 2022
DOI: https://doi.org/10.1038/s41598-022-18698-4
Springer Nature Limited

This article is cited by

Machine learning prediction and classification of behavioral selection in a canine olfactory detection program
- Alexander W. Eyre
- Isain Zapata
- Carlos E. Alvarez
Scientific Reports (2023)
Comparative neurogenetics of dog behavior complements efforts towards human neuropsychiatric genetics
- Kathleen Morrill
- Frances Chen
- Elinor Karlsson
Human Genetics (2023)

Genome scanning of behavioral selection in a canine olfactory detection breeding cohort

Abstract

Similar content being viewed by others

Genetic testing of dogs predicts problem behaviors in clinical and nonclinical samples

Transposons in the Williams–Beuren Syndrome Critical Region are Associated with Social Behavior in Assistance Dogs

Comparative neurogenetics of dog behavior complements efforts towards human neuropsychiatric genetics

Introduction

Results

Cohort and population structure analysis

Genome scan for risk of elimination due to behavior

Candidate gene annotation for theory building

Discussion

Experimental approach and robustness

Implications for the working dog and behavioral genetics fields

Gene annotation for gene prioritization, theory building and future validation

Conclusions

Materials and methods

Data acquisition

Ethics statement

Data processing and statistical analysis

Genome annotation

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Figure S1.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Machine learning prediction and classification of behavioral selection in a canine olfactory detection program

Comparative neurogenetics of dog behavior complements efforts towards human neuropsychiatric genetics

Search

Navigation