Abstract
Background
Canine hip dysplasia (CHD) is a common disease, with a complex genetic background. Dogs with severe CHD sometimes also suffer from osteoarthritis (OA), an inflammatory, often painful and incurable condition. Previous studies have reported breed-specific genetic loci associated with different hip dysplasia and OA phenotypes. However, the independent replication of the known associations within or across breeds has been difficult due to variable phenotype measures, inadequate sample sizes and the existence of population specific variants.
Results
We execute a validation study of 46 genetic markers in a cohort of nearly 1600 dogs from ten different breeds. We categorize the dogs into cases and controls according to the hip scoring system defined by the Fédération Cynologique Internationale (FCI). We validate 21 different loci associated on fourteen chromosomes. Twenty of these associated with CHD in specific breeds, whereas one locus is unique to the across-breed study. We show that genes involved in the neddylation pathway are enriched among the genes in the validated loci. Neddylation contributes to many cellular functions including inflammation.
Conclusions
Our study successfully replicates many loci and highlights the complex genetic architecture of CHD. Further characterisation of the associated loci could reveal CHD-relevant genes and pathways for improved understanding of the disease pathogenesis.
Similar content being viewed by others
Background
Revealing the genetic background of canine hip dysplasia (CHD) has remained one of the biggest veterinary conundrums in the past few decades. Although there has been a lot of effort to uncover risk loci and causal variants, their validation and replication has proven difficult. An inadequate sample size has been an issue in many studies but studies have also been hampered by the complexity and inaccuracy of the phenotypes, and an apparent genetic heterogeneity across breeds.
The prevalence of CHD is highly variable between breed groups and individual breeds. For example, in USA and Canada, where dogs scored within the American Veterinary Medical Association system, the extreme values vary from a zero disease prevalence in Italian Greyhounds to a high prevalence of 77.7% in the Bulldogs [1]. Consequently, it is important to find both the genetic factors that are breed-specific and those that are shared between breeds. Validation of breed-specific loci is a tedious effort, since one needs to collect a robustly phenotyped independent cohort of each breed. It is usually easier to collect a large multi-breed cohort that can be used to define which CHD associated loci are shared between breeds. The accuracy of the studied phenotypes is imperative. The Finnish Kennel Club (FKC) implements the hip scoring system defined by the Fédération Cynologique Internationale (FCI) [2], and uses only few specialised veterinarians to evaluate the hip scores, therefore reducing the inter-observer bias [3].
The single nucleotide polymorphic genetic markers (SNPs) evaluated in this study have been associated with CHD by us and others over the past ten years. Zhou et al. (2010) identified a total of six SNPs in their association study of Norberg angle (a measure of hip joint laxity) and OA in several breeds [4]. Friedenberg et al. (2011) found a homozygous deletion haplotype (intronic to Fibrillin 2; FBN2) associated with a severe form of CHD in Labrador Retrievers, as well as in 14 other breeds and in a cross-bred (Labrador Retriever - Greyhound) dog cohort [5]. Pfahler and Distl (2012) conducted a genome-wide association study (GWAS) of the FCI hip score in Bernese Mountain dogs and found three significantly associated SNPs representing two different loci [6]. Fels and Distl (2014) carried out a validation study with the FCI hip score in the German Shepherd [7]. They reported three significantly associated SNPs from canine chromosomes CFA24, CFA26, and CFA34 [7]. Fels et al. (2014) uncovered novel SNPs in previously identified quantitative trait loci (QTL) and found that nine SNPs in five loci associated significantly with the FCI hip score in their German Shepherd cohort [8]. Lavrijsen et al. (2014) used GWAS and exon sequencing to identify multiple alleles associated with the FCI hip score in Dutch Labrador Retrievers [9]. Sanchez-Molano et al. (2014) also studied Labrador Retrievers using a hip score defined by the British Veterinary Association and the Kennel Club; they revealed two significantly associated QTL including several SNPs [10]. Bartolome et al. (2015) reported a genetic predictive model for CHD based on seven SNPs they found using GWAS and candidate gene approaches in Labrador Retrievers [11]. Additional markers were provided by our collaborator at Genoscoper Laboratories [12]. Our own previous case-control GWAS of the FCI hip score in German Shepherds [13] revealed loci on CFA1 and CFA9 that harboured markers with either risk or protective alleles.
This study aimed to validate the previously reported associations to better understand their significance and overall genetic heterogeneity of CHD. We selected 52 SNPs based on prior research, and carried out across- and within breeds replication studies in nearly 1600 dogs from 10 breeds. We successfully replicated five markers across breeds and identified 20 markers in different breeds, which highlights the heterogeneous genetic architecture of CHD.
Results
We genotyped 52 SNPs that have previously been associated with CHD (Additional files 1 and 2), and carried out various case-control association analyses of CHD using an independent cohort of 1607 dogs consisting of 10 breeds. After quality control 46 SNPs and 1570 dogs (751 cases and 819 controls, 666 males and 904 females) remained for our analyses. The variant data of the 46 SNPs in our cohort are available through the European Variation Archive (https://www.ebi.ac.uk/eva). These SNPs are referred to by their ssIDs in this paper (see Additional file 1). We used the FCI hip score to categorize dogs into cases (hip score C or worse on both joints) and controls (hip score A/A). Dogs with hip score B were excluded to minimize phenotype ambiguity because although considered normal the FCI hip score B represents a borderline normal phenotype. We used the Cochran-Mantel-Haenszel 2x2xK (CMH) test for the breed-stratified data in our analyses.
Four SNPs associated with CHD across breeds
The raw P-values, the empirical P-values from the permutation procedure, and odds ratios (OR) from the across-breed CMH test as implemented in PLINK [14] are shown in Table 1. Only the markers with a significant association to CHD, and which also passed a test of homogeneity of OR across the breeds (see Methods), are reported. A total of four SNPs associated significantly with CHD in the across-breed analysis (Table 1). The markers are located on CFA1, CFA14, CFA26, and CFA37. Of these, the SNP on CFA1 is from our previous study of the FCI hip score on German Shepherds [13]. The variant ss7212922135 on CFA14 was originally associated with the FCI hip score in Bernese Mountain dogs [6]. On CFA26, ss7212922151 originally associated with the FCI hip score in German Shepherds [7], as did ss7212922122 on CFA33 [8]. The variant ss7212922139 on CFA37 originally associated with OA in a multi-breed analysis [4]. In contrast to the other four markers, ss7212922139 in CFA37 was not significantly associated with CHD in any of the breed-specific analyses.
Within-breed analyses reveal additional markers
The definition of cases and controls was the same as it was in the across-breed analysis. We used two methods, basic association analysis by X2-test of allele frequencies and logistic regression, to assess breed-specific association of the SNPs to CHD. The analysis method that demonstrated a better model fit, as evaluated by visual examination of quantile-quantile (Q-Q) plots (see Additional file 3), was chosen for each breed. Logistic regression showed better model fit for five breeds and the basic association test for the other five (see Additional file 3). Some inflation of the expected versus observed P-values was observed in three breeds: Finnish Lapphund, Golden Retriever, and Labrador Retriever (see Additional file 3). All these breeds have at least partially separated breeding lines of herding/working dogs (meaning restricted mixing of the breeding dogs between lines), which may be the source for the inflation.
The within-breed analyses showed multiple significant associations for different SNPs per breed (Table 2). The number of significantly associated SNPs varied between breeds, ranging from none in Bernese Mountain dogs to six significant associations in the Labrador Retrievers (Table 2). In total, 22 SNPs on CFA1, CFA3, CFA8, CFA11, CFA12, CFA14, CFA17, CFA21, CFA24, CFA25, CFA26, CFA33, and CFA34 demonstrated significant associations to CHD (Table 2). Six SNPs were significant in more than one breed (ss7212922122, ss7212922151, ss7212922154, ss7212922155, ss7212922161, ss7212922163 Table 2). These six SNPs are located on CFA33, CFA26, CFA34, CFA11, CFA1, and CFA24. The marker ss7212922155 on CFA11 was significant on three breeds (Table 2).
ss7212922126 and ss7212922153 on CFA1, as well as ss7212922156 and ss7212922152 on CFA8 were in high linkage disequilibrium (r2 > 0.80) and therefore these SNP pairs were interpreted to represent one locus each. Thus, the 22 markers associating with CHD and with OR’s deviating from 1 represent 20 different loci on 13 chromosomes (Table 2).
Enrichment of the neddylation-pathway
Our across and within breed analyses highlighted altogether 21 loci on fourteen chromosomes. These loci contain hundreds of candidate genes and we wanted to understand whether they are enriched in any cellular pathways. We performed two analyses using the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) searching for the dog genes/proteins [15], including all the positional candidate genes within 1 Mb of the associated SNPs. In the first analysis, no enriched pathways were detected among the 58 positional candidate genes from the across-breed study (see Additional File 4). For the second analysis, we pooled all the 272 positional candidate genes from the across and within breed studies (Additional File 4). An enrichment in the neddylation pathway (Reactome ID: CFA-8951664) was spotted by STRING with a false discovery rate of 0.0314 (12 observed genes / 220 genes in the neddylation-pathway gene set, see Fig. 1 and Table 3).
Discussion
Previous efforts by us and others have discovered tens of loci associating with CHD across breeds. However, the significance of these findings has remained vague due to the lack of proper replication because of the variability of the phenotypes, varied study approaches and inadequate sample sizes.
Our replication study in over 1600 dogs across 10 breeds with 52 reported CHD markers validates 21 loci on fourteen chromosomes. Five loci were associated across breeds. Two loci included more than one validated marker. Inclusion of more markers in the study would have made the validation of each locus more robust and possibly allowed further selection of associated loci by the presence of clusters of validated SNPs. Also, after our analyses several interesting markers were published [16].
In total, the replicated loci include over 250 genes with an enrichment of candidate genes in a neddylation pathway. Neddylation contributes to various cellular functions, including inflammation, commonly found in CHD and OA. Collectively, these results highlight the complex genetic background of CHD and provide important insights to the significance of the common and breed-specific loci in CHD. This helps to prioritize loci for further studies to identify causal variants and suggest a novel hypothesis for the possible contribution of the affected neddylation pathway to CHD and OA.
Four loci on CFA1, CFA14, CFA26, and CFA37 associated with CHD across breeds. The variants ss7212922118 and ss7212922120 on CFA1 were reported by us to associate with CHD in German Shepherds [13]. These SNPs locate within and upstream of NADPH Oxidase 3 (NOX3), which is a catalyst for the formation of superoxides and other reactive oxygen species. NADPH oxidases are an essential part of a reaction chain that has been suggested to contribute to the initiation of articular cartilage degradation [17]. However, NOX3 is mainly expressed in the inner ear and foetal tissues, which leaves its role in CHD equivocal.
The SNP on CFA14 (ss7212922135) was originally reported to associate with CHD in Bernese Mountain dogs [6]. This SNP did not associate with CHD in our Bernese Mountain dog cohort, which may result from different case definition between the current and the earlier study. The variant ss7212922135 originated from a study cohort where all cases (N = 33) had mild CHD (FCI score C) [6], while our Bernese Mountain dog cases represented mild-to-severe CHD based on FCI hip scoring (Ncases = 88; 42 with mild (FCI score C), 40 with moderate (FCI score D), and six with severe (FCI score E) CHD). However, ss7212922135 was significant in the across-breed analysis. This SNP lies within the ninth intron of Cortactin Binding Protein 2 encoding gene (CTTNBP2) [6]. CTTNBP2 participates in brain development and the regulation of synapse organisation. Although no obvious connection was found between this gene and CHD in the original study [6], some more recent phenotype associations have been listed in the GWAS catalogue (https://www.ebi.ac.uk/gwas/home) for CTTNBP2 that are worth noting: juvenile idiopathic arthritis and idiopathic osteonecrosis of the femoral head. Furthermore, body mass index adjusted waist-hip ratio, a measure for storage fat in humans, is listed in the GWAS catalogue [18] for ST7, WNT2, ASZ1, and CFTR, all within ±1 Mb of ss7212922135. Obesity is a known environmental risk factor for hip dysplasia and OA in both dogs and humans [19], although the mechanism is still poorly understood. Finally, we want to highlight the gene encoding Wnt Family Member 2 (WNT2) ~ 401 kb downstream ss7212922135. Wnt signalling pathways have been shown to participate in joint development, cartilage maintenance homeostasis, and the development and progression of OA [20].
Variant ss7212922151 on CFA26 was the third SNP, which demonstrated significant association to CHD in the across-breed analysis. This SNP originates from an association study of CHD in German Shepherds and locates within the fifth intron of Kinase Suppressor Of Ras 2 encoding gene (KSR2) [7]. KSR2, a scaffolding protein in the Ras-Raf-MEK-ERK pathway, has been associated with obesity in mice and humans [21, 22]. Although the non-SMAD-dependent TGF-β/BMP signalling in the osteoblastic lineage involves MKK3/6 and p38 [23], we are not aware of any studies reporting on their interaction with KSRs. Another gene of interest in the same locus is Nitric Oxide Synthase 1 (NOS1; ~ 262 kb away from ss7212922151). A large variety of different phenotypes have been reported in Nos1 murine models, including abnormal skeletal and skeletal muscle phenotypes for Nos1tm1Plh-mutants [24].
ss7212922122 on CFA33 locates within the gene encoding PEST Proteolytic Signal Containing Nuclear Protein (PCNP), and was originally found to associate with CHD in German Shepherds [8]. PCNP expression is omnipresent in different tissues, and with its ubiquitination partner Np95/ICBP90-like RING finger protein (NIRF) it may be involved in a signalling pathway of cell cycle regulation and/or genome stability [25, 26]. ss7212922122 is also about 469 kb upstream from ABI Family Member 3 Binding Protein (ABI3BP), which is a collagen and glycosaminoglycan binding molecule and an extracellular matrix structural component. Moreover, an intron variant of ABI3BP (rs9828061) has been associated with joint hypermobility measurement in humans [27].
Lastly, ss7212922139 on CFA37 locates within the first intron of Neuropilin 2 (NRP2). It was the only marker that despite not being significantly associated with CHD in any particular breed was still significantly associated with CHD in the across-breed analysis. Indeed, this SNP originated from a study, which utilised association and linkage populations of multiple breeds and their crosses [4]. In our study the marker had OR < 1, indicating protective effect. The original study did not report ORs, and the SNP associated with OA [4]. Although the current study data did not contain the direct OA phenotypes, OA is nevertheless assessed and considered when determining the FCI score. Considering the original and current study, ss7212922139 could represent a genuine “across-breed locus” for CHD and OA. Zhou et al. (2010) suggested Par-3 Family Cell Polarity Regulator Beta (PARD3B) as a candidate gene due to its association with OA of the knee in humans [4, 28]. In addition to PARD3B, NRP2 could also be a plausible candidate for OA. Neuropilin 2 is as a co-receptor for vascular endothelial growth factors (VEGFs) [29]. Increased VEGF expression has been indicated to associate with increased severity of OA [30], and a functional study demonstrated that VEGF injections into knee joints of mice induced OA [31]. Moreover, VEGF and its receptors have been studied as targets for treatment of OA [32], and an experimental study of a VEGF antibody (rhu-Mab-VEGF; Bevacizumab or Avastin®) has shown that Bevacizumab could offer a potential therapy for OA [33]. However, it remains unknown how NRP2-VEGF interaction could affect the development and progression of OA in dogs.
The current within-breed analyses revealed a multitude of loci that associated with CHD, strengthening the hypothesis that CHD has a complex genetic architecture and distinct genetic backgrounds in different breeds. The highest number of associated SNPs was found with Labrador Retriever (6), while none of the tested markers associated with CHD in our Bernese Mountain dog cohort. The associated markers and loci varied between breeds. Some SNPs demonstrated association to CHD in more than one breed and ss7212922151 on CFA26 was also significant in the across-breed analysis. We want to acknowledge here that the results in the Finnish Lapphund, Golden Retriever and Labrador Retriever breeds are inflated due to possible population stratification (as evidenced by the Q-Q plots, see Additional file 3). These breeds have both show and working/herding lines, which we could not account for in this study due to the missing line information. Thus, the breed-specific results for these three breeds should be regarded with some caution.
Some of the significant markers from the within-breed analyses had notably higher or lower ORs (Tables 1 and 2), which is probably due to higher across-breed variation for these markers. Also, worth noting was that some breeds had ORs of over 5 for certain markers (Table 2), which indicates relatively strong association to the disease outcome for such a complex disorder. Future studies should concentrate on these loci for breeds such as the Great Dane that had OR of 5.07 for ss7212922133 on CFA14, and had no marked inflation observed (see Additional file 3, plots M and N).
This study replicated altogether 21 loci with over 250 potential candidate genes raising an interesting question of the possible relationship and enrichment of the candidate genes in the associated loci predisposing to CHD. The STRING analysis revealed that the candidate genes were enriched in a single pathway, neddylation, which is a ubiquitination-like conserved post-translational protein modification process [34]. The key actor in neddylation is NEDD8, which is conjugated to its substrates with the help of enzymes E1 (activation of NEDD8; NAE1-UBA3 heterodimer), E2 (conjugation; UBE2M or UBE2F), and E3 (ligase; RBX1 or RBX2) [35]. Neddylation modifies the biochemical properties of the target substrates [34], such as many members of the cullin-family, p53, or EGFR [35, 36]. Neddylation is essential for cell cycle progression [37, 38], and it has been linked to many pathologies, especially to human cancers [36, 37]. Neddylation was recently linked to inflammatory arthritis via increased NF-κB activation, although increased expression of neddylation-related genes (NEDD8 and CUL1) were observed only in the synovium of the rheumatic arthritis and not in the controls with non-inflammatory OA [39].
Interestingly, searching the STRING database with the 12 neddylation pathway associated candidate genes from the current study (Table 3) and with 14 genes (CYBA, MAPK14, MMP2, MMP9, NCF1, NCF2, NCF4, NOX3, NOXA1, NTN1, RAC1, RAC1, TRIO, VCAM1) highlighted in our previous studies on CHD on German Shepherds [13, 40] produced two gene clusters that shared 12 genes associated with Class I MHC mediated antigen processing & presentation (R-CFA-983169, https://version-11-0b.string-db.org/cgi/network?networkId=bPwtieGk6WuV, and Additional file 5).
There is cumulating evidence that inflammatory mechanisms are active in OA [41, 42]. Neddylation is demonstrated to participate in regulation of i.a. T-cell and macrophage functions during inflammation [39, 43, 44]. Both T-cell and macrophage mediated inflammatory responses have been observed in OA [42, 45], and therefore, the possible role of neddylation-pathway in the immune base of OA should be explored further.
Conclusions
We replicate 21 previously reported CHD-associated loci on fourteen chromosomes and identify neddylation as a novel candidate pathway for CHD and OA. We identify common and breed-specific loci and highlight the complex genetic architecture of CHD. Identification of the causal genes and variants in the associated loci remains as an important future task to better understand the molecular pathogenesis of CHD and its subtraits towards improved treatment and diagnostic options.
Methods
Study cohorts and phenotype
We used SNP genotyping to validate 52 SNPs on several chromosomes in a large cohort comprising of ten breeds. Our cohort consisted of 1607 dogs with FCI hip scores (Table 4), which were used to categorize dogs into cases (hip score C or worse on both joints, N = 772) and controls (hip score A/A, N = 835). It is worth noting that the FCI scoring does not represent a quantitative phenotype. Furthermore, the distribution of phenotype categories of the cohort is very skewed and does not meet the assumptions required for the analysis of datasets in the ordinary scale. Because of these reasons, only case-control analysis was possible. To minimize phenotype ambiguity, dogs with hips scored B were excluded. Although score B is considered normal, it nevertheless represents a borderline normal phenotype.
The participating breeds were (in order of number of samples per breed): Finnish Lapphund, Golden Retriever, Lagotto Romagnolo, Bernese Mountain dog, Samoyed, Spanish Water dog, Great Dane, Labrador Retriever, Karelian Bear dog, and Finnish Hound. These breeds were chosen for the project because the prevalence of CHD in them is at least moderate. FKC collects FCI hip scoring data into an open-access breeding database from which the phenotypes were gathered for this study [2, 46]. We investigated the prevalence of CHD in the above-mentioned breeds as the mean of observed yearly prevalences including all cases from FCI hip score C to E, measured in an eleven-year period (2006–2016, year of birth) [2, 46]. The lowest mean prevalence was observed for Labrador Retriever (19%) and the Finnish Hound (28%) [46]. Finnish Lapphund, Golden Retriever, Samoyed, Spanish Water dog, and Great Dane all had a mean prevalence of CHD between 30 and 40% during this time period [46]. The highest mean prevalence was observed for the Karelian Bear dog (41%), Bernese Mountain dog (43%), and Lagotto Romagnolo (45%) [46].
To minimise possible genomic stratification and subsequent inflation of the test statistics we were careful not to incorporate close relatives and included only one individual from all core-families during the initial data collection. Nevertheless, more distant relatedness may exist within the breeds. Also, Finnish Lapphund, Golden Retriever and Labrador Retriever are breeds that have working/herding breeding lines, which are at least partially separate from the rest of the breeding dogs. In the current study we did not have the breeding line information for these dogs and could not account for it in the analyses. This may cause additional inflation of the test statistics due to stratification within the breed. The breed-specific results for these three breeds must therefore be interpreted with some caution.
SNP genotyping
Agena MassARRAY® iPLEX was used to genotype 52 SNPs (Additional files 1 and 2) in 1607 dogs from ten breeds. Six of the SNPs were chosen for the project from our own study in German Shepherds [13]. Ten markers from CFA1, CFA5, CFA8, CFA20, and CFA25 were selected based on investigations by Wisdom Health. These markers are included in a patent (Patent no.: US10150998B2 [12]) and five of them were also in Lavrijsen et al. (2014) [9]. The rest of the markers were chosen from earlier studies of Zhou et al. (2010) [4], Friedenberg et al. (2011) [5], Pfahler and Distl (2012) [6], Bartolome et al. (2015) [11], Fels and Distl (2014) [7], Fels et al. (2014) [8], and Sanchez-Molano et al. (2014) [10] to evaluate whether results were replicable in our data set. Data from the previous studies (breed, size of cohort and reported raw and corrected P-values) are summarized for each marker in Additional file 1.
The genotyping was performed by the Institute of Molecular Medicine Finland (FIMM) Technology Centre, University of Helsinki. The genotyping was done in two separate batches; the first batch included samples from breeds Labrador Retriever, Golden Retriever, and Bernese Mountain Dog; the second batch included samples from breeds Spanish Water Dog, Karelian Bear Dog, Lagotto Romagnolo, Finnish Hound, Samoyed, Finnish Lapphund, and Great Dane. Initial quality control of data was carried out in FIMM. In the first batch one sample was discarded due to success rates lower than 70%, and four SNP assays were discarded due to unreliable or no results. In the second batch, five samples were discarded due to success rates lower than 70%, and two SNP assays were discarded due to unreliable or no results. The resulting data were delivered to us as map and ped files. The map file was based on the CanFam3.1. reference, Annotation Release 104.
Quality control
We carried out the quality control in stages for within-breed and across-breed association analyses. Initially there were 1607 samples and 52 SNPs before the QC steps. The QC thresholds for the breed-specific data were: 0.90 for per ID and per SNP call rates, 0.05 for the minor allele frequency (MAF), and 0.0001 for the cut-off p-value in check for Hardy-Weinberg Equilibrium (HWE) (done in controls only). The resulting breed-specific data is described in Table 4. SNP-specific failures to meet the QC criteria are presented in Additional file 1.
The QC for the across-breed data was done in two steps in PLINK. The first QC step was done at the breed-level before merging, with the following thresholds: 0.90 for per ID and per SNP call rates, 0.0001 for the cut-off p-value in check for HWE check in controls. MAF cut-off level was set to zero in this initial step, because some SNPs that might not pass the MAF threshold within a breed, might however pass it in the across-breed cohort. Subsequently, the SNPs and individuals that passed this initial QC step were merged into the across-breed data set. The final QC with the MAF cut-off threshold at 0.05 was then executed over the whole across-breed data, which left us 46 SNPs and 1572 samples. However, one Bernese Mountain dog and one Golden Retriever had hip scores of B/D (left hip/right hip) and they were excluded from the analysis, because we could not rule out the possibility of unilateral CHD induced by an injury or other environmental factor. Therefore, we finally had 1570 dogs in our analyses, of which 751 were cases, 819 were controls, and 666 were males and 904 were females.
Association analysis
Before the analyses we tested the quality-controlled SNP data for possible batch effects in R with the glm-function. This was done because the genotyping was executed in two batches. We did not observe any significant batch effects. The within-breed analyses were carried out with the --assoc (X2-test of allele frequencies) or with --logistic (logistic regression) functions in PLINK, with age at radiographing as covariate in the logistic model (--assoc cannot use covariates).
As the SNPs in the study are a strongly selected and small subset of all the SNPs in the genome, the quantification of inflation is a challenge. QQ-plots were used to assess the overall fit of the alternative models and to select between them (Additional file 3). The QQ-plot is a more representative proxy for the fit of the model than the lambda value by PLINK which is based on the ratio of single (median) value of the test variable.
Some inflation of the test statistics in both analysis methods was observed in three breeds (see Additional file 3, plots A–D and O–P): Finnish Lapphund, Golden Retriever, and Labrador Retriever. We did not attempt to quantify or control the stratification in these breeds by PCA as it is not expected to work when the number of markers is small and the effect of any single marker is expected to be low [47] as is the case in our study.
Odds ratios and their 95% confidence intervals were calculated in PLINK using the default settings and the function --ci. PLINK assigns the less frequent allele as the minor allele that increases the risk when the odds ratio is greater than one.
We used 2x2xK (K = 11) Cochran-Mantel-Haenszel (CMH) statistics for the across-breed analysis. CMH is a standard test for a stratified case-control analysis. This was carried out in PLINK with the function --mh and with breed clusters defined with the --within function. Odds ratios and the respective 95% confidence intervals were automatically calculated by PLINK. The CMH test assumes homogeneity of the odds ratios between strata (breeds in our case), and violation of this assumption may lead to false positive associations [48]. Therefore, we used --homog function in PLINK to check if any of the SNPs demonstrating association in the CMH test, would violate the homogeneity assumption; all associated SNPs passed the homogeneity test. All permutation analyses (using 10,000 permutations) within and across breeds were executed with the max(T) permutation procedure in PLINK with the function -- mperm 10,000. We used a fixed seed (--seed 873,051,416) generated in Unix shell with “date +%N” to ensure reproducible results in all of the permutation analyses.
STRING analysis
We used STRING (Search tool for retrieval for interacting genes/proteins) (V11.0) [15] to carry out a pathway analysis of the candidate gene sets, which we acquired from our association analyses. All genes from within 1 Mb from the variant that demonstrated significant association to CHD were listed and then used as an input for the STRING Multiple proteins search. The used candidate gene sets are listed in the Additional file 4. The STRING database was queried with 272 canine genes but seven genes (TMEM244, STRA6L, FOXE1, RPS13, SERGEF, ESPNL, and FAB172B) were not found. In addition, U6 and CNKSR3 were not recovered in the expected chromosome and were discarded. Thus, the STRING network for the positional candidate genes consisted 263 nodes (https://version-11-0.string-db.org/cgi/network.pl?networkId=5Sqk4IV9gi5b). We also performed an additional search with a combined set of neddylation pathway associated genes from Tables 3 and 14 genes highlighted in our previous studies (https://version-11-0b.string-db.org/cgi/network?networkId=bPwtieGk6WuV).
Availability of data and materials
The datasets generated and analysed during this study are available in the FIGSHARE repository: https://doi.org/10.6084/m9.figshare.11369511. The data was anonymised to protect the privacy of the dog owners.
References
Loder RT, Todhunter RJ. The demographics of canine hip dysplasia in the United States and Canada. J Vet Med. 2017;2017:5723476. https://doi.org/10.1155/2017/5723476.
Brass W. Hüftgelenkdysplasie und Ellbogenkrankkung im Visier der Fédération Cynologique Internationale. Kleintierpraxis. 1993;38:191–266.
Verhoeven G, Coopman F, Duchateau L, Bosmans T, Van Ryssen B, Van Bree H. Interobserver agreement on the assessability of standard ventrodorsal hip-extended radiographs and its effect on agreement in the diagnosis of canine hip dysplasia and on routine FCI scoring. Vet Radiol Ultrasound. 2009;50:259–63.
Zhou Z, Sheng X, Zhang Z, Zhao K, Zhu L, Guo G, et al. Differential genetic regulation of canine hip dysplasia and osteoarthritis. PLoS One. 2010;5:e13219. https://doi.org/10.1371/journal.pone.0013219.
Friedenberg SG, Zhu L, Zhang Z, van den Berg Foel W, Schweitzer PA, Wang W, et al. Evaluation of a fibrillin 2 gene haplotype associated with hip dysplasia and incipient osteoarthritis in dogs. Am J Vet Res. 2011;72:530–40. https://doi.org/10.2460/ajvr.72.4.530.
Pfahler S, Distl O. Identification of quantitative trait loci (QTL) for canine hip dysplasia and canine elbow dysplasia in Bernese Mountain dogs. PLoS One. 2012;7:e49782. https://doi.org/10.1371/journal.pone.0049782.
Fels L, Distl O. Identification and validation of quantitative trait loci (QTL) for canine hip dysplasia (CHD) in German shepherd dogs. PLoS One. 2014;9:e96618. https://doi.org/10.1371/journal.pone.0096618.
Fels L, Marschall Y, Philipp U, Distl O. Multiple loci associated with canine hip dysplasia (CHD) in German shepherd dogs. Mamm Genome. 2014;25:262–9. https://doi.org/10.1007/s00335-014-9507-1.
Lavrijsen ICM, Leegwater PAJ, Martin AJ, Harris SJ, Tryfonidou MA, Heuven HCM, et al. Genome wide analysis indicates genes for basement membrane and cartilage matrix proteins as candidates for hip dysplasia in Labrador retrievers. PLoS One. 2014;9:e87735. https://doi.org/10.1371/journal.pone.0087735.
Sánchez-Molano E, Woolliams JA, Pong-Wong R, Clements DN, Blott SC, Wiener P. Quantitative trait loci mapping for canine hip dysplasia and its related traits in UK Labrador retrievers. BMC Genomics. 2014;15:833. https://doi.org/10.1186/1471-2164-15-833.
Bartolomé N, Segarra S, Artieda M, Francino O, Sánchez E, Szczypiorska M, et al. A genetic predictive model for canine hip dysplasia: integration of genome wide association study (GWAS) and candidate gene approaches. PLoS One. 2015;10:e0122558. https://doi.org/10.1371/journal.pone.0122558.
Martin A. US10150998B2; 2018. p. 1–30. https://patents.google.com/patent/US10150998B2/en. Accessed 10 Oct 2019
Mikkola LI, Holopainen S, Lappalainen AK, Pessa-Morikawa T, Augustine TJP, Arumilli M, et al. Novel protective and risk loci in hip dysplasia in German shepherds. PLoS Genet. 2019;15:e1008197. https://doi.org/10.1371/journal.pgen.1008197.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75. https://doi.org/10.1086/519795.
Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47:D607–13. https://doi.org/10.1093/nar/gky1131.
Hayward JJ, Castelhano MG, Oliveira KC, Corey E, Balkman C, Baxter TL, et al. Complex disease and phenotype mapping in the domestic dog. Nat Commun. 2016;7:10460. https://doi.org/10.1038/ncomms10460.
Daumer KM, Khan AU, Steinbeck MJ. Chlorination of pyridinium compounds. Possible role of hypochlorite, N-chloramines, and chlorine in the oxidation of pyridinoline cross-links of articular cartilage collagen type II during acute inflammation. J Biol Chem. 2000;275:34681–92. https://doi.org/10.1074/jbc.M002003200.
GWAS catalog. https://www.ebi.ac.uk/gwas/efotraits/EFO_0007788. Accessed 16 Jan 2020.
Marshall WG, Bockstahler BA, Hulse DA, Carmichael S. A review of osteoarthritis and obesity: current understanding of the relationship and benefit of obesity treatment and prevention in the dog. Vet Comp Orthop Traumatol. 2009;22:339–45. https://doi.org/10.3415/VCOT-08-08-0069.
Stampella A, Monteagudo S, Lories R. Wnt signaling as target for the treatment of osteoarthritis. Best Pract Res Clin Rheumatol. 2017;31:721–9. https://doi.org/10.1016/J.BERH.2018.03.004.
Pearce LR, Atanassova N, Banton MC, Bottomley B, van der Klaauw AA, Revelli J-P, et al. KSR2 mutations are associated with obesity, insulin resistance, and impaired cellular fuel oxidation. Cell. 2013;155:765–77. https://doi.org/10.1016/j.cell.2013.09.058.
Revelli J-P, Smith D, Allen J, Jeter-Jones S, Shadoan MK, Desai U, et al. Profound obesity secondary to Hyperphagia in mice lacking kinase suppressor of Ras 2. Obesity. 2011;19:1010–8. https://doi.org/10.1038/oby.2010.282.
Wu M, Chen G, Li Y-P. TGF-β and BMP signaling in osteoblast, skeletal development, and bone formation, homeostasis and disease. Bone Res. 2016;4:16009. https://doi.org/10.1038/boneres.2016.9.
Nos1 Targeted Allele Detail MGI Mouse (MGI:1857227). http://www.informatics.jax.org/allele/MGI:1857227. Accessed 24 Sep 2019.
Mori T, Li Y, Hata H, Kochi H. NIRF is a ubiquitin ligase that is capable of ubiquitinating PCNP, a PEST-containing nuclear protein. FEBS Lett. 2004;557:209–14. https://doi.org/10.1016/S0014-5793(03)01495-9.
Mori T, Li Y, Hata H, Ono K, Kochi H. NIRF, a novel RING finger protein, is involved in cell-cycle regulation. Biochem Biophys Res Commun. 2002;296:530–6. https://doi.org/10.1016/S0006-291X(02)00890-2.
Pickrell JK, Berisa T, Liu JZ, Ségurel L, Tung JY, Hinds DA. Detection and interpretation of shared genetic influences on 42 human traits. Nat Genet. 2016;48:709–17. https://doi.org/10.1038/ng.3570.
Valdes AM, Loughlin J, Timms KM, van Meurs JJB, Southam L, Wilson SG, et al. Genome-wide association scan identifies a prostaglandin-endoperoxide synthase 2 variant involved in risk of knee osteoarthritis. Am J Hum Genet. 2008;82:1231–40. https://doi.org/10.1016/j.ajhg.2008.04.006.
Favier B. Neuropilin-2 interacts with VEGFR-2 and VEGFR-3 and promotes human endothelial cell survival and migration. Blood. 2006;108:1243–50. https://doi.org/10.1182/blood-2005-11-4447.
Yuan Q, Sun L, Li J-J, An C-H. Elevated VEGF levels contribute to the pathogenesis of osteoarthritis. BMC Musculoskelet Disord. 2014;15:437. https://doi.org/10.1186/1471-2474-15-437.
Ludin A, Sela JJ, Schroeder A, Samuni Y, Nitzan DW, Amir G. Injection of vascular endothelial growth factor into knee joints induces osteoarthritis in mice. Osteoarthr Cartil. 2013;21:491–7. https://doi.org/10.1016/j.joca.2012.12.003.
Hamilton JL, Nagao M, Levine BR, Chen D, Olsen BR, Im H-J. Targeting VEGF and its receptors for the treatment of osteoarthritis and associated pain. J Bone Miner Res. 2016;31:911–24. https://doi.org/10.1002/jbmr.2828.
Nagai T, Sato M, Kobayashi M, Yokoyama M, Tani Y, Mochida J. Bevacizumab, an anti-vascular endothelial growth factor antibody, inhibits osteoarthritis. Arthritis Res Ther. 2014;16:427. https://doi.org/10.1186/s13075-014-0427-y.
Rabut G, Peter M. Function and regulation of protein neddylation. “Protein modifications: beyond the usual suspects” review series. EMBO Rep. 2008;9:969–76. https://doi.org/10.1038/embor.2008.183.
Brown JS, Jackson SP. Ubiquitylation, neddylation and the DNA damage response. Open Biol. 2015;5:150018. https://doi.org/10.1098/rsob.150018.
Jiang Y, Jia L, Sun Y. Induction of Protective Autophagy in Cancer Cells by NAE Inhibitor MLN4924. In: Hayat MA, editor. Autophagy: Cancer, Other Pathologies, Inflammation, Immunity, Infection, and Aging. Elsevier; 2015. p. 135–43. https://doi.org/10.1016/B978-0-12-801033-4.00008-4.
Enchev RI, Schulman BA, Peter M. Protein neddylation: beyond cullin–RING ligases. Nat Rev Mol Cell Biol. 2015;16:30–44. https://doi.org/10.1038/nrm3919.
Tateishi K, Omata M, Tanaka K, Chiba T. The NEDD8 system is essential for cell cycle progression and morphogenetic pathway in mice. J Cell Biol. 2001;155:571–9. https://doi.org/10.1083/jcb.200104035.
Liu K, Chen K, Zhang Q, Zhang L, Yan Y, Guo C, et al. TRAF6 neddylation drives inflammatory arthritis by increasing NF-κB activation. Lab Investig. 2019;99:528–38. https://doi.org/10.1038/s41374-018-0175-8.
Mikkola L, Holopainen S, Pessa-Morikawa T, Lappalainen AK, Hytönen MK, Lohi H, et al. Genetic dissection of canine hip dysplasia phenotypes and osteoarthritis reveals three novel loci. BMC Genomics. 2019;20:1027.
Goldring MB, Otero M. Inflammation in osteoarthritis. Curr Opin Rheumatol. 2011;23:471–8. https://doi.org/10.1097/BOR.0b013e328349c2b1.
Robinson WH, Lepus CM, Wang Q, Raghu H, Mao R, Lindstrom TM, et al. Low-grade inflammation as a key mediator of the pathogenesis of osteoarthritis. Nat Rev Rheumatol. 2016;12:580–92. https://doi.org/10.1038/nrrheum.2016.136.
Jin H, Liao L, Park Y, Liu Y-C. Neddylation pathway regulates T-cell function by targeting an adaptor protein Shc and a protein kinase Erk signaling. Proc Natl Acad Sci U S A. 2013;110:624. https://doi.org/10.1073/PNAS.1213819110.
Chang F-M, Reyna SM, Granados JC, Wei S-J, Innis-Whitehouse W, Maffi SK, et al. Inhibition of neddylation represses lipopolysaccharide-induced proinflammatory cytokine production in macrophage cells. J Biol Chem. 2012;287:35756–67. https://doi.org/10.1074/jbc.M112.397703.
de Lange-Brokaar BJE, Ioan-Facsinay A, van Osch GJVM, Zuurmond A-M, Schoones J, Toes REM, et al. Synovial inflammation, immune cells and their cytokines in osteoarthritis: a review. Osteoarthr Cartil. 2012;20:1484–99. https://doi.org/10.1016/J.JOCA.2012.08.027.
Finnish Kennel Club. Breeding Database. 2019. https://jalostus.kennelliitto.fi/frmEtusivu.aspx? FCI=0. Accessed 23 Aug 2019.
Freedman ML, Reich D, Penney KL, McDonald GJ, Mignault AA, Patterson N, et al. Assessing the impact of population stratification on genetic association studies. Nat Genet. 2004;36:388–93.
Wiberg RAW, Gaggiotti OE, Morrissey MB, Ritchie MG. Identifying consistent allele frequency differences in studies of stratified populations. Methods Ecol Evol. 2017;8:1899–909. https://doi.org/10.1111/2041-210X.12810.
Acknowledgements
We acknowledge the Institute of Molecular Medicine Finland (FIMM) Technology Centre for their help in the genotyping. We thank Milla Ahola for valuable discussions about model fitting and other statistical aspects of our study. We also wish to thank Sini Karjalainen for her technical assistance during the project. Finally, we are sincerely grateful for all the dog owners who donated samples from their dogs for the study.
Funding
This study was funded by the Academy of Finland (252602, www.aka.fi/en, to AI), The American Kennel Club Canine Health Foundation (01828, www.akcchf.org, to AI), The Aatos and Jane Erkko Foundation (jaes.fi/en, to HL), and Biocentrum Helsinki (to HL).
Author information
Authors and Affiliations
Contributions
Design and concept: HL, AI, LM. Materials, methods and reagents: LM, HL, MKH, AL, JD. Data analyses: LM and KK. The manuscript was drafted by LM, HL, and AI, and was commented and approved by others.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
We followed guidelines for research ethics and good scientific practice. We have an ethical license to collect EDTA blood samples (ESAVI/343/04.10.07/2016) from the Animal Experiment Board in Finland that is under the Regional State Administrative Agency of Southern Finland. The dog owners have signed a form of consent at the time of the blood sampling.
Consent for publication
Not applicable.
Competing interests
JD is an employer of Genoscoper Laboratories, a unit of Wisdom Health in Mars Petcare, which provides genetic testing to dogs. HL provides consultancy to Wisdom Health.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1.
List of markers used in the validation study. 52 selected markers with their position in CanFam3.1 are listed. From the original publications the following details are presented: marker ID, associated phenotype, raw and corrected P value, and the cohort size and represented breeds. Regarding the current study, the breeds for which the QC failed are indicated as well as the ssID’s for the markers that passed the QC.
Additional file 2.
Reference, alternative, major and minor alleles, and genotype frequencies for the 23 validated SNPs in the across-breed and within-breed studies.
Additional file 3.
Quantile-Quantile plots of the within-breed association analyses. The image shows breed-wise comparisons of P-values (−log10) from a logistic regression model (logistic) or basic association analysis (X2 test; assoc).
Additional file 4.
Candidate gene sets for the STRING (V11.0) pathway analysis. Two lists are provided: set of candidate genes from the across-breed association analysis (N = 41), and a set of candidate genes from the within-breed association analyses (N = 261). Total number of unique genes in the analyses is 272. Each of the listed genes are located maximum ±1 Mb from a SNP that demonstrated significant association to CHD in one of the analyses.
Additional file 5.
Reactome pathway R-CFA-983169 on Class I MHC mediated antigen processing & presentation bridges the clusters of neddylation pathway associated genes (ASB7, PSMC6, FBXO10, DCAF10, ASB4, ASB18, COPS8, UBE2F, ASB1, FBXW8, FBXO21, WSB2) and the NOX3/MMP2/MMP9/TRIO-associated genes (CYBA, MAPK14, MMP2, MMP9, NCF1, NCF2, NCF4, NOX3, NOXA1, NTN1, RAC1, TRIO, VCAM1) from our previous studies on German Shepherds. Genes belonging to R-CFA-983169 are marked red. For a high resolution image and the original analysis, see: https://version-11-0b.string-db.org/cgi/network?networkId=bPwtieGk6WuV.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Mikkola, L., Kyöstilä, K., Donner, J. et al. An across-breed validation study of 46 genetic markers in canine hip dysplasia. BMC Genomics 22, 68 (2021). https://doi.org/10.1186/s12864-021-07375-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12864-021-07375-x