A combined GWAS approach reveals key loci for socially-affected traits in Yorkshire pigs

Wu, Pingxian; Wang, Kai; Zhou, Jie; Chen, Dejuan; Jiang, Anan; Jiang, Yanzhi; Zhu, Li; Qiu, Xiaotian; Li, Xuewei; Tang, Guoqing

doi:10.1038/s42003-021-02416-3

A combined GWAS approach reveals key loci for socially-affected traits in Yorkshire pigs

Article
Open access
Published: 20 July 2021

Volume 4, article number 891, (2021)
Cite this article

Download PDF

You have full access to this open access article

Communications Biology

A combined GWAS approach reveals key loci for socially-affected traits in Yorkshire pigs

Download PDF

Pingxian Wu ORCID: orcid.org/0000-0002-6098-9549¹^na1,
Kai Wang¹^na1,
Jie Zhou¹,
Dejuan Chen¹,
Anan Jiang¹,
Yanzhi Jiang²,
Li Zhu¹,
Xiaotian Qiu³,
Xuewei Li¹ &
…
Guoqing Tang ORCID: orcid.org/0000-0002-6920-5050¹

3207 Accesses
10 Citations
Explore all metrics

Abstract

Socially affected traits in pigs are controlled by direct genetic effects and social genetic effects, which can make elucidation of their genetic architecture challenging. We evaluated the genetic basis of direct genetic effects and social genetic effects by combining single-locus and haplotype-based GWAS on imputed whole-genome sequences. Nineteen SNPs and 25 haplotype loci are identified for direct genetic effects on four traits: average daily feed intake, average daily gain, days to 100 kg and time in feeder per day. Nineteen SNPs and 11 haplotype loci are identified for social genetic effects on average daily feed intake, average daily gain, days to 100 kg and feeding speed. Two significant SNPs from single-locus GWAS (SSC6:18,635,874 and SSC6:18,635,895) are shared by a significant haplotype locus with haplotype alleles ‘GGG’ for both direct genetic effects and social genetic effects in average daily feed intake. A candidate gene, MT3, which is involved in growth, nervous, and immune processes, is identified. We demonstrate the genetic differences between direct genetic effects and social genetic effects and provide an anchor for investigating the genetic architecture underlying direct genetic effects and social genetic effects on socially affected traits in pigs.

Whole-genome re-sequencing association study for direct genetic effects and social genetic effects of six growth traits in Large White pigs

Article Open access 04 July 2019

Single-step genome-wide association study for social genetic effects and direct genetic effects on growth in Landrace pigs

Article Open access 11 September 2020

Statistical model and testing designs to increase response to selection with constrained inbreeding in genomic breeding programs for pigs affected by social genetic effects

Article Open access 04 January 2021

Introduction

Social interactions inevitably occur among groups of individuals and these interactions influence the genetic variation of a trait in livestock¹. In current pig production systems, pigs are penned together into contemporary groups, social interactions, and observed phenotypes genetically contribute to socially affected traits of individuals and their group mates². Socially affected traits are influenced both by the genes of the individual itself (direct genetic effects, DGE) and by the genotype of other individuals in the same group (social genetic effects, SGE, also called indirect genetic effects)^2,3,4,5. Previous research has proven that many phenotypes are affected by both DGE and SGE and the contribution of SGE to total genetic variance is large in pigs, such as growth rate, body weight (BW)^6,7, feed intake, backfat thickness (BFT), muscle depth², behavior^8,9, and average daily gain (ADG)^10,11. The existence of SGE affect the estimation of DGE and therefore result in unfavorable consequence on genetic improvement of complex traits. Until recently, the genetic architecture of socially affected traits that involve both DGE and SGE is still poorly understood in pigs.

With the availability of genomic information, GWAS have increased the understanding of the contributions of genetic variants toward various complex traits. However, previous studies have mainly focused on DGE and have not considered the contributions of SGE. For many phenotypes, DGE and SGE can be directly quantified using a social genetic model². Based on the social genetic model, the classical (direct) genetic model is expanded with SGE¹². This model simultaneously contains both DGE and SGE that enables researchers to simultaneously estimate DGE and SGE values.

Detecting the genetic architecture of DGE and SGE may provide deeper insight into a better understanding of complex traits. To date, only a few studies have identified some SNPs associated with SGE. Using the Illumina 60 K SNP BeadChip, an important SNP was identified to be associated with DGE and SGE in chickens¹³. In pigs, five SNPs on chromosome 6 associated with social ADG were detected with an Illumina panel¹⁴. Additionally, in our previous study, a total of 27 SNPs associated with six growth traits were identified using whole-genome sequences (WGS) of 40 Yorkshire pigs (Zhejiang Tianpeng Group Co., Ltd. Zhejiang, China)¹⁵. However, because of the limited sample size, the study was not efficient in detecting credible associations with DGE and SGE.

Assessing the impact of SGE on socially affected traits is challenging. Haplotype-based GWAS can capture those genetic variants that are not detected by single-locus GWAS¹⁶. Therefore, to better investigate the genetic architecture of socially affected traits in pigs, the present study used both GWAS approaches to estimate the DGE and SGE on each trait by employing a social genetic model, WGS of 60 pigs, and genotyping of 1204 Yorkshire pigs with Illumina 50 K SNP chips. Therefore, the main objectives of this study were (i) to construct haplotype loci based on imputed WGS data and identify associations between haplotype-DGE and haplotype-SGE by GWAS; (ii) to perform single-locus analysis to explore SNPs associated with socially affected traits by considering both DGE and SGE in Yorkshire pigs; (iii) to compare the results of GWAS based on haplotypes and SNPs for DGE and SGE; and (iv) to combine single-locus and haplotype-based GWAS to reveal important SNPs and genes for DGE and SGE in pigs.

Results

Phenotype statistics

Summary statistics for all phenotypic data from 1204 Yorkshire pigs were shown in Table 1. Based on the social genetic model, the estimated accuracies and deregressed EBVs of DGE and SGE for each trait were presented in Table 2. All the DGE and SGE values were used for further analysis.

Table 1 Descriptive statistics of phenotypes for eight socially–affected traits in Yorkshire pigs.

Full size table

Table 2 Summary of the estimated accuracies and deregressed EBVs for direct genetic (DGE) and effects social genetic effects (SGE) in Yorkshire pigs.

Full size table

Evaluation of sequencing data and genotype imputation

A total of 60 pigs were sequenced and the statistic results were list in Supplementary Data 1. After initial quality filtering, 3.0-TB clean data with the average mapped read depth of 17.11 was retained. The mean reads were 336,176,275 for each sample. After mapping to the pig reference genome, the mean mapped reads and the mean uniquely mapped reads were 312,374,099 and 293,904,451 for each sample, respectively. The average mapping and unmapping ratios were 92.92 and 1.28% per individual, respectively.

This study conducted genotype imputation from a 50 K SNP chip to WGS data in Yorkshire pigs using a two-breed reference population (20 Landrace and 40 Yorkshire pigs). In total, 36,969 and 14,539,997 SNPs were identified in the target and reference populations, respectively. Beagle 5.1 software was used to impute the WGS reference panel across the 1204 genotyped pigs. After genotype imputation, a total of 14,539,997 SNPs with an average accuracy of 0.63 were available for this study. For each chromosome, the average imputation accuracy was in the range of 0.59–0.67 (Fig. 1). The SSC14 had the highest imputation accuracy (0.67), while SSC10 had the lowest imputation accuracy (0.59). After quality control, a total of 3,072,572 SNPs with an average imputation accuracy of 0.90 were retained for further analyses.

**Fig. 1: Imputation accuracy for each chromosome in Yorkshire pigs.**

The average imputation accuracies of imputed SNPs with different minor allele frequency (MAF) were calculated and are shown in Fig. 2. The imputation accuracies were comparatively stable for different MAFs where the MAF was higher than 0.1. However, the imputation accuracy decreased sharply when the MAF was lower than 0.1.

**Fig. 2: Imputation accuracy against minor allele frequency (MAF).**

Single-locus GWAS for eight socially affected traits

For average daily feed intake (ADFI)

The imputed-GWAS results for DGE in ADFI are shown in Table 3, Supplementary Data 2, and Fig. 3a. The imputed-GWAS identified 413 SNPs ($P \; < \; {3.25\times 10}^{-7}$) associated with DGE at the suggestive threshold. Of these 413 SNPs, the 17 SNPs ($P \; < \; {1.63\times 10}^{-8}$) located on SSC6 and SSC8 showed a significant signal. The most significant SNP ($P={1.07\times 10}^{-10}$) was located on SSC6: 46,448,592. In a narrow 9.75 kb region (18.64–18.65 Mb) on chromosome 6, there were ten consecutive genome-wide significant SNPs that showed significant peaks. In the region of SSC8: 79.51–79.53 Mb, five genome-wide significant SNPs showed high signals. A λ of less than 1.05 is considered to indicate a lack of population stratification¹⁷. The λ calculated for the DGE on ADFI was 1.01; implying no population stratification (Supplementary Fig. 1a). For chip-based GWAS, the significant threshold was set at $P={2.70\times 10}^{-5}$. On the basis of the 50-K chip data, a total of 97 SNPs were identified (Supplementary Fig. 2). Three SNPs from the chip-based GWAS were also in the imputed GWAS ($P \; < \; {2.70\times 10}^{-5}$, Supplementary Table 1).

Table 3 Single-locus GWAS for direct genetic effects (DGE) in Yorkshire pigs.

Full size table

**Fig. 3: GWAS results for direct genetic effects (DGE) and social genetic effects (SGE) in average daily feed intake (ADFI).**

For SGE on ADFI, a total of 26 SNPs reached the suggestive threshold using imputed GWAS (Table 4, Supplementary Data 2, and Fig. 3b). Among these, 14 genome-wide significant SNPs were detected on chromosome 6, which had the highest signals (Fig. 3b). The λ was equal to 1.00, which indicates no population stratification (Supplementary Fig. 1b). A total of 89 SNPs were found from the chip-based GWAS (Supplementary Fig. 2). Among them, two overlapped with the imputed GWAS ($P \; < \; {2.70\times 10}^{-5}$, Supplementary Table 1).

Table 4 Single-locus GWAS for social genetic effects (SGE) in Yorkshire pigs.

Full size table

A comparison between the imputed GWAS of DGE and SGE on ADFI revealed a total of 17 common SNPs shared between DGE and SGE (Fig. 4). Notably, the use of SGE identified associations for ADFI: two SNPs (SSC6: 25,649,764, $P={1.67\times 10}^{-9}$ and SSC6: 25,651,261, $P={2.68\times 10}^{-9}$) were significant for SGE but were not significant for DGE ($P \; > \; {1.00\times 10}^{-7}$).

**Fig. 4: The common SNPs were shared by both direct genetic effects (DGE) and social genetic effects (SGE).**

For ADG

On the basis of imputed GWAS, seven common SNPs reached the genome-wide significant threshold ($P={1.63\times 10}^{-8}$) and were identified for both DGE (Table 3 and Figs. 4, 5a) and SGE in ADG (Table 4 and Figs. 4, 5b). Of these SNPs, five consecutive loci located within SSC8: 79.51–79.53 Mb were identified with a P value of ${1.85\times 10}^{-9}$ for DGE and ${3.17\times 10}^{-9}$ for SGE. Ten consecutive SNPs were detected at a suggestive threshold for DGE in ADG in the region between 18.64 and 18.65 MB on chromosome 6 (Supplementary Data 2). The λ was 1.02 for DGE (Supplementary Fig. 1c) and 1.03 for SGE (Supplementary Fig. 1d). These λ values were similar to the values found for DGE and SGE in ADFI. Moreover, in the chip-based GWAS, 87 SNPs were identified for DGE and 85 SNPs were identified for SGE ($P \; < \; {2.70\times 10}^{-5}$, Supplementary Fig. 2).

**Fig. 5: GWAS results for direct genetic effects (DGE) and social genetic effects (SGE) in average daily gain (ADG).**

For the other six traits

Using imputed GWAS, no significant SNPs were identified for either DGE or SGE for the backfat thickness to 100 kg (B100), days to 100 kg (D100), feed conversion ratio (FCR), residual feed intake (RFI), time in feeder per day (TPD), and feeding speed (FS) traits (Supplementary Fig. 3). However, two peaks were found for D100; one of which was located on SSC6 for DGE and the other on SSC5 for SGE. For RFI, a distinct peak located on SSC10 was found for both DGE and SGE.

In the chip-based GWAS, the total number of SNPs detected ($P \; < \; {2.70\times 10}^{-5}$) was 4 for B100, 15 for D100, 20 for RFI, 3 for FS, and 5 for TPD. For SGE, the total number of SNPs detected ($P \; < \; {2.70\times 10}^{-5}$) was 1 for B100, 5 for RFI, 1 for FS, and 1 for TPD (Supplementary Fig. 2). Among them, three SNPs were also detected by imputed GWAS for the B100, D100, and FS traits ($P \; < \; {2.70\times 10}^{-5}$, Supplementary Table 1).

Haplotype-based GWAS for eight socially affected traits

For ADFI

Four significant haplotype loci were identified for DGE (Supplementary Table 2 and Fig. 3c) and seven for SGE (Supplementary Table 3 and Fig. 3d). These were distributed on SSC1, SSC6, SSC7, SSC10, and SSC12 ($P \; < \; {1.82\times 10}^{-7}$). Among these, four common haplotype loci (HapL1834, HapL910, HapL1412, and HapL1193) were shared by DGE and SGE. Only the MT3 gene was located within the haplotype HapL1412 “GGG”, with a haplotype frequency of 0.05. Furthermore, 12 suggestive haplotype loci were detected for DGE and 17 for SGE (${1.82\times 10}^{-7} \; < \; P \; < \; {3.64\times 10}^{-6}$, Supplementary Table 4). The λ was 1.06 for DGE (Supplementary Fig. 4a) and 1.08 for SGE (Supplementary Fig. 4b); implying no population stratification.

For ADG

Three haplotype loci surpassing the significance threshold ($P={1.82\times 10}^{-7}$) were detected for DGE (Supplementary Table 2 and Fig. 5c) and one was detected for SGE (Supplementary Table 3 and Fig. 5d). The haplotype loci (HapL910) were shared by DGE and SGE. In addition, this haplotype was shared by ADFI and ADG with a haplotype frequency of 0.03. A total of ten suggestive haplotype loci were identified for DGE and SGE (Supplementary Table 4). The λ was 1.04 for DGE (Supplementary Fig. 4c) and 1.03 for SGE (Supplementary Fig. 4d); implying no population stratification.

For D100 and B100

When considering the DGE and SGE, we identified 20 different haplotype loci that reached the significance threshold ($P={1.82\times 10}^{-7}$) for D100 (Supplementary Tables 2, 3 and Figs. 6a, b). Among these, three common haplotype loci (HapL36 HapL1601, and HapL213) were detected for DGE and SGE. Eighteen candidate genes were located within these significant regions. The SDK1 gene within the significant haplotype HapL36 was shared by DGE and SGE. On basis of the suggestive level ($P={3.64\times 10}^{-6}$), a total of 21 haplotype loci were detected for DGE and 13 for SGE (Supplementary Table 4). For B100, no haplotype loci were found to be associated with DGE (Supplementary Fig. 5a) or SGE (Supplementary Fig. 5b).

**Fig. 6: Haplotype-based GWAS results for direct genetic effects (DGE) and social genetic effects (SGE).**

For FCR and RFI

No significant haplotype loci were detected for DGE or SGE (Supplementary Fig. 6); however, five suggestive haplotype loci (${1.82\times 10}^{-7} \; < \; P \; < \; {3.64\times 10}^{-6}$) were identified for DGE (three for FCR and two for RFI) and one for SGE on RFI (Supplementary Table 4 and Supplementary Fig. 6).

For TPD and FS

Significant haplotype loci located on SSC4: 130,340,342–130,390,531 were identified for the DGE on TPD (Fig. 6c) and significant haplotype loci were detected for the SGE on FS (Supplementary Fig. 7). Furthermore, 15 different haplotype loci that surpassed the suggestive level ($P={3.64\times 10}^{-6}$) were identified for DGE and SGE (Supplementary Table 4).

Comparing single-locus and haplotype-based GWAS

Haplotype-based GWAS identified more associations than single-locus GWAS for the DGE and SGE on the eight socially affected traits in pigs. Importantly, the significant HapL1412 with haplotype allele “GGG” overlapped with two significant SNPs (SSC6: 18,635,874 and SSC6: 18,635,895) for both DGE and SGE on ADFI. A candidate gene, MT3, was found in these regions. Furthermore, multiple significant SNPs and haplotype loci were shared by different traits, which indicates the pleiotropism.

Discussion

The socially affected traits are influenced by multiple genes. This study is designed to investigate DGE and SGE for eight socially affected traits using imputed WGS, as there is strong evidence that these traits are socially affected in pigs^2,8,9,10,18. Using the social genetic model¹², we estimated the DGE and SGE of each socially affected trait in Yorkshire pigs. Then the single-locus and haplotype-based GWAS were performed to investigate associations for DGE and SGE using imputed data.

Different imputation strategies can result in different levels of accuracy¹⁹. The use of multi-breed reference populations is an effective way to improve the accuracy of imputation^19,20. However, a large genetic distance between the reference and target populations can result in extremely low accuracies (less than 0.49)²¹. Landrace and Yorkshire pigs are bred according to the same breeding goals and are therefore likely genetically similar²². Additionally, because of the small effective population size for pigs²³, the imputation parameter of effective population size was set at 100 in the Beagle 5.1 software. After imputation, the average accuracy at the whole-genome level was 0.63, and 3,072,572 SNPs had a Beagle R² higher than 0.80 in the imputed WGS dataset. The imputation accuracy observed in the current study was higher than that reported in other studies^19,20,21, for example, a study using a ten-breed reference population of 168 pigs and using imputation from low- or high-density SNP data to WGS resulted in relatively low accuracy (0.39–0.49)²¹. Our results were more similar to the accuracy obtained from a multi-breed reference population of cattle (>0.70)^19,20. Thus, the present study indicates that using mixed reference populations with similar genetic backgrounds can result in reasonable imputation accuracy in pigs. The imputation accuracy increased with reference population size²⁴. In this study, a total of 40 sequenced pigs was used. Although the reference population size is smaller than that used in previous studies (>100) for pigs²⁵, this study also obtained a high imputation accuracy, and therefore, the imputed data can provide important information regarding socially affected traits in pigs.

Imputation accuracy is heavily affected by the MAF²⁶. Rare variants with a low MAF result in poor imputation accuracy and therefore decrease the average imputation accuracy. In our study, the accuracy decreased sharply when the MAF was lower than 0.10. Filtering out the SNPs with MAF < 0.10 from the imputed data increased the average imputation accuracy from 0.63 to 0.71. In our study, the imputation accuracy increased with the MAF, which is in agreement with previous studies in livestock^21,24,27,28. Rare variants are expected to have larger effects on complex traits than common variants²⁹. It is also harder to construct the haplotype background using rare SNPs and they decrease the set of template haplotypes for genotype imputation because they are observed only a few times in reference genotype datasets²⁶. Thus, genotype imputation of rare SNPs does not provide high accuracy levels. It is important to investigate imputation methods for rare variants in further studies.

In the processes of natural selection and the artificial selection of pigs, complex traits have developed high levels of genetic variation. The single-locus analysis focuses on identifying SNPs and mining functional genes for complex traits¹³, but this method has only identified a small fraction of genetic variants in pigs. Haplotype-based GWAS is a complementary method that intensifies the benefits from linkage disequilibrium (LD) and enables the evaluation of the genetic determinants of complex traits. The use of haplotype information in GWAS would likely be beneficial for detecting further genetic variants³⁰. However, haplotype-based GWAS using imputed WGS has not yet been reported in pigs. The rare variants may contribute large effects to complex traits²⁹. Our analysis were done on imputed sequence data, which may contain imputation errors³¹. To alleviate this issue, further quality control was also conducted on the imputed sequence genotypes, rare variants with a MAF lower than 0.01 were moved. After quality control, the average MAF is 0.30 for chip-based data, 0.27 for imputed data, and 0.09 for haplotype alleles. Our results show that haplotype-based GWAS can be effective and can provide more genetic information than single-locus GWAS. These results are in agreement with previous studies; for example, haplotype-based GWAS in plants³² and cattle¹⁶ captured genetic variants that were not identified by single-locus analysis. Importantly, by combining single-locus and haplotype-based GWAS, we confirmed the presence of two important SNPs (SSC6: 18,635,874 and SSC6: 18,635,895) for DGE and SGE in ADFI. Therefore, our results highlight the advantages of haplotype-based GWAS for detecting genetic variants that influence complex traits in pigs. The power of GWAS is limited by sample size and SNP density. Using the imputed WGS instead of the 50 K SNP chip is powerful to investigate socially affected traits in pigs. Although only 1204 imputed WGS were used in this study, our results also provided important genetic variation for socially affected traits. Further studies are recommended to validate these results using a larger sample size.

Socially affected traits are known to be affected by SGE, but the mechanisms of SGE are not fully known. Many studies have been conducted to understand the genetic architecture of complex traits and they have identified many important SNPs and genes associated with these traits³³. However, these studies have mainly focused on investigations of DGE, despite the majority of complex traits in pigs being affected by social interactions among individuals². Pigs are highly social animals, so ignoring SGE could incorrectly estimate genetic variance and restrict genetic improvement.

Recently, numerous studies have shown that using social genetic models during genetic evaluation can enhance the genetic improvement of socially affected traits in pigs^1,10. However, few studies have investigated the genetic architecture of DGE and SGE in pigs^14,15,34. On the basis of our GWAS results, we found that the significant loci and candidate genes of DGE are different from SGE for each trait in Yorkshire pigs. These results are in agreement with previous studies, for example, a distinct difference was found between DGE and SGE for socially affected traits in laying hens¹³, and a study based on single-step GWAS found only one QTL (SSC6: 19.9–20.9 Mb) for DGE and SGE on ADG in pigs³⁴. The population composition may affect the estimation of DGE and SGE. An experimental design using groups comprising two families is optimal for estimating SGE³⁵. Our group’s composed of numerous families because all pigs in our study were from a commercial pig performance testing station. The group design in this study may have affected the estimation of DGE and SGE; however, it is difficult to conduct an experiment containing only two families from a commercial breeding program. Moreover, in our study, all pigs were derived from the same herd and there were 20 pigs in each group, which made it easier to estimate DGE and SGE⁵.

Our results provide further evidence for the genetic differences in DGE and SGE with regard to socially affected traits. Economically important traits are selected by artificial selection when based on the classical model that only considers DGE. SGE is, therefore, less frequently selected when compared with DGE. The complex traits in pigs are highly polygenic, and more research is required to understand the genetic architecture of the SGE on the complex traits of pigs.

To assess whether the identified SNPs associated with socially affected traits in our study replicate previously identified QTL, we compared our results with the PigQTL database based on the genome position of each SNP and QTL. A total of 78 important QTL were found to overlap with previously identified QTL for DGE of complex traits in pigs (Supplementary Table 5). No previous QTL has been reported to be associated with SGE. SGE has been evaluated to play an important role in livestock¹². Thus, the investigation of the genetic architecture of SGE has important implications for pig breeding programs.

In our study, a total of 56 candidate genes associated with SGE and DGE were detected. Gene enrichment analysis showed that enriched pathways were associated with nucleic acid, calcium, and metal ion binding (Supplementary Table 6). In particular, two candidate genes, IQCM and CDH11, were associated with both DGE and SGE on the ADG and ADFI traits. CDH11 gene encodes a calcium-dependent glycoprotein and mediates calcium-dependent cell-cell adhesion³⁶. The CDH11 gene is essential for tissue development, regulation of cell proliferation, and survival^37,38. This gene has also been associated with postnatal bone development in mice³⁹. Previous studies have shown that the CDH11 gene is associated with growth traits in cattle including weaning weight⁴⁰ and RFI⁴¹. In pigs, this gene was found to be associated with fat and meat quality traits⁴² and to regulate neural development and cell motility⁴³. In this important chromosome region, six QTL for growth traits were overlapped, including ADG⁴⁴ and BW⁴⁵ in pigs. Additionally, these significant SNPs were located on the QTL for health traits⁴⁶.

In the region of 45.95–46.95 Mb on SSC6, numerous zinc-finger protein family genes were found to be associated with both DGE and SGE. The zinc-finger protein is one of the most abundant classes of transcription factors and regulates cell growth and differentiation. According to our gene ontology (GO) analysis, these genes regulate transcription, nucleic acid binding, and metal ion binding. Furthermore, this chromosome region overlapped with four reported QTL affecting ADG^44,47,48 and eating behavior⁴⁹ traits in pigs. Thus, daily interactions (SGE) between group pigs would influence their health. Interestingly, the significant SNPs within the region SSC6: 45.95–46.95 Mb were located within 20 QTL known to be associated with health traits^46,50 in pigs (https://www.animalgenome.org/).

In the region of SSC6: 18.14–19.14 Mb, MT3 gene was found in both single-locus and haplotype-based GWAS for socially affected traits. This gene encodes a growth inhibitory factor that regulates many biological processes, particularly within the nervous and immune systems, and thereby influences health. MT3 plays an important role in zinc homeostasis and cell death⁵¹, and inhibits cell growth under zinc-deficient conditions⁵². Interestingly, our study also identified numerous zinc-finger protein family genes to be associated with DGE and SGE in SSC6: 45.95–46.95 Mb.

The previous studies identified several QTL on SSC6 for DGE and SGE in pigs^14,15,34. The QTL (SSC6: 18.14–19.14 Mb) in the current study was close to the reported QTL (SSC6: 19.9–20.9 Mb) for both DGE and SGE on ADG in pigs³⁴. Furthermore, there were eight reported QTL associated with growth traits in the region of SSC6: 18.14–19.14 Mb. Among them, five QTL were related to ADG⁵³ and feeding intake⁵⁴. The significant SNPs associated with DGE and SGE are located in nine reported QTL for health traits⁴⁶. And, it was reported that three QTL located on SSC6: 18.14–19.14 Mb in pigs were associated with social interaction traits, and health traits^49,55.

In summary, we report the study employing combined single-locus and haplotype-based GWAS to identify the genetic architecture of socially affected traits that are influenced by both DGE and SGE in pigs. Our study shows the feasibility of mapping genomic variants that underlie SGE and provides genomic information for socially affected traits in pigs. These results provide evidence that the genomic architecture of SGE and DGE is different for socially affected traits. Hence, we recommend the use of both DGE and SGE to evaluate genetic architecture for socially affected traits in pigs.

Methods

Ethics declarations

All experimental procedures were performed in accordance with the Institutional Review Board (IRB14044) and the Institutional Animal Care and Use Committee of the Sichuan Agricultural University under permit number DKY-B20140302.

Animals and housing

During the period of 2017–2019, phenotypic data were collected for Yorkshire pigs from the national nucleus pig breeding farm of New Hope Group, Co., Ltd. (Sichuan, China) using the Osborne FIRE Pig Performance Testing System (Osborne, KS, United States). As target population, a total of 1204 Yorkshire pigs were placed in a temperature-controlled room at 25 ± 2 °C and relative humidity of 65–80% during the period of performance test from 30 to 110 kg. As reference population, a total of 60 pigs (20 Landrace and 40 Yorkshire pigs) were randomly selected from core populations. These pigs were group-housed in cement-floor pens (20 pigs in each pen) and the nutrient requirements were met as recommended by the National Research Council (NRC 2012).

Phenotypic data

A total of 1112 pigs’ phenotypic data were collected, including ADG (kg/d), D100, B100, ADFI (kg/d), RFI, FCR, TPD (min/d), and FS (g/min). In this study, each pig was labeled with a unique electric identification tag on ear that was detected by the Osborne FIRE Pig Performance Testing System. The feed time, feed consumption, and BW were recorded at each visit to the feeder for each pig. At the end of the performance test, BFT between the third and fourth last ribs of each pig was calculated by PIGLOG 105B ultrasound machine (SFK Technology, Søborg, Denmark). ADG was calculated as linear regressions of BW from 30 to 110 kg on the performance test days. ADFI was calculated based on the total amount of recorded total feed intake (TFI) divided by the number of corresponding feed days at the feeder. TPD was calculated based on the total amount of recorded total time divided by the number of corresponding feed days (min/d), and FS = ADFI/TPD (g/min)^56,57. FCR, D100, B100, and RFI were calculated as follows⁵⁸:

$${{{{\mathrm{FCR}}}}}=\frac{{{{{\mathrm{TFI}}}}}}{{{{{\rm{Weight}}}}}_{2}-{{{{\rm{Weight}}}}}_{1}}$$

(1)

$${{{\rm{D}}}}100={{{\rm{tested}}}}\,{{{\rm{days}}}}+(100-{{{{\rm{Weight}}}}}_{2})\;\times\; \frac{{{{\rm{tested}}}}\,{{{\rm{days}}}}-{{{\rm{A}}}}}{{{{{\rm{Weight}}}}}_{2}}$$

(2)

$${{{\rm{B}}}}100={{{\rm{BFT}}}}+\left(100-{{{{\rm{Weight}}}}}_{2}\right)\times \frac{{{{\rm{BFT}}}}}{{{{{\rm{Weight}}}}}_{2}-B}$$

(3)

$${{{\rm{RFI}}}}={{{\rm{ADFI}}}}-14.1{{{\rm{ADG}}}}-2.83{{{\rm{BFT}}}}-110.9{{{\rm{AMW}}}}$$

(4)

$${{{\rm{AMW}}}}=\frac{\left({{{{\rm{Weight}}}}}_{2}^{1.6}-{{{{\rm{Weight}}}}}_{1}^{1.6}\right)}{1.6\times \left({{{{\rm{Weight}}}}}_{2}-{{{{\rm{Weight}}}}}_{1}\right)}$$

(5)

where ${{{{\bf{Weight}}}}}_{{{{\bf{1}}}}}$ and ${{{{\bf{Weight}}}}}_{{{{\bf{2}}}}}$ are weights at the start and end of the performance test, respectively; ${{{\bf{tested}}}}$ ${{{\bf{days}}}}$ is the duration of the performance test in days. ${{{\bf{A}}}}$ is 50.775 for males and 46.415 for females; ${{{\bf{B}}}}$ is −7.277 for males and −9.440 for females. A and B were calculated based on an actual dataset of performance tests involving 5000 pigs⁵⁹. The true days and B100 were firstly obtained based on two data that were the closest to 100 kg using linear interpolation. Then the nonlinear models of D100 and B100 were constructed based on linear interpolation as models (2) and (3). Finally, the A and B were calculated based on models (2) and (3) using the NLIN procedure in SAS software, respectively. BFT is the tested backfat thickness at the end of a performance test. AMW is an average metabolic BW.

The deregressed EBVs of DGE and SGE

The DGE and SGE were estimated for eight socially affected traits using a social genetic effect model¹², as follows:

$${{{\boldsymbol{y}}}}={{{\boldsymbol{Xb}}}}+{{{{\boldsymbol{Z}}}}}_{{{{\boldsymbol{D}}}}}{{{{\boldsymbol{a}}}}}_{{{{\boldsymbol{D}}}}}+{{{{\boldsymbol{Z}}}}}_{{{{\boldsymbol{S}}}}}{{{{\boldsymbol{a}}}}}_{{{{\boldsymbol{S}}}}}+{{{\boldsymbol{Wl}}}}+{{{\boldsymbol{V}}}}{{{\boldsymbol{g}}}}+{{{\boldsymbol{e}}}}$$

(6)

where ${{{\boldsymbol{y}}}}$ is the vector of phenotypic values; ${{{\boldsymbol{b}}}}$ is the vector of fixed effects, including sex, test year and month, birth year and month effects; ${{{{\boldsymbol{a}}}}}_{{{{\boldsymbol{D}}}}}$ and ${{{{\boldsymbol{a}}}}}_{{{{\boldsymbol{S}}}}}$ are vectors of DGE and SGE, respectively; ${{{\boldsymbol{l}}}}$ is the vector of random litter effects; ${{{\boldsymbol{g}}}}$ is the vector of random group effects in which the pigs were penned during the performance test; $e$ is the random residual vector; ${{{\bf{X}}}}$, ${{{{\boldsymbol{Z}}}}}_{{{{\boldsymbol{D}}}}}$, ${{{{\boldsymbol{Z}}}}}_{{{{\boldsymbol{S}}}}}$, ${{{\boldsymbol{W}}}}$,and ${{{\boldsymbol{V}}}}$ are the incidence matrices of ${{{\boldsymbol{b}}}}$, ${{{{\boldsymbol{a}}}}}_{{{{\boldsymbol{D}}}}}$, ${{{{\boldsymbol{a}}}}}_{{{{\boldsymbol{S}}}}}$, ${{{\boldsymbol{l}}}}$, and ${{{\boldsymbol{g}}}}$, respectively. This model was implemented by AI-REML in DMU software⁶⁰. The estimated accuracies of DGE and SGE were calculated based on the formula $r=\sqrt{1-{s}_{e}^{2}/{\sigma }_{a}^{2}}$, ${s}_{e}^{2}$ is the error variance of DGE and SGE, ${\sigma }_{a}^{2}$ is the corresponding genetic variance. Based on the estimated accuracies of DGE and SGE, their deregressed breeding values (EBVs) were obtained based on the formula $D{{{\rm{eregressed}}}}$ ${EBVs}={g}_{i}/{r}_{i}^{2}$, ${g}_{i}$ is the EBVs of $i$th individual, ${r}_{i}^{2}$ is the square of estimated accuracies for $i$th individual⁶¹. Then, the deregressed EBVs were used to implement for association analysis.

Genomic DNA extraction

The ear tissue samples were collected and stored in 75% alcohol. Genomic DNA from 1264 ear tissues was extracted using the Tissues Genomic DNA (Omega Bio-Tek, Norcross, GA, USA) kit according to the manufacturer’s instructions, and then the quality and quantity were measured using a Nanodrop-2000 spectrophotometer. The genomic DNA with the ratio of light absorption (A260/280) between 1.8 and 2.0, concentration ≥50 ng/µL, and total volume ≤50 µL were eligible.

Genotyping by Illumina Porcine SNP50K BeadChip

A total of 1204 Yorkshire pigs, 442 boars and 762 gilts, were genotyped by the Illumina Porcine 50K SNP Chip (Neogen, Lincoln, NE, USA), which contained 50,697 SNPs. Firstly, the SNPs with no position information and located on sex chromosomes were removed from the genotype data, which contains 11,874 SNPs. Then, quality control of the genotype data was performed using PLINK software⁶². The SNPs with call rate < 0.90, MAF < 0.05, and Hardy–Weinberg equilibrium test (HWE) <10⁻⁶, were excluded from the dataset. After quality control, a total of 1854 SNPs were removed. Finally, a total of 36,969 SNPs were used as target genotype data.

Whole-genome sequencing and SNP calling

The WGS reference data were obtained for 60 pigs (20 Landrace and 40 Yorkshire pigs), with 150 bp paired-end reads on the Illumina HiSeq PE150 platform. The sequencing was performed by BGI Co., Ltd. (Wuhan, China). After sequencing, the quality of raw reads was checked with a Phred score of 20 as the minimum to filter the adapter polluted reads and multiple N reads (where N > 10% of one read) to produce clean reads by FastQC (http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc/). Then the clean reads were mapped to the pig reference genome (Sscrofa11.1) using BWA (version 0.7.15) software with the parameters mem -t 10 -k 32 -M⁶³. The SAM files produced from BWA were converted to BAM files using SAMtools (version 1.19)⁶⁴. The potential PCR duplicates were removed by MarkerDuplicates utility in Picard release 1.119 (https://sourceforge.net/projects/picard/files/picardtools/1.119/). After that, BAM files were used to call SNPs using GATK (version 3.5) software⁶⁵ with multi-sample approaches. The raw SNPs generated from GATK were filtered with QualByDepth (QD) < 2.0, FisherStrand (FS) < 60.0, RMSMappingQuality (MQ) < 40.0, MappingQualityRankSumTest < –12.5, and ReadPosRankSumTest < –8.0. After the initial filtering, a total of 21,104,245 SNPs remained. For further analyses, the SNPs with MAF >0.05, missing rate <0.1, HWE <10⁻⁶, read depth (dp) >6, and the SNPs located on autosomes were considered as reference genotype data.

Imputation from 50 K chip to WGS

Genotype imputation between target and reference genotype data were performed by Beagle (version 5.1)⁶⁶ with default parameter settings, except for setting the effective population size to 100²². The imputation accuracy of each SNP was assessed using the Beagle R², which is the estimated squared correlation between the estimated allele dosage and the true allele dosage⁶⁷. A two-breed reference population (20 Landrace and 40 Yorkshire pigs) with a small genetic distance²² was used to perform genotype imputation from 50 K SNPs to WGS. After imputation, to maintain a balance between the average accuracy and the number of SNPs, SNPs with a Beagle R² < 0.8 were excluded. Furthermore, SNPs with MAF less than 0.01 and HWE less than 10⁻⁶ were removed from imputed data. Finally, a total of 3,072,572 SNPs were retained for further analyses.

Constructing haplotype loci

Haplotype loci were identified from the imputed WGS data in 1204 pigs. Haplotype loci were detected for each chromosome as proposed by Gabriel et al.⁶⁸ using PLINK v1.90 software⁶². The parameters were set to “-blocks no-pheno-req -blocks-max-kb 1000 -blocks-strong-lowci 0.8 -geno 0.1”. A haplotype block containing two or more SNPs with high LD was defined, and loci with a confidence interval of r² higher than 0.8 were considered into one block. Then, the haplotype calling and identification of haplotype alleles were performed by GHap package⁶⁹. Furthermore, haplotype loci were transformed into multi-allelic markers, and the haplotype genotype matrix was used to perform GWAS. Using the imputed data with 3,072,572 SNPs, a total of 33,708 haplotype loci were constructed and each haplotype loci contained 34.8 alleles. Finally, 274,741 haplotype alleles with MAF >0.01 were retained.

Association analysis

GWAS were performed independently for eight socially affected traits by considering both DGE and SGE using GEMMA software⁶⁵. Before association analysis, the centered genotypes were used to estimate the n × n genomic relationship matrix between the individuals. The genomic relatedness matrix was calculated as follows:

$$G=\frac{1}{p}\mathop{\sum }\limits_{i=1}^{p}{\left({X}_{i}-{1}_{n}{\bar{x}}_{i}\right)\left({X}_{i}-{1}_{n}{\bar{x}}_{i}\right)}^{T}$$

(7)

where $G$ is the genomic relatedness matrix between the individuals; $n$ is the number of individuals; $p$ is the number of genotypes; $i$ represents the $i$th SNP; X denote n × p matrix of genotypes; ${X}_{i}$ as its $i$th column representing genotypes of $i$th SNP; ${\bar{x}}_{i}$ as the sample mean of $i$th SNP; ${1}_{n}$ as a n × 1 vector of 1’s.

The deregressed EBVs were used to perform single-locus GWAS (including chip-based GWAS and imputed-GWAS) and haplotype-based GWAS, the following unified mixed linear model was used:

$${{{\bf{y}}}}={{{\bf{Xm}}}}+{{{\bf{Wa}}}}+{{{\bf{e}}}}$$

(8)

$${{{\boldsymbol{a}}}} \sim {{{{{\mathbf{MVN}}}}}}\left({{{\bf{0}}}},{{{\boldsymbol{G}}}}{{{{\boldsymbol{\sigma }}}}}_{{{{\boldsymbol{a}}}}}^{{{{\bf{2}}}}}\right)$$

$${{{\boldsymbol{e}}}} \sim {{{{{\mathbf{MVN}}}}}}\left({{{\bf{0}}}},{{{\boldsymbol{I}}}}{{{{\boldsymbol{\sigma }}}}}_{{{{\boldsymbol{e}}}}}^{{{{\bf{2}}}}}\right)$$

where ${{{\bf{y}}}}$ is the vector of the deregressed EBVs of DGE or SGE; ${{{\bf{m}}}}$ is the SNP effects; ${{{\bf{a}}}}$ is the vector of residual polygenic effects; ${{{\bf{e}}}}$ is the vector of random residuals; ${{{\bf{X}}}}$ and ${{{\bf{W}}}}$ are the incidence matrices for ${{{\bf{m}}}}$ and ${{{\bf{a}}}}$, respectively; G is a genomic relationship matrix, ${{{\boldsymbol{I}}}}$ is an identity matrix; MVN denotes multivariate normal distribution.

The genome-wide significant threshold value was determined using the Bonferroni correction method⁷⁰. For chip-based GWAS, the genome-wide significant and suggestive levels were set as P = 0.05/N₁ = 1.35 × 10⁻⁶ and P = 1/N₁ = 2.70 × 10⁻⁵, respectively, where N₁ is the number of analyzed SNPs. For imputed-GWAS, the genome-wide significant and suggestive levels were set as P = 0.05/N₂ = 1.63 × 10⁻⁸ and P = 1/N₂ = 3.25 × 10⁻⁷, respectively, where N₂ is the number of analyzed SNPs. For haplotype-based GWAS, the genome-wide significant and suggestive levels were calculated as P = 0.05/N₃ = 1.82 × 10⁻⁷ and P = 1/N₃ = 3.64 × 10⁻⁶, respectively, where N₃ is the number of studied haplotype alleles.

The Manhattan plots were drawn using qqman package⁷¹. Genomic inflation factor (λ) was calculated to judge the extent of false-positive signals with estlambda function in GenABEL package (${{{\rm{\lambda }}}}=\frac{{{{\rm{the}}}}\,{{{\rm{observed}}}}\,P\,{{{\rm{values}}}}}{{{{\rm{the}}}}\,{{{\rm{expected}}}}\,P\,{{{\rm{values}}}}}$)⁷².

Candidate genes and functional analysis

The region within a 1 Mb region centering each significant SNP was defined as the QTL region. The candidate functional genes were searched within the identified QTL regions on the pig genome assembly 11.1 (https://asia.ensembl.org/Sus_scrofa/Info/Index). Then, the biological functions of these candidate genes were investigated on PubMed (https://www.ncbi.nlm.nih.gov/pubmed) and the reported literature. For functional annotation, GO analysis was performed on DAVID Bioinformatics Resources (https://david.ncifcrf.gov). The Fisher’s test was used to assess the significance of the determined enriched terms^73,74. Enriched GO terms (P < 0.05) were selected to investigate the genes involved in biological processes⁷⁵.

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The datasets supporting the results of this article are included within the article. All genotypic, phenotype data, and supplemental material were deposited at the figshare repository (https://doi.org/10.6084/m9.figshare.12611786). All other data are available from the corresponding authors upon reasonable request.

References

Ellen, E. D. et al. The prospects of selection for social genetic effects to improve welfare and productivity in livestock. Front. Genet. 5, 377 (2014).
Article PubMed PubMed Central Google Scholar
Bergsma, R., Kanis, E., Knol, E. F. & Bijma, P. The contribution of social effects to heritable variation in finishing traits of domestic pigs (Sus scrofa). Genetics 178, 1559–1570 (2008).
Article CAS PubMed PubMed Central Google Scholar
Moore, A. J., Brodie, E. D. 3rd & Wolf, J. B. Interacting phenotypes and the evolutionary process:I. Direct and indirect genetic effects of social interactions. Evolution 51, 1352–1362 (1997).
Article PubMed Google Scholar
Canario, L., Lundeheim, N. & Bijma, P. Pig growth is affected by social genetic effects and social litter effects that depend on group size. In Proc. 9th World Congress on Genetics Applied to Livestock Production (ed. Giessen) 1–6 (German Society of Animal Science, 2010).
Bijma, P. The quantitative genetics of indirect genetic effects: a selective review of modelling issues. Heredity 112, 61–69 (2014).
Article CAS PubMed Google Scholar
Rostellato, R., Sartori, C., Bonfatti, V., Chiarot, G. & Carnier, P. Direct and social genetic effects on body weight at 270 days and carcass and ham quality traits in heavy pigs. J. Anim. Sci. 93, 1–10 (2015).
Article CAS PubMed Google Scholar
Kim, Y. et al. Social genetic effects on days to 90 kg in Duroc and Yorkshire pigs. Korean. J. Agric. Sci. 43, 595–602 (2016).
Google Scholar
Camerlink, I., Turner, S. P., Bijma, P. & Bolhuis, J. E. Indirect genetic effects and housing conditions in relation to aggressive behaviour in pigs. PLoS ONE 8, e65136 (2013).
Article CAS PubMed PubMed Central Google Scholar
Camerlink, I., Ursinus, W. W., Bartels, A. C., Bijma, P. & Bolhuis, J. E. Indirect genetic effects for growth in pigs affect behaviour and weight around weaning. Behav. Genet. 48, 413–420 (2018).
Article PubMed PubMed Central Google Scholar
Nielsen, H. M., Ask, B. & Madsen, P. Social genetic effects for growth in pigs differ between boars and gilts. Genet. Sel. Evol. 50, 4 (2018).
Article PubMed PubMed Central Google Scholar
Bouwman, A. C., Bergsma, R., Duijvesteijn, N. & Bijma, P. Maternal and social genetic effects on average daily gain of piglets from birth until weaning. J. Anim. Sci. 88, 2883–2892 (2010).
Article CAS PubMed Google Scholar
Muir, W. M. Incorporation of competitive effects in forest tree or animal breeding programs. Genetics 170, 1247–1259 (2005).
Article PubMed PubMed Central Google Scholar
Brinker, T., Bijma, P., Vereijken, A. & Ellen, E. D. The genetic architecture of socially-affected traits: a GWAS for direct and indirect genetic effects on survival time in laying hens showing cannibalism. Genet. Sel. Evol. 50, 38 (2018).
Article PubMed PubMed Central Google Scholar
Hong, J. K. et al. A genome-wide association study of social genetic effects in Landrace pigs. Asian-Australas. J. Anim. Sci. 31, 784–790 (2018).
Article PubMed Google Scholar
Wu, P. et al. Whole-genome re-sequencing association study for direct genetic effects and social genetic effects of six growth traits in Large White pigs. Sci. Rep. 9, 9667 (2019).
Article PubMed PubMed Central CAS Google Scholar
Chen, Z., Yao, Y., Ma, P., Wang, Q. & Pan, Y. Haplotype-based genome-wide association study identifies loci and candidate genes for milk yield in Holsteins. PLoS ONE 13, e0192695 (2018).
Article PubMed PubMed Central CAS Google Scholar
Price, A. L., Zaitlen, N. A., Reich, D. & Patterson, N. New approaches to population stratification in genome-wide association studies. Nat. Rev. Genet. 11, 459–463 (2010).
Article CAS PubMed PubMed Central Google Scholar
Reimert, I. et al. Backtest and novelty behavior of female and castrated male piglets, with diverging social breeding values for growth. J. Anim. Sci. 91, 4589–4597 (2013).
Article CAS PubMed Google Scholar
Brøndum, R. F., Guldbrandtsen, B., Sahana, G., Lund, M. S. & Su, G. Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle. BMC Genomics 15, 1–8 (2014).
Article CAS Google Scholar
Bouwman, A. C. & Veerkamp, R. F. Consequences of splitting whole-genome sequencing effort over multiple breeds on imputation accuracy. BMC Genet. 15, 1–9 (2014).
Article Google Scholar
van den Berg, S. et al. Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies. Genet. Sel. Evol. 51, 2 (2019).
Article PubMed PubMed Central Google Scholar
Grossi, D. A. et al. Genetic diversity, extent of linkage disequilibrium and persistence of gametic phase in Canadian pigs. BMC Genet. 18, 6 (2017).
Article PubMed PubMed Central Google Scholar
Hall, S. J. Effective population sizes in cattle, sheep, horses, pigs and goats estimated from census and herdbook data. Animal 10, 1778–1785 (2016).
Article CAS PubMed Google Scholar
Ye, S. et al. Imputation from SNP chip to sequence: a case study in a Chinese indigenous chicken population. J. Anim. Sci. Biotechnol. 9, 30 (2018).
Article PubMed PubMed Central Google Scholar
van den Berg, S. et al. Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies. Genet. Sel. Evol. 51, 1–13 (2019).
Google Scholar
Das, S., Abecasis, G. R. & Browning, B. L. Genotype imputation from large reference panels. Annu. Rev. Genomics Hum. Genet. 19, 73–96 (2018).
Article CAS PubMed Google Scholar
Song, H. et al. Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs. Genet. Sel. Evol. 51, 58 (2019).
Article PubMed PubMed Central CAS Google Scholar
Hayes, B. J., Bowman, P. J., Daetwyler, H. D., Kijas, J. W. & van der Werf, J. H. Accuracy of genotype imputation in sheep breeds. Anim. Genet. 43, 72–80 (2012).
Article CAS PubMed Google Scholar
Morris, A. P. & Zeggini, E. An evaluation of statistical approaches to rare variant analysis in genetic association studies. Genet. Epidemiol. 34, 188–193 (2010).
Article PubMed Google Scholar
Lorenz, A. J., Hamblin, M. T. & Jannink, J. L. Performance of single nucleotide polymorphisms versus haplotypes for genome-wide association analysis in barley. PLoS ONE 5, e14079 (2010).
Article PubMed PubMed Central CAS Google Scholar
Brøndum, R. F., Ma, P., Lund, M. S. & Su, G. Short communication: genotype imputation within and across Nordic cattle breeds. J. Dairy Sci. 95, 6795–6800 (2012).
Article PubMed CAS Google Scholar
Contreras-Soto, R. I. et al. A genome-wide association study for agronomic traits in soybean using SNP markers and SNP-based haplotype analysis. PLoS ONE 12, e0171105 (2017).
Article PubMed PubMed Central CAS Google Scholar
Hu, Z. L., Park, C. A., Wu, X. L. & Reecy, J. M. Animal QTLdb: an improved database tool for livestock animal QTL/association data dissemination in the post-genome era. Nucleic Acids Res. 41, D871–D879 (2013).
Article CAS PubMed Google Scholar
Hong, J. K. et al. Single-step genome-wide association study for social genetic effects and direct genetic effects on growth in Landrace pigs. Sci. Rep. 10, 14958 (2020).
Article PubMed PubMed Central Google Scholar
Bijma, P. Estimating indirect genetic effects: precision of estimates and optimum designs. Genetics 186, 1013–1028 (2010).
Article PubMed PubMed Central Google Scholar
Cheng, S. L. et al. Human osteoblasts express a repertoire of cadherins, which are critical for BMP-2-induced osteogenic differentiation. J. Bone Min. Res. 13, 633–644 (1998).
Article CAS Google Scholar
Larue, L. et al. A role for cadherins in tissue formation. Development 122, 3185–3194 (1996).
Article CAS PubMed Google Scholar
Gumbiner, B. M. Regulation of cadherin-mediated adhesion in morphogenesis. Nat. Rev. Mol. Cell Biol. 6, 622–634 (2005).
Article CAS PubMed Google Scholar
Farber, C. R. et al. Identification of quantitative trait loci influencing skeletal architecture in mice: emergence of Cdh11 as a primary candidate gene regulating femoral morphology. J. Bone Min. Res. 26, 2174–2183 (2011).
Article CAS Google Scholar
McClure, M. C. et al. A genome scan for quantitative trait loci influencing carcass, post-natal growth and reproductive traits in commercial Angus cattle. Anim. Genet. 41, 597–607 (2010).
Article CAS PubMed Google Scholar
Sherman, E. L. et al. Fine mapping quantitative trait loci for feed intake and feed efficiency in beef cattle. J. Anim. Sci. 87, 37–45 (2009).
Article CAS PubMed Google Scholar
Piórkowska, K., Żukowski, K., Ropka-Molik, K., Tyra, M. & Gurgul, A. A comprehensive transcriptome analysis of skeletal muscles in two Polish pig breeds differing in fat and meat quality traits. Genet. Mol. Biol. 41, 125–136 (2018).
Article PubMed PubMed Central CAS Google Scholar
Schulte, J. D. et al. Cadherin-11 regulates motility in normal cortical neural precursors and glioblastoma. PLoS ONE 8, e70962 (2013).
Article CAS PubMed PubMed Central Google Scholar
Harmegnies, N. et al. Results of a whole-genome quantitative trait locus scan for growth, carcass composition and meat quality in a porcine four-way cross. Anim. Genet. 37, 543–553 (2006).
Article CAS PubMed Google Scholar
Rothammer, S. et al. Genome-wide QTL mapping of nine body composition and bone mineral density traits in pigs. Genet. Sel. Evol. 46, 68 (2014).
Article PubMed PubMed Central Google Scholar
Reiner, G. et al. Identification of QTL affecting resistance/susceptibility to acute Actinobacillus pleuropneumoniae infection in swine. Mamm. Genome 25, 180–191 (2014).
Article CAS PubMed Google Scholar
Bidanel, J. P. et al. Detection of quantitative trait loci for growth and fatness in pigs. Genet. Sel. Evol. 33, 289–309 (2001).
Article CAS PubMed PubMed Central Google Scholar
Rückert, C. & Bennewitz, J. Joint QTL analysis of three connected F2-crosses in pigs. Genet. Sel. Evol. 42, 40 (2010).
Article PubMed PubMed Central Google Scholar
Reiner, G. et al. Mapping of quantitative trait loci affecting behaviour in swine. Anim. Genet. 40, 366–376 (2009).
Article CAS PubMed Google Scholar
Wang, J. Y. et al. Genome-wide association studies for hematological traits in swine. Anim. Genet. 44, 34–43 (2013).
Article PubMed CAS Google Scholar
Vašák, M. & Meloni, G. Mammalian metallothionein-3: new functional and structural insights. Int. J. Mol. Sci. 18, 1117 (2017).
Palmiter, R. D. Constitutive expression of metallothionein-III (MT-III), but not MT-I, inhibits growth when cells become zinc deficient. Toxicol. Appl. Pharm. 135, 139–146 (1995).
Article CAS Google Scholar
Liu, G. et al. A genome scan reveals QTL for growth, fatness, leanness and meat quality in a Duroc-Pietrain resource population. Anim. Genet. 38, 241–252 (2007).
Article CAS PubMed Google Scholar
Houston, R. D., Haley, C. S., Archibald, A. L. & Rance, K. A. A QTL affecting daily feed intake maps to chromosome 2 in pigs. Mamm. Genome 16, 464–470 (2005).
Article CAS PubMed Google Scholar
Laenoi, W. et al. Quantitative trait loci analysis for leg weakness-related traits in a Duroc × Pietrain crossbred population. Genet. Sel. Evol. 43, 13 (2011).
Article PubMed PubMed Central Google Scholar
Ding, R. et al. Genome-wide association analysis reveals genetic loci and candidate genes for feeding behavior and eating efficiency in Duroc boars. PLoS ONE 12, e0183244 (2017).
Article PubMed PubMed Central CAS Google Scholar
Do, D. N., Strathe, A. B., Jensen, J., Mark, T. & Kadarmideen, H. N. Genetic parameters for different measures of feed efficiency and related traits in boars of three pig breeds. J. Anim. Sci. 91, 4069–4079 (2013).
Article CAS PubMed Google Scholar
Liu, X. The ministry of agriculture has formulated the “national pig genetic improvement program (2009-2020)”. Agri. Tech. Equip. 19, 10–13 (2009).
Zhang, Z., Zhang, H., Chen, Z. M. & Jia-Qi, L. I. Development of correction formula for production traits in swine breeding. Chinese. J. Anim. Sci. 51, 49–54 (2015).
Google Scholar
Madsen, O. & Jensen, J. A user’s guide to DMU. A package for analysing multivariate mixed models. in Version 6, release 5.1 (2012).
Garrick, D. J., Taylor, J. F. & Fernando, R. L. Deregressing estimated breeding values and weighting information for genomic regression analyses. Genet .Sel. Evol. 41, 55 (2009).
Article PubMed PubMed Central Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central CAS Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).
Article CAS PubMed PubMed Central Google Scholar
Browning, B. L., Zhou, Y. & Browning, S. R. A one-penny imputed genome from next-generation reference panels. Am. J. Hum. Genet. 103, 338–348 (2018).
Article CAS PubMed PubMed Central Google Scholar
Browning, B. L. & Browning, S. R. A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am. J. Hum. Genet. 84, 210–223 (2009).
Article CAS PubMed PubMed Central Google Scholar
Gabriel, S. B. et al. The structure of haplotype blocks in the human genome. Science 296, 2225–2229 (2002).
Article CAS PubMed Google Scholar
Utsunomiya, Y. T., Milanesi, M., Utsunomiya, A. T., Ajmone-Marsan, P. & Garcia, J. F. GHap: an R package for genome-wide haplotyping. Bioinformatics 32, 2861–2862 (2016).
Article CAS PubMed Google Scholar
Bland, J. M. & Altman, D. G. Multiple significance tests: the Bonferroni method. BMJ 310, 170 (1995).
Article CAS PubMed PubMed Central Google Scholar
Turner, S. D. qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots. Preprint at bioRxiv https://doi.org/10.1101/005165 (2014).
Aulchenko, Y. S., Ripke, S., Isaacs, A. & van Duijn, C. M. GenABEL: an R library for genome-wide association analysis. Bioinformatics 23, 1294–1296 (2007).
Article CAS PubMed Google Scholar
Dennis, G. Jr. et al. DAVID: database for annotation, visualization, and integrated discovery. Genome Biol. 4, P3 (2003).
Article PubMed Google Scholar
Rivals, I., Personnaz, L., Taing, L. & Potier, M. C. Enrichment or depletion of a GO category within a class of genes: which test? Bioinformatics 23, 401–407 (2007).
Article CAS PubMed Google Scholar
Wang, K. et al. Genome wide association analysis reveals new production trait genes in a male Duroc population. PLoS ONE 10, e0139207 (2015).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

This study was supported by grants from the Sichuan Science and Technology Program (2020YFN0024 and 2021YFYZ0030), the China Agriculture Research System of MOF and MARA, the National Key R&D Program of China (2018YFD0501204), the National Natural Science Foundation of China (C170102), and the Sichuan Innovation Team of Pig (sccxtd-2021-08).

Author information

These authors contributed equally: Pingxian Wu, Kai Wang.

Authors and Affiliations

Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan, China
Pingxian Wu, Kai Wang, Jie Zhou, Dejuan Chen, Anan Jiang, Li Zhu, Xuewei Li & Guoqing Tang
College of Life Science, Sichuan Agricultural University, Yaan, Sichuan, China
Yanzhi Jiang
National Animal Husbandry Service, Beijing, Beijing, China
Xiaotian Qiu

Authors

Pingxian Wu
View author publications
You can also search for this author in PubMed Google Scholar
Kai Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Dejuan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Anan Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yanzhi Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Li Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaotian Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Xuewei Li
View author publications
You can also search for this author in PubMed Google Scholar
Guoqing Tang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.W., G.T., and K.W. performed experiments; P.W. and K.W. analyzed data and prepared figures and tables; G.T. and P.W. edited and revised the manuscript; G.T., P.W., J.Z., D.C., A.J., Y.J., L.Z., X.Q., and X.L. conceived, designed research, and wrote this paper. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Guoqing Tang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: Brooke LaFlamme and George Inglis.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Data 1-2

COMMSBIO-21-0181B-nr-editorial-policy-checklist

Supplementary Information

Description of Additional Supplementary Files

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, P., Wang, K., Zhou, J. et al. A combined GWAS approach reveals key loci for socially-affected traits in Yorkshire pigs. Commun Biol 4, 891 (2021). https://doi.org/10.1038/s42003-021-02416-3

Download citation

Received: 20 January 2021
Accepted: 29 June 2021
Published: 20 July 2021
DOI: https://doi.org/10.1038/s42003-021-02416-3
Springer Nature Limited

Associated content

DNA Day 2023

Collection 25 April 2023

A combined GWAS approach reveals key loci for socially-affected traits in Yorkshire pigs

Abstract

Similar content being viewed by others

Introduction

Results

Phenotype statistics

Evaluation of sequencing data and genotype imputation

Single-locus GWAS for eight socially affected traits

For average daily feed intake (ADFI)

For ADG

For the other six traits

Haplotype-based GWAS for eight socially affected traits

For ADFI

For ADG

For D100 and B100

For FCR and RFI

For TPD and FS

Comparing single-locus and haplotype-based GWAS

Discussion

Methods

Ethics declarations

Animals and housing

Phenotypic data

The deregressed EBVs of DGE and SGE

Genomic DNA extraction

Genotyping by Illumina Porcine SNP50K BeadChip

Whole-genome sequencing and SNP calling

Imputation from 50 K chip to WGS

Constructing haplotype loci

Association analysis

Candidate genes and functional analysis

Reporting Summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation