Skip to main content
Log in

An imputed whole-genome sequence-based GWAS approach pinpoints causal mutations for complex traits in a specific swine population

  • Research Paper
  • Published:
Science China Life Sciences Aims and scope Submit manuscript

Abstract

Sequencing-based genome-wide association studies (GWAS) have facilitated the identification of causal associations between genetic variants and traits in diverse species. However, it is cost-prohibitive for the majority of research groups to sequence a large number of samples. Here, we carried out genotype imputation to increase the density of single nucleotide polymorphisms in a large-scale Swine F2 population using a reference panel including 117 individuals, followed by a series of GWAS analyses. The imputation accuracies reached 0.89 and 0.86 for allelic concordance and correlation, respectively. A quantitative trait nucleotide (QTN) affecting the chest vertebrate was detected directly, while the investigation of another QTN affecting the residual glucose failed due to the presence of similar haplotypes carrying wild-type and mutant allelesin the reference panel used in this study. A high imputation accuracy was confirmed by Sanger sequencing technology for the most significant loci. Two candidate genes, CPNE5 and MYH3, affecting meat-related traits were proposed. Collectively, we illustrated four scenarios in imputation-based GWAS that may be encountered by researchers, and our results will provide an extensive reference for future genotype imputation-based GWAS analyses in the future.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ai, H., Ren, J., Zhang, Z., Ma, J., Guo, Y., Yang, B., and Huang, L. (2012). Detection of quantitative trait loci for growth- and fatness-related traits in a large-scale White Duroc×Erhualian intercross pig population. Anim Genet 43, 383–391.

    Article  CAS  PubMed  Google Scholar 

  • Brøndum, R.F., Guldbrandtsen, B., Sahana, G., Lund, M.S., and Su, G. (2014). Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle. BMC Genomics 15, 728.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  • Calus, M.P.L., Bouwman, A.C., Hickey, J.M., Veerkamp, R.F., and Mulder, H.A. (2014). Evaluation of measures of correctness of genotype imputation in the context of genomic prediction: A review of livestock applications. Animal 8, 1743–1753.

    Article  CAS  PubMed  Google Scholar 

  • Chang, C.C., Chow, C.C., Tellier, L.C., Vattikuti, S., Purcell, S.M., and Lee, J.J. (2015). Second-generation plink: Rising to the challenge of larger and richer datasets. Gigascience 4, 7.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  • Cho, I.C., Park, H.B., Ahn, J.S., Han, S.H., Lee, J.B., Lim, H.T., Yoo, C.K., Jung, E.J., Kim, D.H., Sun, W.S., et al. (2019). A functional regulatory variant of MYH3 influences muscle fiber-type composition and intramuscular fat content in pigs. PLoS Genet 15, e1008279.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Clark, D.L., Boler, D.D., Kutzler, L.W., Jones, K.A., McKeith, F.K., Killefer, J., Carr, T.R., and Dilger, A.C. (2011). Muscle gene expression associated with increased marbling in beef cattle. Anim Biotech 22, 51–63.

    Article  CAS  Google Scholar 

  • Creutz, C.E., Tomsig, J.L., Snyder, S.L., Gautier, M.C., Skouri, F., Beisson, J., and Cohen, J. (1998). The copines, a novel class of C2 domain-containing, calciumdependent, phospholipid-binding proteins conserved from paramecium to humans. J Biol Chem 273, 1393–1402.

    Article  CAS  PubMed  Google Scholar 

  • Daetwyler, H.D., Capitan, A., Pausch, H., Stothard, P., van Binsbergen, R., Brøndum, R.F., Liao, X., Djari, A., Rodriguez, S.C., Grohs, C., et al. (2014). Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle. Nat Genet 46, 858–865.

    Article  CAS  PubMed  Google Scholar 

  • Das, S., Forer, L., Schönherr, S., Sidore, C., Locke, A.E., Kwong, A., Vrieze, S.I., Chew, E.Y., Levy, S., McGue, M., et al. (2016). Next-generation genotype imputation service and methods. Nat Genet 48, 1284–1287.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Das, S., Abecasis, G.R., and Browning, B.L. (2018). Genotype imputation from large reference panels. Annu Rev Genom Hum Genet 19, 73–96.

    Article  CAS  Google Scholar 

  • Delaneau, O., Howie, B., Cox, A.J., Zagury, J.F., and Marchini, J. (2013). Haplotype estimation using sequencing reads. Am J Hum Genet 93, 687–696.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Ding, X., Jin, Y., Wu, Y., Wu, Y., Wu, H., Xiong, L., Song, X., Liu, S., Fan, W., and Fan, M. (2008). Localization and cellular distribution of CPNE5 in embryonic mouse brain. Brain Res 1224, 20–28.

    Article  CAS  PubMed  Google Scholar 

  • Druet, T., and Georges, M. (2010). A hidden markov model combining linkage and linkage disequilibrium information for haplotype reconstruction and quantitative trait locus fine mapping. Genetics 184, 789–798.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Druet, T., and Georges, M. (2015). Linkphase3: An improved pedigree-based phasing algorithm robust to genotyping and map errors. Bioinformatics 31, 1677–1679.

    Article  CAS  PubMed  Google Scholar 

  • Du, X., Huang, G., He, S., Yang, Z., Sun, G., Ma, X., Li, N., Zhang, X., Sun, J., Liu, M., et al. (2018). Resequencing of 243 diploid cotton accessions based on an updated a genome identifies the genetic basis of key agronomic traits. Nat Genet 50, 796–802.

    Article  CAS  PubMed  Google Scholar 

  • Duan, Y.Y., Ma, J.W., Yuan, F., Huang, L.B., Yang, K.X., Xie, J.P., Wu, G. Z., and Huang, L.S. (2009). Genome-wide identification of quantitative trait loci for pork temperature, pH decline, and glycolytic potential in a large-scale White DurocøChinese Erhualian resource population. J Anim Sci 87, 9–16.

    Article  CAS  PubMed  Google Scholar 

  • Duan, Y., Zhang, H., Zhang, Z., Gao, J., Yang, J., Wu, Z., Fan, Y., Xing, Y., Li, L., Xiao, S., et al. (2018). VRTN is required for the development of thoracic vertebrae in mammals. Int J Biol Sci 14, 667–681.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Fan, Y., Xing, Y., Zhang, Z., Ai, H., Ouyang, Z., Ouyang, J., Yang, M., Li, P., Chen, Y., Gao, J., et al. (2013). A further look at porcine chromosome 7 reveals VRTN variants associated with vertebral number in Chinese and Western pigs. PLoS ONE 8, e62534.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Genomes Project, C. (2015). A global reference for human genetic variation. Nature 526, 68–74.

    Article  CAS  Google Scholar 

  • Guo, Y., Mao, H., Ren, J., Yan, X., Duan, Y., Yang, G., Ren, D., Zhang, Z., Yang, B., Ouyang, J., et al. (2009). A linkage map of the porcine genome from a large-scale White DurocøErhualian resource population and evaluation of factors affecting recombination rates. Anim Genet 40, 47–52.

    Article  PubMed  CAS  Google Scholar 

  • Howie, B.N., Donnelly, P., and Marchini, J. (2009). A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 5, e1000529.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  • Howie, B., Fuchsberger, C., Stephens, M., Marchini, J., and Abecasis, G.R. (2012). Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet 44, 955–959.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Huang, L., Li, Y., Singleton, A.B., Hardy, J.A., Abecasis, G., Rosenberg, N. A., and Scheet, P. (2009). Genotype-imputation accuracy across worldwide human populations. Am J Hum Genet 84, 235–250.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • International HapMap, C. (2007). A second generation human haplotype map of over 3.1 million SNPs. Nature 449, 851–861.

    Article  CAS  Google Scholar 

  • International HapMap, C. (2010). Integrating common and rare genetic variation in diverse human populations. Nature 467, 52–58.

    Article  CAS  Google Scholar 

  • Johnson, R.C., Nelson, G.W., Troyer, J.L., Lautenberger, J.A., Kessing, B. D., Winkler, C.A., and O’Brien, S.J. (2010). Accounting for multiple comparisons in a genome-wide association study (GWAS). BMC Genomics 11, 724.

    Article  PubMed  PubMed Central  Google Scholar 

  • Lalioti, V.S., Vergarajauregui, S., Villasante, A., Pulido, D., and Sandoval, I.V. (2013). C6orf89 encodes three distinct HDAC enhancers that function in the nucleolus, the golgi and the midbody. J Cell Physiol 228, 1907–1921.

    Article  CAS  PubMed  Google Scholar 

  • Langmead, B., and Nellore, A. (2018). Cloud computing for genomic data analysis and collaboration. Nat Rev Genet 19, 208–219.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Li, Y., Willer, C., Sanna, S., and Abecasis, G. (2009). Genotype imputation. Annu Rev Genom Hum Genet 10, 387–406.

    Article  CAS  Google Scholar 

  • Liu, H.J., Tan, Y.R., Li, M.L., Liu, C., Xiang, Y., and Qin, X.Q. (2011). Cloning of a novel protein interacting with BRS-3 and its effects in wound repair of bronchial epithelial cells. PLoS ONE 6, e23072.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Liu, X., Xiong, X., Yang, J., Zhou, L., Yang, B., Ai, H., Ma, H., Xie, X., Huang, Y., Fang, S., et al. (2015). Genome-wide association analyses for meat quality traits in Chinese Erhualian pigs and a Western Duroc×(Landrace×Yorkshire) commercial population. Genet Sel Evol 47, 44.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  • Luo, W., Cheng, D., Chen, S., Wang, L., Li, Y., Ma, X., Song, X., Liu, X., Li, W., Liang, J., et al. (2012). Genome-wide association analysis of meat quality traits in a porcine large White×Minzhu intercross population. Int J Biol Sci 8, 580–595.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Ma, J., Ren, J., Guo, Y., Duan, Y., Ding, N., Zhou, L., Li, L., Yan, X., Yang, K., Huang, L., et al. (2009). Genome-wide identification of quantitative trait loci for carcass composition and meat quality in a large-scale White Duroc×Chinese Erhualian resource population. Anim Genet 40, 637–647.

    Article  CAS  PubMed  Google Scholar 

  • Ma, J., Yang, J., Zhou, L., Zhang, Z., Ma, H., Xie, X., Zhang, F., Xiong, X., Cui, L., Yang, H., et al. (2013). Genome-wide association study of meat quality traits in a White Duroc×Erhualian F2 intercross and Chinese Sutai pigs. PLoS ONE 8, e64047.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Ma, J., Yang, J., Zhou, L., Ren, J., Liu, X., Zhang, H., Yang, B., Zhang, Z., Ma, H., Xie, X., et al. (2014). A splice mutation in the PHKG1 gene causes high glycogen content and low meat quality in pig skeletal muscle. PLoS Genet 10, e1004710.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  • Marchini, J., and Howie, B. (2010). Genotype imputation for genome-wide association studies. Nat Rev Genet 11, 499–511.

    Article  CAS  PubMed  Google Scholar 

  • McCarthy, S., Das, S., Kretzschmar, W., Delaneau, O., Wood, A.R., Teumer, A., Kang, H.M., Fuchsberger, C., Danecek, P., Sharp, K., et al. (2016). A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet 48, 1279–1283.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Orho-Melander, M., Melander, O., Guiducci, C., Perez-Martinez, P., Corella, D., Roos, C., Tewhey, R., Rieder, M.J., Hall, J., Abecasis, G., et al. (2008). Common missense variant in the glucokinase regulatory protein gene is associated with increased plasma triglyceride and C-reactive protein but lower fasting glucose concentrations. Diabetes 57, 3112–3121.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Park, N., Yoo, J.C., Ryu, J., Hong, S.G., Hwang, E.M., and Park, J.Y. (2012). Copine1 enhances neuronal differentiation of the hippocampal progenitor HIB5 cells. Mol Cells 34, 549–554.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Pe’er, I., Yelensky, R., Altshuler, D., and Daly, M.J. (2008). Estimation of the multiple testing burden for genomewide association studies of nearly all common variants. Genet Epidemiol 32, 381–385.

    Article  PubMed  Google Scholar 

  • Pimentel, E.C.G., Edel, C., Emmerling, R., and Götz, K.U. (2015). How imputation errors bias genomic predictions. J Dairy Sci 98, 4131–4138.

    Article  CAS  PubMed  Google Scholar 

  • R Core Team. (2018). R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing.

    Google Scholar 

  • Ramos, A.M., Crooijmans, R.P.M.A., Affara, N.A., Amaral, A.J., Archibald, A.L., Beever, J.E., Bendixen, C., Churcher, C., Clark, R., Dehais, P., et al. (2009). Design of a high density SNP genotyping assay in the pig using SNPs identified and characterized by next generation sequencing technology. PLoS ONE 4, e6524.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  • Ren, D.R., Ren, J., Ruan, G.F., Guo, Y.M., Wu, L.H., Yang, G.C., Zhou, L. H., Li, L., Zhang, Z.Y., and Huang, L.S. (2012). Mapping and fine mapping of quantitative trait loci for the number of vertebrae in a White Duroc×Chinese Erhualian intercross resource population. Anim Genet 43, 545–551.

    Article  CAS  PubMed  Google Scholar 

  • Ros-Freixedes, R., Whalen, A., Chen, C.Y., Gorjanc, G., Herring, W.O., Mileham, A.J., and Hickey, J.M. (2020). Accuracy of whole-genome sequence imputation using hybrid peeling in large pedigreed livestock populations. Genet Sel Evol 52, 17.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Sanchez, M.P., Govignon-Gion, A., Croiseau, P., Fritz, S., Hozé, C., Miranda, G., Martin, P., Barbat-Leterrier, A., Letaïef, R., Rocha, D., et al. (2017). Within-breed and multi-breed gwas on imputed whole-genome sequence variants reveal candidate mutations affecting milk protein composition in dairy cattle. Genet Sel Evol 49, 68.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  • Scott, L.J., Mohlke, K.L., Bonnycastle, L.L., Willer, C.J., Li, Y., Duren, W. L., Erdos, M.R., Stringham, H.M., Chines, P.S., Jackson, A.U., et al. (2007). A genome-wide association study of type 2 diabetes in finns detects multiple susceptibility variants. Science 316, 1341–1345.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • van Binsbergen, R., Bink, M.C., Calus, M.P., van Eeuwijk, F.A., Hayes, B. J., Hulsegge, I., and Veerkamp, R.F. (2014). Accuracy of imputation to whole-genome sequence data in holstein friesian cattle. Genet Sel Evol 46, 41.

    Article  PubMed  PubMed Central  Google Scholar 

  • Visscher, P.M., Wray, N.R., Zhang, Q., Sklar, P., McCarthy, M.I., Brown, M.A., and Yang, J. (2017). 10 years of gwas discovery: Biology, function, and translation. Am J Hum Genet 101, 5–22.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Wang, K.S., Zuo, L., Pan, Y., Xie, C., and Luo, X. (2015). Genetic variants in the CPNE5 gene are associated with alcohol dependence and obesity in caucasian populations. J Psychiatr Res 71, 1–7.

    Article  PubMed  Google Scholar 

  • Whalen, A., Gorjanc, G., and Hickey, J.M. (2019). Family-specific genotype arrays increase the accuracy of pedigree-based imputation at very low marker densities. Genet Sel Evol 51, 33.

    Article  PubMed  PubMed Central  Google Scholar 

  • Xin, W.S., Zhang, F., Yan, G.R., Xu, W.W., Xiao, S.J., Zhang, Z.Y., and Huang, L.S. (2018). A whole genome sequence association study for puberty in a large Duroc×Erhualian F2 population. Anim Genet 49, 29–35.

    Article  CAS  PubMed  Google Scholar 

  • Xiong, X., Liu, X., Zhou, L., Yang, J., Yang, B., Ma, H., Xie, X., Huang, Y., Fang, S., Xiao, S., et al. (2015). Genome-wide association analysis reveals genetic loci and candidate genes for meat quality traits in Chinese Laiwu pigs. Mamm Genome 26, 181–190.

    Article  PubMed  Google Scholar 

  • Xu, C., Zhang, J., Huang, X., Sun, J., Xu, Y., Tang, Y., Wu, J., Shi, Y., Huang, Q., and Zhang, Q. (2006). Solution structure of human peptidyl prolyl isomerase-like protein 1 and insights into its interaction with SKIP. J Biol Chem 281, 15900–15908.

    Article  CAS  PubMed  Google Scholar 

  • Yan, G., Guo, T., Xiao, S., Zhang, F., Xin, W., Huang, T., Xu, W., Li, Y., Zhang, Z., and Huang, L. (2018). Imputation-based whole-genome sequence association study reveals constant and novel loci for hematological traits in a large-scale swine F2 resource population. Front Genet 9, 401.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Yan, G., Qiao, R., Zhang, F., Xin, W., Xiao, S., Huang, T., Zhang, Z., and Huang, L. (2017). Imputation-based whole-genome sequence association study rediscovered the missing QTL for lumbar number in Sutai pigs. Sci Rep 7, 615.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  • Yan, X.M., Ren, J., Huang, X., Zhang, Z.Y., Ouyang, J., Zeng, W.H., Zou, Z.Z., Yang, S.J., Yang, B., and Huang, L.S. (2009). Comparison of production traits between pigs with and without the Escherichia coli F4 receptors in a White Duroc×Erhualian intercross F2 population. J Anim Sci 87, 334–339.

    Article  CAS  PubMed  Google Scholar 

  • Yang, B., Zhang, W., Zhang, Z., Fan, Y., Xie, X., Ai, H., Ma, J., Xiao, S., Huang, L., and Ren, J. (2013). Genome-wide association analyses for fatty acid composition in porcine muscle and abdominal fat tissues. PLoS ONE 8, e65554.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Yang, G., Ren, J., Li, S., Mao, H., Guo, Y., Zou, Z., Ren, D., Ma, J., and Huang, L. (2008). Genome-wide identification of QTL for age at puberty in gilts using a large intercross F2 population between White Duroc×Erhualian. Genet Sel Evol 40, 529.

    PubMed  PubMed Central  Google Scholar 

  • Yano, K., Yamamoto, E., Aya, K., Takeuchi, H., Lo, P.C., Hu, L., Yamasaki, M., Yoshida, S., Kitano, H., Hirano, K., et al. (2016). Genome-wide association study using whole-genome sequencing rapidly identifies new genes influencing agronomic traits in rice. Nat Genet 48, 927–934.

    Article  CAS  PubMed  Google Scholar 

  • Yoo, J.C., Lim, T., Park, J.S., Hah, Y.S., Park, N., Hong, S.G., Park, J.Y., and Yoon, T.J. (2013). SYT14L, especially its C2 domain, is involved in regulating melanocyte differentiation. J Dermatol Sci 72, 246–251.

    Article  CAS  PubMed  Google Scholar 

  • Zhang, Z.Y., Ren, J., Ren, D.R., Ma, J.W., Guo, Y.M., and Huang, L.S. (2009). Mapping quantitative trait loci for feed consumption and feeding behaviors in a White Duroc×Chinese Erhualian resource population1. J Anim Sci 87, 3458–3463.

    Article  CAS  PubMed  Google Scholar 

  • Zhang, Z., Hong, Y., Gao, J., Xiao, S., Ma, J., Zhang, W., Ren, J., and Huang, L. (2013). Genome-wide association study reveals constant and specific loci for hematological traits at three time stages in a White Duroc×Erhualian F2 resource population. PLoS ONE 8, e63665.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Zhou, W., Fritsche, L.G., Das, S., Zhang, H., Nielsen, J.B., Holmen, O.L., Chen, J., Lin, M., Elvestad, M.B., Hveem, K., et al. (2017). Improving power of association tests using multiple sets of imputed genotypes from distributed reference panels. Genet Epidemiol 41, 744–755.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (31640046 and 31760656) and the National Key Research and Development Program of China (2020YFA0509500).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Junwu Ma or Zhiyan Zhang.

Additional information

Compliance and ethics

The author(s) declare that they have no conflict of interest.

Electronic Supplementary Material

11427_2020_1960_MOESM1_ESM.docx

An Imputed Whole-genome Sequence-based GWAS Approach Pinpoints Causal Mutations for Complex Traits in a Specific Swine Populations

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yan, G., Liu, X., Xiao, S. et al. An imputed whole-genome sequence-based GWAS approach pinpoints causal mutations for complex traits in a specific swine population. Sci. China Life Sci. 65, 781–794 (2022). https://doi.org/10.1007/s11427-020-1960-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11427-020-1960-9

Keywords

Navigation