A cautionary note on the impact of protocol changes for genome-wide association SNP × SNP interaction studies: an example on ankylosing spondylitis
- 385 Downloads
Genome-wide association interaction (GWAI) studies have increased in popularity. Yet to date, no standard protocol exists. In practice, any GWAI workflow involves making choices about quality control strategy, SNP filtering, linkage disequilibrium (LD) pruning, analytic tool to model or to test for genetic interactions. Each of these can have an impact on the final epistasis findings and may affect their reproducibility in follow-up analyses. Choosing an analytic tool is not straightforward, as different tools exist and current understanding about their performance is based on often very particular simulation settings. In the present study, we wish to create awareness for the impact of (minor) changes in a GWAI analysis protocol can have on final epistasis findings. In particular, we investigate the influence of marker selection and marker prioritization strategies, LD pruning and the choice of epistasis detection analytics on study results, giving rise to 8 GWAI protocols. Discussions are made in the context of the ankylosing spondylitis (AS) data obtained via the Wellcome Trust Case Control Consortium (WTCCC2). As expected, the largest impact on AS epistasis findings is caused by the choice of marker selection criterion, followed by marker coding and LD pruning. In MB-MDR, co-dominant coding of main effects is more robust to the effects of LD pruning than additive coding. We were able to reproduce previously reported epistasis involvement of HLA-B and ERAP1 in AS pathology. In addition, our results suggest involvement of MAGI3 and PARK2, responsible for cell adhesion and cellular trafficking. Gene ontology biological function enrichment analysis across the 8 considered GWAI protocols also suggested that AS could be associated to the central nervous system malfunctions, specifically, in nerve impulse propagation and in neurotransmitters metabolic processes.
KeywordsLinkage Disequilibrium Ankylose Spondylitis Major Histocompatibility Complex Class Encode Scheme GWAI Analysis
The research was funded by the Fonds de la Recherche Scientifique (FNRS) (incl. FNRS F.R.F.C. project convention no. 2.4609.11). We thank François Van Lishout and Elena Gusareva from the Systems and Modeling Unit, Montefiore Institute, University of Liege, Belgium for their support and advice. This study makes use of data generated by the Wellcome Trust Case Control Consortium. A full list of the investigators who contributed to the generation of the data is available from www.wtccc.org.uk. Funding for the project was provided by the Wellcome Trust under award no. 076113.
Conflict of interest
The authors declare that they have no competing interests.
- Bush WS, Dudek SM, Ritchie MD (2009) Biofilter: a knowledge-integration system for the multi-locus analysis of genome-wide association studies. Pac Symp Biocomput 2009:368–379Google Scholar
- Cattaert T, Calle ML, Dudek SM, Mahachie John JM, Van Lishout F, Urrea V, Ritchie MD, Van Steen K (2011) Model-based multifactor dimensionality reduction for detecting epistasis in case-control data in the presence of noise. Ann Hum Genet 75:78–89. doi: 10.1111/j.1469-1809.2010.00604.x PubMedCentralPubMedCrossRefGoogle Scholar
- Claushuis D, Cortes A, Bradbury LA, Martin TM, Rosenbaum JT, Reveille JD, Wordsworth P, Pointon J, Evans D, Leo P, Mukhopadhyay P, Brown MA (2012) A genome wide association study of anterior uveiti. In: Annual Scientific Meeting of the American-College-of-Rheumatology (ACR) and Association-of-Rheumatology-Health-Professionals (ARHP). Wiley, Washington, DC, pp S259–S259Google Scholar
- Colin Freeman JM (2012) GTOOL. Oxford University. http://www.well.ox.ac.uk/~cfreeman/software/gwas/gtool.html. Accessed Mar 2014
- Evans DM, Spencer CC, Pointon JJ, Su Z, Harvey D, Kochan G, Oppermann U, Dilthey A, Pirinen M, Stone MA, Appleton L, Moutsianas L, Leslie S, Wordsworth T, Kenna TJ, Karaderi T, Thomas GP, Ward MM, Weisman MH, Farrar C, Bradbury LA, Danoy P, Inman RD, Maksymowych W, Gladman D, Rahman P, Spondyloarthritis Research Consortium of C, Morgan A, Marzo-Ortega H, Bowness P, Gaffney K, Gaston JS, Smith M, Bruges-Armas J, Couto AR, Sorrentino R, Paladini F, Ferreira MA, Xu H, Liu Y, Jiang L, Lopez-Larrea C, Diaz-Pena R, Lopez-Vazquez A, Zayats T, Band G, Bellenguez C, Blackburn H, Blackwell JM, Bramon E, Bumpstead SJ, Casas JP, Corvin A, Craddock N, Deloukas P, Dronov S, Duncanson A, Edkins S, Freeman C, Gillman M, Gray E, Gwilliam R, Hammond N, Hunt SE, Jankowski J, Jayakumar A, Langford C, Liddle J, Markus HS, Mathew CG, McCann OT, McCarthy MI, Palmer CN, Peltonen L, Plomin R, Potter SC, Rautanen A, Ravindrarajah R, Ricketts M, Samani N, Sawcer SJ, Strange A, Trembath RC, Viswanathan AC, Waller M, Weston P, Whittaker P, Widaa S, Wood NW, McVean G, Reveille JD, Wordsworth BP, Brown MA, Donnelly P, Australo-Anglo-American Spondyloarthritis C, Wellcome Trust Case Control C (2011) Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility. Nat Genet. 43: 761–7. doi: 10.1038/ng.873
- Gao W, Sweeney C, Walsh C, Rooney P, McCormick J, Veale DJ, Fearon U (2013) Notch signalling pathways mediate synovial angiogenesis in response to vascular endothelial growth factor and angiopoietin 2. Ann Rheum Dis 72:1080–1088. doi: 10.1136/annrheumdis-2012-201978 PubMedCentralPubMedCrossRefGoogle Scholar
- Grange L (2014) Epistasis in genetic susceptibility to infectious diseases: comparison and development of methods application to severe dengue in Asia. Dissertation, University Paris DiderotGoogle Scholar
- Greene CS, Penrod NM, Williams SM, Moore JH (2009) Failure to replicate a genetic association may provide important clues about genetic architecture. PLoS One 4:e5639. doi: 10.1371/journal.pone.0005639
- Jenisch S, Henseler T, Nair RP, Guo SW, Westphal E, Stuart P, Kronke M, Voorhees JJ, Christophers E, Elder JT (1998) Linkage analysis of human leukocyte antigen (HLA) markers in familial psoriasis: strong disequilibrium effects provide evidence for a major determinant in the HLA-B/-C region. Am J Hum Genet 63:191–199. doi: 10.1086/301899 PubMedCentralPubMedCrossRefGoogle Scholar
- Lopez-Arbesu R, Ballina-Garcia FJ, Alperi-Lopez M, Lopez-Soto A, Rodriguez-Rodero S, Martinez-Borra J, Lopez-Vazquez A, Fernandez-Morera JL, Riestra-Noriega JL, Queiro-Silva R, Quinones-Lombrana A, Lopez-Larrea C, Gonzalez S (2007) MHC class I chain-related gene B (MICB) is associated with rheumatoid arthritis susceptibility. Rheumatology (Oxford) 46:426–430. doi: 10.1093/rheumatology/kel331 CrossRefGoogle Scholar
- Mahachie J (2012) Thesis: Genomic association screening methodology for high-dimensional and complex data structures, University of LiegeGoogle Scholar
- Nischwitz S, Cepok S, Kroner A, Wolf C, Knop M, Muller-Sarnowski F, Pfister H, Roeske D, Rieckmann P, Hemmer B, Ising M, Uhr M, Bettecken T, Holsboer F, Muller-Myhsok B, Weber F (2010) Evidence for VAV2 and ZNF433 as susceptibility genes for multiple sclerosis. J Neuroimmunol 227:162–166. doi: 10.1016/j.jneuroim.2010.06.003 PubMedCrossRefGoogle Scholar
- RCoreTeam (2013) R: a language and environment for statistical computing. ViennaGoogle Scholar
- SNP & Variation Suite (Version 8.x) [Software]. Golden Helix, Inc., Bozeman, MT. Available from http://www.goldenhelix.com
- Sun X, Lu Q, Mukheerjee S, Crane PK, Elston R, Ritchie MD (2014) Analysis pipeline for the epistasis search–statistical versus biological filtering. Front Genet 5:106. doi: 10.3389/fgene.2014.00106
- Westfall PH, Young SS (1993) Resampling-based multiple testing: examples and methods for p-value adjustment. Wiley-Interscience, CanadaGoogle Scholar
- Zhang X, Zou F, Wang W (2008) Fastanova: an efficient algorithm for genome-wide association study. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 821–829Google Scholar