Abstract
Transcriptome-wide association studies (TWAS) have been recently applied to successfully identify many novel genes associated with complex traits. While appealing, TWAS tend to identify multiple significant genes per locus, and many of them may not be causal due to confounding through linkage disequilibrium (LD) among SNPs. Here we introduce a powerful fine-mapping method that prioritizes putative causal genes by accounting for local LD. We apply a weighted adaptive test with eQTL-derived weights to maintain high power across various scenarios. Through simulations, we show that our new approach yielded a well-controlled Type I error rate while achieving higher power and AUC than competing methods. We applied our approach to a schizophrenia GWAS summary dataset and successfully prioritized some well-known schizophrenia-related genes, such as C4A. Importantly, our approach identified some putative causal genes (e.g., B3GAT1 and RGS6) that were missed by competing methods and TWAS. Our results suggest that our approach is a useful tool to prioritize putative causal genes, gaining insights into the mechanisms of complex traits.
Similar content being viewed by others
Availability of data and materials
We make our software FOGS publicly available on a GitHub repository: https://github.com/ChongWu-Biostat/FOGS. TWAS software and eQTL derived weights can be downloaded at http://gusevlab.org/projects/fusion/. The software FOCUS can be obtained at https://github.com/bogdanlab/focus/. The Schizophrenia GWAS summary data (Pardiñas et al. 2018) can be downloaded at http://walters.psycm.cf.ac.uk, and the Lung Health Study data can be obtained at dbGAP (phs000335.v2.p2). The data used in this paper can be downloaded at https://figshare.com/articles/FOGS_data/7636691.
References
Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, Wheeler HE, Torres JM, Torstenson ES, Shah KP, Garcia T, Edwards TL et al (2018) Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun 9:1825
Barbeira AN, Pividori MD, Zheng J, Wheeler HE, Nicolae DL, Im HK (2019) Integrating predicted transcriptome from multiple tissues improves association detection. PLoS Genet 15(1):e1007889
Barfield R, Feng H, Gusev A, Wu L, Zheng W, Pasaniuc B, Kraft P (2018) Transcriptome-wide association studies accounting for colocalization using egger regression. Genet Epidemiol 42(5):418–433
Benner C, Havulinna AS, Järvelin M-R, Salomaa V, Ripatti S, Pirinen M (2017) Prospects of fine-mapping trait-associated genomic regions by using summary statistics from genome-wide association studies. Am J Hum Genet 101(4):539–551
Berisa T, Pickrell JK (2016) Approximately independent linkage disequilibrium blocks in human populations. Bioinformatics 32(2):283–285
Burgess S, Thompson SG (2015) Multivariable mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol 181(4):251–260
Callicott JH, Egan MF, Mattay VS, Bertolino A, Bone AD, Verchinksi B, Weinberger DR (2003) Abnormal fmri response of the dorsolateral prefrontal cortex in cognitively intact siblings of patients with schizophrenia. Am J Psychiatry 160(4):709–719
Collier DA, Eastwood BJ, Malki K, Mokrab Y (2016) Advances in the genetics of schizophrenia: toward a network and pathway view for drug discovery. Ann NY Acad Sci 1366(1):61–75
GTEx Consortium (2017) Genetic effects on gene expression across human tissues. Nature 550(7675):204–213
DeLong ER, DeLong DM, Clarke-Pearson DL (1988) Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44(3):837–845
Dodd LE, Pepe MS (2003) Partial auc estimation and regression. Biometrics 59(3):614–623
Fromer M, Roussos P, Sieberts SK, Johnson JS, Kavanagh DH, Perumal TM, Ruderfer DM, Oh EC, Topol A, Shah HR et al (2016) Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nat Neurosci 19(11):1442–1453
Gamazon ER, Wheeler HE, Shah KP, Mozaffari SV, Aquino-Michaels K, Carroll RJ, Eyler AE, Denny JC, Nicolae DL, Cox NJ et al (2015) A gene-based association method for mapping traits using reference transcriptome data. Nat Genet 47(9):1091–1098
Genomes Project Consortium (2015) A global reference for human genetic variation. Nature 526(7571):68–74
Goes FS, McGrath J, Avramopoulos D, Wolyniec P, Pirooznia M, Ruczinski I, Nestadt G, Kenny EE, Vacic V, Peters I et al (2015) Genome-wide association study of schizophrenia in Ashkenazi Jews. Am J Med Genet Part B Neuropsychiatr Genet 168(8):649–659
Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BW, Jansen R, De Geus EJ, Boomsma DI, Wright FA et al (2016) Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet 48(3):245–252
Han B, Eskin E (2011) Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. Am J Hum Genet 88(5):586–598
He Y, Xu G, Wu C, Pan W (2018) Asymptotically independent u-statistics in high-dimensional testing. arXiv preprint arXiv:1809.00411
Hu Y, Li M, Lu Q, Weng H, Wang J, Zekavat SM, Yu Z, Li B, Gu J, Muchnik S et al (2019) A statistical framework for cross-tissue transcriptome-wide association analysis. Nat Genet 51(3):568–576
Kichaev G, Pasaniuc B (2015) Leveraging functional-annotation data in trans-ethnic fine-mapping studies. Am J Hum Genet 97(2):260–271
Kwak I-Y, Pan W (2015) Adaptive gene-and pathway-trait association testing with GWAS summary statistics. Bioinformatics 32(8):1178–1184
Li Z, Chen J, Yu H, He L, Xu Y, Zhang D, Yi Q, Li C, Li X, Shen J et al (2017) Genome-wide association analysis identifies 30 new susceptibility loci for schizophrenia. Nat Genet 49(11):1576–1583
MacArthur J, Bowler E, Cerezo M, Gil L, Hall P, Hastings E, Junkins H, McMahon A, Milano A, Morales J et al (2016) The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res 45(D1):D896–D901
Mancuso N, Freund MK, Johnson R, Shi H, Kichaev G, Gusev A, Pasaniuc B (2019) Probabilistic fine-mapping of transcriptome-wide association studies. Nat Genet 51(4):675–682
McGrath J, Saha S, Chant D, Welham J (2008) Schizophrenia: a concise overview of incidence, prevalence, and mortality. Epidemiol Rev 30(1):67–76
Pan W, Kim J, Zhang Y, Shen X, Wei P (2014) A powerful and adaptive association test for rare variants. Genetics 197(4):1081–1095
Pardiñas AF, Holmans P, Pocklington AJ, Escott-Price V, Ripke S, Carrera N, Legge SE, Bishop S, Cameron D, Hamshere ML et al (2018) Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nat Genet 50(3):381–389
Pasaniuc B, Zaitlen N, Shi H, Bhatia G, Gusev A, Pickrell J, Hirschhorn J, Strachan DP, Patterson N, Price AL (2014) Fast and accurate imputation of summary statistics enhances evidence of functional enrichment. Bioinformatics 30(20):2906–2914
Pirinen M, Donnelly P, Spencer CC et al (2013) Efficient computation with a linear mixed model on large-scale data sets with applications to genetic studies. Ann Appl Stat 7(1):369–390
Raitakari OT, Juonala M, Rönnemaa T, Keltikangas-Järvinen L, Räsänen L, Pietikäinen M, Hutri-Kähönen N, Taittonen L, Jokinen E, Marniemi J et al (2008) Cohort profile: the cardiovascular risk in young finns study. Int J Epidemiol 37(6):1220–1226
Ripke S, Neale BM, Corvin A, Walters JT, Farh K-H, Holmans PA, Lee P, Bulik-Sullivan B, Collier DA, Huang H et al (2014) Biological insights from 108 schizophrenia-associated genetic loci. Nature 511(7510):421–427
Schaid DJ, Chen W, Larson NB (2018) From genome-wide associations to candidate causal variants by statistical fine-mapping. Nat Rev Genet 19:491–504
Sekar A, Bialas AR, de Rivera H, Davis A, Hammond TR, Kamitaki N, Tooley K, Presumey J, Baum M, Van Doren V et al (2016) Schizophrenia risk from complex variation of complement component 4. Nature 530(7589):177–183
Shen X, Pan W, Zhu Y (2012) Likelihood-based selection and sharp parameter estimation. J Am Stat Assoc 107(497):223–232
Spain SL, Barrett JC (2015) Strategies for fine-mapping complex traits. Hum Mol Genet 24(R1):R111–R119
Stančáková A, Civelek M, Saleem NK, Soininen P, Kangas AJ, Cederberg H, Paananen J, Pihlajamäki J, Bonnycastle LL, Morken MA et al (2012) Hyperglycemia and a common variant of gckr are associated with the levels of eight amino acids in 9,369 finnish men. Diabetes 51(1):1895–1902
The Autism Spectrum Disorders Working Group (2017) Meta-analysis of GWAS of over 16,000 individuals with autism spectrum disorder highlights a novel locus at 10q24. 32 and a significant overlap with schizophrenia. Mol Autism 8(21):1–17
Vilhjálmsson BJ, Yang J, Finucane HK, Gusev A, Lindström S, Ripke S, Genovese G, Loh P-R, Bhatia G, Do R et al (2015) Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am J Hum Genet 97(4):576–592
Wainberg M, Sinnott-Armstrong N, Mancuso N, Barbeira AN, Knowles DA, Golan D, Ermel R, Ruusalepp A, Quertermous T, Hao K et al (2019) Opportunities and challenges for transcriptome-wide association studies. Nat Genet 51(4):592–599
Wright FA, Sullivan PF, Brooks AI, Zou F, Sun W, Xia K, Madar V, Jansen R, Chung W, Zhou Y-H et al (2014) Heritability and genomics of gene expression in peripheral blood. Nat Genet 46(5):430–437
Wu C, Pan W (2018) Integrating eQTL data with GWAS summary statistics in pathway-based analysis with application to schizophrenia. Genet Epidemiol 42(3):303–316
Wu C, Xu G, Pan W (2019) An adaptive test on high-dimensional parameters in generalized linear models. Stat Sinica 29:2163–2186
Xu B, Roos JL, Levy S, Van Rensburg E, Gogos JA, Karayiorgou M (2008) Strong association of de novo copy number mutations with sporadic schizophrenia. Nat Genet 40(7):880–885
Xu Z, Wu C, Wei P, Pan W (2017) A powerful framework for integrating eQTL and GWAS summary data. Genetics 207:893–902
Xue H, Pan W, Initiative ADN, et al. (2019) Some statistical consideration in transcriptome-wide association studies. bioRxiv: 812677
Yang C, Wan X, Lin X, Chen M, Zhou X, Liu J (2018) Comm: a collaborative mixed model to dissecting genetic contributions to complex traits by leveraging regulatory information. Bioinformatics 35(10):1644–1652
Yang J, Ferreira T, Morris AP, Medland SE, Madden PA, Heath AC, Martin NG, Montgomery GW, Weedon MN, Loos RJ et al (2012) Conditional and joint multiple-snp analysis of gwas summary statistics identifies additional variants influencing complex traits. Nat Genet 44(4):369–375
Zhang C-H (2010) Nearly unbiased variable selection under minimax concave penalty. Ann Stat 38(2):894–942
Zhao Q, Wang J, Hemani G, Bowden J, Small DS (2018) Statistical inference in two-sample summary-data mendelian randomization using robust adjusted profile score. arXiv preprint arXiv:1801.09652
Zhou Y, Liang M, Jiang T, Tian L, Liu Y, Liu Z, Liu H, Kuang F (2007) Functional dysconnectivity of the dorsolateral prefrontal cortex in first-episode schizophrenia using resting-state fmri. Neurosci Lett 417(3):297–302
Zhu X, Stephens M (2017) Bayesian large-scale multiple regression with summary statistics from genome-wide association studies. Ann Appl Stat 11(3):1561
Acknowledgements
We thank reviewers for helpful comments. This research was supported by the Minnesota Supercomputing Institute. We appreciate the availability of the dbGaP data.
Funding
This research was supported by NIH Grants R21AG057038, R01HL116720, R01GM113250 and R01HL105397. CW was supported by a First Year Assistant Professor Grant at Florida State University.
Author information
Authors and Affiliations
Contributions
WP conceived the study. CW and WP developed the methods. CW performed the analysis and drafted the manuscript. WP supervised the study. All authors approved/edited the final manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Wu, C., Pan, W. A powerful fine-mapping method for transcriptome-wide association studies. Hum Genet 139, 199–213 (2020). https://doi.org/10.1007/s00439-019-02098-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00439-019-02098-2