Skip to main content
Log in

A powerful fine-mapping method for transcriptome-wide association studies

  • Original Investigation
  • Published:
Human Genetics Aims and scope Submit manuscript

Abstract

Transcriptome-wide association studies (TWAS) have been recently applied to successfully identify many novel genes associated with complex traits. While appealing, TWAS tend to identify multiple significant genes per locus, and many of them may not be causal due to confounding through linkage disequilibrium (LD) among SNPs. Here we introduce a powerful fine-mapping method that prioritizes putative causal genes by accounting for local LD. We apply a weighted adaptive test with eQTL-derived weights to maintain high power across various scenarios. Through simulations, we show that our new approach yielded a well-controlled Type I error rate while achieving higher power and AUC than competing methods. We applied our approach to a schizophrenia GWAS summary dataset and successfully prioritized some well-known schizophrenia-related genes, such as C4A. Importantly, our approach identified some putative causal genes (e.g., B3GAT1 and RGS6) that were missed by competing methods and TWAS. Our results suggest that our approach is a useful tool to prioritize putative causal genes, gaining insights into the mechanisms of complex traits.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Availability of data and materials

We make our software FOGS publicly available on a GitHub repository: https://github.com/ChongWu-Biostat/FOGS. TWAS software and eQTL derived weights can be downloaded at http://gusevlab.org/projects/fusion/. The software FOCUS can be obtained at https://github.com/bogdanlab/focus/. The Schizophrenia GWAS summary data (Pardiñas et al. 2018) can be downloaded at http://walters.psycm.cf.ac.uk, and the Lung Health Study data can be obtained at dbGAP (phs000335.v2.p2). The data used in this paper can be downloaded at https://figshare.com/articles/FOGS_data/7636691.

References

  • Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, Wheeler HE, Torres JM, Torstenson ES, Shah KP, Garcia T, Edwards TL et al (2018) Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun 9:1825

    PubMed  PubMed Central  Google Scholar 

  • Barbeira AN, Pividori MD, Zheng J, Wheeler HE, Nicolae DL, Im HK (2019) Integrating predicted transcriptome from multiple tissues improves association detection. PLoS Genet 15(1):e1007889

    PubMed  PubMed Central  Google Scholar 

  • Barfield R, Feng H, Gusev A, Wu L, Zheng W, Pasaniuc B, Kraft P (2018) Transcriptome-wide association studies accounting for colocalization using egger regression. Genet Epidemiol 42(5):418–433

    PubMed  PubMed Central  Google Scholar 

  • Benner C, Havulinna AS, Järvelin M-R, Salomaa V, Ripatti S, Pirinen M (2017) Prospects of fine-mapping trait-associated genomic regions by using summary statistics from genome-wide association studies. Am J Hum Genet 101(4):539–551

    CAS  PubMed  PubMed Central  Google Scholar 

  • Berisa T, Pickrell JK (2016) Approximately independent linkage disequilibrium blocks in human populations. Bioinformatics 32(2):283–285

    CAS  PubMed  Google Scholar 

  • Burgess S, Thompson SG (2015) Multivariable mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol 181(4):251–260

    PubMed  PubMed Central  Google Scholar 

  • Callicott JH, Egan MF, Mattay VS, Bertolino A, Bone AD, Verchinksi B, Weinberger DR (2003) Abnormal fmri response of the dorsolateral prefrontal cortex in cognitively intact siblings of patients with schizophrenia. Am J Psychiatry 160(4):709–719

    PubMed  Google Scholar 

  • Collier DA, Eastwood BJ, Malki K, Mokrab Y (2016) Advances in the genetics of schizophrenia: toward a network and pathway view for drug discovery. Ann NY Acad Sci 1366(1):61–75

    PubMed  Google Scholar 

  • GTEx Consortium (2017) Genetic effects on gene expression across human tissues. Nature 550(7675):204–213

    PubMed Central  Google Scholar 

  • DeLong ER, DeLong DM, Clarke-Pearson DL (1988) Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44(3):837–845

    CAS  PubMed  Google Scholar 

  • Dodd LE, Pepe MS (2003) Partial auc estimation and regression. Biometrics 59(3):614–623

    PubMed  Google Scholar 

  • Fromer M, Roussos P, Sieberts SK, Johnson JS, Kavanagh DH, Perumal TM, Ruderfer DM, Oh EC, Topol A, Shah HR et al (2016) Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nat Neurosci 19(11):1442–1453

    CAS  PubMed  PubMed Central  Google Scholar 

  • Gamazon ER, Wheeler HE, Shah KP, Mozaffari SV, Aquino-Michaels K, Carroll RJ, Eyler AE, Denny JC, Nicolae DL, Cox NJ et al (2015) A gene-based association method for mapping traits using reference transcriptome data. Nat Genet 47(9):1091–1098

    CAS  PubMed  PubMed Central  Google Scholar 

  • Genomes Project Consortium (2015) A global reference for human genetic variation. Nature 526(7571):68–74

    Google Scholar 

  • Goes FS, McGrath J, Avramopoulos D, Wolyniec P, Pirooznia M, Ruczinski I, Nestadt G, Kenny EE, Vacic V, Peters I et al (2015) Genome-wide association study of schizophrenia in Ashkenazi Jews. Am J Med Genet Part B Neuropsychiatr Genet 168(8):649–659

    CAS  Google Scholar 

  • Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BW, Jansen R, De Geus EJ, Boomsma DI, Wright FA et al (2016) Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet 48(3):245–252

    CAS  PubMed  PubMed Central  Google Scholar 

  • Han B, Eskin E (2011) Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. Am J Hum Genet 88(5):586–598

    CAS  PubMed  PubMed Central  Google Scholar 

  • He Y, Xu G, Wu C, Pan W (2018) Asymptotically independent u-statistics in high-dimensional testing. arXiv preprint arXiv:1809.00411

  • Hu Y, Li M, Lu Q, Weng H, Wang J, Zekavat SM, Yu Z, Li B, Gu J, Muchnik S et al (2019) A statistical framework for cross-tissue transcriptome-wide association analysis. Nat Genet 51(3):568–576

    CAS  PubMed  PubMed Central  Google Scholar 

  • Kichaev G, Pasaniuc B (2015) Leveraging functional-annotation data in trans-ethnic fine-mapping studies. Am J Hum Genet 97(2):260–271

    CAS  PubMed  PubMed Central  Google Scholar 

  • Kwak I-Y, Pan W (2015) Adaptive gene-and pathway-trait association testing with GWAS summary statistics. Bioinformatics 32(8):1178–1184

    PubMed  PubMed Central  Google Scholar 

  • Li Z, Chen J, Yu H, He L, Xu Y, Zhang D, Yi Q, Li C, Li X, Shen J et al (2017) Genome-wide association analysis identifies 30 new susceptibility loci for schizophrenia. Nat Genet 49(11):1576–1583

    CAS  PubMed  Google Scholar 

  • MacArthur J, Bowler E, Cerezo M, Gil L, Hall P, Hastings E, Junkins H, McMahon A, Milano A, Morales J et al (2016) The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res 45(D1):D896–D901

    PubMed  PubMed Central  Google Scholar 

  • Mancuso N, Freund MK, Johnson R, Shi H, Kichaev G, Gusev A, Pasaniuc B (2019) Probabilistic fine-mapping of transcriptome-wide association studies. Nat Genet 51(4):675–682

    CAS  PubMed  PubMed Central  Google Scholar 

  • McGrath J, Saha S, Chant D, Welham J (2008) Schizophrenia: a concise overview of incidence, prevalence, and mortality. Epidemiol Rev 30(1):67–76

    PubMed  Google Scholar 

  • Pan W, Kim J, Zhang Y, Shen X, Wei P (2014) A powerful and adaptive association test for rare variants. Genetics 197(4):1081–1095

    PubMed  PubMed Central  Google Scholar 

  • Pardiñas AF, Holmans P, Pocklington AJ, Escott-Price V, Ripke S, Carrera N, Legge SE, Bishop S, Cameron D, Hamshere ML et al (2018) Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nat Genet 50(3):381–389

    PubMed  PubMed Central  Google Scholar 

  • Pasaniuc B, Zaitlen N, Shi H, Bhatia G, Gusev A, Pickrell J, Hirschhorn J, Strachan DP, Patterson N, Price AL (2014) Fast and accurate imputation of summary statistics enhances evidence of functional enrichment. Bioinformatics 30(20):2906–2914

    CAS  PubMed  PubMed Central  Google Scholar 

  • Pirinen M, Donnelly P, Spencer CC et al (2013) Efficient computation with a linear mixed model on large-scale data sets with applications to genetic studies. Ann Appl Stat 7(1):369–390

    Google Scholar 

  • Raitakari OT, Juonala M, Rönnemaa T, Keltikangas-Järvinen L, Räsänen L, Pietikäinen M, Hutri-Kähönen N, Taittonen L, Jokinen E, Marniemi J et al (2008) Cohort profile: the cardiovascular risk in young finns study. Int J Epidemiol 37(6):1220–1226

    PubMed  Google Scholar 

  • Ripke S, Neale BM, Corvin A, Walters JT, Farh K-H, Holmans PA, Lee P, Bulik-Sullivan B, Collier DA, Huang H et al (2014) Biological insights from 108 schizophrenia-associated genetic loci. Nature 511(7510):421–427

    CAS  PubMed Central  Google Scholar 

  • Schaid DJ, Chen W, Larson NB (2018) From genome-wide associations to candidate causal variants by statistical fine-mapping. Nat Rev Genet 19:491–504

    CAS  PubMed  PubMed Central  Google Scholar 

  • Sekar A, Bialas AR, de Rivera H, Davis A, Hammond TR, Kamitaki N, Tooley K, Presumey J, Baum M, Van Doren V et al (2016) Schizophrenia risk from complex variation of complement component 4. Nature 530(7589):177–183

    CAS  PubMed  PubMed Central  Google Scholar 

  • Shen X, Pan W, Zhu Y (2012) Likelihood-based selection and sharp parameter estimation. J Am Stat Assoc 107(497):223–232

    PubMed  PubMed Central  Google Scholar 

  • Spain SL, Barrett JC (2015) Strategies for fine-mapping complex traits. Hum Mol Genet 24(R1):R111–R119

    CAS  PubMed  PubMed Central  Google Scholar 

  • Stančáková A, Civelek M, Saleem NK, Soininen P, Kangas AJ, Cederberg H, Paananen J, Pihlajamäki J, Bonnycastle LL, Morken MA et al (2012) Hyperglycemia and a common variant of gckr are associated with the levels of eight amino acids in 9,369 finnish men. Diabetes 51(1):1895–1902

    Google Scholar 

  • The Autism Spectrum Disorders Working Group (2017) Meta-analysis of GWAS of over 16,000 individuals with autism spectrum disorder highlights a novel locus at 10q24. 32 and a significant overlap with schizophrenia. Mol Autism 8(21):1–17

    Google Scholar 

  • Vilhjálmsson BJ, Yang J, Finucane HK, Gusev A, Lindström S, Ripke S, Genovese G, Loh P-R, Bhatia G, Do R et al (2015) Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am J Hum Genet 97(4):576–592

    PubMed  PubMed Central  Google Scholar 

  • Wainberg M, Sinnott-Armstrong N, Mancuso N, Barbeira AN, Knowles DA, Golan D, Ermel R, Ruusalepp A, Quertermous T, Hao K et al (2019) Opportunities and challenges for transcriptome-wide association studies. Nat Genet 51(4):592–599

    CAS  PubMed  PubMed Central  Google Scholar 

  • Wright FA, Sullivan PF, Brooks AI, Zou F, Sun W, Xia K, Madar V, Jansen R, Chung W, Zhou Y-H et al (2014) Heritability and genomics of gene expression in peripheral blood. Nat Genet 46(5):430–437

    CAS  PubMed  PubMed Central  Google Scholar 

  • Wu C, Pan W (2018) Integrating eQTL data with GWAS summary statistics in pathway-based analysis with application to schizophrenia. Genet Epidemiol 42(3):303–316

    PubMed  PubMed Central  Google Scholar 

  • Wu C, Xu G, Pan W (2019) An adaptive test on high-dimensional parameters in generalized linear models. Stat Sinica ​29:2163–2186

    Google Scholar 

  • Xu B, Roos JL, Levy S, Van Rensburg E, Gogos JA, Karayiorgou M (2008) Strong association of de novo copy number mutations with sporadic schizophrenia. Nat Genet 40(7):880–885

    CAS  PubMed  Google Scholar 

  • Xu Z, Wu C, Wei P, Pan W (2017) A powerful framework for integrating eQTL and GWAS summary data. Genetics 207:893–902

    CAS  PubMed  PubMed Central  Google Scholar 

  • Xue H, Pan W, Initiative ADN, et al. (2019) Some statistical consideration in transcriptome-wide association studies. bioRxiv: 812677

  • Yang C, Wan X, Lin X, Chen M, Zhou X, Liu J (2018) Comm: a collaborative mixed model to dissecting genetic contributions to complex traits by leveraging regulatory information. Bioinformatics 35(10):1644–1652

    Google Scholar 

  • Yang J, Ferreira T, Morris AP, Medland SE, Madden PA, Heath AC, Martin NG, Montgomery GW, Weedon MN, Loos RJ et al (2012) Conditional and joint multiple-snp analysis of gwas summary statistics identifies additional variants influencing complex traits. Nat Genet 44(4):369–375

    CAS  PubMed  PubMed Central  Google Scholar 

  • Zhang C-H (2010) Nearly unbiased variable selection under minimax concave penalty. Ann Stat 38(2):894–942

    Google Scholar 

  • Zhao Q, Wang J, Hemani G, Bowden J, Small DS (2018) Statistical inference in two-sample summary-data mendelian randomization using robust adjusted profile score. arXiv preprint arXiv:1801.09652

  • Zhou Y, Liang M, Jiang T, Tian L, Liu Y, Liu Z, Liu H, Kuang F (2007) Functional dysconnectivity of the dorsolateral prefrontal cortex in first-episode schizophrenia using resting-state fmri. Neurosci Lett 417(3):297–302

    CAS  PubMed  Google Scholar 

  • Zhu X, Stephens M (2017) Bayesian large-scale multiple regression with summary statistics from genome-wide association studies. Ann Appl Stat 11(3):1561

    PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank reviewers for helpful comments. This research was supported by the Minnesota Supercomputing Institute. We appreciate the availability of the dbGaP data.

Funding

This research was supported by NIH Grants R21AG057038, R01HL116720, R01GM113250 and R01HL105397. CW was supported by a First Year Assistant Professor Grant at Florida State University.

Author information

Authors and Affiliations

Authors

Contributions

WP conceived the study. CW and WP developed the methods. CW performed the analysis and drafted the manuscript. WP supervised the study. All authors approved/edited the final manuscript.

Corresponding authors

Correspondence to Chong Wu or Wei Pan.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (XLSX 1696 kb)

Supplementary material 2 (PDF 40 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wu, C., Pan, W. A powerful fine-mapping method for transcriptome-wide association studies. Hum Genet 139, 199–213 (2020). https://doi.org/10.1007/s00439-019-02098-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00439-019-02098-2

Navigation