Discovery of shared genomic loci using the conditional false discovery rate approach

Smeland, Olav B.; Frei, Oleksandr; Shadrin, Alexey; O’Connell, Kevin; Fan, Chun-Chieh; Bahrami, Shahram; Holland, Dominic; Djurovic, Srdjan; Thompson, Wesley K.; Dale, Anders M.; Andreassen, Ole A.

doi:10.1007/s00439-019-02060-2

Discovery of shared genomic loci using the conditional false discovery rate approach

Review
Published: 13 September 2019

Volume 139, pages 85–94, (2020)
Cite this article

Human Genetics Aims and scope Submit manuscript

3888 Accesses
81 Citations
5 Altmetric
Explore all metrics

Abstract

In recent years, genome-wide association study (GWAS) sample sizes have become larger, the statistical power has improved and thousands of trait-associated variants have been uncovered, offering new insights into the genetic etiology of complex human traits and disorders. However, a large fraction of the polygenic architecture underlying most complex phenotypes still remains undetected. We here review the conditional false discovery rate (condFDR) method, a model-free strategy for analysis of GWAS summary data, which has improved yield of existing GWAS and provided novel findings of genetic overlap between a wide range of complex human phenotypes, including psychiatric, cardiovascular, and neurological disorders, as well as psychological and cognitive traits. The condFDR method was inspired by Empirical Bayes approaches and leverages auxiliary genetic information to improve statistical power for discovery of single-nucleotide polymorphisms (SNPs). The cross-trait condFDR strategy analyses separate GWAS data, and leverages overlapping SNP associations, i.e., cross-trait enrichment, to increase discovery of trait-associated SNPs. The extension of the condFDR approach to conjunctional FDR (conjFDR) identifies shared genomic loci between two phenotypes. The conjFDR approach allows for detection of shared genomic associations irrespective of the genetic correlation between the phenotypes, often revealing a mixture of antagonistic and agonistic directional effects among the shared loci. This review provides a methodological comparison between condFDR and other relevant cross-trait analytical tools and demonstrates how condFDR analysis may provide novel insights into the genetic relationship between complex phenotypes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Detecting local genetic correlations with scan statistics

Article Open access 01 April 2021

Cross-Phenotype Association Analysis Using Summary Statistics from GWAS

Investigation of multi-trait associations using pathway-based analysis of GWAS summary statistics

Article Open access 04 February 2019

References

Andreassen OA et al (2013a) Improved detection of common variants associated with schizophrenia by leveraging pleiotropy with cardiovascular-disease risk factors. Am J Hum Genet 92:197–209. https://doi.org/10.1016/j.ajhg.2013.01.001
Article CAS PubMed PubMed Central Google Scholar
Andreassen OA, Thompson WK, Dale AM (2013b) Boosting the power of schizophrenia genetics by leveraging new statistical tools. Schizophr Bull. https://doi.org/10.1093/schbul/sbt168
Article PubMed PubMed Central Google Scholar
Andreassen OA et al (2013c) Improved detection of common variants associated with schizophrenia and bipolar disorder using pleiotropy-informed conditional false discovery rate. PLoS Genet 9:e1003455. https://doi.org/10.1371/journal.pgen.1003455
Article CAS PubMed PubMed Central Google Scholar
Andreassen OA et al (2014a) Genetic pleiotropy between multiple sclerosis and schizophrenia but not bipolar disorder: differential involvement of immune-related gene loci. Mol Psychiatry 20:207–214. https://doi.org/10.1038/mp.2013.195
Article CAS PubMed PubMed Central Google Scholar
Andreassen OA et al (2014b) Identifying common genetic variants in blood pressure due to polygenic pleiotropy with associated phenotypes. Hypertension 63:819–826. https://doi.org/10.1161/hypertensionaha.113.02077
Article CAS PubMed Google Scholar
Andreassen OA et al (2014c) Shared common variants in prostate cancer and blood lipids. Int J Epidemiol 43:1205–1214. https://doi.org/10.1093/ije/dyu090
Article PubMed PubMed Central Google Scholar
Baurecht H et al (2015) Genome-wide comparative analysis of atopic dermatitis and psoriasis gives insight into opposing genetic mechanisms. Am J Hum Genet 96:104–120. https://doi.org/10.1016/j.ajhg.2014.12.004
Article CAS PubMed PubMed Central Google Scholar
Bhattacharjee S et al (2012) A subset-based approach improves power and interpretation for the combined analysis of genetic association studies of heterogeneous traits. Am J Hum Genet 90:821–835. https://doi.org/10.1016/j.ajhg.2012.03.015
Article CAS PubMed PubMed Central Google Scholar
Brainstorm C et al (2018) Analysis of shared heritability in common disorders of the brain. Science. https://doi.org/10.1126/science.aap8757
Article Google Scholar
Broce I et al (2018) Immune-related genetic enrichment in frontotemporal dementia: an analysis of genome-wide association studies. PLoS Med 15:e1002487. https://doi.org/10.1371/journal.pmed.1002487
Article CAS PubMed PubMed Central Google Scholar
Broce IJ et al (2019) Dissecting the genetic relationship between cardiovascular risk factors and Alzheimer’s disease. Acta Neuropathol 137:209–226. https://doi.org/10.1007/s00401-018-1928-6
Article CAS PubMed Google Scholar
Bulik-Sullivan B et al (2015a) An atlas of genetic correlations across human diseases and traits. Nat Genet 47:1236–1241. https://doi.org/10.1038/ng.3406
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan BK et al (2015b) LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet 47:291–295. https://doi.org/10.1038/ng.3211
Article CAS PubMed PubMed Central Google Scholar
Cross-Disorder Group of the Psychiatric Genomics C et al (2013) Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs. Nat Genet 45:984–994. https://doi.org/10.1038/ng.2711
Article CAS Google Scholar
Davies G et al (2018) Study of 300,486 individuals identifies 148 independent genetic loci influencing general cognitive function. Nat Commun 9:2098. https://doi.org/10.1038/s41467-018-04362-x
Article CAS PubMed PubMed Central Google Scholar
Desikan RS et al (2015) Polygenic overlap between C-reactive protein, plasma lipids, and alzheimer disease. Circulation 131:2061–2069. https://doi.org/10.1161/CIRCULATIONAHA.115.015489
Article CAS PubMed PubMed Central Google Scholar
Devlin B, Roeder K (1999) Genomic control for association studies. Biometrics 55:997–1004
Article CAS PubMed Google Scholar
Drange OK et al (2019) Genetic overlap between alzheimer’s disease and bipolar disorder implicates the MARK2 and VAC14 genes. Front Neurosci 13:220. https://doi.org/10.3389/fnins.2019.00220
Article PubMed PubMed Central Google Scholar
Efron B (2007) Size, power and false discovery rates. Ann Stat 35:1351–1377
Article Google Scholar
Efron B (2010) Large-scale inference: empirical Bayes methods for estimation, testing, and prediction. Institute of mathematical statistics monographs, vol 1. Cambridge University Press, Cambridge
Book Google Scholar
Efron B, Tibshirani R (2002) Empirical bayes methods and false discovery rates for microarrays. Genet Epidemiol 23:70–86. https://doi.org/10.1002/gepi.1124
Article PubMed Google Scholar
Ellinghaus D et al (2012) Combined analysis of genome-wide association studies for Crohn disease and psoriasis identifies seven shared susceptibility loci. Am J Hum Genet 90:636–647. https://doi.org/10.1016/j.ajhg.2012.02.020
Article CAS PubMed PubMed Central Google Scholar
Ferrari R et al (2017) Genetic architecture of sporadic frontotemporal dementia and overlap with Alzheimer’s and Parkinson’s diseases. J Neurol Neurosurg Psychiatry 88:152–164. https://doi.org/10.1136/jnnp-2016-314411
Article PubMed Google Scholar
Frei O et al (2019) Bivariate causal mixture model quantifies polygenic overlap between complex traits beyond genetic correlation. Nat Commun 10:2417. https://doi.org/10.1038/s41467-019-10310-0
Article CAS PubMed PubMed Central Google Scholar
Giambartolomei C, Vukcevic D, Schadt EE, Franke L, Hingorani AD, Wallace C, Plagnol V (2014) Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet 10:e1004383. https://doi.org/10.1371/journal.pgen.1004383
Article CAS PubMed PubMed Central Google Scholar
Gratten J, Visscher PM (2016) Genetic pleiotropy in complex traits and diseases: implications for genomic medicine. Genome Med 8:78. https://doi.org/10.1186/s13073-016-0332-x
Article CAS PubMed PubMed Central Google Scholar
Grotzinger AD et al (2019) Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nat Hum Behav. https://doi.org/10.1038/s41562-019-0566-x
Article PubMed PubMed Central Google Scholar
Hackinger S, Zeggini E (2017) Statistical methods to detect pleiotropy in human complex traits. Open Biol. https://doi.org/10.1098/rsob.170125
Article PubMed PubMed Central Google Scholar
Han B, Duong D, Sul JH, de Bakker PI, Eskin E, Raychaudhuri S (2016) A general framework for meta-analyzing dependent studies with overlapping subjects in association mapping. Hum Mol Genet 25:1857–1866. https://doi.org/10.1093/hmg/ddw049
Article CAS PubMed PubMed Central Google Scholar
Hernan MA, Robins JM (2006) Instruments for causal inference: an epidemiologist’s dream? Epidemiology (Cambridge, Mass) 17:360–372. https://doi.org/10.1097/01.ede.0000222409.00878.37
Article Google Scholar
Hill WD, Davies G, Group CCW, Liewald DC, McIntosh AM, Deary IJ (2016) Age-dependent pleiotropy between general cognitive function and major psychiatric disorders. Biol Psychiat 80:266–273. https://doi.org/10.1016/j.biopsych.2015.08.033
Article PubMed Google Scholar
Holland D et al (2019) Beyond SNP heritability: polygenicity and discoverability of phenotypes estimated with a univariate gaussian mixture model. bioRxiv. https://doi.org/10.1101/133132
Article Google Scholar
Hu Y et al (2018) Identification of novel potentially pleiotropic variants associated with osteoporosis and obesity using the cFDR method. J Clin Endocrinol Metab 103:125–138. https://doi.org/10.1210/jc.2017-01531
Article PubMed Google Scholar
Karch CM et al (2018) Selective genetic overlap between amyotrophic lateral sclerosis and diseases of the frontotemporal dementia spectrum. JAMA Neurol 75:860–875. https://doi.org/10.1001/jamaneurol.2018.0372
Article PubMed PubMed Central Google Scholar
Lawlor DA, Harbord RM, Sterne JA, Timpson N, Davey Smith G (2008) Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Stat Med 27:1133–1163. https://doi.org/10.1002/sim.3034
Article PubMed Google Scholar
Le Hellard S et al (2017) Identification of gene loci that overlap between schizophrenia and educational attainment. Schizophr Bull 43:654–664. https://doi.org/10.1093/schbul/sbw085
Article PubMed Google Scholar
LeBlanc M et al (2015) Identifying novel gene variants in coronary artery disease and shared genes with several cardiovascular risk factors. Circ Res. https://doi.org/10.1161/circresaha.115.306629
Article PubMed PubMed Central Google Scholar
Lee SH, Yang J, Goddard ME, Visscher PM, Wray NR (2012) Estimation of pleiotropy between complex diseases using single-nucleotide polymorphism-derived genomic relationships and restricted maximum likelihood. Bioinformatics 28:2540–2542. https://doi.org/10.1093/bioinformatics/bts474
Article CAS PubMed PubMed Central Google Scholar
Lencz T et al (2014) Molecular genetic evidence for overlap between general cognitive ability and risk for schizophrenia: a report from the cognitive genomics consorTium (COGENT). Mol Psychiatry 19:168–174. https://doi.org/10.1038/mp.2013.166
Article CAS PubMed Google Scholar
Liley J, Wallace C (2015) A pleiotropy-informed Bayesian false discovery rate adapted to a shared control design finds new disease associations from GWAS summary statistics. PLoS Genet 11:e1004926. https://doi.org/10.1371/journal.pgen.1004926
Article CAS PubMed PubMed Central Google Scholar
Lin DY, Sullivan PF (2009) Meta-analysis of genome-wide association studies with overlapping subjects. Am J Hum Genet 85:862–872. https://doi.org/10.1016/j.ajhg.2009.11.001
Article CAS PubMed PubMed Central Google Scholar
Liu JZ et al (2013) Dense genotyping of immune-related disease regions identifies nine new risk loci for primary sclerosing cholangitis. Nat Genet 45:670–675. https://doi.org/10.1038/ng.2616
Article CAS PubMed PubMed Central Google Scholar
Lo MT et al (2017) Modeling prior information of common genetic variants improves gene discovery for neuroticism. Hum Mol Genet 26:4530–4539. https://doi.org/10.1093/hmg/ddx340
Article CAS PubMed PubMed Central Google Scholar
Lv WQ et al (2017) Novel common variants associated with body mass index and coronary artery disease detected using a pleiotropic cFDR method. J Mol Cell Cardiol 112:1–7. https://doi.org/10.1016/j.yjmcc.2017.08.011
Article CAS PubMed PubMed Central Google Scholar
Manolio TA et al (2009) Finding the missing heritability of complex diseases. Nature 461:747–753. https://doi.org/10.1038/nature08494
Article CAS PubMed PubMed Central Google Scholar
McLaughlin RL et al (2017) Genetic correlation between amyotrophic lateral sclerosis and schizophrenia. Nat Commun 8:14774. https://doi.org/10.1038/ncomms14774
Article CAS PubMed PubMed Central Google Scholar
Morris AP (2011) Transethnic meta-analysis of genomewide association studies. Genet Epidemiol 35:809–822. https://doi.org/10.1002/gepi.20630
Article PubMed PubMed Central Google Scholar
Mufford M et al (2019) Concordance of genetic variation that increases risk for tourette syndrome and that influences its underlying neurocircuitry. Transl Psychiatry 9:120. https://doi.org/10.1038/s41398-019-0452-3
Article CAS PubMed PubMed Central Google Scholar
Nichols T, Brett M, Andersson J, Wager T, Poline JB (2005) Valid conjunction inference with the minimum statistic. Neuroimage 25:653–660. https://doi.org/10.1016/j.neuroimage.2004.12.005
Article PubMed Google Scholar
O’Reilly PF, Hoggart CJ, Pomyen Y, Calboli FCF, Elliott P, Jarvelin M-R, Coin LJM (2012) MultiPhen: joint model of multiple phenotypes can increase discovery in GWAS. PLOS One 7:e34861. https://doi.org/10.1371/journal.pone.0034861
Article CAS PubMed PubMed Central Google Scholar
Pasaniuc B, Price AL (2017) Dissecting the genetics of complex traits using summary association statistics. Nat Rev Genet 18:117–127. https://doi.org/10.1038/nrg.2016.142
Article CAS PubMed Google Scholar
Pickrell JK, Berisa T, Liu JZ, Ségurel L, Tung JY, Hinds DA (2016) Detection and interpretation of shared genetic influences on 42 human traits. Nat Genet 48:709–717. https://doi.org/10.1038/ng.3570
Article CAS PubMed PubMed Central Google Scholar
Price AL et al (2008) Long-range LD can confound genome scans in admixed populations. Am J Hum Genet 83:132–135. https://doi.org/10.1016/j.ajhg.2008.06.005
Article CAS PubMed PubMed Central Google Scholar
Purcell SM, Wray NR, Stone JL, Visscher PM, O’Donovan MC, Sullivan PF, Sklar P (2009) Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460:748–752. https://doi.org/10.1038/nature08185
Article CAS PubMed Google Scholar
Savage JE et al (2018) Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence. Nat Genet 50:912–919. https://doi.org/10.1038/s41588-018-0152-6
Article CAS PubMed PubMed Central Google Scholar
Schork AJ et al (2013) All SNPs are not created equal: genome-wide association studies reveal a consistent pattern of enrichment among functionally annotated SNPs. PLoS Genet 9:e1003449. https://doi.org/10.1371/journal.pgen.1003449
Article CAS PubMed PubMed Central Google Scholar
Schork AJ, Wang Y, Thompson WK, Dale AM, Andreassen OA (2016) New statistical approaches exploit the polygenic architecture of schizophrenia—implications for the underlying neurobiology. Curr Opin Neurobiol 36:89–98. https://doi.org/10.1016/j.conb.2015.10.008
Article CAS PubMed Google Scholar
Schwartzman A, Lin X (2011) The effect of correlation in false discovery rate estimation. Biometrika 98:199–214. https://doi.org/10.1093/biomet/asq075
Article PubMed PubMed Central Google Scholar
Shadrin AA et al (2018) Novel loci associated with attention-deficit/hyperactivity disorder are revealed by leveraging polygenic overlap with educational attainment. J Am Acad Child Adolesc Psychiatry 57:86–95. https://doi.org/10.1016/j.jaac.2017.11.013
Article PubMed Google Scholar
Shi H, Mancuso N, Spendlove S, Pasaniuc B (2017) Local genetic correlation gives insights into the shared genetic architecture of complex traits. Am J Hum Genet 101:737–751. https://doi.org/10.1016/j.ajhg.2017.09.022
Article CAS PubMed PubMed Central Google Scholar
Sivakumaran S et al (2011) Abundant pleiotropy in human complex diseases and traits. Am J Hum Genet 89:607–618. https://doi.org/10.1016/j.ajhg.2011.10.004
Article CAS PubMed PubMed Central Google Scholar
Smeland OB et al (2017a) Identification of genetic loci jointly influencing schizophrenia risk and the cognitive traits of verbal-numerical reasoning, reaction time, and general cognitive function. JAMA Psychiatry 74:1065–1075. https://doi.org/10.1001/jamapsychiatry.2017.1986
Article PubMed PubMed Central Google Scholar
Smeland OB et al (2017b) Identification of genetic loci shared between schizophrenia and the big five personality traits. Sci Rep 7:2222. https://doi.org/10.1038/s41598-017-02346-3
Article CAS PubMed PubMed Central Google Scholar
Smeland OB et al (2018) Genetic overlap between schizophrenia and volumes of hippocampus, putamen, and intracranial volume indicates shared molecular genetic mechanisms. Schizophr Bull 44:854–864. https://doi.org/10.1093/schbul/sbx148
Article PubMed Google Scholar
Smeland OB et al (2019) Genome-wide analysis reveals extensive genetic overlap between schizophrenia, bipolar disorder, and intelligence. Mol Psychiatry. https://doi.org/10.1038/s41380-018-0332-x
Article PubMed PubMed Central Google Scholar
Smith GD, Ebrahim S (2003) ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol 32:1–22
Article PubMed Google Scholar
Smoller JW, Andreassen OA, Edenberg HJ, Faraone SV, Glatt SJ, Kendler KS (2018) Psychiatric genetics and the structure of psychopathology. Mol Psychiatry. https://doi.org/10.1038/s41380-017-0010-4
Article PubMed PubMed Central Google Scholar
Sniekers S et al (2017) Genome-wide association meta-analysis of 78,308 individuals identifies new loci and genes influencing human intelligence. Nat Genet 49:1107–1112. https://doi.org/10.1038/ng.3869
Article CAS PubMed PubMed Central Google Scholar
Solovieff N, Cotsapas C, Lee PH, Purcell SM, Smoller JW (2013) Pleiotropy in complex traits: challenges and strategies. Nat Rev Genet 14:483–495. https://doi.org/10.1038/nrg3461
Article CAS PubMed PubMed Central Google Scholar
Stahl EA et al (2019) Genome-wide association study identifies 30 loci associated with bipolar disorder. Nat Genet 51:793–803. https://doi.org/10.1038/s41588-019-0397-8
Article CAS PubMed PubMed Central Google Scholar
Sun L, Craiu RV, Paterson AD, Bull SB (2006) Stratified false discovery control for large-scale hypothesis testing with application to genome-wide association studies. Genet Epidemiol 30:519–530. https://doi.org/10.1002/gepi.20164
Article PubMed Google Scholar
Turley P et al (2018) Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat Genet 50:229–237. https://doi.org/10.1038/s41588-017-0009-4
Article CAS PubMed PubMed Central Google Scholar
van der Meer D et al (2018) Brain scans from 21,297 individuals reveal the genetic architecture of hippocampal subfield volumes. Mol Psychiatry. https://doi.org/10.1038/s41380-018-0262-7
Article PubMed PubMed Central Google Scholar
van der Sluis S, Posthuma D, Dolan CV (2013) TATES: efficient multivariate genotype-phenotype analysis for genome-wide association studies. PLoS Genet 9:e1003235. https://doi.org/10.1371/journal.pgen.1003235
Article CAS PubMed PubMed Central Google Scholar
Visscher PM, Wray NR, Zhang Q, Sklar P, McCarthy MI, Brown MA, Yang J (2017) 10 years of GWAS discovery: biology, function, and translation. Am J Hum Genet 101:5–22. https://doi.org/10.1016/j.ajhg.2017.06.005
Article CAS PubMed PubMed Central Google Scholar
Wang Y et al (2016a) Genetic overlap between multiple sclerosis and several cardiovascular disease risk factors. Mult Scler 22:1783–1793. https://doi.org/10.1177/1352458516635873
Article CAS PubMed PubMed Central Google Scholar
Wang Y et al (2016b) Leveraging genomic annotations and pleiotropic enrichment for improved replication rates in schizophrenia GWAS. PLoS Genet 12:e1005803. https://doi.org/10.1371/journal.pgen.1005803
Article CAS PubMed PubMed Central Google Scholar
Willer CJ, Li Y, Abecasis GR (2010) METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26:2190–2191. https://doi.org/10.1093/bioinformatics/btq340
Article CAS PubMed PubMed Central Google Scholar
Winsvold BS et al (2017) Shared genetic risk between migraine and coronary artery disease: A genome-wide analysis of common variants. PLoS One 12:e0185663. https://doi.org/10.1371/journal.pone.0185663
Article CAS PubMed PubMed Central Google Scholar
Witoelar A et al (2017) Genome-wide pleiotropy between parkinson disease and autoimmune diseases. JAMA Neurol 74:780–792. https://doi.org/10.1001/jamaneurol.2017.0469
Article PubMed PubMed Central Google Scholar
Yokoyama JS et al (2016) Association between genetic traits for immune-mediated diseases and alzheimer disease. JAMA neurology 73:691–697. https://doi.org/10.1001/jamaneurol.2016.0150
Article PubMed PubMed Central Google Scholar
Yokoyama JS et al (2017) Shared genetic risk between corticobasal degeneration, progressive supranuclear palsy, and frontotemporal dementia. Acta Neuropathol 133:825–837. https://doi.org/10.1007/s00401-017-1693-y
Article CAS PubMed PubMed Central Google Scholar
Yoo YJ, Pinnaduwage D, Waggott D, Bull SB, Sun L (2009) Genome-wide association analyses of North American rheumatoid arthritis consortium and Framingham heart study data utilizing genome-wide linkage results. BMC Proc 3(Suppl 7):S103
Article PubMed PubMed Central Google Scholar
Zhu Z et al (2018) Causal associations between risk factors and common diseases inferred from GWAS summary data. Nat Commun 9:224. https://doi.org/10.1038/s41467-017-02317-2
Article CAS PubMed PubMed Central Google Scholar
Zuber V et al (2018) Identification of shared genetic variants between schizophrenia and lung cancer. Sci Rep 8:674. https://doi.org/10.1038/s41598-017-16481-4
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

National Institutes of Health (NS057198; EB00790); National Institutes of Health NIDA/NCI: U24DA041123; the Research Council of Norway (229129; 213837; 248778; 223273; 249711); the South-East Norway Regional Health Authority (2017-112); KG Jebsen Stiftelsen (SKGJ-2011-36).

Author information

Authors and Affiliations

NORMENT Centre, Division of Mental Health and Addiction, Oslo University Hospital and Institute of Clinical Medicine, University of Oslo, Kirkeveien 166, 0424, Oslo, Norway
Olav B. Smeland, Oleksandr Frei, Alexey Shadrin, Kevin O’Connell, Shahram Bahrami & Ole A. Andreassen
Department of Cognitive Science, University of California San Diego, La Jolla, San Diego, CA, 92093, USA
Chun-Chieh Fan & Anders M. Dale
Department of Radiology, University of California of San Diego, La Jolla, San Diego, CA, 92093, USA
Chun-Chieh Fan, Dominic Holland & Anders M. Dale
Department of Family Medicine and Public Health, University of California San Diego, La Jolla, San Diego, CA, 92093, USA
Wesley K. Thompson
Department of Neuroscience, University of California San Diego, La Jolla, San Diego, CA, 92093, USA
Dominic Holland & Anders M. Dale
Center for Multimodal Imaging and Genetics, University of California San Diego, La Jolla, San Diego, CA, 92037, USA
Dominic Holland & Anders M. Dale
Department of Medical Genetics, Oslo University Hospital, Oslo, Norway
Srdjan Djurovic
NORMENT Centre, Department of Clinical Science, University of Bergen, Bergen, Norway
Srdjan Djurovic

Authors

Olav B. Smeland
View author publications
You can also search for this author in PubMed Google Scholar
Oleksandr Frei
View author publications
You can also search for this author in PubMed Google Scholar
Alexey Shadrin
View author publications
You can also search for this author in PubMed Google Scholar
Kevin O’Connell
View author publications
You can also search for this author in PubMed Google Scholar
Chun-Chieh Fan
View author publications
You can also search for this author in PubMed Google Scholar
Shahram Bahrami
View author publications
You can also search for this author in PubMed Google Scholar
Dominic Holland
View author publications
You can also search for this author in PubMed Google Scholar
Srdjan Djurovic
View author publications
You can also search for this author in PubMed Google Scholar
Wesley K. Thompson
View author publications
You can also search for this author in PubMed Google Scholar
Anders M. Dale
View author publications
You can also search for this author in PubMed Google Scholar
Ole A. Andreassen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Olav B. Smeland or Ole A. Andreassen.

Ethics declarations

Conflict of interest

OA.A. has received speaker’s honorarium from Lundbeck and is a consultant for Healthlytix. C.C.F. is under employment of Multimodal Imaging Service, dba Healthlytix, in addition to his research appointment at the University of California, San Diego. A.M.D. is a founder of and holds equity interest in CorTechs Labs and serves on its scientific advisory board. He is also a member of the Scientific Advisory Board of Healthlytix and receives research funding from General Electric Healthcare (GEHC). The terms of these arrangements have been reviewed and approved by the University of California, San Diego in accordance with its conflict of interest policies. Remaining authors have no conflicts of interest to declare.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The condFDR/conjFDR software is available on https://github.com/precimed/pleiofdr as a MATLAB package, under GPL v3 license.

Box 1: Conditional and conjunctional false discovery rate

The ‘enrichment’ seen in the conditional Q–Q plots can be directly interpreted in terms of a Bayesian interpretation of the true discovery rate (TDR = 1 – false discovery rate (FDR)) (Efron 2010). More specifically, for a given p value, under a simple two-group (null and non-null) model, Bayes rule gives the posterior probability of being null as:

$${\text{FDR}}\left( p \right)\, = \,\pi_{0} F_{0} \left( p \right)/F\left( p \right),$$

(1)

where π₀ is the proportion of null SNPs, F₀ is the cumulative distribution function (cdf) of the null SNPs, and F is the cdf of all SNPs, both null and non-null (Efron 2007). Here, we assume the SNP p values are a priori independent and identically distributed. Under the null hypothesis, F₀ is the cdf of the uniform distribution on the unit interval [0,1], so that Eq. (1) reduces to:

$${\text{FDR}}\left( p \right)\, = \,\pi_{0} p/F\left( p \right).$$

(2)

F can be estimated by the empirical cdf q = N_p/Ν, where N_p is the number of SNPs with p values less than or equal to p, and N is the total number of SNPs. Replacing F by q in Eq. (2), we get:

$${\text{Estimated FDR}}\left( p \right)\, = \,\pi_{0} p/q,$$

(3)

which is biased upwards as an estimate of the FDR (Efron and Tibshirani 2002). Replacing π₀ in Eq. (3) with unity gives an estimated FDR that is further biased upward:

$$q^*\, = \,p/q.$$

(4)

If π₀ is close to one, the increase in bias going from Eqs. (3–4) is minimal. The quantity 1 – p/q is, therefore, biased downward, and hence a conservative estimate of the TDR. Referring to the Q–Q plots, we see that q* is equivalent to the nominal p value divided by the empirical quantile, as defined earlier. We can thus read the FDR estimate directly off the Q–Q plot as:

$$- { \log }_{ 10} \left( {q^*} \right)\, = \,{ \log }_{ 10} \left( q \right){-}{ \log }_{ 10} \left( p \right),$$

(5)

i.e., the horizontal shift of the curves in the Q–Q plots from the expected line x = y, with a larger shift corresponding to a smaller FDR. To estimate the conditional FDR of a given SNP, we repeat the above procedure for a subset of SNPs with p values in the secondary GWAS equal to or lower than that observed for the given SNP. Formally, this is given by:

$${\text{FDR}}\left( {p_{1} |p_{2} } \right)\, = \,\pi_{0} \left( {p_{2} } \right)p_{1} /F\left( {p_{1} |p_{2} } \right),$$

(6)

where p₁ is the p value for the first phenotype, p₂ is the p value for the second, and F(p₁ | p₂) is the conditional cdf and π₀ (p₂) the conditional proportion of null SNPs for the first phenotype, given that p values for the second phenotype are p₂ or smaller. The condFDR framework is closely related to the stratified FDR method developed by Sun et al. (2006). Whereas they propose computing FDR separately conditional on membership in pre-defined discrete strata of p values, here, we condition the estimated FDR on a continuous random variable, the SNP p values with respect to a second phenotype.

To identify SNPs jointly associated with two phenotypes using conjunctional FDR, the conditional FDR procedure is repeated after inverting the roles of the primary and secondary phenotypes. Similar to previous conjunction tests for p value statistics (Nichols et al. 2005), the conjunctional FDR estimate is defined as the maximum of both conditional FDR values, which minimizes the effect of a single phenotype driving the common association signal. Formally, the conjunctional FDR is given by:

$$\begin{aligned}& {\text{FDR}}_{{{\text{Phenotype1}}\& {\text{Phenotype2}}}} \left( {p_{1} , \, p_{2} } \right)\,\\ & \quad = \,\pi_{0} F_{0} \left( {p_{1} , \, p_{2} } \right)/F\left( {p_{1} , \, p_{2} } \right)\, \\&\quad+ \,\pi_{1} F_{1} \left( {p_{1} , \, p_{2} } \right)/F\left( {p_{1} , \, p_{2} } \right)\, \\&\quad+ \,\pi_{2} F_{2} \left( {p_{1} , \, p_{2} } \right)/F\left( {p_{1} , \, p_{2} } \right), \end{aligned}$$

(7)

where π₀ is the a priori proportion of SNPs null for both phenotypes simultaneously and F₀(p₁, p₂) is the joint null cdf, π₁ is the a priori proportion of SNPs non-null for the first phenotype and null for the second with F₁(p₁, p₂) the joint cdf of these SNPs, and π₂ is the a priori proportion of SNPs non-null for the second phenotype and null for the first, with joint cdf F₂(p₁, p₂). F(p₁, p₂) is the joint overall mixture cdf for all phenotype 1 and 2 SNPs.

Conditional empirical cdfs provide a model-free method to obtain conservative estimates of Eq. (7). This can be seen as follows: estimate the conjunction FDR by:

$$\begin{aligned} &{\rm{Estimated\, FDR}}_{{{{\rm Phenotype1}}} \& {{{\rm Phenotype2}}}}\\&\quad = {{\text{max}}} \, \left\{ {{\text{Estimated FDR}}_{{\rm Phenotype1} | {\rm Phenotype2}} ,}\right. \\&\quad\left.{{{\rm{ Estimated\, FDR}}}_{{{\rm Phenotype2} | {\rm Phenotype}1}} } \right\}, \end{aligned}$$

(8)

where Estimated FDR_{Phenotype1|Phenotype2} and Estimated FDR_{Phenotype2|Phenotype1} are conservative (upwardly biased) estimates of Eq. (6). Thus, Eq. (8) is a conservative estimate of max {p₁/F(p₁| p₂), p₂/F(p₂|p₁)} = max{p₁F₂(p₂)/F(p₁, p₂), p₂F₁(p₁)/F(p₁, p₂)}, with F₁(p₁) and F₂(p₂) the marginal non-null cdfs of SNPs for phenotypes 1 and 2, respectively. For enriched samples, p values will tend to be smaller than predicted from the uniform distribution, so that F₁(p₁) ≥ p₁ and F₂(p₂) ≥ p₂. Then, max {p₁F₂(p₂)/F(p₁, p₂), p₂F₁(p₁)/F(p₁, p₂)} ≥ [π₀ + π₁ + π₂] max{p₁F₂(p₂)/F(p₁, p₂), p₂F₁(p₁)/F(p₁, p₂)} ≥ [π₀p₁p₂ + π₁p₂F₁(p₁) + π₂p₁F₂(p₂)]/F(p₁, p₂).

Under the assumption that SNPs are independent if one or both are null, reasonable for disjoint samples, this last quantity is precisely the conjunctional FDR given in Eq. (7). Thus, Eq. (8) is a conservative model-free estimate of the conjunctional FDR.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Smeland, O.B., Frei, O., Shadrin, A. et al. Discovery of shared genomic loci using the conditional false discovery rate approach. Hum Genet 139, 85–94 (2020). https://doi.org/10.1007/s00439-019-02060-2

Download citation

Received: 20 November 2018
Accepted: 08 August 2019
Published: 13 September 2019
Issue Date: January 2020
DOI: https://doi.org/10.1007/s00439-019-02060-2

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Discovery of shared genomic loci using the conditional false discovery rate approach

Abstract

Access this article

Similar content being viewed by others

Detecting local genetic correlations with scan statistics

Cross-Phenotype Association Analysis Using Summary Statistics from GWAS

Investigation of multi-trait associations using pathway-based analysis of GWAS summary statistics

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Box 1: Conditional and conjunctional false discovery rate

Rights and permissions

About this article

Cite this article

Navigation

Discovery of shared genomic loci using the conditional false discovery rate approach

Abstract

Access this article

Similar content being viewed by others

Detecting local genetic correlations with scan statistics

Cross-Phenotype Association Analysis Using Summary Statistics from GWAS

Investigation of multi-trait associations using pathway-based analysis of GWAS summary statistics

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Box 1: Conditional and conjunctional false discovery rate

Box 1: Conditional and conjunctional false discovery rate

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation