Abstract
Next generation sequencing based molecular assays have enabled unprecedented opportunities to quantitatively measure genome function. When combined with dense genetic data, quantitative trait loci (QTL) mapping of molecular traits is a fundamental tool for understanding the genetic basis of gene regulation. However, standard computational approaches for QTL mapping ignore the diploid nature of human genomes, testing for association between genotype and the total counts of sequencing reads mapping to both alleles at each genomic feature. In this work, we develop a new phase-aware test for QTL analysis (PhAT-QTL) leveraging the inherent single nucleotide resolution of sequencing reads to associate the alleles of each marker with the allele-specific counts (ASC) at a genomic feature. Through simulations, we show PhAT-QTL achieves increased power relative to standard genotype-based tests as a function of the number of heterozygotes for a given marker, the noise correlation between haplotypes, and the number of samples with detectable allele-specific counts at a genomic feature. Simulations further show that phasing error and error in quantifying ASC results in a loss of power as opposed to bias. Read simulations on varying haplotype structures (simulated from 1000 Genomes phased genomes) demonstrate that PhAT-QTL is able to detect 20% more QTLs while maintaining the same false positive rate as previous approaches. Applied to RNA-sequencing data, PhAT-QTL achieves similar performance as previous phase-aware methods in detecting cis expression QTLs (cis-eQTLs) but at a fraction of the computational cost.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Montgomery, S.B., Sammeth, M., Gutierrez-Arcelus, M., Lach, R.P., Ingle, C., Nisbett, J., Guigó, R., Dermitzakis, E.T.: Transcriptome genetics using second generation sequencing in a Caucasian population. Nature 464, 773–777 (2010)
Pickrell, J.K., Marioni, J.C., Pai, A.A., Degner, J.F., Engelhardt, B.E., Nkadori, E., Veyrieras, J.-B., Stephens, M., Gilad, Y., Pritchard, J.K.: Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature 464, 768–772 (2010)
Lappalainen, T., Sammeth, M., Friedländer, M.R., Hoen, P.A.C.T., Monlong, J., Rivas, M.A., Gonzàlez-Porta, M., Kurbatova, N., Griebel, T., Ferreira, P.G., Barann, M., Wieland, T., Greger, L., van Iterson, M., Almlöf, J., Ribeca, P., Pulyakhina, I., Esser, D., Giger, T., Tikhonov, A., Sultan, M., Bertier, G., MacArthur, D.G., Lek, M., Lizano, E., Buermans, H.P.J., Padioleau, I., Schwarzmayr, T., Karlberg, O., Ongen, H., Kilpinen, H., Beltran, S., Gut, M., Kahlem, K., Amstislavskiy, V., Stegle, O., Pirinen, M., Montgomery, S.B., Donnelly, P., McCarthy, M.I., Flicek, P., Strom, T.M., Consortium, T.G., Lehrach, H., Schreiber, S., Sudbrak, R.: Carracedo, Á., Antonarakis, S.E., Häsler, R., Syvänen, A.-C., van Ommen, G.-J., Brazma, A., Meitinger, T., Rosenstiel, P., Guigó, R., Gut, I.G., Estivill, X., Dermitzakis, E.T.: Transcriptome and genome sequencing uncovers functional variation in humans. Nature. 501, 506–511 (2013)
Battle, A., Mostafavi, S., Zhu, X., Potash, J.B., Weissman, M.M., McCormick, C., Haudenschild, C.D., Beckman, K.B., Shi, J., Mei, R., Urban, A.E., Montgomery, S.B., Levinson, D.F., Koller, D.: Characterizing the genetic basis of transcriptome diversity through RNA-sequencing of 922 individuals. Genome Res. 24, 14–24 (2014)
Li, Y.I., van de Geijn, B., Raj, A., Knowles, D.A., Petti, A.A., Golan, D., Gilad, Y., Pritchard, J.K.: RNA splicing is a primary link between genetic variation and disease. Science 352, 600–604 (2016)
Battle, A., Khan, Z., Wang, S.H., Mitrano, A., Ford, M.J., Pritchard, J.K., Gilad, Y.: Genomic variation. Impact of regulatory variation from RNA to protein. Science 347, 664–667 (2015)
Cheng, C.S., Gate, R.E., Aiden, A.P., Siba, A., Tabaka, M., Lituiev, D., Machol, I., Subramaniam, M., Shammim, M., Hougen, K.L., Wortman, I., Huang, S.-C., Durand, N.C., Feng, T., De Jager, P.L., Chang, H.Y., Lieberman Aiden, E., Benoist, C., Beer, M.A., Ye, C.J., Regev, A.: Genetic determinants of chromatin accessibility and gene regulation in T cell activation across human individuals. bioRxiv 090241 (2016)
Grubert, F., Zaugg, J.B., Kasowski, M., Ursu, O., Spacek, D.V., Martin, A.R., Greenside, P., Srivas, R., Phanstiel, D.H., Pekowska, A., Heidari, N., Euskirchen, G., Huber, W., Pritchard, J.K., Bustamante, C.D., Steinmetz, L.M., Kundaje, A., Snyder, M.: Genetic control of chromatin states in humans involves local and distal chromosomal interactions. Cell 162, 1051–1065 (2015)
Waszak, S.M., Delaneau, O., Gschwind, A.R., Kilpinen, H., Raghav, S.K., Witwicki, R.M., Orioli, A., Wiederkehr, M., Panousis, N.I., Yurovsky, A., Romano-Palumbo, L., Planchon, A., Bielser, D., Padioleau, I., Udin, G., Thurnheer, S., Hacker, D., Hernandez, N., Reymond, A., Deplancke, B., Dermitzakis, E.T.: Population variation and genetic control of modular chromatin architecture in humans. Cell 162, 1039–1050 (2015)
van de Geijn, B., McVicker, G., Gilad, Y., Pritchard, J.K.: WASP: allele-specific software for robust molecular quantitative trait locus discovery. Nat. Methods 12, 1061–1063 (2015)
Sun, W.: A statistical framework for eQTL mapping using RNA-seq data. Biometrics 68, 1–11 (2012)
Kumasaka, N., Knights, A.J., Gaffney, D.J.: Fine-mapping cellular QTLs with RASQUAL and ATAC-seq. Nat. Genet.: Nat. Res. 48, 206–213 (2016)
Munger, S.C., Raghupathy, N., Choi, K., Simons, A.K., Gatti, D.M., Hinerfeld, D.A., Svenson, K.L., Keller, M.P., Attie, A.D., Hibbs, M.A., Graber, J.H., Chesler, E.J., Churchill, G.A.: RNA-Seq alignment to individualized genomes improves transcript abundance estimates in multiparent populations. Genetics 198, 59–73 (2014)
Kang, H.M., Zaitlen, N.A., Wade, C.M., Kirby, A., Heckerman, D., Daly, M.J., Eskin, E.: Efficient control of population structure in model organism association mapping. Genetics 178, 1709–1723 (2008)
Gilmour, A.R., Thompson, R., Cullis, B.R.: Average information REML: an efficient algorithm for variance parameter estimation in linear mixed models. Biometrics 51, 1440–1450 (1995)
O’Connell, J., Gurdasani, D., Delaneau, O., Pirastu, N., Ulivi, S., Cocca, M., Traglia, M., Huang, J., Huffman, J.E., Rudan, I., McQuillan, R., Fraser, R.M., Campbell, H., Polasek, O., Asiki, G., Ekoru, K., Hayward, C., Wright, A.F., Vitart, V., Navarro, P., Zagury, J.-F., Wilson, J.F., Toniolo, D., Gasparini, P., Soranzo, N., Sandhu, M.S., Marchini, J.: A general approach for haplotype phasing across the full spectrum of relatedness. PLoS Genet. 10, e1004234 (2014)
Li, B., Dewey, C.N.: RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 12, 323 (2011)
Storey, J.D., Tibshirani, R.: Statistical significance for genomewide studies. Proc. Natl. Acad. Sci. USA 100, 9440–9445 (2003)
GEUVADIS portal. http://www.geuvadis.org/web/geuvadis/rnaseq-project
Weisenfeld, N.I., Kumar, V., Shah, P., Church, D., Jaffe, D.B.: Direct determination of diploid genome sequences. bioRxiv 070425 (2016)
Jain, M., Olsen, H.E., Paten, B., Akeson, M.: The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community. Genome Biol. 17, 239 (2016)
Eid, J., Fehr, A., Gray, J., Luong, K., Lyle, J., Otto, G., Peluso, P., Rank, D., Baybayan, P., Bettman, B., Bibillo, A., Bjornson, K., Chaudhuri, B., Christians, F., Cicero, R., Clark, S., Dalal, R., deWinter, A., Dixon, J., Foquet, M., Gaertner, A., Hardenbol, P., Heiner, C., Hester, K., Holden, D., Kearns, G., Kong, X., Kuse, R., Lacroix, Y., Lin, S., Lundquist, P., Ma, C., Marks, P., Maxham, M., Murphy, D., Park, I., Pham, T., Phillips, M., Roy, J., Sebra, R., Shen, G., Sorenson, J., Tomaney, A., Travers, K., Trulson, M., Vieceli, J., Wegener, J., Wu, D., Yang, A., Zaccarin, D., Zhao, P., Zhong, F., Korlach, J., Turner, S.: Real-time DNA sequencing from single polymerase molecules. Science 323, 133–138 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Subramaniam, M., Zaitlen, N., Ye, J. (2017). PhAT-QTL: A Phase-Aware Test for QTL Detection. In: Cai, Z., Daescu, O., Li, M. (eds) Bioinformatics Research and Applications. ISBRA 2017. Lecture Notes in Computer Science(), vol 10330. Springer, Cham. https://doi.org/10.1007/978-3-319-59575-7_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-59575-7_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59574-0
Online ISBN: 978-3-319-59575-7
eBook Packages: Computer ScienceComputer Science (R0)