Abstract
The alternative splicing pattern of transcription units can be influenced by the genotype of a neighboring locus, which is termed splicing quantitative trait locus (sQTL). Here we report a comprehensive catalog of sQTLs discovered from the public RNA-seq and matched genotype datasets of three European ancestries. Each pair of RNA-seq and genotype dataset was analyzed with IVAS, a locally developed R/Bioconductor package for sQTL discovery. A meta-analysis was applied to the three result sets to reach a consensus of 2525 sQTLs (FDR < 0.05). Among them, nine independent sQTLs overlapped the known signals in the catalog of genome-wide association studies. Interestingly, six of these sQTLs are associated with the alternative exons, whose absence would hamper the protein function by omitting a critical/conserved domain. Altogether, we report the list of candidate sQTLs, and it might be useful for the explanations of trait-associated polymorphisms.
Similar content being viewed by others
References
Allen HL, Estrada K, Lettre G, Berndt SI, Weedon MN, Rivadeneira F, Willer CJ, Jackson AU, Vedantam S, Raychaudhuri S et al (2010) Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 467:832–838
Begum F, Ghosh D, Tseng GC, Feingold E (2012) Comprehensive literature review and statistical considerations for GWAS meta-analysis. Nucleic Acids Res 40:3777–3784
Boyle AP, Hong EL, Hariharan M, Cheng Y, Schaub MA, Kasowski M, Karczewski KJ, Park J, Hitz BC, Weng S et al (2012) Annotation of functional variation in personal genomes using Regulome DB. Genome Res 22:1790–1797
Comuzzie AG, Cole SA, Laston SL, Voruganti VS, Haack K, Gibbs RA, Butte NF (2012) Novel genetic loci identified for the pathophysiology of childhood obesity in the Hispanic population. PLoS One 7:e51954
Consortium TGP, Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME, McVean GA (2010) A map of human genome variation from population scale sequencing. Nature 467:1061–1073. doi:10.1038/nature09534
Deng X, Sabino EC, Cunha-Neto E, Ribeiro AL, Ianni B, Mady C, Busch MP, Seielstad M (2013) Genome wide association study (GWAS) of Chagas cardiomyopathy in Trypanosoma cruzi seropositive subjects. PLoS One 8:e79629
E Z, Wang L, Zhou J (2013) Splicing and alternative splicing in rice and humans. BMB Rep 46:439–447
ENCODE Project Consortium (2004) The ENCODE (ENCyclopedia Of DNA Elements) project. Science 306:636–640
ENCODE Project Consortium (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489:57–74
Goes FS, Hamshere ML, Seifuddin F, Pirooznia M, Belmonte-Mahon P, Breuer R, Schulze T, Nothen M, Cichon S, Rietschel M et al (2012) Genome-wide association of mood-incongruent psychotic bipolar disorder. Transl Psychiatry 2:e180
Gold B, Kirchhoff T, Stefanov S, Lautenberger J, Viale A, Garber J, Friedman E, Narod S, Olshen AB, Gregersen P et al (2008) Genome-wide association study provides evidence for a breast cancer risk locus at 6q22.33. Proc Natl Acad Sci USA 105:4340–4345
Gong C, Xu Y, Fan Y, Liu X, Xiong C, He L, Liu C, Rao S, Xia W, Ding L et al (2016) Functional mechanisms for diabetic nephropathy-associated genetic variants. Genes Genom. doi:10.1007/s13258-016-0415-5
Ioannidis JP, Thomas G, Daly MJ (2009) Validating, augmenting and refining genome-wide association signals. Nat Rev Genet 10:318–329
Kallberg M, Margaryan G, Wang S, Ma J, Xu J (2014) RaptorX server: a resource for template-based protein structure modeling. Methods Mol Biol 1137:17–27
Katz Y, Wang ET, Airoldi EM, Burge CB (2010) Analysis and design of RNA sequencing experiments for identifying isoform regulation. Nat Methods 7:1009–1015
Manolio TA (2009) Cohort studies and the genetics of complex disease. Nat Genet 41:5–6
McDonough CW, Palmer ND, Hicks PJ, Roh BH, An SS, Cooke JN, Hester JM, Wing MR, Bostrom MA, Rudock ME et al (2011) A genome-wide association study for diabetic nephropathy genes in African Americans. Kidney Int 79:563–572
Monlong J, Calvo M, Ferreira PG, Guigo R (2014) Identification of genetic variants associated with alternative splicing using sQTLseekeR. Nat Commun 5:4698
Nicolae DL, Gamazon E, Zhang W, Duan S, Dolan ME, Cox NJ (2010) Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS Genet 6:e1000888
Okada Y, Wu D, Trynka G, Raj T, Terao C, Ikari K, Kochi Y, Ohmura K, Suzuki A, Yoshida S et al (2014) Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506:376–381
Olalde I, Sanchez-Quinto F, Datta D, Marigorta UM, Chiang CW, Rodriguez JA, Fernandez-Callejo M, Gonzalez I, Montfort M, Matas-Lalueza L et al (2014) Genomic analysis of the blood attributed to Louis XVI (1754–1793), king of France. Sci Rep 4:4666. doi:10.1038/srep04666
Parsa A, Chang YP, Kelly RJ, Corretti MC, Ryan KA, Robinson SW, Gottlieb SS, Kardia SL, Shuldiner AR, Liggett SB (2011) Hypertrophy-associated polymorphisms ascertained in a founder cohort applied to heart failure risk and mortality. Clin Transl Sci 4:17–23
Schadt EE, Molony C, Chudin E, Hao K, Yang X, Lum PY, Kasarskis A, Zhang B, Wang S, Suver C et al (2008) Mapping the genetic architecture of gene expression in human liver. PLoS Biol 6:e107
Scott LJ, Muglia P, Kong XQ, Guan W, Flickinger M, Upmanyu R, Tozzi F, Li JZ, Burmeister M, Absher D et al (2009) Genome-wide association and meta-analysis of bipolar disorder in individuals of European ancestry. Proc Natl Acad Sci USA 106:7501–7506
Teslovich TM, Musunuru K, Smith AV, Edmondson AC, Stylianou IM, Koseki M, Pirruccello JP, Ripatti S, Chasman DI, Willer CJ et al (2010) Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466:707–713
Wang GS, Cooper TA (2007) Splicing in disease: disruption of the splicing code and the decoding machinery. Nat Rev Genet 8:749–761
Willer CJ, Li Y, Abecasis GR (2010) METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26:2190–2191
Zhang F, Lupski JR (2015) Non-coding genetic variants in human disease. Hum Mol Genet 24:R102–R110
Zhao K, Lu ZX, Park JW, Zhou Q, Xing Y (2013) GLiMMPS: robust statistical model for regulatory variation of alternative splicing using RNA-seq data. Genome Biol 14:R74
Acknowledgments
This work was supported in part by the Industrial Strategic Technology Development Program, 10,040,231, “Bioinformatics platform development for next-generation bioinformation analysis,” funded by the Ministry of trade, industry & energy (MOTIE, Korea). This work was also supported by the National Research Foundation, funded by the Ministry of Science, ICT & Future Planning, Korea (NRF-2012M3A9D1054705).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Seonggyun Han, Hyeim Jung, Kichan Lee, Hyunho Kim and Sangsoo Kim declare no conflict of interest.
Human and animal rights
This article does not contain any studies with human subjects or animals performed by any of the authors.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Han, S., Jung, H., Lee, K. et al. Genome wide discovery of genetic variants affecting alternative splicing patterns in human using bioinformatics method. Genes Genom 39, 453–459 (2017). https://doi.org/10.1007/s13258-016-0466-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13258-016-0466-7