Prioritizing genes for follow-up from genome wide association studies using information on gene expression in tissues relevant for type 2 diabetes mellitus

Parikh, Hemang; Lyssenko, Valeriya; Groop, Leif C

doi:10.1186/1755-8794-2-72

Prioritizing genes for follow-up from genome wide association studies using information on gene expression in tissues relevant for type 2 diabetes mellitus

Research article
Open access
Published: 31 December 2009

Volume 2, article number 72, (2009)
Cite this article

Download PDF

You have full access to this open access article

BMC Medical Genomics Aims and scope Submit manuscript

Prioritizing genes for follow-up from genome wide association studies using information on gene expression in tissues relevant for type 2 diabetes mellitus

Download PDF

Hemang Parikh^1,2,
Valeriya Lyssenko¹ &
Leif C Groop^1,3

6382 Accesses
29 Citations
6 Altmetric
2 Mentions
Explore all metrics

Abstract

Background

Genome-wide association studies (GWAS) have emerged as a powerful approach for identifying susceptibility loci associated with polygenetic diseases such as type 2 diabetes mellitus (T2DM). However, it is still a daunting task to prioritize single nucleotide polymorphisms (SNPs) from GWAS for further replication in different population. Several recent studies have shown that genetic variation often affects gene-expression at proximal (cis) as well as distal (trans) genomic locations by different mechanisms such as altering rate of transcription or splicing or transcript stability.

Methods

To prioritize SNPs from GWAS, we combined results from two GWAS related to T2DM, the Diabetes Genetics Initiative (DGI) and the Wellcome Trust Case Control Consortium (WTCCC), with genome-wide expression data from pancreas, adipose tissue, liver and skeletal muscle of individuals with or without T2DM or animal models thereof to identify T2DM susceptibility loci.

Results

We identified 1,170 SNPs associated with T2DM with P < 0.05 in both GWAS and 243 genes that were located in the vicinity of these SNPs. Out of these 243 genes, we identified 115 differentially expressed in publicly available gene expression profiling data. Notably five of them, IGF2BP2, KCNJ11, NOTCH2, TCF7L2 and TSPAN8, have subsequently been shown to be associated with T2DM in different populations. To provide further validation of our approach, we reversed the approach and started with 26 known SNPs associated with T2DM and related traits. We could show that 12 (57%) (HHEX, HNF1B, IGF2BP2, IRS1, KCNJ11, KCNQ1, NOTCH2, PPARG, TCF7L2, THADA, TSPAN8 and WFS1) out of 21 genes located in vicinity of these SNPs were showing aberrant expression in T2DM from the gene expression profiling studies.

Conclusions

Utilizing of gene expression profiling data from different tissues of individuals with or without T2DM or animal models thereof is a powerful tool for prioritizing SNPs from WGAS for further replication studies.

View this article's peer review reports

Identification of potential genetic causal variants for obesity-related traits using statistical fine mapping

Article 27 July 2023

Diabetes risk loci-associated pathways are shared across metabolic tissues

Article Open access 14 May 2022

Genetic fine mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci

Article 09 November 2015

Background

Genome-wide association study (GWAS) offers unbiased ways to examine association of more than a million single nucleotide polymorphisms (SNPs) with disease [1]. Several GWAS have indentified novel genomic regions influencing risk for type 2 diabetes mellitus (T2DM) [2–6]. However, the challenge remains to prioritize SNPs from GWAS for further replication [1]; even if one would try to replicate only the top 1% with strongest genetic evidence 5,000-10,000 SNPs need to be genotyped in replication studies. Most available methods for prioritizing genes for follow-up from GWAS are based upon bibliometric analyses such as Gene Relationships Among Implicated Loci (GRAIL) [7]. In addition, bioinformatic tool such as TEAM (a tool for the integration of expression and linkage and association maps) is designed to integrate linkage, association and expression data, together with functional annotations however currently it is not implemented to handle GWAS data [8].

Genetic variation often influences gene expression by different mechanisms such as altering rate of transcription or splicing or transcript stability [9]. In this study we first have used published gene expression profiling data in key tissues (i.e. pancreas, adipose tissue, liver and skeletal muscle) from humans and animal models of T2DM to identify genes in GWAS which would deserve follow-up in replication studies. Secondly to provide further validation of our approach, we reversed the approach and tested 21 genes (ADAMTS9, CDKAL1, CDKN2B, FTO, GCK, GCKR, HHEX, HNF1B, IGF2BP2, IRS1, JAZF1, KCNJ11, KCNQ1, MTNR1B, NOTCH2, PPARG, SLC30A8, TCF7L2, THADA, TSPAN8 and WFS1) located in vicinity of 26 known SNPs associated with T2DM and related traits [2, 5, 6, 10–13] for their expression in the same data sets.

Methods

Results from two separate GWAS are reported here.

Diabetes Genetics Initiative (DGI)

The DGI included 1,464 T2DM patients and 1,467 normoglycemic controls from Sweden and Finland [2]. These samples were genotyped on the Affymetrix GeneChip^® Human Mapping 500K Array Set which contains ~500,000 SNPs for interrogation. 386,731 autosomal SNPs were included for further analyses which have genotype call frequency > 0.95, the Hardy-Weinberg equilibrium (HWE) P > 10^-6 in controls and a minor allele frequency (MAF) > 0.01 in both populations.

Wellcome Trust Case Control Consortium (WTCCC)

The WTCCC included 1,924 T2DM patients and 2,938 population controls from UK (6). These samples were also genotyped on the Affymetrix GeneChip^® Human Mapping 500K Array Set which contains ~500,000 SNPs for interrogation. 393,453 autosomal SNPs were included for further analyses which have genotype call frequency > 0.95, the HWE P > 10^-4 in the total sample and a MAF > 0.01.

Meta-analysis of T2DM and related traits GWAS

Recently large scale meta-analysis of T2DM and related traits GWAS comprised tens of thousands of individuals and investigated ~2.2 million SNPs (directly genotyped and imputed), followed by well-powered replication studies have robustly identified a total of 26 SNPs (rs4607103, rs12779790, rs7754840, rs10811661, rs2191349, rs9939609, rs560887, rs4607517, rs780094, rs1111875, rs4430796, rs757210, rs1470579, rs2943641, rs7578326, rs864745, rs5219, rs2237892, rs10830963, rs10923931, rs1801282, rs13266634, rs7903146, rs7578597, rs7961581 and rs10010131) located in vicinity of 21 genes (ADAMTS9, CDKAL1, CDKN2B, FTO, GCK, GCKR, HHEX, HNF1B, IGF2BP2, IRS1, JAZF1, KCNJ11, KCNQ1, MTNR1B, NOTCH2, PPARG, SLC30A8, TCF7L2, THADA, TSPAN8 and WFS1) been associated with T2DM and related traits in several population [2, 5, 6, 10–13].

Gene expression profiling data

Results from five gene expression profiling studies from four different tissues (i.e., pancreas, adipose tissue, liver and skeletal muscle) in humans and animal models have been used for the analysis of the putative affect of a SNP on gene expression.

(a) From pancreas

(1) Pancreatic islets from healthy controls and T2DM patients

We included gene expression profiling data of human islets from cadaver islet donors with and without T2DM [14]. We downloaded the raw gene expression data from the Diabetes Genome Anatomy Project (DGAP) database, accession number 103 http://www.diabetesgenome.org/chipperdb/expt.cgi?id=103.

(2) Pancreas from healthy and diabetic rats

We also evaluated the effect of streptozotocin (STZ) induced diabetes on expression of genes of interest in pancreas using previously published microarray data [15]. We downloaded the raw gene expression data from diabetic and healthy untreated rats from the National Center for Biotechnology Information's (NCBI) Gene Expression Omnibus (GEO) database, accession number GSE2470 http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE2470.

(b) From skeletal muscle

(3) Skeletal muscle from healthy controls and T2DM patients

We analyzed the expression of individual genes using our previously published microarray data from human skeletal muscle of healthy controls and T2DM patients [16]. We downloaded the raw gene expression data from the DGAP database, accession number 54 http://www.diabetesgenome.org/chipperdb/expt.cgi?id=54

(4) Skeletal muscle from healthy and diabetic mice

We evaluated the effect of STZ- induced diabetes on expression of genes of interest in skeletal muscle from previously published microarray data [17]. We downloaded the raw gene expression data from diabetic and healthy mice from the DGAP database, accession number 104 http://www.diabetesgenome.org/chipperdb/expt.cgi?id=104

(c) From adipose tissue, liver and skeletal muscle

(5) From Zucker diabetic fatty (ZDF) and Zucker lean control (ZLC) rats

Finally, we also examined the expression of genes of interest in insulin-sensitive tissues like adipose tissue, liver and skeletal muscle of ZDF rats at age of 6 weeks (pre-diabetic) and 12 weeks (diabetic) and compared with age- and sex-matched ZLC rats [18]. We downloaded the normalized gene expression data from the NCBI's GEO database, accession number GSE1080 http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE1080

All studies were conducted according to the principles of the Helsinki Declaration and approved by respective local ethics committees [2, 5, 6, 10–18].

Statistical Analysis

SNPs and gene selection

We used the NCBI's single nucleotide polymorphism database (dbSNP) build 130 to identify genes located in the vicinity of selected SNPs. Homologues of the genes for mouse and rat were identified using the NCBI's HomoloGene release 64. We included only those genes that were evolutionarily conserved in three different species namely human, mouse and rat.

Analysis of microarray data

We used ENTREZ custom chip definition files to regroup the individual probes into consistent probesets and remap them to the correct sets of genes for different Affymetrix arrays [19–21]. The method used for calculating gene expression from Affymetrix array data can have a major impact on the results [22–24]. Hence, we used four different procedures for normalization and summarization which combines the multiple probe intensities for each gene to produce an expression value: (1) by MAS5.0 algorithms to adjust for background noise level using estimates of the distribution of probe intensities, scaling based normalization and summarization based on a Tukey's biweight [25], (2) by the GC-content Robust Multi-array Average (GC-RMA) method [26] with additional background adjustment using sequence information to estimate probe affinity for nonspecific binding, quantile based normalization and summarization based on a multi-array model fit using median polish algorithm, (3) by Probe Logarithmic Intensity Error (PLIER) method (Affymetrix) utilizing both perfect match and mismatch signaling with quantile based normalization and (4) by Robust Multi-array Average (RMA) method [27, 28] which implements model-based background adjustment, quantile based normalization and summarization based on a multi-array model fit using median polish algorithm.

For interclass, unpaired comparisons, we used a two-tailed Student's t-test with equal variance to identify differences in expression (log (base 2) transformed) of individual gene from different gene expression profiling studies (for each of four normalization methods namely MAS5.0, GC-RMA, PLIER and RMA discretely). Due to hypothesis generating nature of this study, we considered only those genes that were significantly altered in diabetes or associated traits with a P < 0.05 in at least one normalization method. For the ZDF and ZLC rat studies, we used fold change values which were calculated by dividing the median of normalized signal channel intensity (Cy5) by the median of normalized control channel intensity (Cy3) to assess the difference between ZDF vs. ZLC. The genes with expression ratios (i.e., ratio of normalized mean signal intensities of ZDF rat gene to that of ZLC) lesser than 0.67 or greater than 1.50 were considered differentially expressed.

Results

To identify T2DM susceptibility loci, we compiled results from two previously reported GWAS which used the Affymetrix GeneChip^® Human Mapping 500K Array Set containing ~500,000 SNPs for interrogation [2, 6]. To provide a framework for the analytical approach, we have used different cut-off of P-values from both GWAS (Table 1). For instance, we selected 1,170 directly genotyped SNPs associated with T2DM with P < 0.05 in both GWAS (Table 1) and 243 genes were located in the vicinity of these SNPs (Table 2). To determine whether any of these genes exhibit altered expression in diabetes or associated traits in humans or rodent organisms, we utilized published gene expression profiling data from pancreas, adipose tissue, liver and skeletal muscle (Methods). Out of 243 genes, we identified 115 genes differentially expressed between diabetic and healthy tissues in these studies (Additional file 1: Supplemental Table S1, Tables 3 and 4 and Figure 1). Out of 115 genes, we identified five genes (namely IGF2BP2, KCNJ11, NOTCH2, TCF7L2 and TSPAN8) for which SNPs located in their vicinity have shown association with T2DM in different populations [2, 5, 6]. Moreover, the results for SNPs selected based on different GWAS P-values thresholds are shown in Table 1 to Table 4.

Table 1 Numbers of SNPs associated with T2DM with different cut-off of P-values in DGI and WTCCC GWAS

Full size table

Table 2 Numbers of genes located in the vicinity of the SNPs (from Table 1) which are associated with T2DM with different cut-off of P-values in DGI and WTCCC GWAS

Full size table

Table 3 Numbers of genes for which expression levels in pancreas, skeletal muscle, adipose tissue or liver were altered in diabetes as compared to controls

Full size table

Table 4 Numbers of genes (out of the genes from Table 3) for which SNPs located in their vicinity have shown association with T2DM in different populations [2, 5, 6]

Full size table

While in the first part we used the GWAS studies to confirm the findings from the expression arrays, in the second part we asked the question whether any of the 26 SNPs associated with T2DM or related traits identified through meta-analysis of GWAS would show differences in expression between diabetic and healthy tissues in the same data sets (Methods). Intriguingly, 12 (57%) (HHEX, HNF1B, IGF2BP2, IRS1, KCNJ11, KCNQ1, NOTCH2, PPARG, TCF7L2, THADA, TSPAN8 and WFS1) out of 21 genes located in vicinity of these SNPs were differentially expressed in tissues from T2DM individuals/animal models as compared to healthy tissues (Additional file 1: Supplemental Table S2 and Figure 2). Out of these 12 genes, eight (HHEX, HNF1B, KCNQ1, NOTCH2, TCF7L2, THADA, TSPAN8 and WFS1) showed differential expression in pancreatic islets, five (HNF1B, IRS1, KCNJ11, NOTCH2 and WFS1) showed differential expression in skeletal muscle, two (IGF2BP2 and PPARG) showed differential expression in adipose tissue and only one (PPARG) showed differential expression in liver (Additional file 1: Supplemental Table S2). This finding further supports the notion that defects in different tissues contribute to the pathogenesis of T2DM.

Discussion

The main objective of this study was to test a novel approach for prioritizing of SNPs from GWAS by combining the results from these studies with publicly available genome-wide expression profiling data from key tissues (i.e. pancreas, adipose tissue, liver and skeletal muscle) of T2DM. We identified five genes namely IGF2BP2, KCNJ11, NOTCH2, TCF7L2 and TSPAN8 for which SNPs located in their vicinity have been consistently associated with T2DM in several populations [2, 5, 6]. More intriguingly, 12 (57%) (HHEX, HNF1B, IGF2BP2, IRS1, KCNJ11, KCNQ1, NOTCH2, PPARG, TCF7L2, THADA, TSPAN8 and WFS1) out of 21 genes located in vicinity of known 26 T2DM or related traits associated SNPs showed different expression in tissues of individuals with or without T2DM or animal models thereof.

Prioritizing SNPs from GWAS for further replication in other populations is a challenging task due to the fact that a large numbers of comparisons are made in parallel. Moreover, effect sizes of individual SNPs are usually very small and would often not reach significance after correction for multiple testing. To address this problem several strategies were applied including meta-analysis of GWAS and bibliometric-bioinformatic approaches like GRAIL [7]. A meta-analysis of GWAS from different populations identified six novel loci associated with T2DM [5]. However, even though top signals are selected based upon the strength of association large scale replication studies are still needed.

GRAIL is a web-tool to examine relationships between genes in different disease associated loci. It is based on finding similarities in the published scientific text among the associated genes. Using GRAIL for the 1,170 SNPs associated with T2DM (Figure 1) we could identify three genes subsequently being associated with T2DM, namely JAZF1, KCNJ11 and TCF7L2 [4]. In comparison, five genes (IGF2BP2, KCNJ11, NOTCH2, TCF7L2 and TSPAN8) were identified using our current approach. Moreover, our approach can be very useful for designing tissue specific functional studies. Also, a more general approach such as GeneMiner (which is a meta-analysis approach that integrates data of heterogeneous origin e.g. DNA microarrays and complementing qualitative data covering several human and mouse tissues related to T2DM) has identified several functional T2DM candidate genes [29]. However, we were able to identify only two (HNF1B and IRS1) out of the 21 genes which are associated with T2DM and related traits [2, 5, 6, 10–13] using GeneMiner in contrast to 12 (57%) using our approach (Additional file 1: Supplemental Table S2).

Importantly, our findings demonstrate that more than 50% of the genes in which genetic variants have been known to increase risk of T2DM showed altered expression in different tissues. The perturbation was highest, as expected, in pancreatic islets, where eight genes i.e. HHEX, HNF1B, KCNQ1, NOTCH2, TCF7L2, THADA, TSPAN8 and WFS1, showed aberrant expression. All of these genetic loci, apart from the less studied TSPAN8, have been implicated in pathways primarily involved in insulin secretion, cell proliferation and regeneration [30]. Of note, genetic variants in the THADA and WFS1 have recently been shown to impair glucagon-like peptide-1-stimulated insulin secretion [31, 32]. Furthermore, many of these loci have also shown effects on insulin sensitivity [33]. In line with this, five genes, i.e. HNF1B, IRS1, KCNJ11, NOTCH2 and WFS1, were also differentially expressed in skeletal muscle. Of all T2DM genes, IRS1 seems to have a clear effect on insulin sensitivity; the T2DM-associated allele was associated with decreased IRS1 protein expression as well as reduced phosphatidylinositol-3-kinase-activity and insulin-stimulated glucose uptake in humans [12].

Defects in fat metabolism and excess fat deposition in the abdominal region play an important role in the pathogenesis of T2DM and obesity [34]. Our findings that expression of the IGFBP2 and PPARG, genes was altered in adipose tissue further supports this notion. PPARG is a nuclear receptor that regulates transcription of genes involved in adipogenesis. A common Pro12Ala polymorphism has been associated with decreased transcriptional activity and increased insulin sensitivity and thereby provides protection against T2DM [35]. Additionally, PPARG also showed differential expression in the liver, where it regulates a number of genes involved in both glucose and lipid metabolism. These results add further support to a role for variation in these genes in the pathogenesis of T2DM.

There are several issues to consider in the interpretation of the results. In the current study, we have only examined cis-regulatory variants and their possible contribution to phenotypic variations leading to T2DM in contrast to trans-regulatory variants. Moreover, different SNP alleles in the vicinity of or within a gene may not alter mRNA level or stability and hence further functional studies are needed to evaluate the biological mechanisms associated with variants in model organisms as well as tissue samples. Future studies integrating genome-wide expression profiles from key tissues of T2DM with GWAS mapping are required for identification of SNPs that are associated with variation in gene expression contributing to T2DM [36].

Conclusions

Taken together these data show the utility of using gene expression profiling studies to prioritize genes from GWAS for further replication efforts.

References

Hirschhorn JN, Daly MJ: Genome-wide association studies for common diseases and complex traits. Nat Rev Genet. 2005, 6: 95-108. 10.1038/nrg1521.
Article CAS PubMed Google Scholar
Saxena R, Voight BF, Lyssenko V, Burtt NP, de Bakker PI, Chen H, Roix JJ, Kathiresan S, Hirschhorn JN, Daly MJ, et al: Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science. 2007, 316: 1331-1336. 10.1126/science.1142358.
Article CAS PubMed Google Scholar
Scott LJ, Mohlke KL, Bonnycastle LL, Willer CJ, Li Y, Duren WL, Erdos MR, Stringham HM, Chines PS, Jackson AU, et al: A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science. 2007, 316: 1341-1345. 10.1126/science.1142382.
Article CAS PubMed PubMed Central Google Scholar
Sladek R, Rocheleau G, Rung J, Dina C, Shen L, Serre D, Boutin P, Vincent D, Belisle A, Hadjadj S, et al: A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature. 2007, 445: 881-885. 10.1038/nature05616.
Article CAS PubMed Google Scholar
Zeggini E, Scott LJ, Saxena R, Voight BF, Marchini JL, Hu T, de Bakker PI, Abecasis GR, Almgren P, Andersen G, et al: Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nat Genet. 2008, 40: 638-645. 10.1038/ng.120.
Article CAS PubMed PubMed Central Google Scholar
Zeggini E, Weedon MN, Lindgren CM, Frayling TM, Elliott KS, Lango H, Timpson NJ, Perry JR, Rayner NW, Freathy RM, et al: Replication of genome-wide association signals in UK samples reveals risk loci for type 2 diabetes. Science. 2007, 316: 1336-1341. 10.1126/science.1142364.
Article CAS PubMed PubMed Central Google Scholar
Raychaudhuri S, Plenge RM, Rossin EJ, Ng AC, Purcell SM, Sklar P, Scolnick EM, Xavier RJ, Altshuler D, Daly MJ: Identifying relationships among genomic disease regions: predicting genes at pathogenic SNP associations and rare deletions. PLoS Genet. 2009, 5: e1000534-10.1371/journal.pgen.1000534.
Article PubMed PubMed Central Google Scholar
Franke L, van Bakel H, Diosdado B, van Belzen M, Wapenaar M, Wijmenga C: TEAM: a tool for the integration of expression, and linkage and association maps. Eur J Hum Genet. 2004, 12: 633-638. 10.1038/sj.ejhg.5201215.
Article CAS PubMed Google Scholar
Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, Colinayo V, Ruff TG, Milligan SB, Lamb JR, Cavet G, et al: Genetics of gene expression surveyed in maize, mouse and man. Nature. 2003, 422: 297-302. 10.1038/nature01434.
Article CAS PubMed Google Scholar
Lyssenko V, Nagorny CL, Erdos MR, Wierup N, Jonsson A, Spegel P, Bugliani M, Saxena R, Fex M, Pulizzi N, et al: Common variant in MTNR1B associated with increased risk of type 2 diabetes and impaired early insulin secretion. Nat Genet. 2009, 41: 82-88. 10.1038/ng.288.
Article CAS PubMed Google Scholar
Prokopenko I, Langenberg C, Florez JC, Saxena R, Soranzo N, Thorleifsson G, Loos RJ, Manning AK, Jackson AU, Aulchenko Y, et al: Variants in MTNR1B influence fasting glucose levels. Nat Genet. 2009, 41: 77-81. 10.1038/ng.290.
Article CAS PubMed Google Scholar
Rung J, Cauchi S, Albrechtsen A, Shen L, Rocheleau G, Cavalcanti-Proenca C, Bacot F, Balkau B, Belisle A, Borch-Johnsen K, et al: Genetic variant near IRS1 is associated with type 2 diabetes, insulin resistance and hyperinsulinemia. Nat Genet. 2009, 41: 1110-1115. 10.1038/ng.443.
Article CAS PubMed Google Scholar
Yasuda K, Miyake K, Horikawa Y, Hara K, Osawa H, Furuta H, Hirota Y, Mori H, Jonsson A, Sato Y, et al: Variants in KCNQ1 are associated with susceptibility to type 2 diabetes mellitus. Nat Genet. 2008, 40: 1092-1097. 10.1038/ng.207.
Article CAS PubMed Google Scholar
Gunton JE, Kulkarni RN, Yim S, Okada T, Hawthorne WJ, Tseng YH, Roberson RS, Ricordi C, O'Connell PJ, Gonzalez FJ, Kahn CR: Loss of ARNT/HIF1beta mediates altered gene expression and pancreatic-islet dysfunction in human type 2 diabetes. Cell. 2005, 122: 337-349. 10.1016/j.cell.2005.05.027.
Article CAS PubMed Google Scholar
Altirriba J, Barbera A, Del Zotto H, Nadal B, Piquer S, Sanchez-Pla A, Gagliardino JJ, Gomis R: Molecular mechanisms of tungstate-induced pancreatic plasticity: a transcriptomics approach. BMC Genomics. 2009, 10: 406-10.1186/1471-2164-10-406.
Article PubMed PubMed Central Google Scholar
Mootha VK, Lindgren CM, Eriksson KF, Subramanian A, Sihag S, Lehar J, Puigserver P, Carlsson E, Ridderstrale M, Laurila E, et al: PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003, 34: 267-273. 10.1038/ng1180.
Article CAS PubMed Google Scholar
Yechoor VK, Patti ME, Ueki K, Laustsen PG, Saccone R, Rauniyar R, Kahn CR: Distinct pathways of insulin-regulated versus diabetes-regulated gene expression: an in vivo analysis in MIRKO mice. Proc Natl Acad Sci USA. 2004, 101: 16525-16530. 10.1073/pnas.0407574101.
Article CAS PubMed PubMed Central Google Scholar
Suh YH, Kim Y, Bang JH, Choi KS, Lee JW, Kim WH, Oh TJ, An S, Jung MH: Analysis of gene expression profiles in insulin-sensitive tissues from pre-diabetic and diabetic Zucker diabetic fatty rats. J Mol Endocrinol. 2005, 34: 299-315. 10.1677/jme.1.01679.
Article CAS PubMed Google Scholar
Dai M, Wang P, Boyd AD, Kostov G, Athey B, Jones EG, Bunney WE, Myers RM, Speed TP, Akil H, et al: Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data. Nucleic Acids Res. 2005, 33: e175-10.1093/nar/gni179.
Article PubMed PubMed Central Google Scholar
Lu X, Zhang X: The effect of GeneChip gene definitions on the microarray study of cancers. Bioessays. 2006, 28: 739-746. 10.1002/bies.20433.
Article CAS PubMed Google Scholar
Sandberg R, Larsson O: Improved precision and accuracy for microarrays using updated probe set definitions. BMC Bioinformatics. 2007, 8: 48-10.1186/1471-2105-8-48.
Article PubMed PubMed Central Google Scholar
Kibriya MG, Jasmine F, Argos M, Verret WJ, Rakibuz-Zaman M, Ahmed A, Parvez F, Ahsan H: Changes in gene expression profiles in response to selenium supplementation among individuals with arsenic-induced pre-malignant skin lesions. Toxicol Lett. 2007, 169: 162-176. 10.1016/j.toxlet.2007.01.006.
Article CAS PubMed PubMed Central Google Scholar
Millenaar FF, Okyere J, May ST, van Zanten M, Voesenek LA, Peeters AJ: How to decide? Different methods of calculating gene expression from short oligonucleotide array data will give different results. BMC Bioinformatics. 2006, 7: 137-10.1186/1471-2105-7-137.
Article PubMed PubMed Central Google Scholar
Seo J, Hoffman EP: Probe set algorithms: is there a rational best bet?. BMC Bioinformatics. 2006, 7: 395-10.1186/1471-2105-7-395.
Article PubMed PubMed Central Google Scholar
Gautier L, Cope L, Bolstad BM, Irizarry RA: affy--analysis of Affymetrix GeneChip data at the probe level. Bioinformatics. 2004, 20: 307-315. 10.1093/bioinformatics/btg405.
Article CAS PubMed Google Scholar
Wu Z, Irizarry RA: Stochastic models inspired by hybridization theory for short oligonucleotide arrays. J Comput Biol. 2005, 12: 882-893. 10.1089/cmb.2005.12.882.
Article CAS PubMed Google Scholar
Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP: Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2003, 31: e15-10.1093/nar/gng015.
Article PubMed PubMed Central Google Scholar
Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003, 4: 249-264. 10.1093/biostatistics/4.2.249.
Article PubMed Google Scholar
Rasche A, Al-Hasani H, Herwig R: Meta-analysis approach identifies candidate genes and associated molecular networks for type-2 diabetes mellitus. BMC Genomics. 2008, 9: 310-10.1186/1471-2164-9-310.
Article PubMed PubMed Central Google Scholar
Prokopenko I, McCarthy MI, Lindgren CM: Type 2 diabetes: new genes, new understanding. Trends Genet. 2008, 24: 613-621. 10.1016/j.tig.2008.09.004.
Article CAS PubMed Google Scholar
Schafer SA, Mussig K, Staiger H, Machicao F, Stefan N, Gallwitz B, Haring HU, Fritsche A: A common genetic variant in WFS1 determines impaired glucagon-like peptide-1-induced insulin secretion. Diabetologia. 2009, 52: 1075-1082. 10.1007/s00125-009-1344-5.
Article CAS PubMed Google Scholar
Simonis-Bik AM, Nijpels G, van Haeften TW, Houwing-Duistermaat JJ, Boomsma DI, Reiling E, van Hove EC, Diamant M, Kramer MH, Heine RJ, et al: Gene variants in the novel type 2 diabetes loci CDC123/CAMK1D, THADA, ADAMTS9, BCL11A and MTNR1B affect different aspects of pancreatic beta cell function. Diabetes. 2010, 59: 293-301. 10.2337/db09-1048.
Article CAS PubMed Google Scholar
Stancakova A, Kuulasmaa T, Paananen J, Jackson AU, Bonnycastle LL, Collins FS, Boehnke M, Kuusisto J, Laakso M: Association of 18 confirmed susceptibility loci for type 2 diabetes with indices of insulin release, proinsulin conversion, and insulin sensitivity in 5,327 nondiabetic Finnish men. Diabetes. 2009, 58: 2129-2136. 10.2337/db09-0117.
Article CAS PubMed PubMed Central Google Scholar
Despres JP, Lemieux I, Tchernof A, Couillard C, Pascot A, Lemieux S: [Fat distribution and metabolism]. Diabetes Metab. 2001, 27: 209-214.
CAS PubMed Google Scholar
Deeb SS, Fajas L, Nemoto M, Pihlajamaki J, Mykkanen L, Kuusisto J, Laakso M, Fujimoto W, Auwerx J: A Pro12Ala substitution in PPARgamma2 associated with decreased receptor activity, lower body mass index and improved insulin sensitivity. Nat Genet. 1998, 20: 284-287. 10.1038/3099.
Article CAS PubMed Google Scholar
Emilsson V, Thorleifsson G, Zhang B, Leonardson AS, Zink F, Zhu J, Carlson S, Helgason A, Walters GB, Gunnarsdottir S, et al: Genetics of gene expression and its effect on disease. Nature. 2008, 452: 423-428. 10.1038/nature06758.
Article CAS PubMed Google Scholar

Pre-publication history

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1755-8794/2/72/prepub

Download references

Acknowledgements

We are grateful to Laufey Amundadottir for valuable comments.

This work was supported by grants from the Swedish Knowledge Foundation through the Industrial PhD program in Medical Bioinformatics at the Center for Medical Innovations (CMI) at the Karolinska Institute (HP), The Diabetes Programme at Lund University (HP), Diabetesföreningen in Malmö (HP), The Medical Faculty at Lund University (HP), a Linne grant from the Swedish Research Council (LG), an EC grant (EXGENESIS) (LG) and a grant from the Knut and Alice Wallenberg Foundation (LG).

Author information

Authors and Affiliations

Department of Clinical Sciences, Diabetes and Endocrinology, Lund University, University Hospital Malmö, Malmö, Sweden
Hemang Parikh, Valeriya Lyssenko & Leif C Groop
Laboratory of Translational Genomics, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
Hemang Parikh
Department of Medicine, Helsinki University, Helsinki, Finland
Leif C Groop

Authors

Hemang Parikh
View author publications
You can also search for this author in PubMed Google Scholar
Valeriya Lyssenko
View author publications
You can also search for this author in PubMed Google Scholar
Leif C Groop
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hemang Parikh.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

HP conceived the project, designed the study, analyzed the microarray data, performed statistical analysis and drafted the manuscript. VL performed statistical analysis and drafted the manuscript. LCG designed the project and supervised all phases of the project including writing of the manuscript.

All authors have read and approved the final manuscript.

Electronic supplementary material

12920_2009_541_MOESM1_ESM.DOCX

Additional file 1: Supplemental tables. Supplemental Table S1: Genes for which expression levels in pancreas, skeletal muscle, adipose tissue or liver were altered in diabetes as compared to controls. The table lists differentially expressed genes between diabetic and healthy tissues in different gene expression profiling studies. Supplemental Table S2: T2DM and related traits associated genes for which expression levels in pancreas, skeletal muscle, adipose tissue or liver were altered in diabetes as compared to controls. The table lists differentially expressed genes between diabetic and healthy tissues in different gene expression profiling studies. (DOCX 37 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Parikh, H., Lyssenko, V. & Groop, L.C. Prioritizing genes for follow-up from genome wide association studies using information on gene expression in tissues relevant for type 2 diabetes mellitus. BMC Med Genomics 2, 72 (2009). https://doi.org/10.1186/1755-8794-2-72

Download citation

Received: 29 July 2009
Accepted: 31 December 2009
Published: 31 December 2009
DOI: https://doi.org/10.1186/1755-8794-2-72

Prioritizing genes for follow-up from genome wide association studies using information on gene expression in tissues relevant for type 2 diabetes mellitus

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Identification of potential genetic causal variants for obesity-related traits using statistical fine mapping

Diabetes risk loci-associated pathways are shared across metabolic tissues

Genetic fine mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci

Background

Methods

Diabetes Genetics Initiative (DGI)

Wellcome Trust Case Control Consortium (WTCCC)

Meta-analysis of T2DM and related traits GWAS

Gene expression profiling data

(a) From pancreas

(1) Pancreatic islets from healthy controls and T2DM patients

(2) Pancreas from healthy and diabetic rats

(b) From skeletal muscle

(3) Skeletal muscle from healthy controls and T2DM patients

(4) Skeletal muscle from healthy and diabetic mice

(c) From adipose tissue, liver and skeletal muscle

(5) From Zucker diabetic fatty (ZDF) and Zucker lean control (ZLC) rats

Statistical Analysis

SNPs and gene selection

Analysis of microarray data

Results

Discussion

Conclusions

References

Pre-publication history

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Electronic supplementary material

12920_2009_541_MOESM1_ESM.DOCX

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation