Abstract
Purpose
Metformin is widely used to treat type 2 diabetes mellitus (T2DM) individuals. Clinically, inter-individual variability of metformin response is of significant concern and is under interrogation. In this study, a targeted exome and whole transcriptome analysis were performed to identify predictive biomarkers of metformin response in drug-naïve T2DM individuals.
Methods
The study followed a prospective study design. Drug-naïve T2DM individuals (n = 192) and controls (n = 223) were enrolled. T2DM individuals were administered with metformin monotherapy and defined as responders and non-responders based on their glycated haemoglobin change over three months. 146 T2DM individuals were used for the final analysis and remaining samples were lost during the follow-up. Target exome sequencing and RNA-seq was performed to analyze genetic and transcriptome profile. The selected SNPs were validated by genotyping and allele specific gene expression using the TaqMan assay. The gene prioritization, enrichment analysis, drug-gene interactions, disease-gene association, and correlation analysis were performed using various tools and databases.
Results
rs1050152 and rs272893 in SLC22A4 were associated with improved response to metformin. The copy number loss was observed in PPARGC1A in the non-responders. The expression analysis highlighted potential differentially expressed targets for predicting metformin response (n = 35) and T2DM (n = 14). The expression of GDF15, TWISTNB, and RPL36A genes showed a maximum correlation with the change in HbA1c levels. The disease-gene association analysis highlighted MAGI2 rs113805659 to be linked with T2DM.
Conclusion
The results provide evidence for the genetic variations, perturbed transcriptome, allele-specific gene expression, and pathways associated with metformin drug response in T2DM.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Type 2 diabetes mellitus (T2DM) is growing at a distressing rate across the globe, and with-it financial burden on health care is also increasing. T2DM is a complex multifactorial disease with multiple driving factors, including obesity and lifestyle. Lifestyle interventions alone are not always enough for glycaemic control. Metformin is unanimously prescribed by health professionals across the globe as soon as T2DM is diagnosed [1,2,3]. Metformin is comparatively effective, cheaper, and safe than other anti-diabetic drugs [4, 5]. The initial reports suggest that most individuals tolerate metformin, with ~ 30% showing mild gastrointestinal side effects and ~ 5% of individuals showing severe intolerance [6, 7]. Besides T2DM, metformin is also being used for treatment of polycystic ovarian syndrome (PCOS) [8] and is anticipated to be used in the treatment or prevention of cancer in near future [9].
The proposed mechanism of metformin action includes the mitochondrial complex 1 inhibition, AMPK activation, reduction of cyclic AMP levels, and gut microbe interaction [10,11,12,13]. These mechanisms explain the partial benefits of metformin drug and the exact molecular mechanism remains unclear. At present, the guidelines for the treatment of T2DM employ a one-size-fits-all approach without considering inter-individual variation in drug response [14]. Recent literature reports ~ 50% of T2DM fail to reach glycaemic goals after metformin therapy [15]. There are reports of inter-individual variability in metformin response within T2DM that are suggested to be influenced by allelic variants with modest effects [15, 16]. There are several candidate studies evaluating the role of organic cation transporter (OCT) genes and multidrug and toxin extrusion (MATE) genes [17,18,19,20,21]. Multiple variations in ATM, CPA6, PRPF31, and STAT genes have also been identified in association with metformin response using genome-wide association studies (GWAS) [22,23,24]. However, these variations account for a small proportion of variable responses to metformin therapy. Therefore, there is a need to promote the investigations through various omics strategies to develop biomarkers with high therapeutic efficacy [25].
Transcriptome studies have shown that metformin is involved in differential expression of IgA production in the intestinal immune network, cytokine receptor interaction pathways, and genes involved in insulin production and cholesterol homeostasis [26]. The quantitative in vivo evaluation of metformin response revealed both AMPK-dependent and independent networks in the liver [27]. Metabolite analysis has shown significant differences in metabolites such as citric acid, myoinositol, and hippuric acid levels in non-responder and responder groups for metformin drug [28]. A combined metabolome and transcriptome analysis was performed by Udhane et al. [29]. They showed that metformin affects glucose, androgen, energy metabolism, immune system, cell cycle, and sex steroid biosynthesis [29]. These evaluations focused on studying the effects of metformin on various genes, pathways, and networks.
We conducted a study on the drug naive T2DM individuals in view of highlighting the genetic and transcriptomic features that may influence the metformin treatment. In this study, next-generation sequencing (NGS) [30] was utilized to analyze the genetic variations and transcriptomic alterations, to elucidate the allele-specific expression and its implications on clinical response to metformin in drug-naïve T2DM individuals.
Methods
Ethics statement
The protocols and work conducted in this study were in accordance with the declaration of Helsinki. The ethical approval for conducting the study was obtained from Institutional Ethics Committee, Kasturba Hospital, Manipal, India file number: IEC 501/2016. The study was also registered under the Clinical Trials Registry-India (CTRI) (registration number CTRI/2018/01/011508). Written informed consent was obtained from all participants after explaining the study procedure. Further, the work plan of the current study is portrayed in Fig. 1.
Study subjects
The participants were enrolled from the outpatient department of Dr. TMA Pai Hospital, Udupi, and Kasturba Hospital, Manipal, India. T2DM was defined based on WHO criteria. The details are presented in the supplementary information. A total of 192 T2DM and 223 healthy participants were enrolled. The glycaemic index of the T2DM participants was recorded at the time of enrolment and post metformin monotherapy of 3 months. The response to metformin was defined as the decrease in glycated haemoglobin (HbA1c) level > 1% or > 20 mg/dl decrease in fasting blood glucose level from baseline after three months of therapy. The blood samples (5 ml) were used for isolation of DNA and RNA based on the standard protocols [31].
Targeted exome sequencing
The targeted exome sequencing was performed on Ion torrent PGM platform (ThermoFisher Scientific, US) for the genes selected from the pharmacokinetic and pharmacodynamic pathways of metformin in PharmGKB database (Supplementary Table S1). The variants identified from each sample were further grouped based on responder, non-responder and healthy control status, and compared using Maftools with default settings [32]. The copy number analysis was performed using CNVKit (v0.9.2) with default settings [33]. The detailed method for targeted exome sequencing and analysis is presented in supplementary information. We hypothesized that changes in genetic profile may predict the metformin drug response status of a T2DM individual.
Whole transcriptome sequencing
Whole transcriptome sequencing was performed on 30 RNA samples using poly(A) enrichment of the mRNA using dynabeads mRNA direct micro kit (Invitrogen, US; Thermo Fisher Scientific, Inc., US). The detailed method for whole transcriptome sequencing and analysis is presented in supplementary information. We hypothesized that variations in the gene expression may predict the metformin drug response status of a T2DM individual.
Functional analysis and clinical correlation
The gene ontology analysis was performed on differentially expressed genes (DGE) using Enrichr (http://amp.pharm.mssm.edu/Enrichr/) [34]. The enrichment of bio-pathways through KEGG pathways was confirmed with Cytoscape plugin ClueGO + Cluepedia app [35, 36]. The DGEs were evaluated using DGIdb database (source: http://dgidb.org/, assessed on 01 May 2020).
To identify the relationship between the gene expression levels and metformin response, we performed a Pearson’s correlation analysis between the change in HbA1c levels after 3 months of therapy and the gene expression profiles in responders and non-responders.
Identification of SNPs in the DGE
The list of DGEs after enrichment analysis was subjected to analysis using DisGeNET database [37]. The genes were investigated for associations with diabetes or other diseases. The gene names were used as identifiers and the variations listed in the Evidences for Variant-Disease Associations column were extracted. The R package i.e., networkD3 was used for the visualization of associations of the variants in the genes associated with different diseases.
SNP genotyping and Sanger sequencing
The high frequency SNPs selected from ASE analysis along with two coding SNPs namely rs1050152 and rs272893 in SLC22A4 gene were subjected to genotyping using TaqMan Assay. The DNA samples were randomly validated using Sanger sequencing. The rs1050152 and rs272893 in SLC22A4 gene were selected based on the association analysis.
Allele specific expression (ASE) analysis
The ASE analysis was performed on the RNA samples identified as heterozygous for selected SNPs. The RNA samples were converted into cDNA using High-Capacity cDNA Reverse Transcription Kit (Applied Biosystems, US). The cDNAs were used to conduct TaqMan assay using custom designed probes from Custom TaqMan Assay Design Tool (Thermo Fisher Scientific, US) for cDNA samples. The ratio of VIC (mutant allele) and FAM (wild allele) was log transformed and evaluated for differences between responders and non-responders.
Statistical analysis
The difference between the quantitative variables of the study groups was assessed using student’s t-test and ANOVA. The association analysis of SNPs with metformin drug response was evaluated using SNPstats at p-value threshold of 0.05. The difference in allele specific expression between responders and non-responders was evaluated using student’s t-test.
Results
Study participants
We enrolled a total of 192 T2DM and 223 healthy participants (Table 1). During the study course, 19 participants in the T2DM group discontinued the follow-up, 15 participants discontinued the drug, and 12 participants switched to alternative medicine or therapies such as ayurvedic medicine, homeopathy, etc. The final analysis was performed on 146 T2DM individuals who completed three months of follow-up. Among the 146 T2DM participants, 82 were considered responders, and 64 were considered non-responders based on our criteria.
Polymorphism spectrum in study subjects
The targeted sequencing of 22 genes was performed in drug naïve T2DM individuals (n = 30) and controls (15). On average, 192,652 reads were generated from each T2DM sample, and 206,153 reads were generated from each control sample (Supplementary Table S2). The average percentage quality score of Q20 was observed for an average of 88.8% ± 1.35 and 94.48% ± 0.22 for T2DM and control samples, respectively. The total variants identified in 45 samples from our bioinformatics analysis were 715 in 22 genes (Supplementary Table S3). The maximum number of variations were identified in the SLC22A1 gene, and the least number of variations were identified in the ABCC8 gene (Supplementary Table S3). The summary of variations identified from our analysis is shown in Fig. 2. The transitions were higher than transversions observed in our data in all subjects.
Association analysis for drug response
To identify the genetic variations associated with drug response, we compared non-responders (n = 17) vs responders (n = 13) using MAFtools pipeline at default settings (Fig. 3). The analysis highlighted top mutated genes between responders and non-responders. A significant association of metformin drug response was observed for variations in the SLC22A4 gene with an odds ratio of 0.1 (0.008–0.73) with a p-value of 0.01 (Supplementary Table S4). From the association analysis conducted using MAFtools, we identified top mutated genes in responders and non-responders. Further using manual search, we identified the presence of rs272893 variant in 11 responder samples and 5 non-responder samples and rs1050152 variant in 8 responder samples and 1 non-responder sample. These two variants, rs272893 and rs1050152, were selected for replication in the study cohort.
Copy number variants in drug response genes
The comparison with the baseline for the copy number variations in responders and non-responders samples revealed a unique pattern of copy number gains and losses (Fig. 4) (Supplementary Table S5). IRS1 was found to have copy number gain in 23.5% of non-responders and 46.15% of responders. IRS2 was found to have copy number gain in 47.05% of non-responders and 30.76% of responders. PPARGC1A was found to have a loss of copy number in 17.6% of non-responders, whereas no copy number loss was observed in the responder samples. STK11 was found to have copy number loss in 17.6% of the non-responders and 23.07% of responders.
Characterization of gene expression in metformin responders, non-responders, and healthy controls
The RNA-Seq data generated from metformin responders (n = 10), non-responders (n = 10) and healthy controls (n = 10) showed 55,765 raw features across 30 samples (Supplementary Fig. 1). After initial filtering and data normalization, we obtained 15,434 features across 30 samples (Supplementary Fig. 1). The comparison between three groups was made in combinations such as non-responders vs. responders, responders vs. healthy controls, and non-responders vs. healthy controls. The results are presented in the form of Volcano plots (Supplementary Fig. 2). Using the criteria i.e., FC > 2 (upregulated) or < -2 (downregulated), FDR < 5% and p-value < 0.05, we found 165 protein-coding genes in non-responders vs responders, 435 protein-coding genes in responders vs healthy controls, and 257 protein-coding genes in non-responders vs healthy controls. Out of these genes, 38 protein-coding genes in non-responders vs. responders, 405 protein-coding genes in responders vs. healthy controls and 237 protein-coding genes in non-responders vs. healthy controls were upregulated, and 127 protein-coding genes in non-responders vs. responders, 30 protein-coding genes in responders vs. healthy controls and 20 protein-coding genes in non-responders vs. healthy controls were downregulated (Supplementary Fig. 3).
Gene prioritization and enrichment analysis
For gene prioritization, we evaluated the overlapping upregulated and downregulated protein-coding genes in non-responders vs. responders, responders vs. healthy controls, and non-responders vs. healthy controls (Supplementary Table S6). We observed 29 protein-coding genes upregulated in non-responders when compared with responders or healthy controls (DR_Set_1), 59 protein-coding genes upregulated in both non-responders and responders when compared to healthy controls (T2DM_Set_1), 11 protein-coding genes downregulated in both non-responders and responders when compared healthy controls (T2DM_Set_2) and 105 protein-coding genes upregulated in responders when compared with healthy controls but downregulated in non-responders when compared with responders (DR_Set_2) (Supplementary Table S6).
The genes in the above-mentioned four groups/lists were considered for enrichment analysis and the top 30 biological processes, molecular function, and cellular components, along with their corresponding genes are presented in Supplementary Figs. 4, 5, 6, 7. The network analysis of the selected enriched genes from DR_Set_1 showed pathways such as Ras signalling, tryptophan metabolism, neuroactive ligand-receptor interaction, Hippo signalling, nicotinate, and nicotinamide metabolism, glycine, serine, and threonine metabolism (Supplementary Fig. 4). The network analysis of the selected enriched genes from T2DM_Set_1 showed pathways such as oxidative phosphorylation, Wnt signalling, MAPK signalling, miRNAs in cancer, PI3K-Akt signalling, TGF-beta signalling, cytokine–cytokine receptor interaction, viral myocarditis, ubiquitin-mediated proteolysis (Supplementary Fig. 5). The network analysis of the selected enriched genes from T2DM_Set_2 showed pathways such as NOD-like receptor signalling, lysosome receptor protein, cytokine–cytokine receptor interaction, neuroactive ligand-receptor interaction (Supplementary Fig. 6). The network analysis of the selected enriched genes from DR_Set_2 showed pathways such as Wnt signalling, hippo signalling, calcium signalling, p53 signalling, thyroid hormone signalling, MAPK signalling, RNA polymerase, RNA transport, protein export, spliceosome, ribosome, purine metabolism, retinol metabolism, lipoic acid metabolism, arginine and proline metabolism, amino sugar and nucleotide sugar metabolism, steroid hormone biosynthesis, cytokine–cytokine receptor interaction, bacterial invasion of epithelial cells, vasopressin- regulated water reabsorption, African trypanosomiasis.
DEGs in metformin drug response
The prediction of metformin drug response from the patient’s gene expression profile required comparing responders, non-responders, and healthy controls. From the comparisons, we observed that 29 genes were significantly upregulated in non-responders compared with responders and healthy controls (DR_Set_1). We also observed 105 genes to be upregulated in responders compared to healthy controls and downregulated in responders when compared with non-responders (DR_Set_2). Thus, genes from gene set 1 were considered for predicting non-responder’s status and gene set 4 for prediction of responder’s status.
Out of 29 genes from DR_Set_1, seven genes were enriched from the gene ontology and KEGG pathway analysis (Supplementary Table S7) (Fig. 5). IDO2, NMNAT3, FRMD6, and DMGDH genes from gene set 1 showed maximum log fold change > 4. These genes were selected after enrichment and were shown to be involved in tryptophan metabolism, nicotinate and nicotinamide metabolism, Hippo signalling, and glycine, serine, and threonine metabolism, respectively.
Out of 105 genes from DR_Set_2, 28 genes were enriched from the gene ontology and KEGG pathway analysis (Supplementary Table S7) (Fig. 5). Among these genes, only the HSD11B1 gene showed the maximum log fold change i.e., > 4. HSD11B1 gene played a role in steroid hormone biosynthesis, chemical carcinogenesis and metabolism of xenobiotics by cytochrome P450.
The genes enriched from DR_Set_1 (n = 7) and DR_Set_2 (n = 28) were evaluated for drug-gene interactions. A total of 8 genes were found to interact with 25 different drugs (Supplementary Table S8). Among these eight genes, the NMNAT3 gene belonged to DR_Set_1, whereas RENBP, LIPT1, GDF15, HBA1, EIF4E, OXTR, and HSD11B1 belonged to DR_Set_2. None of these genes showed interactions with diabetes drugs.
The correlation analysis between metformin response and gene expression profiles of responders and non-responders highlight the direct correlation of CD82 (r = 0.4, p-value = 0.04), DCTN6 (r = 0.63, p-value = 0.002), EIF4E (r = 0.68, p-value = 0.0008), GDF15 (r = 0.79, p-value < 0.0001), HBA2 (r = 0.45, p-value = 0.04), LIPT1 (r = 0.64, p-value = 0.002), MAP3K13 (r = 0.49, p-value = 0.02), PRKAR2A (r = 0.58, p-value = 0.007), RENBP (r = 0.61, p-value = 0.004), RPL36A (r = 0.71, p-value = 0.0004) and TWISTNB (r = 0.73, p-value = 0.0002) with change in HbA1c levels after 3 months of metformin therapy (Supplementary Fig. 8).
DGEs in T2DM
From the comparisons made between responders, non-responders, and healthy controls, we observed two gene sets, i.e., T2DM_Set_1 and T2DM_Set_2, with DGEs when responders or non-responders were compared with healthy controls (Fig. 5B). The genes from T2DM_Set_1, namely, CXXC4, GDF2, MID1, and NDUFA4L2, were found to be upregulated with a log fold change > 4 in both responders and non-responders when compared with controls. These were involved in Wnt signalling, cytokine–cytokine receptor interaction, ubiquitin-mediated proteolysis, and oxidative phosphorylation pathways as predicted by Kegg pathways. The genes from T2DM_Set_2, namely, IDO1, LAMP3, and CXCL10, were downregulated with a log fold change value < − 4 in both responders and non-responders when compared with controls and were involved in tryptophan metabolism, lysosome, and cytokine–cytokine receptor interaction pathways.
SNPs in the DGEs associated with diabetes
40 SNPs were identified in 6 out of 14 DGEs T2DM_Set_1 and T2DM_Set_2 with validated associations with 20 different disease conditions or traits (Supplementary Fig. 9). Among these associations, rs113805659 in the MAGI2 gene was associated with non-insulin-dependent diabetes mellitus in African Americans [38].
Allele specific expression analysis (ASE)
The ASE was observed for 9 variants with high frequency namely, IRS2 rs4773092, ACACB rs2075262, CRTC2 rs11264680, CRTC2 rs10559, MTOR rs1135172, MTOR rs1057079, RPTOR rs1567962, RPTOR rs2289764, and SLC22A4 rs272879 (Fig. 6) (Supplementary Table S9). Interestingly, in non-responder samples heterozygous for SLC22A4 rs272879, the wild allele was only was expressed (the allelic ratio was 0) compared to responder and control samples where both wild and mutant alleles were expressed (the allelic ratio was 0.6 and 0.46, respectively). These 9 variations were selected for primer design using TaqMan assay's custom probes. Out of 9 variations, the custom probes for 5 variants namely ACACB rs2075262, MTOR rs1135172, MTOR rs1057079, RPTOR rs2289764, and SLC22A4 rs272879.
Association analysis
The association analysis for metformin drug response was performed between responders (n = 82) and non-responders (n = 64). The association analysis was performed for five variants selected from ASE analysis and two variants selected from targeted exome data (Table 2). Out of the seven variants, the association of SLC22A4 rs1050152 and SLC22A4 rs272893 was significant for metformin drug response (Table 2). The SLC22A4 rs1050152 showed an odds ratio of 0.6 (0.3–1.002), with p-value of 0.04 whereas SLC22A4 rs272893 showed an odds ratio of 0.47 (0.2–0.8), with p-value of 0.005. The Sanger sequencing performed on the randomly selected samples showed 100% concordance with the NGS and genotyping data.
ASE using TaqMan assay
The log-transformed ratio of VIC (mutant allele) and FAM (wild allele) was used for comparing ASE in responder and non-responder samples for ACACB rs2075262, MTOR rs1135172, MTOR rs1057079, RPTOR rs2289764, SLC22A4 rs272879, SLC22A4 rs1050152 and SLC22A4 rs272893 (Table 3). Although the ratio of alleles was skewed towards wild for all the SNPs, there was no significant difference observed in the allelic ratios between responders and non-responders samples.
Discussion
The field of pharmacogenomics offers a fascinating prospect for improvement of patient care via optimization of the choice and dose of the medicine, reduction of adverse event risk, and further implementation of the principles of personalized medicine. In the case of T2DM, the results of the recent pharmacogenomics have been hurdled by various factors. In the present study, we describe the heterogeneity in response to metformin drug within T2DM individuals. The genetic and transcriptome profiles of drug naïve T2DM patients and healthy controls were assessed to identify variations and DGEs in non-responsive and responsive patients. Our analysis revealed SLC22A4 gene to be associated with improved response to the metformin drug. The SLC22A4 is involved in the transport of metformin and oral absorption in the intestine. The presence of variations may lead to lower clearance of the drug. This is in accordance with the previous report of mutant SLC22A4 mice showing high plasma metformin concentration [39]. The SNPs, rs272893, and rs1050152, were present in disproportion in responders versus non-responders and were selected for validation in a larger cohort. The copy number loss was observed in PPARGC1A gene in the non-responders samples. PPARGC1A modulates the gluconeogenic targets such as PCK1 and G6PC [40]. The loss in the copy number of the gene may lead to altered gluconeogenesis in the non-responder individuals.
Our analysis of the gene expression led to the identification of four differential expressed gene sets that were able to differentiate non-responders from responders and T2DM patients from healthy controls. Furthermore, these DGEs were found to be involved in key pathways and processes. The overall results highlight the complex heterogeneity among drug naïve T2DM patients before the administration of metformin drug and have the potential for consideration in the early prediction of disease state and response phenotypes.
The overall validity of existing biomarkers for the prediction of metformin drug is still unclear. Metformin is hydrophilic and has to rely on members of the OCT family for entry inside hepatocytes. OCT1 has been specifically shown to be associated with the therapeutic efficacy of metformin [41]. The targets of metformin include mitochondrial complex I and AMPK, which regulate cellular energy homeostasis [42]. The studies to predict metformin drug response have relied on the existing target and treatment on a cell line or mouse models. Udhane et al. [29] have shown novel links between metformin response and energy metabolism, sex steroid biosynthesis, the cell cycle, and the immune system in H295R cells. Recently, Park et al. have identified metabolites, namely citric acid, myoinositol, and hippuric acid, as markers of metformin response in early-phase T2DM patients [28]. However, we followed the clinical phenotypes' transcriptome patterns, i.e., metformin response before the drug administration, to predict early signatures for drug response predictions. We identified the IDO2 gene as highly upregulated in non-responders compared with responders and healthy controls with a fold change of 6.3 and 4.1, respectively. IDO2 gene plays an essential role in tryptophan amino acid catabolism and leads 95% of the amino acid to the kynurenine pathway [43]. The metabolites of the kynurenine pathway are reported to be elevated in individuals with insulin resistance before the manifestation of hyperglycemia [44].
Moreover, Muzik et al. showed the stabilization of tryptophan metabolism by decreasing contribution from the kynurenine metabolic pathway upon metformin treatment [44]. Higher levels of the IDO2 gene in early T2DM subjects may indicate response features of future metformin therapy. Besides the IDO2 gene, FRMD6, DMGDH, and NMNAT3 genes were also highly upregulated in non-responders and are associated with glycine, serine, and threonine metabolism, hippo signalling, and nicotinate and nicotinamide metabolism, respectively. The role of metformin as an inhibitor of the transactivation of YAP, which is a key effector of the Hippo pathway, is reported [45]. On the contrary, FRMD6 gene overexpression has been shown to activate Hippo signalling and induce translocation of YAP from the nucleus to the cytoplasm [46]. DMGDH gene is known to catalyze the conversion of glycine to sarcosine, and hypoglycemia is reported as a risk for developing diabetes [47]. T2DM adults metabolize nicotine more rapidly [48], NMNAT3 genes overexpression in our data may provide a possible explanation for the nicotine addiction of T2DM, as highlighted before [48]. HSD11B1 gene was another gene that was found upregulated four-fold higher in responders and downregulated four-fold lower in non-responders. Although the expression is higher in abdominal adipose tissue, many genetic abnormalities have been implicated in abnormal glucose metabolism, risk of diabetes, and body fat distribution [49, 50].
The correlation of response to metformin therapy with gene expression highlighted GDF15, TWISTNB, and RPL36A genes. GDF15 is a cytokine from the TGF-β superfamily associated with hyperglycemia and risk of incident diabetes [51]. TWISTNB gene expression has been linked to the risk of Uterine Leiomyoma [52] and early-stage laryngeal cancers [53]. RPL36A gene is involved in tumour cell proliferation and associated with gliomas [54, 55]. We showed that GDF15, TWISTNB and RPL36A gene expression in drug naïve T2DM individuals could be used to predict the response to metformin therapy. The DGEs presented here provide early targets in newly diagnosed T2DM for metformin drug response prediction.
The prediction or prognosis of T2DM, despite noteworthy advancements in genome-wide association studies, is still weak. Previous evaluation of T2DM transcriptomes has highlighted TAF1 and MAFK as potential target genes for impaired fasting glucose and T2DM [56]. CDK5, CDKN2A, THADA, and CAPN10 expression levels were also highlighted in a blood-based analysis of T2DM susceptibility genes [57]. However, our study highlighted CXXC4, GDF2, MID1, and NDUFA4L2 genes to be highly expressed in both responders and non-responders compared to healthy controls. Also, IDO1, LAMP3, and CXCL10 genes are poorly expressed in both responders and non-responders compared to healthy controls. These genes are involved in Wnt signalling, cytokine–cytokine receptor interaction, ubiquitin-mediated proteolysis, oxidative phosphorylation, tryptophan metabolism, and adaptive immunity. Furthermore, our disease gene association analysis highlighted rs113805659 in the MAGI2 gene in relation to diabetes in African Americans [38]. All these pathways and genes have been previously implicated in the risk of T2DM and its complications.
The ASE refers to the preferential expression of one of the two alleles in a hybrid under the influence of regulatory elements or sequences from the genome. We evaluated ASEs for SNPs selected from our exome and transcriptome date. However, the constraints in TaqMan probe design limited our analysis to few SNPs among which rs272893 and rs1050152 were found to be associated with better metformin response. None of the SNPs showed a statistical significance. Interestingly it was observed that the expression of the alleles was skewed towards the wild allele in the majority of the samples evaluated.
Conclusions
The present study provides a valuable resource for the genetic variations and transcriptome profile of drug naïve T2DM individuals. The identification of variations in the SLC22A4 gene holds promise for drug response prediction but needs further evaluation in larger cohorts. The expression markers generated from the transcriptome profile can be further evaluated for their role in T2DM predisposition and metformin drug response.
Availability of data and materials
The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.
References
Nathan DM, Buse JB, Davidson MB et al (2009) Medical management of hyperglycemia in type 2 diabetes: a consensus algorithm for the initiation and adjustment of therapy—a consensus statement of the American Diabetes Association and the European Association for the Study of Diabetes. Clin Diabetes 27:4–16. https://doi.org/10.2337/diaclin.27.1.4
Rodbard HW, Jellinger PS, Davidson JA et al (2009) Statement by an American association of clinical endocrinologists/American college of endocrinology consensus panel on type 2 diabetes mellitus: an algorithm for glycemic control. Endocr Pract 15:540–559. https://doi.org/10.4158/EP.15.6.540
Inzucchi SE, Bergenstal RM, Buse JB et al (2015) Management of hyperglycaemia in type 2 diabetes, 2015: a patient-centred approach. Update to a position statement of the American Diabetes Association and the European Association for the Study of Diabetes. Diabetologia 58:429–442. https://doi.org/10.1007/s00125-014-3460-0
Bailey CJ, Turner RC (1996) Metformin. N Engl J Med 334:574–579. https://doi.org/10.1056/NEJM199602293340906
Nathan DM (2007) Finding new treatments for diabetes—how many, how fast... how good? N Engl J Med 356:437–440. https://doi.org/10.1056/NEJMp068294
Knowler WC, Barrett-Connor E, Fowler SE et al (2002) Reduction in the incidence of type 2 diabetes with lifestyle intervention or metformin. N Engl J Med 346:393–403. https://doi.org/10.1056/NEJMoa012512
Kirpichnikov D, McFarlane S, Sowers J (2002) Annals of internal medicine | metformin: an update. Ann Intern Med 137:25–33
Sam S, Ehrmann DA (2017) Metformin therapy for the reproductive and metabolic consequences of polycystic ovary syndrome. Diabetologia 60:1656–1661. https://doi.org/10.1007/s00125-017-4306-3
Heckman-Stoddard BM, DeCensi A, Sahasrabuddhe VV, Ford LG (2017) Repurposing metformin for the prevention of cancer and cancer recurrence. Diabetologia 60:1639–1647. https://doi.org/10.1007/s00125-017-4372-6
Zhou G, Myers R, Li Y et al (2001) Role of AMP-activated protein kinase in mechanism of metformin action. J Clin Invest 108:1167–1174. https://doi.org/10.1172/JCI13505
Miller RA, Chu Q, Xie J et al (2013) Biguanides suppress hepatic glucagon signalling by decreasing production of cyclic AMP. Nature 494:256–260. https://doi.org/10.1038/nature11808
El-Mir MY, Nogueira V, Fontaine E et al (2000) Dimethylbiguanide inhibits cell respiration via an indirect effect targeted on the respiratory chain complex I. J Biol Chem 275:223–228. https://doi.org/10.1074/jbc.275.1.223
Wu H, Esteve E, Tremaroli V et al (2017) Metformin alters the gut microbiome of individuals with treatment-naive type 2 diabetes, contributing to the therapeutic effects of the drug. Nat Med 23:850–858. https://doi.org/10.1038/nm.4345
American Diabetes Association (2020) 9. Pharmacologic approaches to glycemic treatment: standards of medical care in diabetes-2020. Diabetes Care 43:S98–S110. https://doi.org/10.2337/dc20-S009
Zeitler P, Hirst K, Pyle L et al (2012) A clinical trial to maintain glycemic control in youth with type 2 diabetes. N Engl J Med 366:2247–2256. https://doi.org/10.1056/NEJMoa1109333
Zhou K, Donnelly L, Yang J et al (2014) Heritability of variation in glycaemic response to metformin: a genome-wide complex trait analysis. Lancet Diabetes Endocrinol 2:481–487. https://doi.org/10.1016/S2213-8587(14)70050-6
Tkáč I, Klimčáková L, Javorský M et al (2013) Pharmacogenomic association between a variant in SLC47A1 gene and therapeutic response to metformin in type 2 diabetes. Diabetes, Obes Metab 15:189–191. https://doi.org/10.1111/j.1463-1326.2012.01691.x
Choi JH, Yee SW, Ramirez AH et al (2011) A common 5′-UTR variant in MATE2-K is associated with poor response to metformin. Clin Pharmacol Ther 90:674–684. https://doi.org/10.1038/clpt.2011.165
Shikata E, Yamamoto R, Takane H et al (2007) Human organic cation transporter (OCT1 and OCT2) gene polymorphisms and therapeutic effects of metformin. J Hum Genet 52:117–122. https://doi.org/10.1007/s10038-006-0087-0
Chen L, Pawlikowski B, Schlessinger A et al (2010) Role of organic cation transporter 3 (SLC22A3) and its missense variants in the pharmacologic action of metformin. Pharmacogenet Genomics 20:687–699. https://doi.org/10.1097/FPC.0b013e32833fe789
Jablonski KA, McAteer JB, De Bakker PIW et al (2010) Common variants in 40 genes assessed for diabetes incidence and response to metformin and lifestyle intervention in the diabetes prevention program. Diabetes 59:2672–2681. https://doi.org/10.2337/db10-0543
Rotroff DM, Yee SW, Zhou K et al (2018) Genetic variants in CPA6 and PRPF31 are associated with variation in response to metformin in individuals with type 2 diabetes. Diabetes 67:1428–1440. https://doi.org/10.2337/db17-1164
Zhou K, Bellenguez C, Spencer CCA et al (2011) Common variants near ATM are associated with glycemic response to metformin in type 2 diabetes. Nat Genet 43:117–120. https://doi.org/10.1038/ng.735
Niu N, Liu T, Cairns J et al (2016) Metformin pharmacogenomics: a genome-wide association study to identify genetic and epigenetic biomarkers involved in metformin anticancer response using human lymphoblastoid cell lines. Hum Mol Genet 25:4819–4834. https://doi.org/10.1093/hmg/ddw301
Pawlyk AC, Giacomini KM, McKeon C et al (2014) Metformin pharmacogenomics: current status and future directions. Diabetes 63:2590–2599. https://doi.org/10.2337/db13-1367
Ustinova M, Silamikelis I, Kalnina I et al (2019) Metformin strongly affects transcriptome of peripheral blood cells in healthy individuals. PLoS ONE 14:e0224835. https://doi.org/10.1371/journal.pone.0224835
Stein BD, Calzolari D, Hellberg K et al (2019) Quantitative in vivo proteomics of metformin response in liver reveals AMPK-dependent and -independent signaling networks. Cell Rep 29:3331-3348.e7. https://doi.org/10.1016/j.celrep.2019.10.117
Park JE, Jeong GH, Lee IK et al (2018) A pharmacometabolomic approach to predict response to metformin in early-phase type 2 diabetes mellitus patients. Molecules 23:1579. https://doi.org/10.3390/molecules23071579
Udhane SS, Legeza B, Marti N et al (2017) Combined transcriptome and metabolome analyses of metformin effects reveal novel links between metabolic networks in steroidogenic systems. Sci Rep 7:8652. https://doi.org/10.1038/s41598-017-09189-y
Schwarz UI, Gulilat M, Kim RB (2019) The role of next-generation sequencing in pharmacogenetics and pharmacogenomics. Cold Spring Harb Perspect Med 9:a033027. https://doi.org/10.1101/cshperspect.a033027
Green MR, Sambrook J (1989) Molecular cloning
Mayakonda A, Lin DC, Assenov Y et al (2018) Maftools: Efficient and comprehensive analysis of somatic variants in cancer. Genome Res 28:1747–1756. https://doi.org/10.1101/gr.239244.118
Talevich E, Shain AH, Botton T, Bastian BC (2016) CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing. PLoS Comput Biol 12:e1004873. https://doi.org/10.1371/journal.pcbi.1004873
Kuleshov MV, Jones MR, Rouillard AD et al (2016) Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res 44:W90–W97. https://doi.org/10.1093/nar/gkw377
Bindea G, Mlecnik B, Hackl H et al (2009) ClueGO: a cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25:1091–1093. https://doi.org/10.1093/bioinformatics/btp101
Bindea G, Galon J, Mlecnik B (2013) CluePedia cytoscape plugin: pathway insights using integrated experimental and in silico data. Bioinformatics 29:661–663. https://doi.org/10.1093/bioinformatics/btt019
Piñero J, Ramírez-Anguita JM, Saüch-Pitarch J et al (2020) The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res 48:D845–D855. https://doi.org/10.1093/nar/gkz1021
Divers J, Palmer ND, Langefeld CD et al (2017) Genome-wide association study of coronary artery calcified atherosclerotic plaque in African Americans with type 2 diabetes. BMC Genet 18:105. https://doi.org/10.1186/s12863-017-0572-9
Nakamichi N, Shima H, Asano S et al (2013) Involvement of carnitine/organic cation transporter OCTN1/SLC22A4 in gastrointestinal absorption of metformin. J Pharm Sci 102:3407–3417. https://doi.org/10.1002/jps.23595
Rena G, Pearson ER, Sakamoto K (2013) Molecular mechanism of action of metformin: old or new insights? Diabetologia 56:1898–1906. https://doi.org/10.1007/s00125-013-2991-0
Shu Y, Brown C, Castro RA et al (2008) Effect of genetic variation in the organic cation transporter 1, OCT1, on metformin pharmacokinetics. Clin Pharmacol Ther 83:273–280. https://doi.org/10.1038/sj.clpt.6100275
Pryor R, Cabreiro F (2015) Repurposing metformin: an old drug with new tricks in its binding pockets. Biochem J 471:307–322. https://doi.org/10.1042/bj20150497
Unluturk U, Erbas T (2015) Diabetes and tryptophan metabolism. In: Engin A, Engin AB (eds) Tryptophan metabolism: implications for biological processes, health and disease. Springer International Publishing, Cham, pp 147–171
Muzik O, Burghardt P, Yi Z et al (2017) Successful metformin treatment of insulin resistance is associated with down-regulation of the kynurenine pathway. Biochem Biophys Res Commun 488:29–32. https://doi.org/10.1183/09031936.00063810.The
Yuan X, Wei W, Bao Q et al (2018) Metformin inhibits glioma cells stemness and epithelial-mesenchymal transition via regulating YAP activity. Biomed Pharmacother 102:263–270. https://doi.org/10.1016/j.biopha.2018.03.031
Moleirinho S, Patrick C, Tilston-Lünel AM et al (2013) Willin, an upstream component of the hippo signaling pathway, orchestrates mammalian peripheral nerve fibroblasts. PLoS ONE 8:e60028. https://doi.org/10.1371/journal.pone.0060028
Adeva-Andany M, Souto-Adeva G, Ameneiros-Rodríguez E et al (2018) Insulin resistance and glycine metabolism in humans. Amino Acids 50:11–27. https://doi.org/10.1007/s00726-017-2508-0
Keith RJ, Riggs DW, Conklin DJ et al (2017) Nicotine metabolism in adults with type 2 diabetes. Nicotine Tob Res 21:846–849. https://doi.org/10.1093/ntr/ntx214
de Nascimento FV, Piccoli V, Beer MA et al (2015) Association of HSD11B1 polymorphic variants and adipose tissue gene expression with metabolic syndrome, obesity and type 2 diabetes mellitus: a systematic review. Diabetol Metab Syndr 7:38. https://doi.org/10.1186/s13098-015-0036-1
Devang N, Satyamoorthy K, Rai PS et al (2017) Association of HSD11B1 gene polymorphisms with type 2 diabetes and metabolic syndrome in South Indian population. Diabetes Res Clin Pract 131:142–148. https://doi.org/10.1016/j.diabres.2017.07.011
Bao X, Borné Y, Muhammad IF et al (2019) Growth differentiation factor 15 is positively associated with incidence of diabetes mellitus: the Malmö diet and cancer-cardiovascular cohort. Diabetologia 62:78–86. https://doi.org/10.1007/s00125-018-4751-7
Anjum S, Sahar T, Nigam A, Wajid S (2019) Transcriptome analysis of mRNA in uterine leiomyoma using next-generation RNA sequencing. Anticancer Agents Med Chem 19:1703–1718. https://doi.org/10.2174/1871520619666190409102855
Reddy RB, Khora SS, Suresh A (2019) Molecular prognosticators in clinically and pathologically distinct cohorts of head and neck squamous cell carcinoma—a meta-analysis approach. PLoS ONE 14:e0218989. https://doi.org/10.1371/journal.pone.0218989
Kim JH, You KR, Kim IH et al (2004) Over-expression of the ribosomal protein L36a gene is associated with cellular proliferation in hepatocellular carcinoma. Hepatology 39:129–138. https://doi.org/10.1002/hep.20017
Li S, Shi J, Gao H et al (2017) Identification of a gene signature associated with radiotherapy and prognosis in gliomas. Oncotarget 8:88974–88987. https://doi.org/10.18632/oncotarget.21634
Cui Y, Chen W, Chi J, Wang L (2016) Comparison of transcriptome between type 2 diabetes mellitus and impaired fasting glucose. Med Sci Monit Int Med J Exp Clin Res 22:4699–4706. https://doi.org/10.12659/MSM.896772
Christodoulou M-I, Avgeris M, Kokkinopoulou I et al (2019) Blood-based analysis of type-2 diabetes mellitus susceptibility genes identifies specific transcript variants with deregulated expression and association with disease risk. Sci Rep 9:1512. https://doi.org/10.1038/s41598-018-37856-1
Acknowledgements
We thank Technology Information, Forecasting and Assessment Council-Centre of Relevance and Excellence (TIFAC-CORE) in Pharmacogenomics, Government of India for the infrastructure and Ms. Sandhya Salian, Mr. Rajesh Rao, and Mr. Harish U for technical support.
Funding
Open access funding provided by Manipal Academy of Higher Education, Manipal. The study was supported by the Science and Engineering Research Board (Grant No. EMR/2015/002384/BHS), and Manipal Academy of Higher Education, Manipal.
Author information
Authors and Affiliations
Contributions
Work and concept were initiated by PSR. Material preparation, data collection and analysis were performed by MV, ARS, SM, NBP, PJ, SKN, SU. The manuscript was written by MV. All the authors read and approved the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Ethical approval
The study was approved by the Institutional Ethics Committee, Kasturba Hospital, Manipal (IEC 501/2016). The study was also registered under the Clinical Trials Registry—India (CTRI) (registration number CTRI/2018/01/011508).
Research involving human participants and/or animals
The protocols and work conducted in this study were in accordance with the declaration of Helsinki.
Consent to participate
All participants provided written informed consent.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
40618_2022_1989_MOESM2_ESM.pdf
Supplementary Fig. 1. RNA-Seq data normalization (a) Density of logcpm values for raw reads, (b) Density of logcpm values for normalised reads, (c) Logcpm for individual samples before data normalization and (d) Logcpm for individual samples after data normalization (PDF 196 KB)
40618_2022_1989_MOESM3_ESM.pdf
Supplementary Fig. 2. RNA-Seq differential expression (a) Multidimensional scaling (MDS) plot showing distribution of control, non-responders and responders samples, (b) Volcano plot showing log fold change (logFC) on the x-axis and -log10 (P-value) on the y-axis of gene expression alterations found using edgeR. Genes with FDR > 0.05 red dots (PDF 122 KB)
40618_2022_1989_MOESM4_ESM.pdf
Supplementary Fig. 3. Differentially expressed genes in control, non-responders and responders samples (a) Upregulated and (b) Downregulated (PDF 69 KB)
40618_2022_1989_MOESM5_ESM.pdf
Supplementary Fig. 4. Gene Ontology (GO) analysis of genes in Metformin_Set_1. (a) Biological process, (b) Molecular function, (c) Cellular component and (d) Enrichment for GO groups (PDF 490 KB)
40618_2022_1989_MOESM6_ESM.pdf
Supplementary Fig. 5. Gene Ontology (GO) analysis of genes in T2D_Set_1 (a) Biological process, (b) Molecular function, (c) Cellular component and (d) Enrichment for GO groups (PDF 500 KB)
40618_2022_1989_MOESM7_ESM.pdf
Supplementary Fig. 6. Gene Ontology (GO) analysis of genes in T2D_Set_2. (a) Biological process, (b) Molecular function, (c) Cellular component and (d) Enrichment for GO groups (PDF 475 KB)
40618_2022_1989_MOESM8_ESM.pdf
Supplementary Fig. 7. Gene Ontology (GO) analysis of genes in Metformin_Set_2. (a) Biological process, (b) Molecular function, (c) Cellular component and (d) Enrichment for GO groups (PDF 394 KB)
40618_2022_1989_MOESM9_ESM.pdf
Supplementary Fig. 8. Association of gene expression with change in HbA1c levels after 3 months of metformin therapy (a) Correlation between gene expression and change in HbA1c levels; (b) Change in HbA1c against GDF15; (c) Change in HbA1c against TWISTNB and (d) Change in HbA1c against RPL36A. (NRC, normalized read count) (PDF 798 KB)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Vohra, M., Sharma, A.R., Mallya, S. et al. Implications of genetic variations, differential gene expression, and allele-specific expression on metformin response in drug-naïve type 2 diabetes. J Endocrinol Invest 46, 1205–1218 (2023). https://doi.org/10.1007/s40618-022-01989-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40618-022-01989-y