Evidence that the pituitary gland connects type 2 diabetes mellitus and schizophrenia based on large-scale trans-ethnic genetic analyses

Cai, Lei; Sun, Yanlan; Liu, Yonglin; Chen, Wenzhong; He, Lin; Wei, Dong-Qing

doi:10.1186/s12967-022-03704-0

Evidence that the pituitary gland connects type 2 diabetes mellitus and schizophrenia based on large-scale trans-ethnic genetic analyses

Research
Open access
Published: 03 November 2022

Volume 20, article number 501, (2022)
Cite this article

Download PDF

You have full access to this open access article

Journal of Translational Medicine Aims and scope Submit manuscript

Evidence that the pituitary gland connects type 2 diabetes mellitus and schizophrenia based on large-scale trans-ethnic genetic analyses

Download PDF

Lei Cai ORCID: orcid.org/0000-0003-0935-1870¹^na1,
Yanlan Sun^1,2^na1,
Yonglin Liu³^na1,
Wenzhong Chen⁴,
Lin He^1,5 &
…
Dong-Qing Wei²

2798 Accesses
3 Citations
2 Altmetric
Explore all metrics

Abstract

Background

Previous studies on European (EUR) samples have obtained inconsistent results regarding the genetic correlation between type 2 diabetes mellitus (T2DM) and Schizophrenia (SCZ). A large-scale trans-ethnic genetic analysis may provide additional evidence with enhanced power.

Objective

We aimed to explore the genetic basis for both T2DM and SCZ based on large-scale genetic analyses of genome-wide association study (GWAS) data from both East Asian (EAS) and EUR subjects.

Methods

A range of complementary approaches were employed to cross-validate the genetic correlation between T2DM and SCZ at the whole genome, autosomes (linkage disequilibrium score regression, LDSC), loci (Heritability Estimation from Summary Statistics, HESS), and causal variants (MiXeR and Mendelian randomization, MR) levels. Then, genome-wide and transcriptome-wide cross-trait/ethnic meta-analyses were performed separately to explore the effective shared organs, cells and molecular pathways.

Results

A weak genome-wide negative genetic correlation between SCZ and T2DM was found for the EUR (r_g = − 0.098, P = 0.009) and EAS (r_g =- 0.053 and P = 0.032) populations, which showed no significant difference between the EUR and EAS populations (P = 0.22). After Bonferroni correction, the r_g remained significant only in the EUR population. Similar results were obtained from analyses at the levels of autosomes, loci and causal variants. 25 independent variants were firstly identified as being responsible for both SCZ and T2DM. The variants associated with the two disorders were significantly correlated to the gene expression profiles in the brain (P = 1.1E-9) and pituitary gland (P = 1.9E-6). Then, 61 protein-coding and non-coding genes were identified as effective genes in the pituitary gland (P < 9.23E-6) and were enriched in metabolic pathways related to glutathione mediated arsenate detoxification and to D-myo-inositol-trisphosphate.

Conclusion

Here, we show that a negative genetic correlation exists between SCZ and T2DM at the whole genome, autosome, locus and causal variant levels. We identify pituitary gland as a common effective organ for both diseases, in which non-protein-coding effective genes, such as lncRNAs, may be responsible for the negative genetic correlation. This highlights the importance of molecular metabolism and neuroendocrine modulation in the pituitary gland, which may be responsible for the initiation of T2DM in SCZ patients.

Genome-wide association study and trans-ethnic meta-analysis identify novel susceptibility loci for type 2 diabetes mellitus

Article Open access 29 April 2024

Evidence for genetic contribution to the increased risk of type 2 diabetes in schizophrenia

Article Open access 23 November 2018

Heritability and genome-wide association analyses of fasting plasma glucose in Chinese adult twins

Article Open access 18 July 2020

Background

As human economic development as progressed, both schizophrenia (SCZ) and type 2 diabetes mellitus (T2DM), complex polygenic inherited disorders, have become growing challenges that, to date, lack effective solutions [1, 2]. Accumulating evidence from clinical samples demonstrates that the prevalence of T2DM in patients with SCZ is elevated 2 to 3 times compared with the general population, whereas the aetiology for the co-occurrence of SCZ and T2DM is multifactorial [3]. Recent studies have shown that drug-naive patients with their first episode of SCZ have an increased risk of T2DM [4, 5]. Moreover, the increased risk of T2DM is more apparent in young adults with SCZ [3, 6]. Therefore, a better understanding of the genetic relationship between and common genetic basis of SCZ and T2DM is pivotal for providing insights into the treatment and prevention of these diseases.

Since inherited factors rarely correlate with confounders and exhibit no reverse causation, several studies with limited sample sizes have investigated the involved genes common to both SCZ and T2DM and have reported negligible genetic correlations between SCZ and T2DM [7, 8]. This conflicts with a weak genome-wide negative correlation between SCZ and T2DM (rg = − 0.07 and P = 0.002) identified in a forthcoming article with a large-scale sample size of European (EUR) subjects[9]. These inconsistent genetic analysis results may be because the use of limited sample sizes and certain analytical methods potentially result in underpowered correlation analyses, produce bias, and overestimate the results. Moreover, genome-wide association studies (GWASs) involving different population groups can provide samples from global populations to address some of the existing Eurocentric bias, which enhances the ability to identify disease associations and ensures that the findings are mostly relevant to all populations [10]. Thus, a large-scale trans-ethnic genetic analysis can provide new and cross-validated evidence by employing a range of complementary approaches.

In this study, based on GWAS summary data from European (EUR) and East Asian (EAS) populations including a total of 1,466,906 subjects, multiple complementary genomic analysis approaches were utilized to explore the genetic basis for T2DM and SCZ at different levels, such as the whole-genome, autosomes, loci and causal variants. We aimed to provide more evidence of the genetic basis for the comorbidity of these two diseases. First, in addition to performing a linkage disequilibrium (LD) score regression analysis (LDSC) to estimate the genome-wide correlation of SCZ with T2DM, a stratified autosome-based LDSC was used to estimate autosome correlation. Second, Heritability Estimation from Summary Statistics (HESS) method was performed to estimate the locus-level genetic correlation. Third, based on the causal variants of each disease, polygenic overlap and Mendelian randomization (MR) analyses were performed to examine the genetic link between these two diseases. Furthermore, to identify the basic mechanisms underlying the comorbidity of SCZ and T2DM, a genome-wide cross-trait/ethnic meta-analysis was performed to identify the pleiotropic genes shared between SCZ and T2DM and to determine the common effective organs and blood cell types. Finally, a cross-trait/ethnic meta-analysis based on transcriptome-wide association study (TWAS) data was carried out to explore the canonical pathways in the effective organs (Figure S1).

Data and methods

GWAS data sets for SCZ and T2DM

GWAS data were collected from the databases of the Psychiatric Genomics Consortium (PGC) and the DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) consortium upon request. The EAS GWAS T2DM dataset included 433, 540 subjects from 23 projects, and the EUR T2DM dataset contained 898, 130 subjects from 32 projects. The EAS GWAS SCZ dataset included 58, 140 subjects, and the EUR SCZ dataset contained 77, 096 subjects[10,11,12,13]. The detailed demographic characteristics and quality controls are summarized in Supplementary Material part 1.1.

The quality of the GWAS datasets was controlled by applying the following data filters: variants with INFO ≥ 0.80 if they existed were filtered in; variants with consistent alleles among each dataset were checked to adjust two situations: palindromic alleles and opposite alleles. In total, 8, 335, 938 variants for SCZ_EAS and 9, 745, 488 for SCZ_EUR, 11, 825, 585 for T2DM_EAS and 13, 583, 104 for T2DM_EUR were considered for the next analysis.

Genetic correlation analysis

First, the heritability of each disorder (single-trait) and the genome-wide correlation (r_g) between SCZ and T2DM in either the EAS or EUR samples were estimated using linkage disequilibrium (LD) score regression software (LDSC, v1.0.1) and the precomputed LD scores for each population as a reference, which were obtained from the 1000 Genomes (1kG) project phase 3 [7, 14]. Prior to analysis, we filtered out those SNPs that were within the major histocompatibility complex (MHC) but were not within HapMap3 or had a MAF < 5% within the 1kG EUR or EAS reference samples. Furthermore, Fisher’s Z score transformed from r_g was calculated to compare the significance of the difference in the genetic correlations between the EAS and EUR samples (Supplementary materials part 1.2). Moreover, partitioned LDSC analysis was performed to estimate the genetic correlation of these two diseases for each autosome.

Second, the Heritability Estimation from Summary Statistics software package (HESS, v0.5.3-beta) was applied to explore the local-level heritability of each disorder and the genetic correlation between SCZ and T2DM within independent LD blocks obtained from the 1kG reference panel in three steps: S1, preparing the LD block and eigenvalues; S2, estimating the local SNP-heritability of each trait; and S3, estimating the local genetic covariance and standard error [15]. A total of 1, 443 and 1, 702 approximately independent LD blocks for the EAS and EUR samples, respectively, were checked as genome partition loci by HESS [16]. The local genetic correlation was calculated with the following formula:

$${r}_{L}=\frac{{cov}_{L}}{\sqrt{{{h}_{L}^{2}\left(SCZ\right){ h}_{L}^{2} \left(T2D\right)}_{ }}}$$

(1)

Here, cov_L is the local genetic covariance obtained from the third step of HESS, and ${\text{h}}_{L}^{2}$(SCZ) and ${\text{h}}_{L}^{2}$(T2DM) are the estimated local heritability of each disease obtained from the second step.

Finally, to qualify the polygenic overlap of these two disease, the total number of shared and trait-specific causal variants between the two diseases was estimated using MiXeR v1.3 with default parameters [17]. To avoid taking infinitesimally small effects, the presented numbers of causal variants accounted for more than 22.6% of their total estimate and jointly accounted for 90% of the heritability of the SNP in each disease.

Mendelian randomization analysis

To obtain reliable and noteworthy results, bidirectional MR analyses were performed with multiple MR methods based on different assumptions about horizontal pleiotropy. First, GCTA v1.93.3beta2 software was used to analyze the bidirectional causal links between SCZ and T2DM with the generalized summary-data-based Mendelian randomization (GSMR) method [18] with the following parameters: P ≤ 5 × 10^− 8 as the GWAS threshold to select variants for clump analysis; r² ≤ 0.05 as the LD threshold to identify independent SNPs based on the 1kG Project (phase 3) population reference; P = 0.01 as the threshold for heterogeneity in dependent instruments (HEIDI) outlier analysis to remove horizontal pleiotropic SNPs; and 10 as the minimum number of significant and independent instrumental SNPs required for the MR analysis. Then, three more methods, i.e., inverse variance weighting (IVW), maximum likelihood (ML), and weighted median (WMe), were utilized to explore putative causal relationships between SCZ and T2DM in the EUR population using the R package TwoSampleMR with the following parameters: P ≤ 5 × 10^− 8 and r² ≤ 0.05 [19]. MR-Egger regression and MR-PRESSO models with the corresponding R packages were used to determine directional pleiotropy [20].

Genome-wide cross-trait/ethnic meta-analysis

The Cross Phenotype Association (CPASSOC) method [21] was employed to identify shared variants between SCZ and T2DM. This method allows the presence of heterogeneous effects across traits and provides statistical S_Het and P values weighted by sample size. The Z score for each variant for SCZ or T2DM from each population was used as the input source data for the cross-trait/ethnic meta-analysis. A significance level of P = 5 × 10^− 8 was applied as in the GWAS.

Among the genome-wide cross-trait/ethnic significant SNPs, independent cross-trait significant SNPs that met the following two criteria were prioritized: (1) the SNP was not identified as significant in the single-trait GWAS, and (2) the SNP was independent with LD r² < 0.05 within 1,000-kb windows based on the 1kG population reference, evaluated by LD clumping using PLINK v1.970.

Positional gene mapping

To map and prioritize genes, MAGMA gene analysis was performed with the SNP-wide mean model using the 1kG Phase 3 population reference [22]. During the analysis, genes within 100 kb of each candidate SNP were mapped and prioritized, which were in LD with genome-wide significant SNPs at the adjusted r² threshold using the Functional Mapping and Annotation (FUMA) GWAS web tool [23]. Furthermore, to identify the tissue specificity of the SCZ and T2DM cross-traits, MAGMA gene property analyses in FUMA were performed to test correlations between tissue specific gene expression profiles and trait-gene associations based on the full distribution of SNP P values.

Cell type-specific analysis

To determine the effective cell type in human peripheral blood mononuclear cells (PBMCs) for both SCZ and T2DM, 10x genomics’ single-cell RNA-seq (scRNA-seq) data were extracted [24]. Based on the regression model with SNPs, MAGMA gene-property analysis was performed to test the cell type-specificity of phenotypes with GWAS summary statistics using the FUMA platform [23].

Transcriptome-wide cross-trait/ethnicity meta-analysis

The Functional Summary-based Imputation (FUSION) package was used to perform TWAS analysis [25]. The pituitary gland-related expression weights were prepared with the aid of the FUSION website and were then integrated with the GWAS data to identify the gene expression associated with either disease in either population.

Then, association analysis was performed on SubSets (ASSET v2.4.0), which can exhaustively explore all possible subsets of inputs to identify the strongest association signal in both positive and negative directions [26]. The above TWAS data and sample size information for SCZ_EAS, T2DM_EAS, SCZ_EUR and T2DM_EUR were input as trait 1 to trait 4, and the two-sided statistic was generated with the default setting parameters. Finally, we took the beta and P values for each gene to use in the subsequent Ingenuity Pathway analysis (IPA).

IPA analysis

With the above effective genes in the pituitary gland identified for both SCZ and T2DM, IPA software (Ingenuity Systems; Qiagen China Co., Ltd.) was employed to perform the core analysis on the measurement of expression logOR as previously described [27].

Statistical analyses

All statistical analyses were performed using R 4.1.1 and/or Python 2.7/3.7 in the Linux environment, which was run in the π 2.0 cluster supported by the Center for High Performance Computing at Shanghai Jiao Tong University. Detailed descriptions of the genetic correlation analysis, MR analysis and GWCTM are provided in the Supplementary Materials. P values < 0.05 were considered statistically significant, and multiple tests were adjusted by the Bonferroni method to reduce the risk of type I statistical error.

Results

Genetic correlations between SCZ and T2DM

The results of the single-trait LDSC showed that the genome-wide SNP heritability was 44.22 ± 2.33% and 45.06 ± 1.69% for SCZ, and 7.98 ± 0.49% and 4.45 ± 0.27% for T2DM, in the EAS and EUR samples, respectively. The intercepts of the LD score regression were ≤ 1.003 and 1.05 separately in the EAS and EUR samples, indicating slight bias from population stratification and cryptic relatedness[28]. A negative genetic correlation between SCZ and T2DM was found (r_g = − 0.053 and − 0.098, P = 0.032 and 0.009, for the EAS and EUR samples, respectively, Table 1), with no significant difference between the two populations based on Fisher’s Z-transformation method (Z score = 1.23, P = 0.22)[29]. Only in the EUR samples did the negative genetic correlation of SCZ with T2DM remain Bonferroni significant (P < 0.05/2 = 0.025). The intercept of genetic covariance between SCZ and T2DM for each population was ≤ 0.01, indicating negligible sample overlap between these two diseases in the current analysis[7].

Table 1 Genetic correlation and polygenic overlap analyses of the SCZ and T2D

Full size table

Furthermore, the partitioned genetic correlation analysis results demonstrated that in the EUR samples, chromosomes 1, 3, and 13 had significant correlations (r_g = − 0.22, − 0.25, and − 0.31, and P = 0.0042, 0.0033, and 0.039, respectively), and in the EAS samples, chromosomes 10 and 2 had significant correlations (r_g =- 0.20 and − 0.23, and P = 0.035 and 0.044, respectively, Table S1 and Fig. 1 A). Nevertheless, only chromosomes 1 and 3 in the EUR samples remained Bonferroni significant (P < 0.05/2 = 0.025).

The results of the local genetic correlation analysis with HESS showed that in the EUR samples, 157 loci had a correlation with a P value less than 0.05; and in the EAS samples there were 23 loci (Table S2). Nevertheless, only chr18:51554175–55,213,838 (P = 2.03 × 10^− 6) and chr6:63552888–65,765,742 (P = 4.36 × 10^− 6) in the EUR samples remained Bonferroni significant [P < 0.05/(2 × 1702) = 1.47 × 10^− 5]. Furthermore, the number of loci containing the GWAS significant SNPs that were specific to SCZ, specific to T2DM, related to both diseases and related to neither were 69, 62, 5 and 1566, respectively, in the EUR samples, and 14, 150, 2 and 1277, respectively, in the EAS samples. Additionally, the genetic correlations of both the SCZ- and T2DM-specific loci largely had negative values, which supported the genome-wide results from the LDSC analysis (Fig. 1B and C). SCZ- or T2DM-specific loci, rather than common loci were more likely to have a negative maximum genetic correlation in the EUR samples than those in the EAS samples.

The polygenic overlapping analysis results also supported the negative correlation of SCZ and T2DM effect sizes within the shared causal variants, with ρ = − 0.24 ± 0.057 and − 0.28 ± 0.17, and r_g = − 0.049 ± 0.0083 and − 0.076 ± 0.0066 for the EAS and EUR populations, respectively (Table 1; Fig. 1D). Furthermore, SCZ and T2DM had a low polygenic overlap, sharing only approximately 0.7 K of the 7.9 K causal variants (8.9%) and 1.2 K of the 9.9 K causal variants (12.1%) for the EAS and EUR populations, respectively. However, common causal variants accounted for 85.7% of the T2DM causal variants in EUR populations.

Mendelian randomization analysisThe results of the MR analyses based on the four methods (GSMR, IVW, ML, and WMe) indicated that SCZ may have a genetically negative causal effect on T2DM in the EUR samples with 66 instrumental variants (P = 2.84 × 10^− 7, 3.18 × 10^− 4, 3.34 × 10^− 7 and 0.014 for GSMR, IVW, ML and WMe, respectively, Fig. 2 A and Table S3). However, the Bonferroni-corrected P value from the WMe method was 0.056. In these analyses, the Mendelian randomization-Egger (MR-Egger) and Mendelian Randomization Pleiotropy RESidual Sum and Outlier (MR-PRESSO) tests did not support the existence of pleiotropic effects biasing the estimates of the causal effects of SCZ on T2DM in the EUR samples (MR Egger intercept, − 0.01; P = 0. 23; P value for the outlier test∈[0.23,1]).

Genome-wide cross-trait/ethnic meta-analysis

A total of 24, 627 genome-wide significant SNPs were found with the CPASSOC method, which were located on almost all the autosomes (Fig. 3 A). Furthermore, 1, 313 SNPs were identified that had not been reported as significant variants in either of the previous SCZ or T2DM GWASs (Figure S2). Among these 1, 313 SNPs, 25 SNPs were independent variants responsible for the comorbidity of SCZ and T2DM (Table S4, Fig. 3B).