Introduction

Alzheimer’s disease (AD) is the preeminent cause for dementia in the elderly and causes 50–75% dementia types [1,2,3]. In the populations of Caucasian ancestry, large-scale genome-wide association studies (GWASs) have identified some common genetic risk factors for AD [4]. In addition to the APOE gene (encoding apolipoprotein E) as a major AD risk factor, recent GWASs of AD in Caucasian populations also identified several new genetic loci, including: bridging integrator 1 (BIN1), complement receptor 1 (CR1), clusterin (CLU), phosphatidylinositol binding clathrin assembly protein (PICALM), membrane-spanning 4-domains, subfamily A, member 4 (MS4A4)/membrane-spanning 4-domains, subfamily A, member 6E (MS4A6E), CD2-associated protein (CD2AP), CD33 molecule (CD33), EPH receptor A1 (EPHA1) and ATP-binding cassette transporter A7 (ABCA7) [4,5,6,7]. These genetic loci largely stimulate downstream analysis, for example the identification of relevant gene expression regulators [8,9,10,11,12,13,14,15], the construction gene expression regulatory networks [16,17,18,19,20], and so on.

Rs744373 is a single nucleotide polymorphism (SNP) that locates upstream of BIN1 gene. In populations of Caucasian ancestry, rs744373 polymorphism was consistently confirmed to be significantly associated with AD risk with P = 3.16 × 10 − 10 [21], P = 2.6 × 10 − 14 [6], P = 2.13 × 10 − 09 [22], P = 2.9 × 10 − 07 [23] and P = 1.1 × 10 − 04 [24]. Recently, the association has also been extensively investigated in East Asian populations. However, besides the positive associations, many studies have also identified negative results. Tan et al. did not report any significant association when analyzd 1224 Chinese individuals (612 cases and 612 controls) using allele test (P = 0.217) and genotype test (P = 0.547, 0.263 and 0.397 for dominant, recessive and additive logistic genetic models) [25]. The result from Li et al. was also negative [26]. Wang et al. identified a significant result in population from East China (P = 0.038), but not southwest China (P = 0.874). When combining the two parts of populations, they still did not identify any significant association (P = 0.187) [27]. In Brazilian Chinese population, Ramos et al. analyzed 241 individuals (82 cases and 159 controls) and didn’t find any significant results (P = 0.660 for dominant model and P = 0.547 for recessive model) [28]. Ohara et al. did not report significant association (P = 0.06 for additive model) when analyzed 825 AD cases and 2934 controls from Japan [29]. In 2013, we conducted a meta-analysis using all currently available samples (2022 AD cases and 4209 controls) and the results were significant (P = 1.19 × 10 − 02, 7.08 × 10 − 03 and 5.75 × 10 − 03 for the dominant model, recessive model and additive model) [30]. Another subsequent meta-analysis with more samples (11,832 AD cases and 18,133 controls) obtained a consistent result with us [3].

Given the inconsistent findings in East Asian populations, we believe that the relatively small sample sizes, as well as the genetic heterogeneity of AD susceptibility loci among different populations, may be important factors in the untrustworthiness of the results. In this study, we aimed to collect more studies and samples than before and obtain more statistically significant results by performing genetic heterogeneity test and meta-analysis of the rs744373 polymorphism in the Caucasians, East Asians, and pooled populations.

Materials and methods

Literature acquisition

In order to find all available association studies, we searched the PubMed database (https://www.ncbi.nlm.nih.gov/pubmed) and AlzGene database (http://www.alzgene.org/) with the Keywords “Alzheimer’s disease”, “Bridging Integrator 1” or “BIN1”. We also searched Google Scholar (http://scholar.google.com/) to acquire the articles citing the studies obtained in the PubMed and AlzGene databases. The literature acquisition was updated on December 12, 2017. In addition, we collected as much data as we could by directly contacting with authors. These datasets were not published due to not significant results, etc., and were not included in the previous meta-analysis of rs744373 polymorphism with AD.

Inclusion criteria

The studies inclusion criteria contained: (1) being a case-control study; (2) investigating the association between rs744373 polymorphism and AD; (3) being conducted in East Asian or Caucasian populations; (4) providing the numbers of rs744373 genotypes or sufficient data to calculate them or (5) providing an OR with 95% confidence interval (CI) and the P-value or sufficient data to calculate them.

Data extraction

The information was extracted from each study contained: (1) author names; (2) publication year; (3) the sample’s ethnicity; (4) the numbers of cases and controls; (5) the genotyping platform; (6) the frequencies of rs744373 genotypes or sufficient data to calculate them or (7) the OR with 95% CI or sufficient data to calculate them.

Genetic model

Since not all studies provided exact genotype numbers, we investigated the association between rs744373 polymorphism and AD risk in this meta-analysis primarily using the additive genetic model. We selected allele C as effect allele and T as reference allele, the additive model can be described as C allele versus T allele [31].

Comparison of MAF and OR in Caucasians and east Asians

We compared the minor allele frequency (MAF), which is the frequency of rs744373 allele C in this study, and the OR values between the Caucasian populations and East Asian populations. We used the t-test to investigate whether there were differences in the OR values and MAF values between these two populations. We used program R (http://www.r-project.org/) to perform the t-test and calculate the OR and MAF values that not available in the original articles.

Heterogeneity test

We used the Cochran’s Q test to investigate genetic heterogeneity among different studies. Cochran’s Q test approximately follows a chi-square distribution and its degree of freedom is k-1 (k represents the number of studies included in this studies). Statistics I2 can also use to measure the genetic heterogeneity, which is calculated as:

\( {\mathrm{I}}^2=\frac{\mathrm{Q}-\left(\mathrm{k}-1\right)}{\mathrm{Q}}\times 100\% \)

The statistics I2 is in the range of 0–100%, and we divided it into four parts: 0–25%, 26–50%, 51–75%, 76–100%, which respectively represent low, moderate, large and extreme heterogeneity [30]. We conducted Cochran’s Q test in East Asians, Caucasians, and pooled Populations respectively. All calculations of P-value and I2 value were completed using the program R (http://www.r-project.org/). We choose P < 0.05 or I2 > 50% as discriminant criterion for significant result of heterogeneity test.

Meta-analysis

In the meta-analysis, we used fixed effect model (Mantel–Haenszel) or random effect model (DerSimonian–Laird) to calculate the overall OR. And which model to choose depends on whether the genetic heterogeneity is significant or not. If the P-value of Cochran’s Q test was less than 0.05, and I2 value was greater than 50%, we selected the random effect model, otherwise we selected the fixed effect model. The signification of overall OR was measured by Z test.

Sensitivity analysis and publication Bias analysis

To further test the stability of our results, we conducted a sensitivity analysis by sequentially removing each study in the meta-analysis at a time. We used funnel plots to evaluate the potential publication bias. A symmetrical inverted funnel indicated the results were no bias, and an asymmetrical inverted funnel indicated bias results [4]. Begg’s test and Egger’s test was used to evaluate the asymmetry of the funnel plot [4]. The significant level was 0.01. All statistical tests above were also performed using the program R (http://www.r-project.org/).

Results

Literature search and data description

We obtained 126 articles by searching the PubMed database. Eighty-eight articles were excluded because they were (1) not Case-Control design, (2) not analyzed in East Asian or Caucasian populations, (3) not related with AD, (4) meta-analysis or (5) review articles. We further excluded 24 articles because they did not investigate the association between rs744373 polymorphism and AD or not provide sufficient data. The remaining 14 articles met the analysis requirements. According to the same criteria, we also obtained two articles from the AlzGene database. In addition, we had found one article by searching Google Scholar. We applied for two datasets of two articles (studies) by contacting the author directly. Finally, 37 studies in 19 articles, including 22,395 AD cases and 48,773 control samples, were included in this meta-analysis. More detailed information about selecting studies was described in Fig. 1. The main characteristics of included studies were described in Table 1.

Fig. 1
figure 1

A figure Flow diagram of article inclusion or exclusion

Table 1 Main characteristic of included studies

Comparison of MAF and OR between Caucasian and east Asian

There were 11 studies belong to East Asian populations. The MAF values of rs744373, OR values and other information of these 11 studies listed in the top 11 rows in Table 1. The other studies listed in the last 26 rows in Table 1 belonged to Caucasian populations. By using the t-test to compare the MAF values between Caucasians and East Asians, we found a significant result with t = 5.89 and P = 1.53 × 10− 6. However, the result of comparison of OR values did not indicate a very significant distinction between the two populations (t = 1.75 and P = 0.11).

Heterogeneity test

We conducted heterogeneity test of rs744373 polymorphism in different populations, and identified significant genetic heterogeneity in Caucasians (P = 0.001, I2 = 52.3%), East Asians (P = 0.001, I2 = 65.1%) and pooled populations (P = 1.03 × 10− 5, I2 = 57.2%). Detailed results were described in Table 2 and Fig. 2.

Table 2 The results of genetic heterogeneity test and meta-analysis of rs744373 polymorphism in East Asian and Caucasian populations
Fig. 2
figure 2

Forest plot for the meta-analysis of the association between rs744373 and AD under the additive model. “OR” is the abbreviation of Odds Ratio. “Beta” indicates the ln (OR). “se” is the standard error of Beta. “Weight” represents the weight of each study when calculating the overall OR. The genetic heterogeneity test results (I2 and its P-value) and the meta-analysis results (overall OR and 95% CI) in pooled populations are listed at the bottom of the figure. The results for subgroup analysis are also listed by the grey font

Meta-analysis

Based on the results of Heterogeneity test, we used random effect model to calculate the overall OR values in East Asians, Caucasians and pooled populations, respectively. Meta-analysis results indicated significant correlation in Caucasians with P = 3.38 × 10− 08, OR = 1.16, 95% CI 1.10–1.22, and in pooled populations with P = 5 × 10− 07, OR = 1.12, and 95% CI 1.07–1.17 (Table 2). However, we did not find any association between rs744373 polymorphism and AD in East Asian populations with P = 0.39, OR = 1.06, and 95% CI 0.95–1.15. The detailed results and forest diagram were described in Table 2 and Fig. 2.

Sensitivity analysis and publication Bias analysis

Using sensitivity analysis, we identified that the results of meta-analysis remained largely unchanged by excluding any one study (Table 3). The symmetrical inverted funnel in the funnel plot suggested no publication bias of the results (Begg’s test, P = 0.471; Egger’s test, P = 0.428). Funnel diagram was described in Fig. 3.

Table 3 The results of sensitivity analysis
Fig. 3
figure 3

Analysis of publication bias regarding the rs744373 polymorphism in AD. The symmetrical inverted funnel shows that there is not publication bias

Discussion

GWASs showed that SNPs located in upstream of BIN1, particular rs744373, are strongly associated with AD risk [41]. The expression quantitative trait loci (eQTL) analysis identified a pronounced association between rs744373 and the expression of BIN1 in brain tissue [3]. BIN1 gene have diverse functions, including endocytosis, trafficking, immune response, apoptosis, and tau metabolism, that are thought have potential roles in AD pathological mechanism [41, 42]. To some extent, investigating the association between rs744373 and AD risk is helpful for understanding the role of BIN1 in AD pathogenesis.

Based on the significant association between rs744373 polymorphism and AD risk identified by the GWASs in Caucasian populations, many recent studies had also explored this association in East Asian populations, as described in the introduction. However, the findings of the association studies in East Asian were always inconsistent. Considering a relatively small sample size may result in less statistical power, we collected 37 studies involving 22,395 AD cases and 48,773 controls for the meta-analysis. To the best of our knowledge, this was the largest sample size by far.

By meta-analysis of the 37 studies, we obtained significant association between rs744373 polymorphism and AD risk in pooled populations (P = 5 × 10− 07, OR = 1.12, and 95% CI 1.07–1.17) and also in Caucasian populations (P = 3.38 × 10− 08, OR = 1.16, 95% CI 1.10–1.22). The results were consistent with the previous studies. However, in East Asian populations, our results showed a significant genetic heterogeneity of rs744373 polymorphism (P = 0.001, I2 = 65.1%) and the meta-analysis did not show a significant association between rs744373 polymorphism with AD risk by using a random effect model (P = 0.393, OR = 1.057, and 95% CI 0.95–1.15). The insignificant publication bias results and the consistent sensitivity analysis results showed that our results were reliable.

To confirm the findings that were obtained by additive genetic model, we further used the dominant model (CC + CT versus TT) and recessive model (CC versus CT + TT) to investigate the association of rs744373 polymorphism with AD risk based on genotype data of 33,184 samples (12,717 AD cases and 20,467 controls). As same as the results of additive model, we obtained significant association between rs744373 and AD in pooled populations (P = 3.95 × 10− 11, OR = 1.17, 95% CI 1.12–1.23 for dominant model and P = 1.35 × 10− 05, OR = 1.19, 95% CI 1.10–1.29 for recessive model), as well as in Caucasian populations (P = 5.99 × 10− 11, OR = 1.20, 95% CI 1.14–1.27 for dominant model and P = 1.00 × 10− 05, OR = 1.26, 95% CI 1.14–1.39 for recessive model). We also obtained negative results in East Asian populations (P = 0.391, OR = 1.06, 95% CI 0.93–1.21 for dominant model and P = 0.806, OR = 1.03, 95% CI 0.81–1.31 for recessive model). The consistent results among the three kinds of genetic models demonstrated the reliability of our results. The data was described in Additional file 1 and the detailed results were described in Table 2, Table 4, Fig. 4, Fig. 5 and Additional file 1. The information about the samples and publication bias was described in Additional file. In summary, this large-scale meta-analysis highlighted the significant association between rs744373 polymorphism and AD in Caucasian populations but not in the East Asian populations.

Table 4 The genetic heterogeneity test and meta-analysis of rs744373 polymorphism using the additive model, dominant model and recessive model in East Asian and Caucasian populations
Fig. 4
figure 4

Forest plot for the meta-analysis of the association between rs744373 and AD under the dominant model

Fig. 5
figure 5

Forest plot for the meta-analysis of the association between rs744373 and AD under the recessive model

Researchers have begun to focus on AD genetic heterogeneity between different races and ethnicities since the end of the last century [43]. They found that the frequency variations in ApoE subtypes existed among nine populations include Caucasians and East Asians [43]. Besides the most consistent genetic risk factor ApoE for Sporadic AD, some studies have also reported many genetic risk factors that appear distinct AD susceptibility between Caucasian and East Asian populations. For instance, following genes were proven to be only associated with AD risk in Caucasian populations but not in East Asian populations: Triggering Receptor Expressed On Myeloid Cells 2 (TREM2) [44, 45], Solute Carrier Family 24 Member 4 (SLC24A4) [46], NME/NM23 Family Member 8 (NME8) [47], GRB2 Associated Binding Protein 2 (GAB2) [48], Myocyte Enhancer Factor 2C (MEF2C) [49], Inositol Polyphosphate-5-Phosphatase D (INPP5D) [50], CLU [51], ABCA7, CD2AP, and EPHA1 [25], Fermitin Family Member 2 (FERMT2) [52]. Hence, the complex difference among different ethnicities and races probably cause the genetic heterogeneity of AD between Caucasians and East Asians.

Our samples of East Asian ancestry mainly came from Chinese, Japanese and Koreans populations. On the one hand, these samples may not be able to represent the East Asian populations completely. On the other hand, the specific differences in sample collection processes of different studies would lead to genetic heterogeneity among different populations. Considering these limitations, we believe that a large sample size GWAS in East Asian population is very necessary.

Conclusions

Until now, the genetic association between BIN1 rs744373 and AD risk in East Asian populations is still not deterministic. In the study, we conducted a meta-analysis with the largest sample size so far (22,395 AD cases and 48,773 controls). The meta-analysis results under the additive, dominant and recessive model indicated a significant association between rs744373 and AD risk in Caucasian populations but not in East Asian populations. The consistent results of sensitivity analysis, as well as the negative results of publication bias analysis, demonstrated the reliability of our findings. We believe that the greater statistically powerful results are helpful for the understanding of AD genetic risk factors in East Asian populations.