Introduction

Alzheimer’s Disease (AD) accounts for 60–80% of all dementia cases (http://www.alz.org). By 2050, the number of people with AD is predicted to increase from 5.4 million to between 11 and 16 million in the U.S alone, and it is estimated that dementia will cost $2 trillion by 2030 worldwide (http://www.alz.org). Despite these alarming numbers, there is no effective strategy to identify pre-symptomatic disease, which might be the only stage of the disease’s trajectory where we could intervene.

Genomics and genetics approaches have made great progress in revealing the mechanisms underlying Alzheimer’s disease. Genome-wide association studies (GWAS) and meta-analyses have identified 23 statistically significant AD associated genes1. In total 39 AD risk genes have been identified so far2,3, including APOE, APP, TRIP4, ABCA7, and SORL1. These genes highlight the importance of various pathways involved in AD, such as immune response and inflammation, cell migration, lipid transport and endocytosis, hippocampal synaptic function and other cell regulatory processes, along with the role of tau and amyloid protein1. The majority of published gene expression studies have been performed using post-mortem brain tissues and as such have focused on the later stages of the advanced disease4,5,6,7. A key need is to explore how these changes in the brain relate to changes in the blood. The availability of gene expression data from brain tissue and blood cells now make it possible to compare these two tissues, and holds out the possibility of identifying in the blood a panel of predictive biomarkers that are mechanistically associated with this disease in the brain.

No single biomarker, e.g., gene or protein, is likely be a reliable biomarker for early AD. Previous studies have therefore used machine learning (ML) to build multi-biomarker models for clinical diagnosis and prediction of AD based on measurement of RNA, protein, and lipid levels in blood samples8,9. Support Vector Machine (SVM) and random forest (RF) models have proved predictive in distinguishing between cognitively normal, mild cognitive impairment (MCI), i.e. prodromal AD, and subjects with AD using gene expression10,11 and blood analytes12. Pathway-based classification approaches for blood-based AD diagnosis have also been used, with age and APOE4 status of the subjects included as covariates (these are the two known biggest risk factors13). However, as these studies were performed on a variety of platforms with different initial feature sizes and relatively small sample size, very few potential biomarkers have so far been identified or replicated in larger cohort study14.

Our study has two parts. The first was a system analysis to identify differentially expressed genes (DEGs) and pathways in a large-scale human blood dataset, and integrate these with results from brain tissue to comprehensively explore the correlations between blood and brain. The second part was to apply ML techniques to identify a panel of potential predictive biomarkers in the blood, and to see whether gene expression in the blood can be used as a biomarker for AD diagnosis.

Methods

Microarray gene expression profile in human blood

Two independent human whole blood normalized mRNA gene expression datasets were downloaded from GEO (http://www.ncbi.nlm.nih.gov/geo/): GSE63060 and GSE63061 from the AddNeuroMed Cohort15. We merged these two normalized datasets (generated by different Illumina platforms) using the inSilicoMerging R package16, and then extracted 143 patients with AD, 77 MCIs and 104 controls subjects (CTL) from GSE63060; 102 patients with AD, 65 MCIs and 78 CTLs from GSE63061 with Western European and Caucasian ethnicity respectively. Probesets without annotation (Entrez_Gene_ID) were filtered out, which left 22756 probesets corresponding to 16928 unique genes. The limma R package17 was then applied and adjusted by age and gender to identify DEGs (a) between AD patients and CTLs, (b) between MCI patients and CTL groups, and (c) between AD and MCI patients. These comparisons were carried out in the two GEO datasets and in the merged one (referred to as the merged discovery dataset) separately. We focused on this merged discovery dataset for downstream analysis with the Benjamini-Hochberg adjusted p-value, i.e. BH.pval of 0.01 used as the significance level for DEG identification.

In order to evaluate the DEGs identified in our above discovery dataset, two additional datasets were downloaded for analysis. Firstly, the whole blood gene expression dataset (GSE6613) was download from GEO. The Affymetrix U133A CEL profiles were normalized by RMA18 method implemented in affy R package. Probesets were filtered out if (1) they were not annotated or were multiply annotated; or (2) they were present in less than 10 percent of the samples as determined by applying the MAS5 present/absent call algorithm (affy R package). DEGs were identified by applying limma with age and gender adjusting. Nominal pval < 0.01 was used for significance because we observed that no DEG could pass multiple testing (BH.pval > 0.05, see discussion section). This dataset includes samples for AD, MCI, CTL, as well as Parkinson disease (PD). We excluded PD samples after data normalisation.

The second evaluation blood gene expression dataset was downloaded from the Alzheimer’s Disease Neuroimaging Initiative website (ADNI, http://www.adni-info.org/). The ADNI was launched in 2003 as a public-private partnership led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers and clinical and neuropsychological assessment can be combined to measure the progression of MCI and early AD. In our study, we focused on the ADNI2 Caucasian population with disease status according to baseline diagnosis. This cohort has APOE4 information for each individual participant. Limma was applied to each APOE4 group (APOE4 = 0, APOE4 = 1, APOE4 = 2), adjusting for age, gender, RIN, RNA purity ratio A260/280 and A260/230 separately to detect DEGs between patients with AD and CTL, early MCI (EMCI) and CTL, late-MCI (LMCI) and CTL. A nominal p-value of <0.01 was used for significance since no DEG could pass multiple testing (see discussion section). We present results on the APOE4 = 1 group because there were similar numbers of cases for each disease status in this group, but very few AD cases in the other two APOE4 groups.

Microarray gene expression profile in human brain

The GSE84422 dataset includes human post-mortem brain samples taken from 19 brain regions for an AD study6. The cohort used is totally independent to the above blood cohorts. Gene expression profiles of 17 brain regions were generated by both Affymetrix U133A and U133B platforms, and profiles for other two regions were generated by the U133plus2 platform. We processed the raw CEL files as above, identified DEGs for each platform separately adjusted by age, gender, post-mortem interval (PMI) and pH values using limma, as applied in the original study6, and merged them together afterwards to obtain 19 lists of DEGs. Nominal pval < 0.01 was applied for significance, again since no DEG could pass multiple testing (i.e. BH.pval > 0.05). We only analysed definite AD and CTLs in the Caucasian ethnic group. Supplementary Table 1 indicates the sample size in each comparison group including the cases for blood datasets.

To clarify, within our study, DEGs either refer to array probesets, when we discuss DEGs within the same data cohorts, or unique genes (Entrez_Gene_ID), when we compare results from different cohorts for blood and brain.

Pathway analysis for DEGs

We performed pathway analysis on the identified DEGs using commercial Ingenuity® Pathway Analysis (IPA®, QIAGEN Redwood City, www.qiagen.com/ingenuity) software. We chose as significant those canonical pathways with BH.pval < 0.01.

Gene-based Analysis of GWAS data

The International Genomics of Alzheimer’s Project (IGAP) Consortium reported a large-scale of AD GWAS dataset1. The gene-based analysis tool MAGMA19 was applied to the IGAP stage 1 whole genome summary statistics (including 17,008 AD and 37,154 CTLs), with the 1000 genomes European reference panel used to perform the joint SNP gene-based GWAS study. We searched for single-nucleotide polymorphisms (SNPs) within 20 kb up/downstream of each gene (NCBI37.3). Two significance levels were applied, nominal pval < 0.01 and Bonferroni BF.pval < 0.05 to identify significant genes in GWAS, which we refer to as MAGMA genes. The qvalue package in R was also applied.

Biomarker discovery by machine learning

We attempted to identify blood biomarkers and classification models trained/learned from the GSE63060 dataset and tested in GSE63061, and vice versa. Data were adjusted for age and gender by a robust regression model (applying the rlm function in MASS R package); the model residual was further centred and scaled to a mean of zero and standard variation of one across all subjects in each dataset for those common probesets. We used the least absolute shrinkage and selection operator (LASSO) regression feature selection method20, implemented in the glmnet R package, to investigate the prediction performance of different ML approaches, including SVM, RF and logistic Ridge Regression (RR) models with a voting strategy to detect optimal biomarkers and classification models to discriminate AD patients from control subjects. The voting strategy of majority outcomes from the above three ML algorithms was applied to determine the final predictive outcome. The LASSO approach shrank most of the coefficients of variables that have no or less discriminatory power to zero, while variables with non-zero coefficients remained in the final LASSO model representing the joint discriminatory power to separate patients with AD and controls subjects21. An optimal penalty factor lambda was tuned during the cross-validation process. We repeated such LASSO regression with 5-fold cross-validation (CV) 100 times, and the subset of features with the best CV area under the curve (AUC) value for receiver operating characteristic (ROC), or most frequently selected on the training dataset, was kept as the selected biomarker panel (feature set). However, if the number of variables selected was less than two, then the feature set with sub-optimal AUC would be selected. Feature set selected by LASSO initially started from the full feature pool, i.e., 22756 common probesets between GSE63060 and GSE63061. For SVM and RF, we used the default setting when calculating the predict accuracy. For RR, we calculated the optimal cut-off from training with optimal AUC and accuracy, and then applied this cut-off to prediction in testing. Prediction performances of the classifiers were evaluated by AUC, test accuracy (ACC), sensitivity (Sens), and specificity (Spec). For comparison, the area under precision-recall curve (AUPR) were calculated as well using PRROC R package. ROC curves were plotted using the ROCR R package22. All this work was conducted by in-house R programs.

Results

Differentially expressed genes in blood were also found in the brain

DEGs identified in the blood merged discovery dataset included 4980 (4276 unique genes) and 6739 (5746 unique genes) probesets for AD and MCI respectively (Supplementary Fig. 1), with 4158 common probesets representing 3601 unique genes. Only 82 probesets (76 unique genes) were identified as DEGs comparing AD to MCI, and only three of these 82 were DEGs in both AD and MCI (Supplementary Fig. 2 and Supplementary Table 2). It was observed that DEGs in AD (AD-DEGs) are likely to be DEGs in MCI (MCI-DEGs) with a highly significant enrichment (OR = 29.1, 95%CI 26.7–31.7, pval < 1.0E-16, Fisher test). In addition, those common DEGs shared the same regulatory directions in both AD and MCI (Supplementary Fig. 2), i.e., 2018 of them were up-regulated both in AD and MCI, while 2140 of them were down-regulated in both AD and MCI. Moreover, those common DEGs have larger changes in MCI compared to controls than in AD compared to controls (wilcox.test p.val < 2.2e-16). It is interesting that this observation holds for all the DEGs in MCI and AD (wilcox.test p.val < 4.06e-7). In addition, both AD-DEGs and MCI-DEGs in blood were significantly associated (absolute Pearson correlation |r| > 0.5) with Braak pathological stage (OR > 1.4, pval < 8.62E-14) or frontal atrophy (OR > 1.2, pval < 8.4E-06) in the brain subjects with AD when mapped to the data in Zhang’s brain study4 (Supplementary Fig. 3). Furthermore, 789 AD-DEGs in blood were also DEGs identified by our previous meta-analysis in brain prefrontal cortex (PFC) region7 with significant enrichment (OR = 1.48, 95%CI 1.34–1.62, pval < 6.28E-16), and 77.9% of them showed the same regulation direction between blood and brain (pval < 2.2E-16, sign test). Similarly, we observed that 998 MCI-DEGs in blood are also DEGs in the brain of AD patients with significant enrichment (OR = 1.39, 95%CI 1.27–1.51, pval = 4.90E-13). Peters et al. recently identified 1497 genes as being differentially expressed with chronological age23, and we observed that AD-DEGs or MCI-DEGs in blood were likely to be ageing-associated genes (OR > 2.00, pval < 2.93E-36 for both, Supplementary Fig. 3). AD-DEGs in brain PFC region7 were also enriched with these ageing-associated genes, although with a slightly lower level of enrichment (OR = 1.8, 95%CI 1.6–2.1, pval < 2.2E-16).

Table 1 lists the top 20 DEGs common to both AD and MCI, the top 10 AD-only DEGs, and the top 10 MCI-only DEGs in blood (see Supplementary Table 2 for the whole list).

Table 1 The top DEGs in blood and their relationships with AD brain.

Validation using Gene expression in other blood datasets

Among the 374 DEGs identified in the GSE6613 validation dataset (see Methods and Supplementary Table 2), 357 were included in the merged discovery dataset. Although DEGs identified in the discovery dataset had an enrichment of DEGs identified in GSE6613 (OR = 2.37, 95%CI 1.91–2.95, pval = 8.35E-15; and OR = 2.78, 95%CI 2.24–3.46, pval = 2.74E-21, for AD and MCI respectively), only three of the top DEGs listed in Table 1 were re-discovered in GSE6613, namely WDFY3, TCIRG1, and NEMF/SDCCAG1.

In the ADNI2 dataset, we identified 416, 630, and 157 DEGs (unique genes) for AD, early MCI (EMCI) and late MCI (LMCI) disease status respectively (see Supplementary Table 2). Both AD-DEGs and MCI-DEGs identified in the merged discovery cohort were enriched with DEGs identified in ADNI2 AD (OR = 1.88, 95%CI 1.53–2.33, pval = 6.11E-09; OR = 2.02, 95%CI 1.65–2.48, 9.67E-12, for AD and MCI respectively, Supplementary Fig. 4). None of the top DEGs listed in Table 1 were re-discovered in the ADNI2 AD dataset. However, HELZ was identified as an early MCI-DEG in the sub-cohort of ADNI2 with APOE4 = 1 genotype. This gene had a 12% up-regulation in both blood of AD and blood of MCI in the merged discovery dataset. An exome sequencing study revealed that variants in HELZ are associated with intellectual disability24. HELZ functions as a RNA helicases, and RNA helicases are involved in almost every RNA related process, including transcription, splicing, ribosome biogenesis, translation and degradation. Therefore, HELZ may have associations with the pathogenesis of neurodegenerative disease including AD25.

Pathway Analysis shows large overlap between blood and brain

For the up-regulated AD-DEGs in blood, 119 significant canonical pathways were identified, including iNOS Signalling (BH.pval = 9.77E-7, ratio = 21/43); B-Cell Receptor Signalling (BH.pval = 3.55E-6, ratio = 48/178); JAK/Stat Signalling (BH.pval = 3.55E-6, ratio = 29/83); and Production of Nitric Oxide and Reactive Oxygen Species in Macrophages (PNOROS, BH.pval = 3.55E-6, ratio = 50/192). For the down-regulated AD-DEGs, only eight significant pathways were identified (Fig. 1 and Supplementary Table 3) including EIF2 Signalling (BH.pval = 3.98E-15, ratio = 64/210); Oxidative Phosphorylation (BH.pval = 2.00E-14, ratio = 39/92); Mitochondrial Dysfunction (BH.pval = 1.58E-11, ratio = 47/152); and Protein Ubiquitination (BH.pval = 7.92E-11, ratio = 63/254), Similarly, we identified 63 and nine significant canonical pathways for up- and down-regulated blood MCI-DEGs respectively. A total of 53 and seven significant pathways were overlapping between AD and MCI for up- and down-regulated DEGs respectively including the top pathways mentioned above. Therefore, 83.3% (60 out of 72) significant pathways identified in MCI were also identified in AD. In our previous gene expression meta-analysis, we identified 168 significant pathways in the brain PFC region7, and 60.1% of these (101 out of 168) were identified in either blood AD or blood MCI, including PNOROS (BH.pval = 1.26E-12, ratio = 44/180), NFkB Signalling (BH.pval = 1.26E-11, ratio = 41/173), iNOS Signalling (BH.pval = 5.37E-7,ratio = 15/44), Mitochondrial Dysfunction (BH.pval = 2.24E-06, ratio = 37/172), and Oxidative Phosphorylation (BH.pval = 4.27E-4, ratio = 24/110). Some pathways were only identified in either blood AD or blood MCI, but not in brain PFC region with AD, such as EIF2 Signalling, Protein Ubiquitination, and mTOR Signalling (see Supplementary Table 3).

Figure 1
figure 1

Significant pathways identified by IPA in the blood dataset. IPA was applied to the DEGs identified from the merged blood datasets (GSE63060 and GSE63061). We show the top ten significant pathways identified for the up-regulated DEGs (red bar) and pathways identified for the down-regulated DEGs (green bar). The dark blue curve shows the ratio between the number of DEGs and the total number of genes in each of these pathways (entire list of IPA pathways is in Supplementary Table 3). (a) Significant pathways for AD-DEGs. Top ten significant pathways identified for the up-regulated DEGs and eight pathways identified for the down-regulated DEGs in AD. (b) Significant pathways for MCI-DEGs. Top ten significant pathways identified for the up-regulated DEGs and nine pathways identified for the down-regulated DEGs in MCI.

Differentially expressed genes not uniform across brain regions

In total, we identified 5552 AD-DEGs (unique genes) in 19 brain sub-regions (Supplementary Table 4), with the numbers of DEGs varying from 14 (Precentral Gyrus) to 1904 (Superior Temporal Gyrus), and an average of 453 DEGs in each region. With such a divergent distribution across 19 brain regions (Table 2), we did not identify any super genes which were DEGs in all 19 brain regions. Two genes (AKAP9, NEBL) were identified as DEGs in eight brain regions, and 3640 DEGs were identified from only a single region. 1048 of these DEGs (18.9%) were identified in our previous meta-analysis in brain PFC region (OR = 1.78, 95%CI 1.64–1.94, pval < 3.53E-42). Figure 2 illustrates the DEGs in these 19 brain regions and the overlap with AD-DEGs or MCI-DEGs in blood. Among these 19 brain regions, Prefrontal Cortex (PC), Occipital Visual Cortex (OVC), and Dorsolateral Prefrontal Cortex (DPC) are the top three regions with the highest proportion of brain DEGs mapped to blood. Only 15% of brain DEGs in hippocampus (HIP) were identified as AD-DEGs in blood. In addition, the mappings of brain AD-DEGs to blood AD-DEGs and brain AD-DEGs to blood MCI-DEGs, were highly associated (R = 0.80, pval < 3.33E-05, Pearson test, Table 2).

Table 2 Numbers of DEGs identified in brain regions and their overlapping in blood.
Figure 2
figure 2

Number of DEGs common to both the blood and the different brain regions. Overlap between DEGs (up-regulated and down-regulated) identified in the merged blood datasets and DEGs identified in each of the 10 brain regions is shown as an arc, the area of which is proportional to the number of overlapping DEGs (see full name of brain region in Table 2).

Gene-based GWAS reveals potential new risk genes

In total, 18229 genes were identified in the IGAP stage 1 GWAS dataset by MAGMA, including all of the 39 GWAS risk genes in AD except INPP5D. Sixty seven MAGMA genes passed BF.pval < 0.05, including 17 AD risk genes, and 15 AD-DEGs and 20 MCI-DEGs in blood (Table 3). Among them, MS4A6A, MS4A4A, ABCA7, HLA-DRA, MTSS1L, NDUFS3, and CD2AP were identified as DEGs in the brain PFC region in our previous brain meta-analysis; Thirteen of them were differentially expressed in at least one brain region. ABCA7 showed 17%-, 19%-, and 13% significant expression fold changes in blood of AD, blood of MCI and brain of AD respectively; this gene may thus be a potential biomarker for early diagnosis. MS4A6A showed >10% down-regulation in blood, and >43% up-regulation in brain; NDUFS3 was >10% down-regulated, and HMHA1 >9% up-regulated in blood and brain. Although HMHA1 is not a risk gene in AD, it has been reported that methylation sites in this gene have a strong relationship to ABCA7 and AD pathologies26. In addition, BCL3, a proto-oncogene candidate, might be a potential novel risk gene for AD, because it was 27% up-regulated in AD brain and identified as a DEG in both AD blood and MCI blood. Supplementary Table 5 indicates the 751 IGAP MAGMA genes (nominal pval < 0.01) and the most significant SNPs in their 20kbp up/downstream regions. We identified 281 and 119 genes at 0.05 or 0.01 significance level respectively when FDR testing was applied.

Table 3 Results of gene-based GWAS analysis.

DEGs in blood did not show any enrichment for these IGAP MAGMA genes at the stringent significance level (BF.pval > 0.05). However, if we apply nominal pval < 0.01 for MAGMA (751 genes identified), both AD-DEGs and MCI-DEGs in blood show enrichment in IGAP genes (OR = 1.33, 95%CI 1.11–1.61, pval = 2.45E-03; OR = 1.36, 95%CI 1.14–1.62, pval = 5.33E-04, respectively). We previously identified 3124 AD-DEGs in the brain PFC region7, and those DEGs had enriched MAGMA genes either for BF.pval < 0.05 or nominal pval < 0.01 (OR = 2.27, 95%CI 1.23–4.02, pval = 5.67E-03; OR = 1.23, 95%CI 1.00–1.51, pval = 4.64E-02 respectively). These results revealed the significant associations between genomics and gene expression in AD.

Creation of potential biomarker panels by machine learning

Our aim here was to identify a set of biomarkers and classification models (classifiers) which can discriminate patients with AD from healthy control subjects, e.g. 143 patients with AD from 104 controls in GSE63060 or 102 patients with AD from 78 controls in GSE63061. We trained classifiers in one dataset and tested them in the other dataset (see Methods).

Figure 3a illustrates an optimal six-feature panel (named Full6set) that was identified by measuring area under the curve (AUC) performance for SVM, RR and RF (0.875, 0.874, 0.849 respectively). The voted AUC (the average of the three AUCs) was 0.866 with 0.783 (95%CI: 0.716–0.841) accuracy for voting outcome. The Full6set contains six probesets: ILMN_2097421 (MRPL51), ILMN_2189933 (RPL36AL), ILMN_1695645 (CETN2), ILMN_1703617 (AHSA1), ILMN_2237746 (ING3), and ILMN_1939297 (GALNT4). In Fig. 3b, an optimal four-feature panel (named Full4set) was identified containing ILMN_1784286 (NDUFA1), ILMN_2097421 (MRPL51), ILMN_2189933 (RPL36AL) and ILMN_2189936 (RPL36AL). SVM, RR and RF classification models had similar testing AUC performance (0.86, 0.86, 0.857) and accuracy (0.773, 0.765, 0.785) respectively. The voting strategy yielded the average AUC of 0.859 and accuracy of 0.781 (95%CI: 0.725–0.831) with balanced sensitivity (0.776) and specificity (0.788). See Supplementary Table 6 for further details.

Figure 3
figure 3

Classification performance of biomarker panels. Different machine learning models were trained in one blood dataset (GSE63060 or GSE63061) and tested in the other (GSE63061 or GSE63060). Results shown from the different ML models in (a,c) all use the same panel of six features (panel Full6set), while ML models in (b,d) use one with four features (panel Full4set). Full6set contains six probesets, i.e. ILMN_2097421 (MRPL51), ILMN_2189933 (RPL36AL), ILMN_1695645 (CETN2), ILMN_1703617 (AHSA1), ILMN_2237746 (ING3), and ILMN_1939297 (GALNT4). Full4set contains four probesets: ILMN_1784286 (NDUFA1), ILMN_2097421 (MRPL51), ILMN_2189933 (RPL36AL) and ILMN_2189936 (RPL36AL). The AUC of vote is the average testing AUCs of SVM, RR, and RF models. See Supplementary Table 6 for detailed performance.

All features in Full6set and Full4set were down-regulated DEGs in the blood merged discovery dataset, except GALNT4 which was an up-regulated DEG (Supplementary Table 2); the two common features, ILMN_2097421 (MRPL51) and ILMN_2189933 (RPL36AL), were the top DEGs in the blood but not in the brain. In order to test the robustness of the classification models and features used, we swapped the training dataset and testing dataset, i.e. we trained classification models in GSE63060 using Full4set then tested in GSE63061, and we trained models in GSE63061 using Full6set and tested in GSE63060. Their testing performances are illustrated in Fig. 3c,d, and Supplementary Table 6. The robustness of the selected features was also tested by random selection (Supplementary Fig. 6). The models using Full6set demonstrated similar classification performances to the models using Full4set. Voting AUC for Full6set models were 0.866 and 0.864 in the two testing datasets (GSE63060 and GSE63061 respectively) with an average of 0.865. For Full4set models, the values were 0.859, 0.875 with an average of 0.867. Moreover, when we used the models trained from AD vs. controls to discriminate MCI from controls, most of the MCI (>72%) were predicted to be AD (Supplementary Table 7). Supplementary Fig. 7 shows the boxplots and swarm plots of each of the features in Full4set where MCI samples were also included, which demonstrates that each of the features had good classification performance.

Discussion

In this study, we observed that in blood samples more DEGs were identified comparing MCI to controls than comparing AD to controls. This suggests that the trajectory from control to MCI to AD is surely not linear. In addition, under the current classification of MCI there are many clinical entities, not all evolving to AD in the same way or time (some MCI even revert to control). Therefore, it is possible that the increased differences we observed between MCI and controls reflect the MCI’s dynamic and heterogeneous state. On the contrary, overt AD is a more stable clinical entity with possibly a more defined gene expression signature. We also observed that AD-DEGs tended to have the same regulation direction as the MCI-DEGs in blood (only a few genes were identified as DEGs comparing AD to MCI samples), and the majority of those AD-DEGs that overlapped in the blood and brain showed consistent directions of regulation, suggesting the biomarkers to be investigated in blood can be potential early diagnostic signatures. Our study shows evidence for a role of ribosomal dysfunction. In blood, the top 10 up- and down-regulated AD-DEGs were also identified as MCI-DEGs, and included ribosomal protein genes such as MRPL51, RPL36AL, and RPS25. Ribosome dysfunction is an early event in AD27, and the abnormal tau-ribosomal interactions in tauopathy lead to a decrease in RNA translation28. Two recent studies reported that reducing ribosomal protein S6 kinase 1 expression improves spatial memory and synaptic plasticity in a mouse model of AD29, and there are striking overlaps between non-steroidal anti-inflammatory (NSAID) drugs-induced changes and gene expression in the blood of AD patients in the ribosome and oxidative phosphorylation pathways30. A novel mutation discovered in the gene NDUFA1 may also lead to a progressive mitochondrial complex I specific neurodegenerative disease31. TYK2 and STAT3 were identified as up-regulated DEGs in both blood and brain (Supplementary Table 2). Tyk2/Stat3 signalling mediates beta-amyloid-induced neuronal cell death in AD32. TYK2 encodes a member of the tyrosine kinase specifically for the Janus kinases (JAKs) protein families, and inhibition of JAK1/JAK3 may provide an efficient therapeutic agent for the treatment of inflammatory diseases33 which might benefit AD patients as well since inflammation drives progression of AD34. It is interesting to note that TCIRG1 showed a greater than 20% up-regulation in blood of AD, blood of MCI and brain of AD. Mutations in this gene can cause lower absolute neutrophil count and may be responsible for infantile malignant osteopetrosis (IMO) disease35,36. However, its role in AD or dementia is not yet proven, and it may be related to neutrophil function and immunity.

We observed that DEGs in blood have a high potential to be identified as DEGs in brain prefrontal cortex region (PFC) through enrichment analysis. Table 2 shows that DEGs in brain PFC, Superior Temporal Gyrus (STG), Inferior Temporal Gyrus (ITG) regions are commonly DEGs in blood. Few DEGs were identified in brain hippocampus (HIP) region due to the large shrinkage in HIP that radically reduces gene expressions, and these DEGs have a low likelihood of being identified as DEGs in blood. It is well known that the hippocampus, a critical region for learning and memory, is especially vulnerable to damage at early stages of AD, hippocampal volume is one of the best AD biomarkers for diagnosis. The brain temporal cortex including STG, ITG, HIP, etc. plays a critical role in cognitive processes, language comprehension, memory formation and recall6. Functional segmentation analysis revealed that AD patients exhibit stronger hippocampus-PFC functional connectivity37. Actually 27.8% of all the DEGs in brain (1544/5552) are also DEGs in AD blood with a significant enrichment (OR = 1.27, 95%CI: 1.18–1.38, pval = 9.8e-10, Fisher test); 2154 DEGs in brain are also DEGs in MCI blood with an enrichment (OR = 1.44, 95%CI 1.34–1.55, pval = 2.2e-16, Fisher test). This shows that gene expression in the blood is a strong representation of gene expressions in the brain.

It has been revealed that mitochondrial dysfunction and oxidative phosphorylation were identified in AD/MCI blood, AD brain and ageing brain, showing the relevance of mitochondrial function in AD38. In our present study, we also found strong evidence for dysregulation of the mitochondrial and oxidative phosphorylation pathways in the blood of patients with AD and MCI.

IGAP provides a powerful data resource for the study of AD and it has been explored by several research teams39,40. To our knowledge, our study is the first to integrate IGAP with datasets from the blood of AD, blood of MCI and brain of AD. Moreover, recent trans-ethnic GWAS identified five novel AD risk genes41 and three of them (TPBG, PFND1/HBEGF, BZRAP1-AS1) were MAGMA genes in our study. Fourteen out of 39 previously identified risk genes of AD were identified as DEGs in at least one brain region of this disease, including MAPT, APP, PSEN1 and ABCA7. Genes simultaneously differentially expressed in several brain regions may be AD-relevant risk genes. For example, AKAP9 was identified as a DEG in eight brain regions including the hippocampus, and two rare mutations in this gene were recently discovered as AD-associated loci by whole exome sequencing42.This gene is also at the significance border in blood (BH.pval = 0.033 and 0.012 for AD and MCI respectively). Moreover, Low et al. discovered that variants of NEBL are relevant to atrial fibrillation (AF) susceptibility43, and NEBL was identified as a DEG in eight brain regions with AF recognized as a risk factor for cognitive decline and dementia44.

Discovering biomarkers in blood for the diagnosis of AD at the earliest and mildest stages is always clinically required and would be hugely beneficial. Recently, Nakamura and colleagues demonstrated the ability of amyloid-β precursor protein APP669–711/Aβ1–42 and Aβ1–40/Aβ1–42 ratios, and their composites in plasma to predict brain amyloid-β burden with very high performances45. Despite the relatively expensive IP-MS measurement method used, their results bring new hope for blood biomarker-based early diagnosis for AD.

In this study, we identified an optimal classification panel of four features, Full4set, by the LASSO feature selection approach. By applying classifiers with Full4set, 75.4% and 72.7% of MCI were predicted as AD in GSE63061 and GSE63060 respectively (Supplementary Table 7). All features in Full4set were DEGs in blood, and this small feature size panel may have the potential to be applied in Point-of-Care (PoC) diagnostic devices that will be developed and validated in the future.

Our study has a number of limitations. For the two blood datasets (GSE63060 and GSE63061), which are the main focus of this study, we applied multiple testing for DEGs identification. However, for the two validation blood datasets and the brain multiple regions dataset, no DEGs could pass the multiple-testing (BH.pval > 0.05), i.e. no significant genes were identified after allowing for multiple testing. We therefore were forced to apply nominal p-value with a more stringent significance level (<0.01) for DEG detection. The sample sizes used in previous transcriptomic and proteomic studies of AD were generally small, particularly in post-mortem brain studies. Therefore, there was a limited power to identify dysfunctional genes. We observed that most of our DEGs had small effect size, and the small sample sizes (particularly in the brain studies) gave us low statistical powers which resulted in a high level of false positives for DEG detection when nominal p-values were applied. Applying multiple testing may lose information, and alternative network-based approaches could be applied for biomarker discovery4,46. In addition, more accurate and sensitive techniques are required to measure such gene expressions, for instance, droplet digital polymerase chain reaction (ddPCR)47 and RNA-seq48. Aside from sample size, another limitation is that the classification effect of any genetic risk factors was not taken into account due to lack of information availability, e.g. for APOE which may be the most important genetic risk factors for AD49. This may be a major limitation as the presence of the APOE4 allele has been shown to influence the classification algorithms based on medical imaging and cerebrospinal fluid (CSF) biomarkers50 (and by our unpublished works). Moreover, our classification model only included gene transcript information and the effect from ageing and gender was adjusted during the data pre-processing. Finally, although AUC-ROC together with Sensitivity/Specificity are frequently used as performance measurements in biomedical research, for example recently in Nakamura and colleagues’ study45, it has been reported that Precision/Recall and Area Under Precision Recall (AUPR) can provide more information in imbalanced dataset51. We had applied ROC with class-weight adjustment in our model training process, and so we compared these results to those obtained using AUPR to assess the effect of data imbalance (please see Supplementary Fig. 8 and Table 6). In general, AUPR values are a bit lower than AUC-ROC values indicating the effect of data imbalance in our case, and there might have be rooms to improve classification performance by applying AUPR in the feature selection process.

In conclusion, our study revealed that genes differentially expressed in the blood were likely to be differentially expressed in the brain and with the same regulation direction. Common pathways were identified and found to be shared among brain AD, blood AD and ageing brain. We also identified a four-feature panel classification model that discriminated between AD patients and controls with promising performances. A larger cohort study is now necessary to validate the reproducibility of this model’s results perhaps using target-based transcriptional measurement.