Changes in expression of VGF, SPECC1L, HLA-DRA and RANBP3L act with APOE E4 to alter risk for late onset Alzheimer’s disease

Branciamore, Sergio; Gogoshin, Grigoriy; Rodin, Andrei S.; Myers, Amanda J.

doi:10.1038/s41598-024-65010-7

Changes in expression of VGF, SPECC1L, HLA-DRA and RANBP3L act with APOE E4 to alter risk for late onset Alzheimer’s disease

Article
Open access
Published: 28 June 2024

Volume 14, article number 14954, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Changes in expression of VGF, SPECC1L, HLA-DRA and RANBP3L act with APOE E4 to alter risk for late onset Alzheimer’s disease

Download PDF

Sergio Branciamore¹,
Grigoriy Gogoshin¹,
Andrei S. Rodin¹ &
…
Amanda J. Myers^2,3,4,5

402 Accesses
Explore all metrics

Abstract

While there are currently over 40 replicated genes with mapped risk alleles for Late Onset Alzheimer’s disease (LOAD), the Apolipoprotein E locus E4 haplotype is still the biggest driver of risk, with odds ratios for neuropathologically confirmed E44 carriers exceeding 30 (95% confidence interval 16.59–58.75). We sought to address whether the APOE E4 haplotype modifies expression globally through networks of expression to increase LOAD risk. We have used the Human Brainome data to build expression networks comparing APOE E4 carriers to non-carriers using scalable mixed-datatypes Bayesian network (BN) modeling. We have found that VGF had the greatest explanatory weight. High expression of VGF is a protective signal, even on the background of APOE E4 alleles. LOAD risk signals, considering an APOE background, include high levels of SPECC1L, HLA-DRA and RANBP3L. Our findings nominate several new transcripts, taking a combined approach to network building including known LOAD risk loci.

Combined Genome-Wide CSF Aβ-42’s Associations and Simple Network Properties Highlight New Risk Factors for Alzheimer’s Disease

Article 17 November 2015

Integration of transcriptomic and genomic data suggests candidate mechanisms for APOE4-mediated pathogenic action in Alzheimer’s disease

Article Open access 02 September 2016

Genetically regulated expression in late-onset Alzheimer’s disease implicates risk genes within known and novel loci

Article Open access 06 December 2021

Introduction

Apolipoprotein E (APOE) is a major risk factor in both early onset AD (EOAD, onset before 65) and late onset AD (LOAD, onset after 65)¹. The APOE locus has three common haplotypes (APOE E2, APOE E3 and APOE E4) defined by two single nucleotide polymorphisms (rs7412 and rs429358). While the E4 haplotype is a major driver of risk¹, the E2 haplotype appears to be protective². We have found that considering APOE background can help to map additional factors involved in genetic risk³. These factors could be either acting in conjunction with APOE to increase or decrease risk or, conversely, could be acting independently of the APOE locus. It is through this context that we decided to map how the APOE E4 risk allele can affect ‘omics profiles.

Our prior work^4,5,6,7 has involved analyzing expression profiles to determine the interplay of DNA, RNA, and protein in LOAD risk. We have used Bayesian network (BN) probabilistic / causal modeling to infer potential causality and we uncovered a potentially predictive transcript (in other words, a "hit") that replicated in two separate sets of data as well as cell culture models⁵. The choice of BN modeling (over alternative network methodologies, e.g., Markov networks and correlation-based co-expression networks) is motivated by a number of factors. First, the BNs ability (due to the BN directed acyclic graph structure) to filter out numerous spurious (indirect, or transitive) dependencies inherent to the multicollinear data rich in mutually dependent variables⁸, such as expression datasets. Second, the BNs ability to model non-additive higher-order relationships, both linear and non-linear. Third, the BNs ability to incorporate mixed data types (such as, in our case, expression data plus APOE haplotypes) into a unified analysis framework. Finally, and importantly, BN modeling is an explainable and segment-able statistical/machine learning activity, wherein each fragment of the network can be mechanistically and biologically interpreted as a probabilistic dependency and potential directional causality at high levels of granularity, and can be further scrutinized and in silico validated using “conventional” statistical tools. By concentrating on a variable of interest (e.g., LOAD phenotype) and its immediate neighborhood in the network, we can perform a classification task with clearly defined local conditional probability tables, something that the dimensionality reduction / clustering / visualization methods, such as UMAP⁹ or t-SNE¹⁰, cannot directly address.

Here, we sought to extend our prior BN work using a few modifications. First, we included the APOE E4 risk haplotype in our analysis. Second, we prioritized data inputs using mutual information (MI) as a variable selection filter, as opposed to differential expression (DE). MI measures the informational gain for understanding the state of one variable given knowledge about the state of another variable. Rather than rely on statistically significant differences in means (DE), MI estimates the overall dependence between two variables. Mixed MI (MMI) can deal with both quantitative and qualitative variables in the same space. Additionally, relationships can be given ranks based on the MMI cumulative distribution function (CDF), with only inputs showing high levels of MMI with the variable of interest (here diagnosis of LOAD) included. The final difference from our prior work is that we have used a different platform for BN network modeling, BNOmics^11,12,13. BNOmics uses a novel hybrid (constraint-based + search-and-score) heuristic algorithm to increase scalability by significantly shrinking the search space thus making a series of full BN modeling experiments with varying hyper-parameters (e.g., discretization schedules) computationally feasible (days instead of weeks) in our setting, increasing convergence and robustness. In addition, difficulties with handling both qualitative and quantitative measures in the same analysis (i.e., the mixed-variable problem) are solved by the novel MMI metric and by our use of adaptive maximum entropy-based discretization. This allows for robust handling of mixed type variables and their linear and non-linear interactions within a single analysis framework.

In summary, we used BNOmics to analyze our Human Brainome datasets⁵ including the APOE E4 risk haplotype in the model, as well as performed separate analyses in the group of individuals who were APOE E4 positive (Group 1) and APOE E4 negative (Group 2). Input data was selected by the transcripts which best explained LOAD status (scored as presence (1) / absence (0) of pathologically confirmed LOAD), and expression profiles were discretized to maximize both biological interpretability and rigor and reproducibility (by insuring convergence in the search space). We were focused on searching for de novo hits that were protective against APOE E4 risk as well as those that contributed to APOE E4 risk.

Subjects and methods

Human brain tissue samples

Sample sets are from The Human Brainome series⁵. Our KRONOSII series was obtained from 21 National Alzheimer’s Coordinating Center (NACC) brain banks and from the Miami Brain Bank as previously described^4,7,14. Additional cohorts were obtained in the same manner as the original US series. Our criteria for inclusion were as follows: self-defined ethnicity of European descent (in an attempt to control for the known allele frequency differences between ethnic groups), neuropathologically confirmed AD or no neuropathology present, and age of death greater than 65. Neuropathological diagnosis was defined by board-certified neuropathologists according to the standard NACC protocols¹⁵. Samples derived from subjects with a clinical history of stroke, cerebrovascular disease, Lewy bodies, or comorbidity with any other known neurological disease were excluded. Alzheimer’s disease or control neuropathology was confirmed by plaque and tangle assessment with 45% of the entire series undergoing Braak staging¹⁶.

The RUSH series includes deceased subjects from two large, prospectively followed cohorts maintained by investigators at Rush University Medical Center in Chicago, IL: The Religious Orders Study (ROS) and the Memory and Aging Project (MAP). The ROS cohort, established in 1994, consists of Catholic priests, nuns, and brothers from 40 groups in 12 states who were at least 55 years of age and free of known dementia at the time of enrollment. The MAP cohort, established in 1997, consists of men and women primarily from retirement facilities in the Chicago area who were at least 53 years of age and free of known dementia at the time of enrollment. All participants in ROS and MAP sign an informed consent agreeing to annual detailed clinical evaluations and cognitive tests, and the rate of follow-up exceeds 90%. Similarly, participants in both cohorts signed an Anatomical Gift Act donating their brains at the time of death. The overall autopsy rate exceeds 85%. The ROS and MAP cohorts were analyzed jointly since they were designed to be combined, are maintained by a single investigative team, and a large set of phenotypes collected are identical in both studies¹⁷. More detailed information regarding the two cohorts can be found in previously published literature¹⁸.

The series included 475 samples in the KRONOSII set (257 controls, 218 cases; age range 65–105; 58% female) and 306 samples in the replicate set (162 controls, 144 cases; age range 66–104; 63% female). Samples were chosen by the diagnostic criteria as stated above as well as the availability of high quality RNA. All samples were assessed for ancestry using principle components analysis and the 1000G dataset¹⁹. Samples in our study cluster with European ancestry (CEU, FIN, GBR, IBS populations).

Ethics statement

The research for this project fails under 45 CFR 46.101(b)(4). Human Subjects exemption #4 states research involving the collection or study of existing data, documents, records, pathological specimens, or diagnostic specimens, if these sources are publicly available or if the information is recorded by the investigator in such a manner that subjects cannot be identified, directly or through identifiers linked to the subjects. In both series, samples were de-identified before receipt, and the study met human studies institutional review board and HIPPA regulations. Consents for biobanking are obtained from the source brain banks. This work is declared not human-subjects research and is IRB exempt under regulation 45 CFR 46. See the Acknowledgements section for a list of individual sites that contributed samples to this effort.

Data collection

Sample data was adjusted for several biological covariates (gender, age at death and cortical region) and several methodological covariates (institute source of sample, post-mortem interval, detection and hybridization date). Adjustment methodologies were as in the prior report⁵.

Data analysis-networks

Our focus was on the construction of high-dimensional multimodal Bayesian networks (BNs). As before⁵, we analyzed the KRONOSII and the RUSH datasets separately. Our analysis pipeline is shown in Fig. 1. APOE status was defined as either having no APOE E4 alleles (APOE negative) or at least one APOE E4 allele (APOE positive).

All analyses were performed using the BNOmics platform. BNOmics is a highly scalable open-source universal-purpose BN modeling and visualization software developed by us previously^11,12,20. Each analysis involved the following:

MMI. The goal of the MMI-based variable selection/ranking was to select a more compact set of transcripts for subsequent BN analysis. We selected two series of 2000 top expression transcripts from the KRONOSII and RUSH data, respectively, as 2000 constituted a reasonable compromise between the computational efficiency of the full BN construction (hours to 1–2 days for each BN) and low probability of false negatives (given the smooth and monotonic nature of the MMI curves and the fact that the resulting Markov blankets of LOAD phenotype contained no more than 10–30 variables).

DISCRETIZATION. In order for our BN analyses to proceed, expression data first must be discretized. We used the multinomial model with the maximum entropy-based discretization, as the linear Gaussian model is not always a good fit with the transcript expression data²¹. We discretized the continuous transcript expression data into 3 bins (“high-medium–low”), which showed stable BN convergence and was also the most convenient discretization schedule from the biological interpretation standpoint. To insure that our discretization was robust and did not create artifacts such as false dependencies or independencies, we additionally plotted the full distribution using the ggstatsplot package in R²² for each of our main targets (continuous values) against disease risk.

BNOmics. We used a hybrid (constraint-based + search-and-score) algorithm with MDL scoring function previously described in^11,12,20, with 20 restarts. BN edges are directed; however, a directed edge in the BN does not necessarily imply either directional causality or the biological hierarchy. Rather, edge directionalities describe the dependency structure between the variables (the factorization of the joint probability of the variables in the BN) that is most likely given the data, largely resolving multicollinearity issues. To identify the most salient transcripts, Markov neighborhoods (MNs) of the primary variable of interest (LOAD status, denoted as “DX” node in the networks) were used. MN is a simplification of the more rigorous concept of the Markov blanket (MB) that includes only the variables that are immediately adjacent to the variable/node of interest (such as DX) in the network. MB, consisting of the node’s “parents”, “children”, and other “parents” of the node’s “children”, is a superset of MN; given the MB of the variable of interest, it is independent of the remaining variables in the network, allowing us to concentrate on a small subset of variables to identify and highlight potential hits.

Data analysis-dataset similarity

To examine global similarity in transcript profiles, for each dataset (D), KRONOSII (K) and RUSH (R), we computed the Mixed Mutual Information (MMI)^13,23 between the level of expression of each transcript i (\({g}_{i}\)) and the corresponding binary variable associated with the LOAD status (DX) as \(MM{I}^{D}\left({g}_{i},DX\right)\). To simplify notation, we define \({I}_{i}^{D}=MM{I}^{D}\left({g}_{i},DX\right)\) . Subsequently, we define two transcript subsets r and k as \({r}_{p}=\left\{{g}_{i} : MM{I}_{i}^{R}>{x}_{R}\right\}\), and \({k}_{p}=\left\{{g}_{i} : MM{I}_{i}^{K}>{x}_{K}\right\}\) , with \({x}_{D}\) corresponding to the \(p\) percentile value in the \(MM{I}^{D}\) distribution. We used the Jaccard Index (JI) to measure the similarity between \({r}_{p}\), \({k}_{p}\). JI is defined as the size of the intersection divided by the size of the union of the sample sets. In our analysis we have computed JI using different cut-offs of MMI as \(JI\left(G,p\right)=\frac{{r}_{p}\cap {k}_{p}}{{r}_{p}\cup {k}_{p}}\), with \(G\) being the set of all transcript labels in KRONOSII and RUSH. Similar to a common practice in transcript-set enrichment analysis, a hypergeometric distribution was used to compute the probability (\({P}_{val}\)) of observing two subsets sampled randomly without replacement from \(G\) of size equal to \({r}_{p}\) and \({k}_{p}\) and with JI greater than or equal to that observed for the corresponding \({r}_{p}\) and \({k}_{p}\). APOE genetic similarity between the two datasets was determined by looking at haplotype frequencies.

Data analysis-visualizations

MBs of the variable of interest (LOAD status / DX) were visualized as parts of the full BNs. Notably, these are the fragments of the full BNs, and not the small BNs built from the further reduced variable sets. The MBs included dependencies between the peripheral variables. The MMI CDFs were plotted for KRONOSII and RUSH, and specific hits were mapped to determine how much explanatory weight they had in each dataset according to a univariate MMI criterion. To dissect the effects (and their directions) of top BN-identified hits, two-level Conditional Probability Trees (CPTs) were created using the local conditional probability tables. CPT visualizations are intuitively similar to the conventional decision trees.

Results

Figure 2 depicts the MB of LOAD diagnosis (DX) for the 3-bin discretized expression data plus APOE genotype for KRONOSII and RUSH. We obtained converging structures in both sets. The top hits in KRONOSII were APOE E4 presence/absence, VGF Nerve Growth Factor Inducible (VGF), Sperm Antigen with Calponin Homology and Coiled-Coil Domains 1 Like (SPECC1L), and Major Histocompatibility Complex, Class II, DR Alpha (HLA-DRA). The top hits in RUSH were APOE, Kinesin Family Member 5A (KIF5A) and Ornithine Decarboxylase (ODC1). Violin plots of normalized, but not discretized profiles for all top targets are given in Supplementary Fig. 1 demonstrating our MB results (dependencies) were not due to possible discretization artifacts. Of note, for the small set of top explanatory hits for LOAD, only APOE E4 presence/absence replicated between the two datasets.

To investigate why there was little replication, we compared both the expression input from both KRONOSII and RUSH as well as examined the haplotype frequencies for APOE E4. We used the JI to compare the MMI expression datasets and found that at high levels of mutual information (> 60%), as we had in our top 2000 MMI input sets, the KRONOSII and RUSH expression datasets overlap, as measured by the JI (red line, Supplementary Fig. 2A). This similarity is statistically more significant than a random distribution of the JI (Supplementary Fig. 2B). From this we conclude that as prior⁵, the KRONOSII and RUSH expression datasets on their own are globally similar.

In this study, we are interested in including genetics in our modeling, specifically the APOE haplotype. To examine whether this was a factor impeding reproducibility, we plotted the frequencies of the main APOE haplotypes in each dataset. Of note, in Supplementary Fig. 3 there is a very low number of APOE E4s in the RUSH cohort. This effectively reduces the cohort size contributing to mapping effects by ~ 40% (KRONOSII APOE E4 positive controls n = 56; RUSH APOE E4 positive controls n = 22; KRONOSII APOE E4 positive cases n = 150; RUSH APOE E4 positive cases n = 57). This difference in APOE haplotype frequencies was not just due to sampling, given the KRONOSII set is larger (n = 475) than the RUSH dataset (n = 306). Examining APOE frequencies in additional RUSH samples (n = 458) where expression was not profiled due to tissue availability and/or quality, the APOE haplotype frequencies were very similar to the current RUSH set (APOE E22 current set = 1%; expanded set = 0.9%; APOE E23 current set = 13.6%; expanded set = 12.9%; APOE E33 current set = 58.5%; expanded set = 59.8%; APOE E24 current set = 2.7%; expanded set = 2.6%; APOE E34 current set = 23.1%; expanded set = 22.1%; APOE E44 current set = 1%; expanded set = 1.3%).

This reduction in power can clearly be seen in Fig. 2, where the explanatory weights (edge strengths) for diagnosis and APOE in RUSH are ~ 40% of those in KRONOSII (RUSH APOE E4 explanatory weight = 21.24; KRONOSII APOE E4 explanatory weight = 58.19). While proportionality of the BN scoring function (MDL, in our case) to the sample size makes direct inter-dataset comparisons and transfer learning between the datasets (cohorts) difficult, we believe it is likely this reduction in power due to less E44 haplotypes in the RUSH cohort is contributing to the lack of reproducibility; generally, we are seeing much larger effects in KRONOSII. We therefore decided to focus on the KRONOSII dataset for uncovering effects. We discuss the limitations of network comparisons in the DISCUSSION section below.

To attempt to validate the results in KRONOSII, we split the dataset into APOE E4 positive individuals and APOE E4 negative individuals and ran these two groups separately. While not as ideal as a completely independent dataset, this approach does serve as a validation of effects given that MMI predictions and BN structures are produced independently from the prior runs using the entire KRONOSII series. Figure 3 shows the MB for the two separate groups of APOE E4 positive and APOE E4 negative samples in KRONOSII. Again, VGF is a top hit. In the KRONOSII group without the APOE risk haplotype, VGF and Homo sapiens unc-13 homolog B (UNC13B) had the greatest explanatory weight for pathologically confirmed LOAD. In the KRONOSII group with the APOE risk haplotype, RAN Binding Protein 3 like (RANBP3L) had the greatest explanatory weight for pathologically confirmed LOAD. Violin plots of normalized, but not discretized profiles for all top targets are given in Supplementary Fig. 4.

While MBs indicate patterns of dependencies and strengths, they do not indicate how the dependency is manifested, e.g., whether the transcript is highly expressed with APOE E4 haplotypes, or vice versa. We were interested to see how our case and control groups split by the level of expression and copies of APOE E4 haplotype. Figures 4, 5, 6 and 7 and Supplementary Figs. 5, 6 show conditional probability trees (CPT), which are decision tree-like visualizations of the conditional probability tables for each of the main hits. We found that VGF, SPECC1L, HLA-DR and RANBL3L gave strong consistent signals and acted in concert with APOE E4 haplotypes. While UNC13 and NEU4 gave strong signals in the network, there was less of a consistent (in its direction) signal.

High levels of VGF and no APOE alleles were most commonly found in the group of controls, with only 8% of the samples in this set having a pathologically confirmed diagnosis of LOAD (Fig. 4 top box, APOE = 0, VGF = 2). The greatest proportion of cases was found in the group of individuals with APOE 4 haplotypes and low levels of VGF (Fig. 4 bottom box, APOE = 1, VGF = 0). Of interest, there is a risk reduction in individuals with high levels of VGF, even if they possess APOE E4 alleles (Fig. 4 yellow highlight, APOE = 1, VGF = 2), indicating that high levels of VGF can be protective on an APOE background. This is a significant effect (comparing high VGF expression and APOE E4 presence with low VGF expression and APOE E4 presence OR = 2.2810; 95% CI 1.2217–4.2588, p-value = 0.0096) with an effective reduction in probability of ~ 40%.

For risk effects, we have mapped SPECC1L, HLA-DRA and RANBP3L. All of these elevate risk with increasing expression in a linear fashion with the APOE E4 haplotype. SPECC1L (Fig. 5) and HLA-DRA (Fig. 6) were seen without splitting the KRONOSII cohort by APOE E4 haplotypes (see Fig. 2). RANBP3L (Fig. 7A) was only ranked in the network when we split the cohort by APOE E4 haplotype (see Fig. 3). To investigate this effect, we traced the path of RANBP3L in the full cohort (Fig. 7B). RANBP3L was "missed" in the full dataset because it is steps removed from the diagnosis (DX) signal and in terms of network informative content it acts through VGF. VGF is our strongest signal, and thus, through splitting the sample into APOE E4 negatives, where VGF has the most effect given it is protective, and APOE E4 positives, where VGF has less of an effect, we are able to uncover RANBP3L.

As stated previously, UNC13 (Supplementary Fig. 5) and NEU4 (Supplementary Fig. 6) did not give a consistent signal in the CPT analysis. For UNC13 consistency in expression is only seen in the APOE E4 haplotype absent set (APOE = 0 branch, top of figure). Note that in multinomial (and therefore multi-bin) BN analysis with more than two bins, relationships do not have to be linear; therefore, “consistency” in this sense indicates relationships with sustained “low-medium–high" or “high-medium–low" directions. There is a modest increase in the probability of LOAD with high UNC13 expression if there are no APOE E4 haplotypes present. This is logical given these hits were only uncovered in the split analysis under the condition where the input KRONOSII dataset had no APOE E4 alleles (Fig. 3A). NEU4 was also found in the MB of the network built from individuals without APOE E4 haplotypes; however, unlike UNC13 there was no real relationship between high levels of expression or low levels of expression and risk. This suggests that, first, the minimum description length (MDL) scoring function in BNs tends towards higher sensitivity in general and, second, multinomial models capture differences at the distribution level that might not directly translate into the sustained (linear) directions. Therefore, “borderline” (lower) edge strengths in the MBs should be further scrutinized using the local probability tables (visualized as CPTs) and validated using traditional statistical univariate tests.

Finally, we were interested to search for the main effects we found in KRONOSII in RUSH and compare the level of MMI (univariate metric that does not account for multicollinearity and does not distinguish between direct and transitive dependencies) with diagnosis for each transcript. We considered the RUSH cohort as well, because while the reduced number of APOE E4 samples inhibits the power to map APOE E4 contingent structures, we still may be able to map effects considering MMI, which is a direct relationship between the hits and LOAD diagnosis. Figure 8 graphs the MMI CDFs for KRONOSII and RUSH, considering our main hits. As can be seen, VGF and APOE E4 haplotype (“APOE_genetic”) have extremely high explanatory signals in the KRONOSII data (solid lines) with the VGF signal actually exceeding APOE in KRONOSII. VGF is also high in the RUSH dataset (dotted lines), but not strong enough to overtake the main APOE signal. Interestingly, RANBP3L is also a MMI hit in both datasets, whereas our other hits only appear to have strong effects in KRONOSII. Finally, we tested APOE expression levels for MMI with DX. APOE expression is not a robust signal in either set. This matches with our finding that there is no relationship between APOE haplotypes and APOE expression when just plotting expression without considering MMI (Supplementary Fig. 7).

Discussion

In this study we have demonstrated the high utility of the BN-driven computational systems biology approach for studying LOAD. The principal advantages of the BN modeling outlined in Introduction above (removal of spurious relationships; mixed data types handling; no linearity or normality assumptions; combination of both network visualization/interpretation and classification, at high levels of granularity) proved to be instrumental in our analysis. In this, we concur with the recent LOAD literature^24,25,26. We believe the BNs to be a robust, flexible and fitting tool, around which further multiscale network modeling in the LOAD context should be centered. The main impediment to a wider BN modeling application is the limited scalability due to the high computational complexity; however, this is now less of an issue due to the increasingly better hardware, and continuing progress in both algorithmic development and software implementation^11,24,27. Notably, it is precisely this high computational complexity that enables filtering out spurious correlations and transitive dependencies, inherent to the datasets with complex correlation structure, which is the case with LOAD. This distinguishes BN modeling from the more straightforward (i.e., relying on pairwise correlations) network-centric approaches currently predominant in the LOAD computational analyses, such as protein–protein interaction (PPI) networks, gene co-expression and regulatory networks, and multi-scale networks^28,29,30,31. Concurrently, BN modeling, although a foundational machine learning (ML) method, emphasizes intrinsic interpretability, which contrasts favorably with the majority of the high predictive performance-oriented ML/DL (deep learning) methods that have recently been gaining traction in the LOAD space^31,32,33, as DL explainability is largely limited to the ex post facto feature attribution and/or broad DL layer-level interpretation. Overall, we see BNs as a “happy medium” methodology, complementing both conventional network-centric approaches and emerging DL techniques in a LOAD computational analysis toolkit.

Our most robust hit was VGF. High levels of VGF are protective against the development of Alzheimer's disease generally, and on an APOE E4 risk background there is approximately a 50% reduction in the proportion of individuals with LOAD when there are high levels of VGF (see Fig. 4, yellow box). This hit was first implicated in LOAD using SELDI-TOF–MS to find novel biomarkers in patient CSF³⁴. This initial CSF result has now been validated in the Alzheimer's Disease Neuroimaging Initiative (ADNI)³⁵. Differences in VGF profiles have also been mapped in brain tissues^24,36 further implicating this hit. VGF (nonacroymic) is a granin-like neuropeptide precursor protein which is processed by prohormone convertases³⁷. Precursor cleavage results in several downstream peptides with various functions including inflammation³⁸, pain³⁸, reproduction³⁹, energy metabolism/feeding⁴⁰, and circadian rhythms⁴¹. Central to all these functions is the crucial role that VGF plays in the secretory pathway along with other members of the granin family⁴². The secretory pathway has long been implicated in Alzheimer's disease pathogenesis with the first findings coming from studies of EOAD, where the Swedish mutation results in differences in Amyloid precursor protein (APP) sorting⁴³. APOE pathogenesis in LOAD also involves pathway regulation; however, in this case APP pathogenesis is likely caused by faster endocytosis through APOE E4 binding of the LRP1 receptor⁴⁴. It remains to be seen whether VGF may block this process directly, or whether the protection from overexpressed VGF is merely due to its growth factor properties.

For all the other hits we mapped, increases in expression caused increases in risk for LOAD development. SPECC1L colocalizes with tubulin and actin. Deficiencies in SPECC1L protein expression can lead to deficits in vertebrate facial morphogenesis⁴⁵. SPECC1L is thought to be a regulator of adherens junction stability and remodeling of the actin cytoskeleton⁴⁶. Loss of SPECC1L increases staining of adherens junctions⁴⁷, and therefore, increased expression of SPECC1L, as seen in our study, may be involved in risk via breakdowns in blood brain barrier (BBB) integrity. APOE also acts on BBB integrity. Specifically, the E4 containing forms of apoE increase age-dependent breakdowns in the blood brain barrier in mouse models ⁴⁸.

HLA-DRA is a member of HLA class II set of proteins, which sit on the surface of antigen presenting cells and act in pathogen recognition. Upregulation of HLA class II proteins is a marker of activated microglia in LOAD⁴⁹ and AD patients have a higher load of CD4⁺HLA-DR⁺ and CD8⁺HLA-DR⁺ lymphocytes⁵⁰. In large, multisite genome-wide association screens variants near the HLA-DRB5-DRB1 cluster just downstream of HLA-DRA are consistently found to be associated with LOAD risk^51,52. HLA is known to be involved in immune inflammatory response⁵³, as well as CNS plasticity and signal transmission for the HLA MHC class I proteins⁵⁴. In the context of APOE genetic risk, prior work has focused on class I proteins, showing protective effects of HLA-DRB haplotypes⁵⁵; however, this is counter to the GWAS results, where HLA-DRB and HLA-DQA/B are risk factors^51,52. Further work has suggested that HLA-antigen incongruence resulting in persistent immune activation can lead to risk for AD⁵⁶, which is in concordance with our finding of HLA-DRB activation. It remains to be seen how APOE haplotypes, HLA-DRB haplotypes and HLA-DRA expression activation interact.

RANBP3L is a nuclear pore protein which enables nuclear protein translocation through the nuclear pore complex (NPC). RANBP3L specifically acts on BMP-specific SMAD 1/5/8 proteins and terminates BMP signaling by blocking nuclear import⁵⁷. Reductions in SMAD signaling have been linked to AD pathogenesis in both neurons and glia ^58,59. Prior work on RANs in AD showed reduced expression of RAN, RANBP1, RANBP2, RANBP5, RANBP9 and RANBP10 in AD tissues, proposing that the toxic forms of APP knocked down RANs and thus reduced import into the nucleus⁶⁰. RANBP3L was not measured in that screen. In our series there are also seen statistically significant reductions in RAN, RANBP1, RANBP6, and RANBP9, which is consistent with the prior report⁵; however, none of these downregulated hits were crucial to the APOE effect. From our studies the crucial effect is an upregulation in RANBP3L blocking SMAD signaling by blocking nuclear import. It remains to be seen how APOE E4 acts in concert with RANBP3L.

Our results complement other recent studies exploring multiscale network analyses in the LOAD context. Guo et al.⁶¹ used multiscale network analyses applied to large-scale human postmortem brain transcriptomic data of LOAD from two cohorts to dissect the interplay of APOE, sex, and LDL receptor related protein 10 (LRP10) as a key LOAD driver. Pan et al.⁶² converged on a VGF/DUSP4 (Dual-Specificity Protein Phosphatase 4) network. Neff et al.⁶³ identified, via multiscale network analysis, LOAD subtype-specific drivers such as GABRB2, LRP10, MSN, PLP1, and ATP6V1A. These and other recent studies illustrate the growing power and the emerging promise of multi-scale and integrative network approaches to reveal and subtype novel LOAD biology^{64,65,66,67,68}, and underline the ability of such approaches to dissect the interactions between APOE, VGF and emerging novel LOAD targets.

A notable limitation of our BN approach is the unbounded nature of the conventional BN scoring criteria (MDL/BIC, AIC). MDL is approximately linearly proportional to the sample size, and is dependent on the variables’ complement and the global network context, thus making direct inter-cohort comparisons difficult, with edge strengths not being commeasurable across the BNs. This incongruity contributed to our difficulties in reconciling KRONOS and RUSH cohort results. An additional limitation is the absence of the principled, rigorous power analysis framework for BN modeling in the genetic epidemiology context. In the future, we plan to explore alternative, commensurate network scoring criteria and edge strength measures.

In conclusion, our study seeks to define the role of APOE risk haplotypes on the downstream effects of expression being agnostic to genomic location or biological hypothesis. We have used MI and BN modeling to define novel hits which can either increase risk or offer protection against developing LOAD in the context of risk conferred by the APOE E4 haplotype. This study extends prior work in that we are sampling all expression profiles, rather than just looking at genomic locations closest to APOE, which is a common technique in GWAS follow-up or cis expression profiling. We are additionally performing network analysis in the context of risk genotypes, which is also not widely done as most network profiling focusses on continuous variable data only (i.e. expression). Network analysis, and specifically BN modeling, offer distinct advantages for the reasons stated above. Finally, we have demonstrated that power is a crucial component to these screens, with reproducibility only seen where there are enough alleles/haplotypes to capture effects. Further work will involve an understanding of additional genetic risk hits beyond APOE as well as compound modeling, where power permits. This will be facilitated by a commensurate BN scoring criteria, allowing for robust comparisons, transfer learning and power estimation frameworks.

Conclusion

Taking a combined approach to network building including the APOE E4 locus, our findings nominate VGF, SPECC1L, HLA-DRA and RANBP3L as novel expression loci involved in the pathogenesis of LOAD. Of particular interest, high levels of VGF are protective on an APOE E4 background.

Data availability

The datasets generated during and/or analyzed during the current study are available in the Laboratory of Functional Neurogenomics website (https://xzmxbgsv808roffneicreq.on.drv.tw/www.lfun/LFUN/LFUN/INDEX.html). Relevant code and software are available directly from the authors, or as part of the BNOmics package, at https://bitbucket.org/77D/bnomics.

References

Corder, E. H. et al. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer’s disease in late onset families. Science 261, 921–923 (1993).
Article ADS CAS PubMed Google Scholar
Reiman, E. M. et al. Exceptionally low likelihood of Alzheimer’s dementia in APOE2 homozygotes from a 5000-person neuropathological study. Nat. Commun. 11, 667 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Reiman, E. M. et al. GAB2 alleles modify Alzheimer’s risk in APOE epsilon4 carriers. Neuron 54, 713–720 (2007).
Article CAS PubMed PubMed Central Google Scholar
Myers, A. J. et al. A survey of genetic human cortical gene expression. Nat Genet 39, 1494–1499 (2007).
Article CAS PubMed Google Scholar
Petyuk, V. A. et al. The human brainome: network analysis identifies HSPA2 as a novel Alzheimer’s disease target. Brain https://doi.org/10.1093/brain/awy215 (2018).
Article PubMed PubMed Central Google Scholar
Piehowski, P. D. et al. Sources of technical variability in quantitative LC-MS proteomics: Human brain tissue sample analysis. J. Proteome Res. 12, 2128–2137 (2013).
Article CAS PubMed PubMed Central Google Scholar
Webster, J. A. et al. Genetic control of human brain transcript expression in Alzheimer disease. Am. J. Hum. Genet. 84, 445–458 (2009).
Article CAS PubMed PubMed Central Google Scholar
Pearl, J. Probabilistic reasoning in intelligent systems: Networks of plausible inference, (Morgan Kaufmann, 1988).
McInnes, L. & Healy, J. UMAP: Uniform manifold approximation and projection for dimension reduction. ArXiv e-prints 1802.03426(2018).
van der Maaten, L. J. P. & Hinton, G. E. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Google Scholar
Gogoshin, G., Boerwinkle, E. & Rodin, A. S. New Algorithm and software (BNOmics) for inferring and visualizing Bayesian networks from heterogeneous big biological and genetic data. J. Comput. Biol. 24, 340–356 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gogoshin, G., Branciamore, S. & Rodin, A. S. Synthetic data generation with probabilistic Bayesian networks. Math. Biosci. Eng. 18, 8603–8621 (2021).
Article MathSciNet PubMed PubMed Central Google Scholar
Wang, X., Branciamore, S., Gogoshin, G., Ding, S. & Rodin, A. S. New analysis framework incorporating mixed mutual information and scalable Bayesian networks for multimodal high dimensional genomic and epigenomic cancer data. Front. Genet. 11, 648 (2020).
Article CAS PubMed PubMed Central Google Scholar
Corneveaux, J. J. et al. Association of CR1, CLU and PICALM with Alzheimer’s disease in a cohort of clinically characterized and neuropathologically verified individuals. Hum. Mol. Genet. 19, 3295–3301 (2010).
Article CAS PubMed PubMed Central Google Scholar
Beekly, D. L. et al. The national Alzheimer’s coordinating center (NACC) database: An Alzheimer disease database. Alzheimer Dis. Assoc. Disord 18, 270–277 (2004).
PubMed Google Scholar
Braak, H. & Braak, E. Staging of Alzheimer’s disease-related neurofibrillary changes. Neurobiol. Aging 16, 271–278 (1995).
Article CAS PubMed Google Scholar
Bennett, D. A., Wilson, R. S., Boyle, P. A., Buchman, A. S. & Schneider, J. A. Relation of neuropathology to cognition in persons without cognitive impairment. Ann. Neurol. 72, 599–609 (2012).
Article PubMed PubMed Central Google Scholar
Bennett, D. A. et al. Overview and findings from the rush Memory and Aging Project. Curr. Alzheimer Res. 9, 646–663 (2012).
Article CAS PubMed PubMed Central Google Scholar
Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).
Article PubMed PubMed Central Google Scholar
Rodin, A. S. et al. Dissecting response to cancer immunotherapy by applying Bayesian network analysis to flow cytometry data. Int. J. Mol. Sci. 22, 2316 (2021).
Article CAS PubMed PubMed Central Google Scholar
de Torrente, L. et al. The shape of gene expression distributions matter: how incorporating distribution shape improves the interpretation of cancer transcriptomic data. BMC Bioinf. 21, 562 (2020).
Article Google Scholar
Patil, I. Visualizations with statistical details: The ‘ggstatsplot’ approach. J. Open Sour. Softw. 6, 3167 (2021).
Article ADS Google Scholar
Gao, W., Kannan, S., Oh, S. & Viswanath, P. Estimating mutual information for discrete-continuous mixtures. ArXiv [Preprint]. Available online at: https://arxiv.org/abs/1709.06212 (2018).
Beckmann, N. D. et al. Multiscale causal networks identify VGF as a key regulator of Alzheimer’s disease. Nat. Commun. 11, 3942 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Bu, Y. & Lederer, J. Integrating additional knowledge into the estimation of graphical models. Int. J. Biostat. 18, 1–17 (2021).
Article PubMed Google Scholar
Zhang, Y., Wang, J., Liu, X. & Liu, H. Exploring the role of RALYL in Alzheimer’s disease reserve by network-based approaches. Alzheimers Res. Ther. 12, 165 (2020).
Article PubMed PubMed Central Google Scholar
Wang, L., Audenaert, P. & Michoel, T. High-dimensional bayesian network inference from systems genetics data using genetic node ordering. Front. Genet. 10, 1196 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ghiassian, S. D., Menche, J. & Barabasi, A. L. A DIseAse MOdule Detection (DIAMOnD) algorithm derived from a systematic analysis of connectivity patterns of disease proteins in the human interactome. PLoS Comput. Biol. 11, e1004120 (2015).
Article ADS PubMed PubMed Central Google Scholar
Fan, L. Y. et al. Integrating single-nucleus sequence profiling to reveal the transcriptional dynamics of Alzheimer’s disease, Parkinson’s disease, and multiple sclerosis. J. Transl. Med. 21, 649 (2023).
Article CAS PubMed PubMed Central Google Scholar
Conte, F. & Paci, P. Alzheimer’s disease: insights from a network medicine perspective. Sci. Rep. 12, 16846 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, Z. et al. A review and analysis of key biomarkers in Alzheimer’s disease. Front. Neurosci. 18, 1358998 (2024).
Article PubMed PubMed Central Google Scholar
Beebe-Wang, N. et al. Unified AI framework to uncover deep interrelationships between gene expression and Alzheimer’s disease neuropathologies. Nat. Commun. 12, 5369 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Jemimah, S. & AlShehhi, A. c-Diadem: a constrained dual-input deep learning model to identify novel biomarkers in Alzheimer’s disease. BMC Med. Genom. 16, 244 (2023).
Article CAS Google Scholar
Carrette, O. et al. A panel of cerebrospinal fluid potential biomarkers for the diagnosis of Alzheimer’s disease. Proteomics 3, 1486–1494 (2003).
Article CAS PubMed Google Scholar
Llano, D. A., Devanarayan, P. & Devanarayan, V. VGF in cerebrospinal fluid combined with conventional biomarkers enhances prediction of conversion from MCI to AD. Alzheimer Dis. Assoc. Disord. 33, 307–314 (2019).
Article CAS PubMed Google Scholar
Barranco, N. et al. Dense core vesicle markers in CSF and cortical tissues of patients with Alzheimer’s disease. Transl. Neurodegener 10, 37 (2021).
Article CAS PubMed PubMed Central Google Scholar
Trani, E. et al. Isolation and characterization of VGF peptides in rat brain Role of PC1/3 and PC2 in the maturation of VGF precursor. J. Neurochem. 81, 565–574 (2002).
Article CAS PubMed Google Scholar
Soliman, N., Okuse, K. & Rice, A. S. C. VGF: a biomarker and potential target for the treatment of neuropathic pain?. Pain Rep. 4, e786 (2019).
Article PubMed PubMed Central Google Scholar
Ferri, G. L. et al. The “VGF” protein in rat adenohypophysis: sex differences and changes during the estrous cycle and after gonadectomy. Endocrinology 136, 2244–2251 (1995).
Article CAS PubMed Google Scholar
Watson, E. et al. Analysis of knockout mice suggests a role for VGF in the control of fat storage and energy expenditure. BMC Physiol. 9, 19 (2009).
Article PubMed PubMed Central Google Scholar
Wisor, J. P. & Takahashi, J. S. Regulation of the vgf gene in the golden hamster suprachiasmatic nucleus by light and by the circadian clock. J. Comp. Neurol. 378, 229–238 (1997).
Article CAS PubMed Google Scholar
Bartolomucci, A. et al. The extended granin family: structure, function, and biomedical implications. Endocr. Rev. 32, 755–797 (2011).
Article CAS PubMed PubMed Central Google Scholar
Haas, C., Hung, A. Y., Citron, M., Teplow, D. B. & Selkoe, D. J. beta-Amyloid, protein processing and Alzheimer’s disease. Arzneimittelforschung 45, 398–402 (1995).
CAS PubMed Google Scholar
Bu, G. Apolipoprotein E and its receptors in Alzheimer’s disease: pathways, pathogenesis and therapy. Nat. Rev. Neurosci. 10, 333–344 (2009).
Article CAS PubMed PubMed Central Google Scholar
Saadi, I. et al. Deficiency of the cytoskeletal protein SPECC1L leads to oblique facial clefting. Am. J. Hum. Genet. 89, 44–55 (2011).
Article CAS PubMed PubMed Central Google Scholar
Bhoj, E. J. et al. Phenotypic spectrum associated with SPECC1L pathogenic variants: new families and critical review of the nosology of Teebi, Opitz GBBB, and Baraitser-Winter syndromes. Eur. J. Med. Genet. 62, 103588 (2019).
Article PubMed Google Scholar
Wilson, N. R. et al. SPECC1L deficiency results in increased adherens junction stability and reduced cranial neural crest cell delamination. Sci. Rep. 6, 17735 (2016).
Article ADS PubMed PubMed Central Google Scholar
Zlokovic, B. V. Cerebrovascular effects of apolipoprotein E: implications for Alzheimer disease. JAMA Neurol. 70, 440–444 (2013).
Article PubMed PubMed Central Google Scholar
McGeer, P. L., McGeer, E. G. & Yasojima, K. Alzheimer disease and neuroinflammation. J. Neural Transm. Suppl. 59, 53–57 (2000).
CAS PubMed Google Scholar
Lueg, G. et al. Clinical relevance of specific T-cell activation in the blood and cerebrospinal fluid of patients with mild Alzheimer’s disease. Neurobiol. Aging 36, 81–89 (2015).
Article CAS PubMed Google Scholar
Lambert, J. C. et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat. Genet. 45, 1452–1458 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kunkle, B. W. et al. Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Abeta, tau, immunity and lipid processing. Nat. Genet. 51, 414–430 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wang, Z. X., Wan, Q. & Xing, A. HLA in Alzheimer’s disease: Genetic association and possible pathogenic roles. Neuromolecular Med. 22, 464–473 (2020).
Article CAS PubMed Google Scholar
Elmer, B. M. & McAllister, A. K. Major histocompatibility complex class I proteins in brain development and plasticity. Trends Neurosci. 35, 660–670 (2012).
Article CAS PubMed PubMed Central Google Scholar
James, L. M. et al. The effects of human leukocyte antigen DRB1*13 and apolipoprotein E on age-related variability of synchronous neural interactions in healthy women. EBioMedicine 35, 288–294 (2018).
Article PubMed PubMed Central Google Scholar
James, L. M. & Georgopoulos, A. P. At the root of 3 “long” diseases: Persistent antigens inflicting chronic damage on the brain and other organs in gulf war illness, long-COVID-19, and chronic fatigue syndrome. Neurosci. Insights 17, 26331055221114816 (2022).
Article PubMed PubMed Central Google Scholar
Chen, F. et al. Nuclear export of smads by RanBP3L regulates bone morphogenetic protein signaling and mesenchymal stem cell differentiation. Mol. Cell Biol. 35, 1700–1711 (2015).
Article CAS PubMed PubMed Central Google Scholar
von Bernhardi, R., Cornejo, F., Parada, G. E. & Eugenin, J. Role of TGFbeta signaling in the pathogenesis of Alzheimer’s disease. Front. Cell. Neurosci. 9, 426 (2015).
Google Scholar
Wu, X. et al. Photoactivation of TGFbeta/SMAD signaling pathway ameliorates adult hippocampal neurogenesis in Alzheimer’s disease model. Stem Cell Res. Ther. 12, 345 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Mastroeni, D. et al. Reduced RAN expression and disrupted transport between cytoplasm and nucleus; a key event in Alzheimer’s disease pathophysiology. PLoS One 8, e53349 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Guo, L. et al. Sex specific molecular networks and key drivers of Alzheimer's disease. Mol. Neurodegener 18, 39 (2023).
Article CAS PubMed PubMed Central Google Scholar
Pan, A. L. et al. Dual-specificity protein phosphatase 4 (DUSP4) overexpression improves learning behavior selectively in female 5xFAD mice, and reduces beta-amyloid load in males and females. Cells 11, 3880 (2022).
Article CAS PubMed PubMed Central Google Scholar
Neff, R. A. et al. Molecular subtyping of Alzheimer's disease using RNA sequencing data reveals novel mechanisms and targets. Sci. Adv. 7, eabb5398 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, Q. et al. A public resource of single cell transcriptomes and multiscale networks from persons with and without Alzheimer's disease. bioRxiv. https://doi.org/10.1101/2023.10.20.563319 (2023).
Article PubMed PubMed Central Google Scholar
Han, S. W. et al. miR-129-5p as a biomarker for pathology and cognitive decline in Alzheimer's disease. Alzheimers Res. Ther. 16, 5 (2024).
Article CAS PubMed PubMed Central Google Scholar
Duarte, M. L. et al. Multiomics analyses identify proline endopeptidase-like protein as a key regulator of protein trafficking, a pathway underlying Alzheimer's disease pathogenesis. Mol. Pharmacol. 104, 1–16 (2023).
Article CAS PubMed PubMed Central Google Scholar
Kim, J. P. et al. Integrative co-methylation network analysis identifies novel DNA methylation signatures and their target genes in Alzheimer's disease. Biol. Psychiatry 93, 842–851 (2023).
Article CAS PubMed Google Scholar
Liu, A. et al. Identifying candidate genes and drug targets for Alzheimer's disease by an integrative network approach using genetic and brain region-specific proteomic data. Hum. Mol. Genet. 31, 3341–3354 (2022).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This manuscript is dedicated to the memory of our dear friends and colleagues Christopher B. Heward and Jason J. Corneveaux. Requiescat in pace. Funding for the Laboratory of Functional Neurogenomics is via the National Institute on Aging (PI: Myers; AG069008). Research reported in this publication was also supported by the National Cancer Institute under grant number P30CA033572 (City of Hope Cancer Center Support) and the National Library of Medicine under grant number R01LM013138 (PI: Rodin). Additional support by the Susumu Ohno Chair in Theoretical Biology (held by A.S.R.) and the Susumu Ohno Distinguished Investigator Fellowship (to G.G.) is kindly acknowledged. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. We thank the patients and their families for their selfless donations. Many data and biomaterials were collected from several National Institute on Aging (NIA) and National Alzheimer’s Coordinating Center (NACC, grant #U01 AG016976) funded sites. Amanda J. Myers, PhD (University of Miami, Department of Psychiatry) and John A. Hardy, PhD (Reta Lila Weston Institute, University College London) collected and prepared the series. Marcelle Morrison-Bogorad, PhD., Tony Phelps, PhD and Walter Kukull PhD are thanked for helping to co-ordinate this collection. The directors, pathologists and technicians involved include: National. Institute on Aging: Ruth Seemann, John Hopkins Alzheimer's Disease Research Center (NIA grant #AG05146): Juan C. Troncoso, MD, Dr. Olga Pletnikova, University of California, Los Angeles (NIA grant # P50 AG16570): Harry Vinters, MD, Justine Pomakian, The Kathleen Price Bryan Brain Bank, Duke University Medical Center (NIA grant #AG05128, NINDS grant # NS39764, NIMH MH60451 also funded by Glaxo Smith Kline): Christine Hulette, MD, Director, John F. Ervin, Stanford University: Dikran Horoupian, MD, Ahmad Salehi, MD, PhD, Massachusetts Alzheimer’s Disease Research Center (P50 AG005134): E. Tessa Hedley-Whyte, MD, MP Frosch, MD, Karlotta Fitch, University of Michigan (NIH grant P50-AG053760): Dr. Roger Albin, Lisa Bain, Eszter Gombosi, University of Kentucky (NIH #AG05144): William Markesbery, MD, Sonya Anderson, Mayo Clinic, Jacksonville: Dennis W. Dickson, MD, Natalie Thomas, Washington University, St Louis Alzheimer’s Disease Research Center (NIH #P50AG05681): Dan McKeel, MD, John C. Morris, MD, Eugene Johnson, Jr., PhD, Virginia Buckles, PhD, Deborah Carter, University of Washington, Seattle (NIH #P50 AG05136):Thomas Montine, MD, PhD, Aimee Schantz, MEd., Boston University Alzheimer’s Disease Research Center (NIH grant P30-AG13846): Ann C. McKee, Carol Kubilus Banner Sun Health Research Institute Brain Donation Program of Sun City, Arizona (NIA #P30 AG19610; Arizona Alzheimer’s Disease Core Center, Arizona Department of Health Services, contract 211,002, Arizona Alzheimer’s Research Center; Arizona Biomedical Research Commission, contracts 4001, 0011, 05_901 and 1001 to the Arizona Parkinson's Disease Consortium; Michael J. Fox Foundation for Parkinson’s Research): Thomas G. Beach, MD, PhD, Lucia I. Sue, Geidy E. Serrano; Emory University: Bruce H. Wainer, MD, PhD, Marla Gearing, PhD, University of Texas, Southwestern Medical School: Charles L. White, III, M.D., Roger Rosenberg, Marilyn Howell, Joan Reisch, Rush University Medical Center, Rush Alzheimer's Disease Center (NIH #AG10161): David A. Bennett, M.D. Julie A. Schneider, MD, MS, Karen Skish, MS, PA (ASCP) MT, Wayne T Longman, University of Miami Brain Endowment Bank (supported in part by HHSN-271–2013-00030C and the McGowan Endowment): Deborah C. Mash, MD, Margaret J Basile, Mitsuko Tanaka , Oregon Health & Science University: Randy Wotljer, PhD. Additional tissues include samples from the following sites: Newcastle Brain Tissue Resource (funding via the Medical Research Council, local NHS trusts and Newcastle University): C.M. Morris, MD, Ian G McKeith, Robert H Perry MRC London Brain Bank for Neurodegenerative Diseases (funding via the Medical Research Council): Simon Lovestone, Md PhD, Safa Al-Sarraj. MD, Claire Troakes, The Netherlands Brain (funding via numerous sources including Stichting MS Research, Brain Net Europe, Hersenstichting Nederland Breinbrekend Werk, International Parkinson Fonds, Internationale Stiching Alzheimer Onderzoek): Inge Huitinga, MD, Marleen Rademaker, Michiel Kooreman, Institut de Neuropatologia, Servei Anatomia Patologica, Universitat de Barcelona: Isidre Ferrer, MD, PhD, Susana Casas Boluda.

Author information

Authors and Affiliations

Department of Computational and Quantitative Medicine, Beckman Research Institute of the City of Hope, Duarte, CA, 91010, USA
Sergio Branciamore, Grigoriy Gogoshin & Andrei S. Rodin
Department of Cell Biology, University of Miami Miller School of Medicine, Miami, FL, 33136, USA
Amanda J. Myers
Institute for Data Science and Computing, University of Miami Miller School of Medicine, Miami, FL, 33136, USA
Amanda J. Myers
Interdepartmental Program in Neuroscience, University of Miami Miller School of Medicine, Miami, FL, 33136, USA
Amanda J. Myers
Interdepartmental Program in Human Genetics and Genomics, University of Miami Miller School of Medicine, Miami, FL, 33136, USA
Amanda J. Myers

Authors

Sergio Branciamore
View author publications
You can also search for this author in PubMed Google Scholar
Grigoriy Gogoshin
View author publications
You can also search for this author in PubMed Google Scholar
Andrei S. Rodin
View author publications
You can also search for this author in PubMed Google Scholar
Amanda J. Myers
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.B. performed all of the computational analysis, wrote the code for the computational analysis, interpreted the results and wrote the paper. G.G. wrote the code for the computational analysis and reviewed the paper. A.S.R. conceived the project, interpreted the results and wrote the paper. A.J.M. conceived the project, supplied the data, interpreted the results and wrote the paper. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Andrei S. Rodin or Amanda J. Myers.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Branciamore, S., Gogoshin, G., Rodin, A.S. et al. Changes in expression of VGF, SPECC1L, HLA-DRA and RANBP3L act with APOE E4 to alter risk for late onset Alzheimer’s disease. Sci Rep 14, 14954 (2024). https://doi.org/10.1038/s41598-024-65010-7

Download citation

Received: 28 November 2023
Accepted: 16 June 2024
Published: 28 June 2024
DOI: https://doi.org/10.1038/s41598-024-65010-7
Springer Nature Limited

Changes in expression of VGF, SPECC1L, HLA-DRA and RANBP3L act with APOE E4 to alter risk for late onset Alzheimer’s disease

Abstract

Similar content being viewed by others

Combined Genome-Wide CSF Aβ-42’s Associations and Simple Network Properties Highlight New Risk Factors for Alzheimer’s Disease

Integration of transcriptomic and genomic data suggests candidate mechanisms for APOE4-mediated pathogenic action in Alzheimer’s disease

Genetically regulated expression in late-onset Alzheimer’s disease implicates risk genes within known and novel loci