Introduction

Alzheimer’s disease (AD) is a debilitating and progressive neurodegenerative disorder that results in severe cognitive and behavioral impairments [1]. AD follows a continuum, starting from an asymptomatic phase, to mild cognitive impairment (MCI), and eventually progressing to the final stage of dementia [2]. It is characterized by the degeneration of brain tissue, particularly in the hippocampus [3] and entorhinal cortex [4], which are closely associated with initial progressive memory loss. Currently, most research focuses on the β-amyloid (Aβ) and tau proteins, which are major components of amyloid plaques and neurofibrillary tangles. Cerebrospinal fluid (CSF) Aβ and tau have demonstrated high diagnostic and predictive accuracy for AD [5, 6]. However, the clinical failures of some drugs targeting Aβ clearance suggest that these biomarkers do not fully capture the complex and multifactorial pathogenic processes occurring in different stages of AD [1, 7]. In reality, although Aβ deposition and tau-related hyperphosphorylation are core mechanisms of AD pathogenesis, other molecular changes have been reported throughout the AD continuum [8, 9].

Proteomics, utilizing liquid chromatography-tandem mass spectrometry (LC-MS/MS), is an unbiased and powerful technique that enables the identification and quantification of proteins based on peptide-level amino acid sequence information. This technique has led to the discovery of numerous proteins capable of identifying AD [10], unveiling the mitochondrial characteristics of AD [11], and revealing changes in energy metabolism associated with glial cell activation [12]. However, most studies primarily focus on identifying AD and unraveling its pathogenic mechanisms, with limited research on using proteomics to predict the conversion from non-AD to AD. Moreover, it may be more effective and rational to utilize differential proteins spanning all stages of the AD disease spectrum for predicting the conversion to AD. To address this, the Alzheimer’s Disease Neuroimaging Initiative (ADNI) has undertaken a high-throughput detection and quantification of proteins in CSF samples using LC-MS/MS to identify differential proteins across the AD continuum [13,14,15]. Unlike previous studies, the ADNI study emphasizes the expression of AD-associated proteins across all stages, ranging from cognitively normal (CN), subjective memory complaint (SMC), early MCI (EMCI), late MCI (LMCI), and progressing to AD. This comprehensive approach allows for the observation of dynamic proteomic changes throughout the entire AD development process.

In the ADNI study, a comprehensive analysis was conducted on the entire AD continuum, measuring sixty-five peptides. Our approach involved initially screening for differential peptides that are common across the other four stages in comparison to AD. Subsequently, the accuracy of predicting the conversion to AD in non-AD individuals by these differential peptides was assessed. Furthermore, the potential mechanisms through which these differential peptides impact cognitive function were investigated. This study makes a significant contribution to understanding the predictive value of shared differential peptides in the AD continuum and sheds light on their involvement in the early stages of AD.

Methods

Participants

The data used in this study were obtained from the ADNI database, accessible at adni.loni.usc.edu. ADNI was initiated in 2003 under the leadership of Principal Investigator Michael W. Weiner, MD, as a collaborative effort between the public and private sectors. The database comprises a wide range of data, including CSF biomarkers, magnetic resonance imaging, positron emission tomography, and neuropsychological assessments. These data were collected from individuals across various cognitive states, including those with CN, SMC, EMCI, LMCI, and AD. For further details, please refer to www.adni-info.org. All research sites participating in ADNI have received approval from their respective local Institutional Review Boards, and all participants or their authorized representatives have provided written informed consent. The ethical approval for ADNI 1, GO, 2, and 3 can be found on ClinicalTrials.gov with the following identifiers: NCT00106899, NCT01078636, NCT01231971, and NCT02854033, respectively.

In this study, A total of 773 participants with complete CSF proteomics data were initially identified from the “Emory University CSF Targeted MS SET2 [ADNI1, GO, 2, 3].csv” file in the ADNI database. These participants consisted of 186 CN, 127 SMC, 79 EMCI, 249 LMCI, and 132 AD. The diagnostic criteria for each group have been previously described in detail [16, 17]. In summary, CN individuals were defined as having Mini-Mental State Examination scores between 24 and 30, a Clinical Dementia Rating of 0, no depression, no mild cognitive impairment, and no dementia. SMC individuals were defined as having a score of ≥ 16 on the first 12 items of the Cognitive Change Index. LMCI individuals adjusted the scores for measuring objective memory loss in combination with the level of education, compared to EMCI individuals. The diagnosis of AD was based on the National Institute of Neurological and Communicative Disorders and Stroke/Alzheimer’s Disease and Related Disorders Association criteria. Subsequently, baseline demographic information, CSF biomarkers, hippocampal and entorhinal volumes, and cognitive function data were extracted from the “ADNIMERGE-Key ADNI tables merged into one table.csv” and “UPENN CSF Biomarker Master [ADNI1, GO, 2, 3].csv” files. Additionally, follow-up data on the progression from non-AD to AD were also obtained from the “ADNIMERGE-Key ADNI tables merged into one table.csv”.

Measurement of CSF-targeted proteomics

Baseline proteomics data was collected from participants enrolled in the ADNI cohort. The targeted proteomics analysis of CSF samples was conducted by the Department of Neurology, Emory University School of Medicine, using mass spectrometry. The methods employed were previously described in the study and involved the analysis of peptide ratios for sixty-five peptides associated with forty-seven proteins [18]. In brief, the CSF samples underwent a series of processing steps, including reduction, alkylation, denaturation, and enzymatic digestion using Lys-C and trypsin. The resulting peptides were then analyzed using a standard flow Agilent 1290 Infinity II liquid chromatography system coupled with a Thermo Fisher Scientific TSQ Altis Triple Quadrupole mass spectrometer. Isotopically labeled peptide standards were added for relative quantification, and the total area ratios for the targeted peptides were reported. For more detailed information on the protein names corresponding to the peptides and the specific detection procedures, please refer to the following link: https://ida.loni.usc.edu/download/files/study/f28033a7-1734-437f-84f9-c4fd2ae2460f/file/adni/ADNI_MethodsReport_EmoryRevised_20221019.pdf.

Measurement of CSF Aβ42, tau, and P-tau

The concentrations of baseline CSF Aβ42, tau, and P-tau in samples from ADNI participants were measured using the multiplex xMAP Luminex platform (Luminex Corp, Austin, TX, USA) and the INNOBIA AlzBio3 kit (Fujirebio, Ghent, Belgium). The CSF samples were analyzed at the laboratory of the University of Pennsylvania.

Measurement of hippocampal and entorhinal volumes

Baseline head images of the participants were acquired using T1-weighted sagittal 3D magnetization-prepared rapid gradient-echo sequences on a 3 Tesla MRI scanner. The images were then processed using FreeSurfer 5.1 to extract the hippocampal and entorhinal volume. FreeSurfer is a widely used software package for automated segmentation and volumetric measurements of brain structures from images.

Neuropsychological assessment

Despite the presence of multiple cognitive assessment scales in the ADNI study, the baseline cognitive function of the participants was evaluated using the 13-item cognitive subscale of the Alzheimer’s Disease Assessment Scale (ADAS-Cog 13), considering the completeness of scoring data and the comprehensiveness of cognitive function assessment. This assessment measures various cognitive domains, including memory, attention, language, orientation, and executive function. The total score of ADAS-Cog 13 ranges from 0 to 85, with higher scores indicating more severe cognitive impairment.

Endpoints

Follow-up was conducted on participants who were diagnosed with CN, SMC, EMCI, and LMCI at baseline. The endpoint event was defined as the conversion to AD. For each participant, the number of months from the first diagnosis of AD was extracted from the database. If a participant was not diagnosed with AD at their last follow-up, the number of months from baseline to the last follow-up was recorded. This allowed us to track the progression of the participants over time and assess the rate of conversion to AD.

Statistical analysis

All statistical analyses were performed using SPSS software (version 26.0; IBM SPSS) and R (version 4.2.3) with a significance level of P < 0.05 on both sides, unless otherwise stated. Descriptive data were presented as mean and standard deviation, median and interquartile range, or numbers and percentages. Differences in numerical variables among multiple groups were assessed using one-way ANOVA or Kruskal-Wallis test, depending on the normality of the data. Multiple imputation techniques were used to fill missing values. The details regarding the missing covariates can be found in Supplementary Table 1.

To assess the differential expression of sixty-five peptides, the Mann-Whitney U test was used and P values were corrected for multiple testing using the false discovery rate method of Benjamini-Hochberg (P < 0.05). This analysis identified peptides with significantly altered abundance levels between different groups, and volcano plots were generated using GraphPad Prism 8.0. Receiver operator characteristic (ROC) curves were employed to evaluate the diagnostic value of peptides, as well as CSF Aβ42, tau, and P-tau among different groups. Positive predictive value (PPV) and negative predictive value (NPV) were calculated based on the ratio of cases in the positive and negative groups to reflect the prevalence of the disease. Additionally, DeLong’s test was used to compare the diagnostic accuracy of these biomarkers among different groups.

Univariable Cox proportional hazards model was used to evaluate the association between those variables and AD. Considering the limited number of instances for the outcome variable, variables with a p-value less than 0.05 were included in the multivariable Cox proportional hazards models. Hippocampal and entorhinal volumes, as well as ADAS-Cog 13, were not included in the analysis as they were considered outcomes of disease progression. CSF peptides and Aβ42, tau, and P-tau levels were log10 transformed for standardization. To validate the robustness of our findings, participants with incomplete data were excluded, and the univariable and multivariable Cox regression analysis were conducted using complete datasets. Kaplan-Meier survival analysis was performed using the log-rank test to compare the differences in survival rates among CN, SMC, EMCI, and LMCI. The C-statistic was used to assess the incremental discriminative value of differential peptides in predicting the conversion to AD.

Furthermore, mediation analysis was conducted using the PROCESS macro, version 3.5, to investigate the indirect effects of peptides on cognitive function through hippocampal and entorhinal volumes. Total, direct, and indirect effects were calculated using bootstrapping with 5000 iterations. The analysis included peptides that showed significant differences in the Cox proportional hazards analysis and were log10 transformed for standardization.

To validate the robustness of our findings, we conducted sensitivity analyses. Participants with incomplete data were excluded, and the differential expression of sixty-five peptides, the univariable Cox regression, and multivariable Cox regression analyses were conducted using complete datasets.

Results

Identification of AD-associated peptides

LC-MS/MS was utilized to measure the levels of sixty-five peptides in 773 subjects. The distribution of participants among different groups was as follows: 186 in the CN group, 127 in the SMC group, 79 in the EMCI group, 249 in the LMCI group, and 132 in the AD group. Figure 1 illustrates the differences in peptide abundance between the various non-AD groups and the AD group, displaying nine differential peptides across the AD continuum. As the severity of cognitive impairment increased, the number of differential peptides decreased (Fig. 1B, D, E). Interestingly, the SMC group had the highest number of differential peptides compared to the AD group (Fig. 1C).

Fig. 1
figure 1

Volcano plots of differential protein expression between every two groups. Blue dots represented peptides with low expression, red dots represented peptides with high expression, and gray dots represented peptides with no differential expression. Nine peptides that showed differential expression were presented across the AD continuum. Abbreviations: CN, cognitively normal; SMC, subjective memory complaint; EMCI, early mild cognitive impairment; LMCI, late mild cognitive impairment; AD, Alzheimer’s disease

The role different peptides may play in different stages of AD development can vary. To achieve the best predictive effect with the minimum number of differential peptides, we selected nine peptides that showed differential expression in all different stages indicated in Fig. 1 for further analysis. Among these peptides, four peptides showed relatively downregulated expression: ADQDTIR, EPVAGDAVPGPK, GLQEAAEER, and VAELEDEK. These peptides correspond to the proteins neuronal pentraxin receptor (NPTXR), neurosecretory protein VGF (VGF), VGF, and neuronal pentraxin-2 (NPTX2), respectively. On the other hand, five peptides showed relatively upregulated expression: LNVTEQEK, NLLSVAYK, VISSIEQK, VSFELFADK, and VVSSIEQK. These peptides correspond to the proteins Enolase 1 (ENO1), 14-3-3 beta/alpha, 14-3-3 beta/alpha, peptidyl-prolyl cis-trans isomerase (PPIase) A, and 14-3-3 protein zeta/delta, respectively. Additionally, in Aβ-positive participants, there were five common differential peptides (VAELEDEK, VSFELFADK, VVSSIEQK, NLLSVAYK, and LNVTEQEK), which were also part of the previously identified nine differential peptides. This indicated differences in certain peptides between Aβ-positive and Aβ-negative participants. Table 1 illustrates significant differences among groups in demographic characteristics and AD biomarkers.

Table 1 Demographics and clinical characteristics (n = 773)

Longitudinal analysis: newly developed AD events during follow-up

Follow-up was conducted on the four non-AD groups (CN, SMC, EMCI, LMCI) to document the time of conversion to AD. Among the 641 non-AD participants, 61 were lost to follow-up, leading to a final analysis of 580 participants. The median follow-up time was 36 months (range: 24 to 48). Over the follow-up period, a total of 146 participants converted to AD.

The survival analysis using the Cox regression model showed that gender, married status, ApoE ε4, baseline diagnosis, Log Aβ42, Log P-tau, low-expressed peptides (Log ADQDTIR, Log EPVAGDAVPGPK, Log GLQEAAEER and Log VAELEDEK), and high-expressed peptides (Log LNVTEQEK, Log NLLSVAYK, Log VISSIEQK, Log VSFELFADK and Log VVSSIEQK) were univariable predictors of the endpoint event (conversion to AD). Furthermore, the multivariable Cox regression analysis revealed that Log VAELEDEK (hazard ratio [HR] = 0.05; 95% CI = 0.01–0.44; P = 0.007), Log VSFELFADK (HR = 0.01; 95% CI = 0.00–0.09; P = 0.001), and Log VVSSIEQK (HR = 38.93; 95% CI = 2.62–577.48; P = 0.008) remained significant predictors of the conversion to AD, along with gender, baseline diagnosis, Log Aβ42, and Log P-tau (Table 2). Furthermore, the initial combined model of Aβ42 and P-tau exhibited a C-statistic of 0.82 in predicting the progression to AD. However, the incorporation of these three additional peptides resulted in an improved C-statistic of 0.85 (P < 0.001).

Table 2 Univariable and multivariable Cox regression analysis for conversion to AD as the outcome measure in non-AD patients (n = 580)

In the multivariable Cox regression model, with CN as the reference group, it was observed that as the severity of baseline cognitive impairment increased from EMCI to LMCI, the risk of conversion to AD also increased. Specifically, the risk of conversion to AD was 5.44 times higher for individuals with EMCI and 10.06 times higher for individuals with LMCI compared to CN. However, individuals with SMC did not show a significant increase in the risk of progressing to AD compared to CN. Considering that there was no significant difference in the progression to AD between CN and SMC, in the Kaplan-Meier survival analysis, CN and SMC were combined into one group. The results showed that baseline diagnosis was a useful predictor for the progression to AD, with the risk of developing AD ranked highest for baseline diagnosis of LMCI, followed by EMCI, and CN combined with SMC (P-all < 0.001, Fig. 2).

Fig. 2
figure 2

Kaplan-Meier survival curves of non-conversion to AD. There were statistically significant differences between the CN and SMC vs. EMCI, CN and SMC vs. LMCI, and EMCI vs. LMCI groups. Abbreviations: CN, cognitively normal; SMC, subjective memory complaint; EMCI, early mild cognitive impairment; LMCI, late mild cognitive impairment; AD, Alzheimer’s disease. ***P < 0.001

Association between targeted peptides and AD biomarkers

We analyzed whether these peptides correlated with other CSF markers of AD pathology in this cohort. Linear regression model analysis of CSF Aβ42, P-tau, and targeted peptides, controlling for age, gender, education, and ApoE ε4 as possible confounders, is shown in Table 3 for 773 subjects. CSF Aβ42 showed significant positive correlations with VAELEDEK and VSFELFADK, but significant negative correlations with VVSSIEQK (P-all < 0.001). Additionally, CSF P-tau exhibited a significant positive correlation with VVSSIEQK, while CSF Aβ42 was significantly negatively correlated with VSFELFADK (P-all < 0.001).

Table 3 Multivariable linear regression between cerebrospinal fluid AD biomarkers and targeted peptides (n = 773)

Diagnostic value of targeted peptides in group differentiation

Utilizing ROC curve analysis, we evaluated the diagnostic value of the three peptides selected based on Cox regression in distinguishing the AD group from the four non-AD groups (CN, SMC, EMCI, and LMCI). Table 4 illustrates statistically significant differences in the diagnostic performance of CSF Aβ42, tau, P-tau, and the three peptides across the various groups. No significant difference was observed in the accuracy of CSF Aβ42 and VVSSIEQK in distinguishing between the control group and the AD group (P = 0.480). In distinguishing between SMC and AD, as well as EMCI and AD, CSF Aβ42 exhibited a higher area under the curve (AUC) than all peptides (P-all < 0.05). Nevertheless, there was no significant difference in AUC between AD-related biomarkers and peptides for differentiating between LMCI and AD (P-all > 0.05).

Table 4 ROC analysis results of cerebrospinal fluid AD biomarkers and peptides for differentiating various diagnoses (n = 773)

Finally, we assessed whether the addition of the three peptides in combination could enhance the accuracy of classical AD biomarkers in diagnosing AD. The combination of Aβ42 and P-tau (AUC = 0.827, 95% confidence interval [CI] = 0.781–0.867, P < 0.001, sensitivity 77.27%, and specificity 78.26%) did not surpass the performance of the combination of the three peptides (AUC = 0.872, 95% CI = 0.831–0.907, P < 0.001, sensitivity 85.61%, and specificity 76.88%) in differentiating between CN and AD (P = 0.084). However, upon adding the three peptides to the combination of Aβ42 and P-tau, the AUC increased to 0.900, significantly higher than the AUC of the combination of Aβ42 and P-tau alone (P < 0.001). This suggests that proteomics is comparable to classical AD biomarkers in predicting AD and can even enhance the predictive performance of classical AD biomarkers when combined with proteomics.

Mediation effect of proteomics on cognition via neuroimaging

Mediation analysis was employed to examine the indirect effects of peptides on cognitive function (ADAS-Cog 13) through the hippocampus and entorhinal regions (Fig. 3). This analysis focused on three peptides that showed significant differences in the multivariable Cox regression: VAELEDEK, VSFELFADK, and VVSSIEQK. We found that VAELEDEK, VSFELFADK, and VVSSIEQK exhibited mediated effects through the hippocampus with proportions of 53.8%, 39.6%, and 38.0%, respectively. Similarly, VAELEDEK, VSFELFADK, and VVSSIEQK also exhibited mediated effects through the entorhinal region, with proportions of 49.8%, 36.3%, and 30.3%, respectively. Additionally, after adjusting for age, gender, education, and ApoE ε4 status, ADAS-Cog 13 exhibited significant positive correlations with VVSSIEQK, but significant negative correlations with VAELEDEK and VSFELFADK (P-all < 0.05).

Fig. 3
figure 3

Mediation effects of peptides on cognitive function via hippocampal and entorhinal volumes. A VAELEDEK effect on ADAS-Cog 13 mediated by hippocampal volume. B VAELEDEK effect on ADAS-Cog 13 mediated by entorhinal volume. C VSFELFADK effect on ADAS-Cog 13 mediated by hippocampal volume. D VSFELFADK effect on ADAS-Cog 13 mediated by entorhinal volume. E VVSSIEQK effect on ADAS-Cog 13 mediated by hippocampal volume. F VVSSIEQK effect on ADAS-Cog 13 mediated by entorhinal volume. Abbreviations: ADAS-Cog 13, 13-item cognitive subscale of the Alzheimer’s Disease Assessment Scale

Sensitivity analyses

After excluding participants with incomplete data, no significant changes were observed in baseline characteristics (Supplementary Table 2). Furthermore, multivariable Cox regression analysis from the full dataset revealed that Log VAELEDEK, Log VSFELFADK, and Log VVSSIEQK remained significant predictors of conversion to AD (Supplementary Table 3). Additionally, in validation of the selected peptides in Aβ-positive patients following the ATN framework, we observed that both peptides (VAELEDEK and VSFELFADK) remained robust predictors of conversion to AD, while VVSSIEQK did not demonstrate predictive capability for conversion to AD (Supplementary Table 4).

Discussion

Among the sixty-five candidate biomarkers investigated, three peptides (VAELEDEK, VSFELFADK, and VVSSIEQK) were identified as significant predictors of conversion to AD across the entire AD continuum. Incorporating these three peptides into the predictive model, in combination with Aβ42 and P-tau, improved the accuracy of predicting AD conversion. Furthermore, these three peptides exhibited strong discriminatory power in distinguishing AD from non-AD individuals. Importantly, this study revealed that the impact of these peptides on cognitive function was mediated through changes in hippocampal and entorhinal volumes. In a prior clinical investigation, a linear correlation between Aβ ratios and VAELEDEK and VVSSIEQK was unveiled [17]. Expanding upon these observations, it was ascertained that these two peptides, alongside the recently identified VSFELFADK, harbor promising clinical utility in forecasting the progression to AD.

In this comprehensive proteomics study, we observed a gradual reduction in the number of peptides with differential expression as the disease progressed from CN to AD. This indicates the existence of distinct differences among the five stages of disease progression, which gradually diminish as the disease advances. The observed alterations in protein regulation throughout the course of AD may contribute to the decrease in the number of peptides with differential expression [19]. Different from previous study that used dozens of proteins as a panel to predict AD [20], this study initially screened nine peptides from a pool of sixty-five peptides that exhibited differential expression across the entire AD continuum. These selected peptides have predictive and clinical value for patients at any stage of the disease. Notably, significant changes in forty-seven peptides were observed in individuals with SMC compared to AD. This supports the notion that SMC represents an earlier stage in the continuum of AD progression compared to MCI [21]. Moreover, early pharmacological interventions have shown promise in improving memory and executive function in individuals with SMC [22]. However, the current diagnostic criteria for SMC primarily rely on questionnaire scales and lack objective biochemical markers. Our findings provide novel evidence supporting the efficacy of proteomics in differentiating SMC from AD.

Building upon the findings of the longitudinal study, the peptides VAELEDEK, VSFELFADK, and VVSSIEQK independently demonstrated predictive capabilities for the occurrence of AD conversion as an endpoint event. Furthermore, incorporating these three peptides into the reference model markedly enhances the C-statistic for the conversion to AD. Specifically, these peptides correspond to NPTX2, PPIase A, and 14-3-3 protein zeta/delta, respectively. In a clinical study with a limited sample size, a significant correlation was observed between the rate of NPTX2 changes and cognitive decline [23]. Another study, longitudinally assessing cognitive scores at multiple time points, concluded that baseline NPTX2 levels were linked to cognitive decline [24]. Currently, there is limited clinical research exploring the association between PPIase A, 14-3-3 protein zeta/delta, and AD. A study investigating various isoforms of 14-3-3 protein zeta/delta in the frontal cortex of 12 postmortem AD patients found no significant difference in expression levels compared to control subjects [25]. However, a separate study, measuring multiple 14-3-3 protein isoforms in the cerebrospinal fluid of 52 AD patients, reported that zeta/delta exhibited the highest AUC among all isoforms [26], aligning with our findings. Distinguishing itself from prior studies, this study derived advantages from a larger sample size, recorded AD conversion as a survival outcome in non-AD patients, and adjusted for classical AD-related biomarkers and baseline diagnostic results in a multivariable Cox regression analysis. These methodological enhancements bolstered the reliability of the conclusions. Additionally, the incorporation of proteomics into the AD-related biomarker model augmented the C-statistic, emphasizing the predictive utility of proteomics in evaluating the risk of AD development.

The potential mechanisms underlying the roles of these three CSF biomarkers in AD are as follows. NPTX2, interacting with alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid-type glutamate receptors, mediates excitatory synapse maturation [27]. Conversely, NPTX1 restricts excitatory synaptic plasticity [28]. Disruption of the dynamic balance between NPTX2 and NPTX1 may lead to impaired synaptic function. Furthermore, C1q participates in the toxicity induced by soluble Aβ oligomers on synapses and long-term potentiation in the hippocampus [29], while reduced expression of NPTX2 activates complement C1q, leading to neurotoxicity, which may be one of the mechanisms contributing to cortical atrophy [30]. PPIase A, previously studied primarily in the context of viruses, belongs to the PPIase family. Cyclophilin A, another member of this family, has been extensively investigated and is known to regulate brain vascular integrity, contributing to neurodegenerative changes [31]. The potential mechanism of PPIase A in AD may be associated with its role in selenium transport to the brain [32]. Impairment of selenium transport may result in increased deposition of amyloid-beta plaques in the brain [33]. While research on the association between PPIase A and AD remains limited, Pin1, as the only known PPIase, has been shown to mediate dendritic spine loss induced by Aβ42 [34]. Both this study and other clinical research [17] indicate a linear correlation between CSF Aβ42 or Aβ ratios and NPTX2, PPIase A, further corroborating previous findings. 14-3-3 proteins, including various isoforms, are highly expressed in the brain, particularly at synapses, and serve as regulatory factors in synaptic transmission and plasticity [35]. They can also co-localize with tau in neurofibrillary tangles, being associated with tau deposition [36], and a clinical study has confirmed their association with cortical atrophy [37]. Although research on the association between 14-3-3 protein zeta/delta and neuronal damage in AD models is lacking, it has been found in a mouse model of spinal muscular atrophy that 14-3-3 protein zeta/delta can activate the MEK/ERK pathway and regulate neuronal survival by interacting with B-Raf [38]. These potential mechanisms offer insights into the potential involvement of these biomarkers in the pathogenesis and progression of AD.

A novel finding of this study was the identification of the mediating role of the hippocampus and entorhinal cortex in the effects of these three proteins on cognitive function, with a moderate mediating effect ranging from 30 to 50%. This suggests that these proteins predominantly exert their biological effects through these specific brain structures. Mediation analysis is a valuable tool used to identify potential variables in causal pathways [39]. By uncovering intermediate variables, it helps elucidate the mechanisms underlying the causal factors. In future studies, interventions targeting these intermediate variables could be explored to influence the outcomes. Previous research utilizing linear mixed models has shown that NPTX2 predicts medial temporal lobe atrophy and decline in memory in AD patients [24]. Our study expands on these findings by presenting a comprehensive causal model. We specifically selected the hippocampus and entorhinal cortex as the mediating brain regions due to their significant impact on cognitive function and their well-established associations with cognition in the field of AD research [4, 40]. This study provides new theoretical evidence supporting interventions targeting these three proteins to potentially impact the hippocampus and entorhinal cortex, thereby potentially slowing down cognitive decline.

This study is notable for including a sufficient number of proteomics samples and conducting a comprehensive analysis of the entire process of AD onset. It provides valuable insights into the potential mechanisms underlying the association between proteomics and AD through longitudinal analyses. However, there are a few limitations that should be acknowledged. Firstly, this study assessed baseline peptides and did not account for the rate of peptide changes, which may also be relevant to AD onset. Secondly, there were some missing values for certain variables. These missing values were handled through deletion during the analysis process, but it is worth noting that the number of missing values was small. Lastly, the follow-up duration in this study was relatively short, spanning only three years. Due to this limitation, we may not have been able to observe longer-term disease progression or effects. Therefore, future studies could consider extending the follow-up period to more comprehensively assess the relevant outcomes. Despite these limitations, the overall results of this study can be considered reliable.

In summary, this study emphasizes that three peptides independently predict the risk of developing AD. The combination of these three peptides with the classical AD biomarker model enhances predictive accuracy. Additionally, these three peptides also demonstrate high accuracy in distinguishing AD from non-AD cases. Finally, we found that the effects of these three peptides on cognitive function are primarily mediated through the volume of the hippocampus and entorhinal cortex. Future research should explore the therapeutic potential of targeting these three peptides to slow down cognitive decline in individuals at risk of developing AD.