Background

Frontotemporal dementia (FTD) is a group of neurodegenerative diseases where the most common phenotypes are behavioural variant FTD (bvFTD) and primary progressive aphasias (PPA). There is a great heterogeneity in FTD, both in terms of clinical symptoms, underlying genetic causes, and neuropathological findings. Over the past years, effort has been put into explaining the diversity by searching for fluid biomarkers that reflect different aspects of FTD [1]. Most efforts have focused on finding biomarkers in cerebrospinal fluid (CSF) and a few promising candidates have been found, such as neurofilament light chain (NEFL) and neuronal pentraxin 2 (NPTX2) [2, 3]. However, the use of CSF biomarkers is limited by the invasive nature of the sampling procedure and restricted availability. Therefore, a reliable blood-based biomarker would be extremely valuable. A well-known blood-based biomarker in genetic FTD is progranulin (GRN), which is reduced in individuals with loss-of-function mutations in the gene with the same name [4]. While serum or plasma GRN levels can be used to confirm mutations in GRN, they do not correlate with clinically important metrics such as age at onset [4]. Previous studies have also identified glial fibrillary acidic protein (GFAP), tau and NEFL as possible plasma-based biomarkers, where GFAP is elevated in symptomatic GRN mutation carriers, tau is elevated in sporadic FTD and in symptomatic MAPT mutation carriers, and NEFL is elevated in both genetic and sporadic FTD [5,6,7]. However, none of the proteins are specific for FTD since increased levels have been observed in other neurological diseases [8, 9]. A large screen of plasma proteins in FTD and Alzheimer disease (AD) found a panel of 12 proteins that discriminated between the two diseases. However, these proteins were associated with AD pathology and no differences were found between FTD cases and controls [10]. Further studies are therefore needed to find biomarkers that are FTD specific.

Here, we present an exploratory plasma profiling study of 158 proteins in 693 participants in the well-described genetic FTD cohort. To our knowledge, a plasma proteomic analysis of this magnitude has not been done in genetic FTD before. We aimed to investigate differences in plasma protein levels between both symptomatic and presymptomatic mutation carriers compared to non-carrier family members who serve as controls. Our findings indicate alterations in plasma protein levels between symptomatic mutation carriers and non-carrier controls, as well as gene specific differences in GRN mutation carriers.

Materials and methods

Cohort

All clinical data and samples included in the study were collected within the Genetic frontotemporal dementia initiative (GENFI) between 2012 and 2019 [11]. Variables included were age at sampling, sex, mutation group (symptomatic mutation carriers, SMC; presymptomatic mutation carriers, PMC; or non-carrier controls, NC), genetic group (chromosome 9 open reading frame 72, C9orf72; progranulin, GRN; or microtubule associated protein tau, MAPT), clinical phenotype, and age at onset. In total, baseline plasma samples from 701 participants were collected including 141 SMC (63 C9orf72, 50 GRN, and 28 MAPT), 283 PMC (97 C9orf72, 135 GRN and 51 MAPT) and 277 NC. Carriers of FTD-causing variants in other genes were not included. Clinically, the SMC most frequently presented with bvFTD (n = 102), followed by PPA (n = 25), FTD with concomitant amyotrophic lateral sclerosis (ALS) (n = 5) and other FTD-related phenotypes (n = 5).

Sample collection according to GENFI protocol

Blood samples (n = 701) were collected at 20 different sites in Europe and Canada in ethylenediaminetetraacetic acid (EDTA) tubes. Samples were centrifuged at 2200 × g for 5 min at 22 °C and the supernatant plasma was transferred to 0.5 ml polypropylene cryotubes and stored at -80 °C until analysis.

Suspension bead array assay

The plasma samples were diluted and labelled with a tenfold molar excess of biotin (NHS-PEG4-biotin. 21329, Thermo Scientific), heat treated, and subsequently mixed with an antibody suspension bead array as described in detail previously [12, 13]. A streptavidin conjugated fluorophore (Streptavidin R-Phycoerythrin Conjugate, Invitrogen) enabled the detection of the proteins, and the readout was performed on a Flexmap 3D instrument (Luminex corporation). Binding events were displayed as signal intensity. Published as well as internal unpublished work were used to guide the selection of target proteins (n = 163) which was based on previously identified promising targets, proteins involved in suggested pathological processes of neurodegeneration and proteins with enriched expression in brain compared to other tissue [14,15,16,17]. The majority of the antibodies (n = 156) were selected from the Human Protein Atlas project (www.proteinatlas.org) and the remaining seven were obtained from other providers (M067-3 from MBL International; MA1-70053, PA5-34943, 34–1000 from Invitrogen Antibodies; MAB2037-SP, AF2420, AF3154 from R&D Systems). The mean coefficient of variance per 384-well plate (n = 3) was less than 10%, and 97% of the antibodies had an individual coefficient of variance below 20%. The inter-assay correlations were high (rho > 0.8 for 154 antibodies). After quality control analysis, five antibodies were excluded due to high correlation to a negative control (rho > 0.6) resulting in 158 protein targets for further analysis (Supplementary Table 1).

Statistical analysis

Data pre-processing

All data pre-processing, analysis and illustrations were performed in R Studio version 2022.2.3.492 using R version 4.2.1 [18]. The data was normalised in two steps to diminish the effects of time delay during readout and potential differences between plates [19]. Prior to statistical analysis, the data was log2- and z-transformed via mean centring and unit variance scaling. Outlier samples with a median protein level three standard deviations higher or lower than the median for the whole cohort were excluded from the analysis (n = 8 samples removed). A residual adjustment approach was used to deal with the potential confounding effect of healthy ageing on protein levels [20]. The effect of healthy ageing on protein levels was estimated in the NC via linear mixed effect models using protein levels as the response variable, age as a fixed effect and collection site as a random intercept (lmer, lme4, [21]). For each subject in the overall cohort (including each of the NC, PMC, and SMC groups), the adjusted protein levels were then obtained through the following:

$${Protein}_{adj.}=Protein- \beta ( Age-\overline{Age} )$$

where Proteinadj. is the age adjusted protein level, Protein is the original protein level, β is the age-associated beta coefficient, Age is the subject’s age and \(\overline{Age}\) is the mean age in the entire cohort.

Demographic statistics

The participants’ ages followed a normal distribution, evaluated by visual assessment of normal probability plot and histogram. Differences in age between SMC, PMC and NC were assessed by one-way ANOVA and Tukey’s HSD post hoc test. Pearson’s Chi-squared test was used to investigate differences in sex between SMC, PMC and NC. P-values below 0.05 were considered significant.

Protein profile analysis

Differences in protein levels between SMC and NC were examined via binomial generalised linear mixed effects models using clinical status (i.e. SMC or NC) as the response variable, protein levels and sex as fixed effects with a random intercept based on collection site (glmer, lme4, [21]). One model per protein was built. The same method was used to assess differences in protein levels between SMC and presymptomatic mutation carriers (PMC), PMC versus NC, as well as to analyse gene specific differences e.g., SMC carrying a GRN mutation (SMC-GRN) vs NC. Log2 fold changes were calculated by subtracting the median log2 transformed protein levels in NC or PMC from the median log2 transformed protein levels in SMC. In contrast, the effects of age and sex on the protein levels in mutation carriers as well as in non-carriers were estimated via generalised linear mixed-effects models using protein levels as the response; age and sex as fixed effects with a random intercept based on collection site (lmer, lme4, [21]). P-values were calculated using the Satterthwaite’s degrees of freedom method (lmerTest, [22]). Multiple testing corrections were made via the Benjamini–Hochberg method for controlling false-discovery rates and an adjusted p-value of 0.05 was considered significant. Only adjusted p-values are reported unless clearly stated otherwise. Protein – protein correlations were calculated using Spearman's rank correlation coefficient. Protein clusters are based on hierarchical clustering using Ward’s clustering criterion.

Results

Cohort

In total, plasma results from 693 participants were included in the statistical analysis, and demographic data of the cohort is presented in Table 1. The age was not significantly different between NC and PMC (p-value = 0.06), but SMC were older than both NC and PMC (p-value = 7.13 × 10–41). There were more females in NC (55%) and PMC (62%) compared to SMC (42%) (NC vs SMC: p-value = 1.3 × 10–2; PMC vs SMC: p-value = 1.1 × 10–4).

Table 1 Demographic data of the cohort

Altered plasma protein levels in symptomatic mutation carriers

When comparing plasma protein levels in SMC to NC, we found that 13 proteins were elevated in SMC (Fig. 1A, Table 2 and Supplementary Fig. 1). In the comparison between SMC versus PMC, we found that 10 proteins were elevated in SMC (Fig. 1B, Table 2). There were six overlapping proteins i.e., six proteins had elevated levels in SMC both in the comparison to NC as well as in the comparison to the protein levels in PMC. An overview of how these proteins correlate with each other can be found in Fig. 2. When stratifying by genetic group, rabphilin 3a (RPH3A) was increased in SMC-GRN compared to NC (p-value = 1.3 × 10–3, odds ratio = 1.915) whereas progranulin, as expected, was decreased (p-value = 9.3 × 10–6, odds ratio = 0.152). No proteins had significantly different levels in the comparison between SMC-C9 and NC nor in the comparison between SMC-MAPT and NC (data not shown).

Fig. 1
figure 1

Volcano plots of plasma protein levels showing -log10 (p-values) of the log2(fold change) for comparisons between different groups. Plasma protein level differences between A) all SMC and NC, B) all SMC and all PMC. Each protein is represented by a gray dot and are coloured red if the protein levels are increased in the SMC compared to the comparison group (NC, or PMC). Dotted horizontal line = adjusted p-value 0.05, dotted vertical line = log2 fold change 0

Table 2 Comparison of plasma protein levels in symptomatic mutation carriers versus non-carriers and presymptomatic mutation carriers
Fig. 2
figure 2

Plots of protein – protein correlations. Protein order is based on hierarchical clustering. Correlation plot of the 17 proteins with elevated levels in symptomatic mutation carriers (SMC) compared to non-carriers (NC) or presymptomatic mutation carriers (PMC). The colour scale indicates Spearman's rank correlation coefficient from dark blue (-1) to bright red (1)

Next, we investigated the correlation between age and protein levels of the altered proteins in mutation carriers. Among the 17 unique proteins with elevated levels in SMC (SMC vs NC or SMC vs PMC), 13 had a significant correlation with age (Table 3).

Table 3 Correlations between age and protein levels in mutation carriers

When analysing sex differences for the 17 proteins elevated in SMC, two proteins were found to be increased in men compared to women in the mutation carriers: apolipoprotein E (APOE, p-value = 1.73 × 10–2, β = 0.267) and apolipoprotein C1 (APOC1, p-value = 1.73 × 10–2, β = 0.235). No sex differences were found among the NC for either protein (p-value = 1.49 × 10–1 and p-value = 7.4 × 10–1, respectively).

Finally, we explored if any of the 158 proteins included in this study were found at different levels in SMC with bvFTD compared to SMC with PPA but found no significant differences (data not shown).

Plasma protein levels in the presymptomatic stage

We also investigated the possibility to detect differences in protein levels already at the presymptomatic stage of FTD by first comparing PMC to NC and then stratifying by gene. The only difference found in these comparisons was decreased levels of GRN in PMC-GRN compared to NC (p-value = 4.44 × 10–3). However, the proteins neurofilament medium chain (NEFM), neuronal pentraxin 2 (NPTX2) and chitinase 3 like 1 (CHI3L1) showed trends of being elevated in PMC-GRN compared to NC (unadjusted p-value = 3.1 × 10–3, unadjusted p-value = 4.8 × 10–3 and unadjusted p-value = 4.6 × 10–3, respectively), though these differences were not significant after adjustment for multiple testing. None of the three proteins showed any correlation with age when analysed in PMC-GRN alone or in PMC-GRN together with SMC-GRN (Supplementary Table 2). NPTX2 was, however, elevated in SMC compared to NC and CHI3L1 was just above the significance threshold in the same comparison (p-value = 5.1 × 10–2). Neither were significant in the comparison between SMC and PMC (both p-values > 0.3).

Discussion

We performed extensive protein profiling of plasma from a genetic FTD cohort, collected within the GENFI study. We found 13 significantly increased plasma proteins in patients with genetic FTD compared to non-carrier controls and 10 proteins that were significantly increased compared to presymptomatic mutation carriers. Six of these proteins were significantly different in both comparisons, indicating that they likely are associated with symptom onset rather than the presence of one of the pathogenic mutations. These six proteins were also significantly correlated with increased age in mutation carriers, after correcting for healthy ageing, which further strengthens their association with symptom onset.

In contrast, four proteins, increased in SMC vs NC, were not correlated with age, nor elevated in the SMC vs PMC comparison, suggesting that they may be elevated already before symptom onset. One of these proteins NPTX2, is of particular interest. NPTX2, a synaptic protein, which has previously been shown to be reduced in CSF from patients with FTD and is potentially one of the first protein biomarkers to become abnormal in genetic FTD [3, 23, 24]. We have, as of yet, no explanation to why NPTX2 is reduced in CSF and elevated in plasma, or if the NPTX2 detected in plasma is brain derived. However, finding elevated levels of NPTX2 in plasma from SMC suggests that NPTX2 could work as a plasma-based biomarker.

We found two proteins, GRN and RPH3A, that differed in SMC-GRN compared to NC, while no proteins were observed at different levels in neither SMC-MAPT nor SMC-C9, compared to NC. A reduction of GRN in progranulin mutation carriers is of course expected since all known pathogenic FTD-related GRN mutations lead to haploinsufficiency. On the other hand, RPH3A was elevated in SMC-GRN. RPH3A is involved in presynaptic vesicle trafficking and has been implicated to play a role in synaptic dysfunction in other neurodegenerative diseases [25, 26]. In addition to the findings in SMC-GRN, we observed some indications of differences already in the presymptomatic stages in GRN mutation carriers. While not statistically significant after adjustment for multiple testing, the differences are still noteworthy since the proteins, NEFM, NPTX2 and CHI3L1, all have been reported as biomarker candidates in CSF [16, 27]. NPTX2 was also elevated in all SMC compared to NC and CHI3L1 was just above the threshold for significance while neither of these two proteins were elevated in SMC when compared to PMC. Taken together this indicates that these two proteins might be upregulated already at the presymptomatic stage. However, further studies are needed to establish if these proteins indeed are related to presymptomatic changes in GRN mutation carriers or if it is a spurious finding.

Biological sex is a known risk factor for several types of dementia, with female sex being a risk factor for AD and male sex being more common in FTD [28, 29]. In light of this, we analysed if any of the proteins identified in the current study exhibited any sex specific patterns. While we could not determine any significant interactions between sex and mutation status for these proteins (data not shown), two proteins were significantly correlated with sex in the mutation carrier group, but not in controls, suggesting a potential biological effect.

We acknowledge several limitations in this study. The focus was on genetic FTD and samples from patients with other neurodegenerative diseases were not included in the analysis. Follow-up studies with comparisons to for example AD and ALS will elucidate the importance of altered plasma proteins in FTD in relation to other diseases as well as sporadic FTD. The suspension bead array technique is a method for analysing multiple proteins simultaneously, which is useful in an exploratory study like this. However, a high-throughput antibody-based single-binder assay can have reduced sensitivity, which may limit the detection of low abundant proteins and require further validation of antibody specificity. In addition, we used a targeted approach, and the protein analysis was thus limited by the protein selection as well as the availability of antibodies.

Conclusions

To our knowledge, this is the first large scale plasma protein profiling specifically in genetic FTD. A reliable fluid biomarker could aid for example in diagnosing FTD at an early stage or in selecting individuals for upcoming clinical trials. Blood-based biomarkers would have the advantage of being easy to access and widely available compared to CSF-biomarkers. Here, we have presented an exploratory study providing proteins, including a previous CSF-biomarker, that are of interest for future investigations as potential biomarkers.