Introduction

Neuroaxonal damage can be caused by a wide spectrum of initiating factors including ischemia, inflammation, compression, trauma and other disease processes. This injury can initiate a cascade of events leading to a variety of neurodegenerative diseases. Therefore, there is substantial interest in identifying biomarkers of neuroaxonal damage, especially early in the life course prior to the onset of disease. Recent attention has focused on blood-based measurements of neurofilament light chain (NfL) as technological advancements have enabled sensitive and high-throughput measurements of blood NfL, measured in plasma or serum. NfL is released into the extracellular space following neuroaxonal injury and can be assayed from serum, plasma (blood) and cerebrospinal fluid (CSF). Moreover, blood NfL levels correlate with CSF NfL among individuals with neurodegenerative diseases [1, 2] and are elevated in Alzheimer’s disease [3,4,5,6] and other neurodegenerative diseases [7,8,9].

Recently there have been several reports that have analyzed the association of plasma NfL levels with mortality in population-based studies [10,11,12,13]. Longitudinal data from the Healthy Aging in Neighborhoods of Diversity Across the Life Span (HANDLS) study found that annualized change in plasma NfL was associated with mortality only in women in this diverse cohort of middle-aged (mean age around 48 years) African American and White adults (n = 694) [10]. Furthermore, a one standard deviation elevation in plasma NfL at the baseline time point was associated with an increased risk for all-cause mortality [10]. In both men and women serum NfL levels were associated with all-cause mortality in adults aged > 65 or older from the Memory and Morbidity in Augsburg Elderly (MEMO) study [11]. In cross-sectional analyses in this study serum NfL levels were associated with neuropsychological test and brain atrophy scores [11].

In independent cohorts of nonagenarians (n = 180) and in centenarians (n = 135), higher plasma NfL levels were associated with lower survival probability [12]. More recently, a study conducted among 294 African Americans (baseline ages 49–65 y), found that after 14–15 years of follow up, baseline serum.

NfL levels were significantly higher in the decedent group (86.1 ± 65.7 pg/ml vs. 50.1 ± 28.0 pg/ml, p < 0.001) [13]. In multivariable-adjusted binomial logistic regression models, which included age, sex, education, baseline smoking status, BMI, and total comorbidities (0–11), serum NfL levels retained its predictive value for all-cause mortality, and sensitivity analyses included other covariates did not alter this key finding [13].

Collectively, these data suggest that blood-based NfL may be an indicator of neural deterioration that may hold predictive value for mortality but more studies are warranted to decipher the relationship of this biomarker to mortality in non-demented adults. Furthermore, given differences in associations of NfL with mortality in men and women, it is important to examine differences overall and by sex and account for cardiometabolic factors that may influence or mediate NfL levels [14, 15]. In fact, NfL has been linked to both the body mass index (BMI) [14, 16] and kidney disease [17,18,19] in previous studies. Studies show that NfL is associated with brain MRI diffusion-weighted metrics such as fractional anisotropy and mean diffusivity [18,19,20,21,22]. In turn, these metrics were previously shown to be linked with serum vitamin D3 status or 25-hydroxyvitamin D3 [25(OH)D3), serum folate and vitamin B-12, elevated blood pressure, various measures of insulin resistance and hyperglycemia including glycated hemoglobin (HbA1c), dyslipidemia and various co-morbidities [23,24,25]. In addition, given its strong association with mortality risk, self-rated health may act as a mediator between NfL and mortality.

The present study examines the association between serum NfL and all-cause mortality in a national sample of community-dwelling US adults aged 20–85 years at baseline and followed for up to 6 years until end of 2019. Associations were tested overall and by sex. As a secondary objective, the study also tests potential interactive and mediating effects of the body mass index (BMI), and other related measures of cardio-metabolic and general health and selected nutritional biomarkers on this relationship.

Materials and methods

Database

The NHANES consists of a series of surveys by the National Center for Health Statistics, a division of the Centers for Disease Control and Prevention (CDC), to evaluate the health and nutritional status of civilian, noninstitutionalized U.S. children and adults and to determine the burden of major diseases and their risk factors [26, 27]. In the NHANES, stratified multistage cluster sampling is used with oversampling [28] of specific groups. Demographic, socioeconomic, and nutritional data are collected through in-person interviews, physical examinations and laboratory tests [29]. Recent waves of NHANES data have over-sampled low-income persons, adolescents 12–19 years, individuals ≥ 60 years of age, African Americans, and Mexican Americans. Beginning in 2007–2008, NHANES oversampled all Hispanics, instead of only Mexican American, low-income persons, individuals ≥ 60 years of age, and African American individuals [28]. In addition, the “Asian” group was oversampled after 2011. The original study was approved by an Institutional Review Board with informed consent provided by all study participants.

Study sample

The NHANES has been a continuous surveillance system, since 1999. For this study, the initial sample consisted of 10,175 NHANES participants from a single wave (2013–2014). Participants who were < 20 years of age (n = 4,406) were excluded, retaining 5,769 with age range of 20-85y at baseline examination. Of this subgroup, 3,698 were excluded for having missing data on serum NfL. Thus, the final sample consisted of 2,071 adults, ≥ 20 years of age, with complete data on serum NfL. No other exclusions were made for most analyses and all other covariates included in our analyses were imputed, using chained equations [30, 31] with details provided under “Statistical analysis” section (Fig. 1). On average, covariates aside from NfL were missing on 3–4% of the final sample of N = 2,071.

Fig. 1
figure 1

Participant flowchart: NHANES 2011–2014. Abbreviations: NfL = Neurofilament Light Chain; NHANES = National Health and Nutrition Surveys

Mortality linkage

The National Center for Health Statistics (NCHS) has linked data collected from several NCHS population surveys with death certificate records from the National Death Index (NDI). In compliance with requirements to protect the confidentiality of the NCHS survey participants, restricted-use versions of the linked mortality files are made available only through the NCHS Research Data Center (RDC). To complement the restricted-use files and increase data access, NCHS also developed public-use versions of the linked mortality files for the 1999–2014 National Health and Nutrition Examination Survey (NHANES) among others. The public-use linked mortality files include a limited set of variables for adult participants only. To reduce the participant disclosure risk, the public-use versions of the NCHS linked mortality files were subjected to data perturbation techniques. Synthetic data were substituted for follow-up time and underlying cause of death for select records. Information regarding vital status was not perturbed. The public-use linked mortality file provides mortality follow-up data from the date of survey participation through December 31, 2019. Detailed description of the linkage methodology and analytic guidelines can be found on the NCHS Data.

Linkage webpage: https://www.cdc.gov/nchs/data/datalinkage/LMF2015_Methodology_Analytic_Considerations.pdf

Serum neurofilament light chain measurement

Eligible NHANES 2013–2014 participants who were aged 20–75 years consented to store their blood samples for future research and consisted of individuals who had stored surplus or pristine serum samples. Using a highly sensitive methods, NfL was measured by immunoassay using acridinium ester (AE) chemiluminescence and paramagnetic particles. This assay can be conducted on an existing, high-throughput, automated platform (Attelica). Strict quality assurance was applied and subsample weights were generated to analyze the data properly. The lower and upper limits of quantification were 3.9 pg/mL and 500 pg/mL, respectively. Other details are provided elsewhere (https://wwwn.cdc.gov/Nchs/Nhanes/2013-2014/SSSNFL_H.htm).

Covariates

Socio-demographic characteristics were defined as follows: age (in y), sex (M/F), race (Non-Hispanic White, Non-Hispanic Black, Mexican American or other Hispanic, and Other race/ethnicities including Non-Hispanic Asian), level of education (< 9th grade or High school, 9–11th Grade, HS graduate, GED or equivalent, some college or Associate’s degree, ≥ College Graduate), marital status (married and/or living with partner vs. other) and poverty status as indicated by the poverty-income ratio [PIR] (< 100%, ≥ 100% to < 200% and ≥ 200%). Lifestyle characteristics were focused on smoking status (non-smoker vs. ex-smoker vs. current smoker), alcohol consumption ≥ 12 glasses (in past 12 months) [Y/N], ever use marijuana, cocaine, heroin or methamphetamine (Y/N). Physical activity in the past 30 days (moderate activity or vigorous activity, work or leisure; walk or bicycle) using a set of self-reported responses based on the short form of the International Physical Activity Questions [32] in terms of frequency (# of days) per week and number of minutes per day. Those were then combined to generate MET.min/week for each category of physical activity intensity. Finally, the MET.min/week values were added together. The NHANES 2013–14 included a dietary component consisting of two 24-h recalls administered by trained Mobile Examination Center (MEC) interviewers. The U.S. Department of Agriculture (USDA) by computerized Automated Multiple Pass Method was used to collect dietary intake data [33]. The first dietary recall interview was collected in-person in the MEC and the second interview was collected by telephone 3 to 10 days later. Nutrient intakes were estimated by linking dietary intake with corresponding USDA’s Food and Nutrient Database for Dietary Studies databases [34, 35]. The average daily nutrients intake from the two 24-h recalls was used in our current analysis. In our present study, the formula reported by Mellen et al. [36] was used to calculate the DASH diet score. This score is divided into nine target nutrients, specifically total fat, saturated fat, protein, fiber, cholesterol, calcium, magnesium, sodium and potassium. Micronutrient goals were expressed per 1000 kcal. The total DASH score was based on the sum of all nutrient targets met. If the participant achieved the DASH target for a nutrient, a value 1 was assigned. A value of 0.5 was given if the intermediate target was achieved, while a value of zero was assigned if neither target was met. Each of the nine components of the DASH total score were considered as separate exposures in our current study, in addition to the total score. In addition to dietary intakes, 3 nutritional biomarkers were included among potential confounding covariates, namely RBC folate, serum vitamin D3 [25(OH)D3] and serum vitamin B-12. Those are described in more detail in supplementary methods 1. Health characteristics were considered among potential mediators/moderators in the association between serum NfL and all-cause mortality. Those included body mass index (BMI) categories, self-rated health (SRH), a co-morbidity index and several measures of cardiometabolic health: systolic and diastolic blood pressure (mm Hg., average of 3 readings), glycated hemoglobin (HbA1c), total cholesterol and urinary albumin:creatinine ratio, ACR (Loge transformed with outliers excluded), (supplementary methods 1). The co-morbidity index was a binary [Y/N] variable for any of several cardio-vascular and cancer morbidities, namely congestive heart failure, coronary heart disease, angina/angina pectoris, heart attack, stroke or cancer/malignancy. SRH was operationalized by one question: “Would you say your health in general is- excellent, very good, good, fair or poor?” and further dichotomized as “excellent/very good/good” as the referent category (“0”) vs. “fair/poor” coded as “1”.

Statistical methods

We used Stata release 17 [37] to perform all descriptive and inferential analyses, accounting for sampling design complexity by including sampling weights, primary sampling units and strata in parts of the analyses. Aside from outcome and exposures, data was imputed using chained equations or MICE (5 imputations, 10 iterations) [30, 31], with most covariates having < 10% missing data compared to the final eligible sample (i.e. N = 2,071). MICE is a statistical technique that iteratively generates multiple imputations for each missing value in a dataset [30, 31]. This is accomplished by fitting predictive models to the observed data and assigning missing values progressively, one variable at a time, using information from other variables [30, 31]. A predictive model is created for each incomplete variable, using observed values of other variables to approximate the missing values [30, 31]. The imputation procedure is continued for a set number of iterations, typically until convergence is reached [30, 31]. At each iteration, missing values are updated using the most recent imputations, allowing imputation models to be refined [30, 31]. In our study, this is done for 10 iterations per imputation for a total of 5 imputations. Main Stata commands used included mi impute, mi passive and mi estimate in all the analyses. Study design complexity and survival time setting was specified using mi stset and mi svyset among others. Only potentially confounding and mediating/moderating covariates were imputed within the final selected sample with complete exposure and outcome data. The overall analytic sample was characterized at baseline using means and proportions. A series of bivariate and multivariable regression models were constructed to evaluate whether baseline characteristics varied according to sex, while accounting for sampling design complexity. To examine associations among plasma NfL exposures, we estimated a series of Cox proportional hazard regression models with sequential covariate adjustment. Time on study (months) was used as the initial underlying scale and was used to generate age at death accounting for initial age at baseline assessment in the NHANES. Sex-specific Kaplan–Meier survival curves were presented for binary NfL exposures (> vs. ≤ median) based on distributions in the final selected sample that has a cutoff of 2.51 for NfL on the Loge transformed scale, while examining time on study as the analytic time variable. In Cox proportional hazards models, heterogeneity by sex of the association between NfL exposures and mortality was tested through addition of two-way interaction terms (NfL × sex) in separate models. The same models were also stratified by sex. Similarly, and as a secondary analysis, interaction by baseline age was tested for the main exposure, overall and within each gender group. The general modeling strategy consisted of a basic model, adjusted for age, sex, race and PIR (Model 1), to which other lifestyle and health-related covariates (listed in the Covariates section) were subsequently added (Model 2).

BMI, measures of cardio-metabolic health (SBP, DBP, total cholesterol, HbA1c and ACR), co-morbidity index, self-rated health and nutritional biomarkers (RBC folate, serum 25(OH)D3, and serum vitamin B-12) were separately assessed as mediating/interactive factors in the total effect of NfL exposures on all-cause mortality in the full Model 2. All other covariates in Model 2 as previously described were considered potential confounders. Continuous potential mediators were transformed into standardized z-scores, while indices were coded as 0 = no, 1 = yes/any, for ease of interpretation. Specifically, the overall effect of each main exposure on all-cause mortality (Y), in the presence of a mediator with which the exposure a may interact (M), was decomposed into four distinctive components, using the following general form of the model, accounting for potentially confounding covariates c [38]. Therefore, in all analyses, exposure levels were a = 1 and a’ = 0.

$$\begin{gathered} E\left[ {Y|\left( {a,m,c} \right)} \right] = \theta_{0} + \theta_{1} a + \theta_{2} m + \theta_{3} a*m + \theta_{c} c \hfill \\ E\left[ {M|\left( {a,c} \right)} \right] = \beta_{0} + \beta_{1} a + \beta_{c}^{T} c \hfill \\ \end{gathered}$$
  1. (i)

    Neither mediation nor interaction or controlled direct effect (CDE): E[CDE | c] = θ1 (aa’). This component is interpreted as the effect of the exposure on the outcome, not related to interaction or mediation [38].

  2. (ii)

    Interaction alone (and not mediation) or interaction reference (INTref): E[INTref | c] = θ30 + β1a’ + βcT c)(aa’). This component is interpreted as the interaction effect of the exposure on the outcome in the presence of the mediator, when the presence of the outcome is not necessary for the presence of the mediator [38].

  3. (iii)

    Both mediation and interaction or mediated interaction (INTmed): E[INTmed | c] = θ3 β1 (a–a’)(a–a’). This component is interpreted as the interaction effect of the exposure on the outcome in the presence of the mediator, when the presence of the outcome is necessary for the presence of the mediator [38].

  4. (iv)

    Only mediation (but not interaction), or pure indirect effect (PIE): E(PIE | c] = (θ2 β1 + θ3 β1a’)(a-a’). This component is interpreted as the effect of the mediator on the outcome when the exposure is necessary for the presence of the mediator [38].

The total effect of the exposure on the outcome (TE = CDE + INTref + INTmed + PIE), is the summation of those four partitioned components that explain the total variance between the exposure and the outcome [38]. This four-way decomposition unifies methods that attribute effects to interactions and methods that examine mediation, and this method has recently been introduced in Stata, allowing to estimate four-way decomposition using parametric or semi-parametric regression models. Importantly, the Med4way command [39] [https://github.com/anddis/med4way] was used to test mediation and interaction of the total effects of NfL exposure on mortality with several mediators/effect modifiers, using Cox PH models for the outcome and linear or logistic regression models for each mediator/effect modifier. In this study, four-way decomposition was applied to the total sample. A logit link was specified for the mediating variable equation when mediators were binary. Total effects were interpreted as hazard ratios on the Loge scale based on Cox proportional hazards models, per SD of exposures if the exposure was continuous and for “exposed” vs. “unexposed” if the exposure was binary. These total effects were then decomposed into four components. An effect size that would result in a hazard ratio > 1.5 was considered as moderate-to-strong. Unlike all other analyses, four-way decomposition models did not account for sampling design complexity.

In all models, we adjusted for sample selectivity due to missing exposure and outcome data, relative to the initially recruited sample, using a two-stage Heckman selection strategy [40]. Initially, we predicted an indicator of selection with socio-demographic factors, namely, age, race, sex and PIR using probit regression, which yielded an inverse mills ratio (IMR)—a function of the probability of being selected given those socio-demographic factors. Subsequently, we estimated our Cox proportional hazards regression models adjusted for the IMR in addition to afore-mentioned covariates [40, 41].

Two supplementary analyses were also carried out. In a first supplementary analysis, all variables of interest including potential mediators/moderators and exogenous variables were compared across exposure levels (above vs. below median) using bivariate linear and multinomial logistic models on multiple imputed data. The second supplementary analysis specifically examined the association between each potential moderator with all-cause mortality (multivariable-adjusted Cox PH model with only main effect of the potential moderator: Model A); tested interactions between Loge transformed and z-scored NfL and those variables of interest in relation to all-cause mortality (multivariable-adjusted Cox PH model with 2-way interaction between NfL and each potential moderator: Model B); and associations of potential mediators/moderators with NfL while adjusting for exogenous covariates (multiple linear regression models, multiple-imputed data: Model C). In Model B, interaction was tested on the multiplicative scale. Conversion to the additive scale can be implied with equations shown below for the Cox proportional hazards model and the relative excess risk due to interaction (RERI), when both exposures and moderators are binary [42,43,44]. This sub-analysis was carried out with exposure (LnNfL, z-scored) and continuous moderators transformed into below and above median binary variables, as well as the remaining binary potential moderators. RERIHR formula is shown below with IR being the estimated incidence rates within each group, conditioning simultaneously on exposure and potential moderator [42,43,44]. For each combination of exposure and potential moderator level, an excess relative risk was computed as shown below [42,43,44]. The attributable proportion (AP) statistic is the proportion of risk for the doubly exposed (+ + or 11) interaction that is due to the risk that is above additive. The synergy index (SI) statistic is the excess risk expressed as a ratio rather than a difference, with a value > 1 indicating synergy or super-additive effects [42,43,44]. Exogenous covariates included age, sex, race/ethnicity, PIR, education, smoking, ever drug use, alcohol use, DASH, total caloric intake, physical activity, household size and marital status. All Stata codes used and secondary Output in this study will be provided under the following repository: baydounm/NHANES_NfL_mortality (github.com).

$$\lambda (t;X,M,C) = \lambda_{0} (t)e^{{\beta_{1} X + \beta_{2} M + \beta_{3} XM + \sum\limits_{k = 1}^{n} {\gamma_{k} C_{k} } }}$$
$$\begin{gathered} RERI_{HR} = e^{{\beta_{1} + \beta_{2} + \beta_{3} }} - e^{{\beta_{1} }} - e^{{\beta_{2} }} + 1 = HR(1,1;c) - HR(1,0;c) - HR(0,1;c) + 1 \hfill \\ = \frac{{IR_{11} - IR_{10} - IR_{01} + IR_{00} }}{{IR_{00} }} \hfill \\ \end{gathered}$$
$$ERR = IR_{am} /IR_{00} - {\text{1, where}}\;a\;{\text{and}}\;m\;{\text{are exposure and potential moderator's values as }} 0 \, \left( - \right){\text{ or 1 }}\left( + \right).$$
$$\begin{gathered} AP = RERI_{HR} /IR_{11} \hfill \\ SI = \left( {IR_{11} - 1} \right)/\left[ {\left( {IR_{10} - 1} \right) + \left( {IR_{01} - 1} \right)} \right] \hfill \\ \end{gathered}$$

Results

Our analytic sample consisted of 2,071 participants aged 20-85y with complete data on serum NfL. Mean age at exam in the total sample was 45.1 y, and 84 deaths occurred over a mean follow-up time of 70.6 months (SD 10.4, IQR 65–78 months). Table 1 shows that on average, Loge transformed serum NfL was 2.54 and was significantly higher among men compared to women (2.59 vs. 2.48, P = 0.001). Men were more likely to have < HS education compared to women (5.2% vs. 3.4%, P = 0.003), while women attained some college education at a greater proportion than men (36.2% vs. 30.7%). Women were more likely than men to be unmarried/unpartnered (37.7% vs. 32.3%, P = 0.008), and to live below the poverty line (PIR < 100%: 19.8% vs. 16.5%, P = 0.042). There were greater odds of being an ex-smoker (28.4% vs. 16.6%, P < 0.001) or current smoker (21.9% vs. 20.7%, P = 0.017) among men compared to women, as opposed to being a never smoker (49.7% vs. 62.7%). Similarly, men were more likely than women to be alcohol consumers over the past 12 months and drug ever users (P < 0.001). Nevertheless, physical activity measured using metabolic equivalents × minutes/week (Met.min.wk−1), was greater among men compared to women (2,652 vs. 1,496, P = 0.005). Sex differences in dietary factors, nutritional biomarkers, cardiometabolic factors and measures of general health status were also detected. Specifically, while caloric intake was greater among men, DASH total score, reflecting diet quality, was higher among women (2.26 vs. 2.02, P = 0.003). Women had higher mean BMI, total cholesterol, urinary ACR, glycated hemoglobin, serum 25(OH)D3 and B-12, as well as higher RBC folate compared with men, while the reverse was true for SBP and DBP. No sex differences were detected in self-rated health and co-morbidity measures .

Table 1 Study sample characteristics, overall and by sex: NHANES 2013–2014

Our key findings are presented in Fig. 2 and Tables 2 and 3. Figure 2 shows a set of Kaplan–Meier survival curves across two groups of Loge NfL levels (above vs. below median). Results indicated that mortality risk was greater in the above median group compared to the group below median NfL, particularly in the overall sample (P = 0.010), with trends observed within each sex group (P < 0.10). When looking at Loge NfL as a continuum as shown in Table 2, each SD was associated with a significantly increased mortality risk (HR 1.88, 95% CI 1.60–2.20, P < 0.001) in the reduced model adjusted for age, sex, race, and poverty income ratio; a finding that was attenuated with addition of lifestyle and health-related factors (which included among other HbA1c and ACR); (HR 1.67, 95% CI 1.41–1.98, P < 0.001). This attenuation was mainly observed in men (HR 1.90 (P = 0.001) in Model 1 vs. 1.70 (P = 0.004) in Model 2), while the reverse pattern was observed among women (HR 1.65 (P = 0.002) in Model 1 vs. 1.83 in Model 2 (P = 0.021)). Little evidence of heterogeneity was found by age, especially after further adjustment for lifestyle and health-related covariates.

Fig. 2
figure 2

Survival probability above and below median NfL plasma levels (Loge transformed), overall and by sex: Kaplan–Meier survival curves: NHANES 2011–2014. Abbreviations: Neurofilament Light Chain; NHANES = National Health and Nutrition Surveys

Table 2 Plasma NfL (Loge transformed, z-scored) and its relation to all-cause mortality, overall, by sex and with interaction by age, NHANES 2013–2014: Cox PH hazards models
Table 3 Plasma NfL (Loge transformed, z-scored) and its relation to all-cause mortality, overall and by sex: mediating and interactive effects of cardiometabolic and general health factors and of nutritional biomarkers using four-way decomposition: NHANES 2013–2014

Table 3 shows main findings derived from four-way decomposition models, which decomposed each total effect (TE) per mediator/moderator of the specified NfL exposure on all-cause mortality into components attributed to mediation alone (Pure Indirect Effect [PIE]), interaction alone (Reference Interaction [INTref]), to both mediation and interaction (Mediated Interaction [INTmed]), and neither mediation nor interaction (Controlled Direct Effect [CDE]). All total effects observed in the overall sample were statistically significant at a type I error of 0.05, and the direct effect was also significant and in the same direction as the total effect for all intermediate variables. For some of those intermediate variables, some indication of mediation and/or moderation was observed. Most notably, the IM component was statistically significant for HbA1c (INTmed =  + 0.032, P = 0.027 with a total effect of + 0.512 or a HR = 1.66, P = 0.020), indicating that 7% of this total effect of NfL on all-cause mortality, adjusting for the remaining covariates, is due to mediated interaction by HbA1c. Another notable finding is the pure mediation detected for 25(OH)D3 (PIE = − 0.036, P = 0.031; TE =  + 0.611; P = 0.002). This suggests that 25(OH)D3 may be positively associated with NfL in this overall sample, while being inversely related to the all-cause mortality outcome or vice versa. Our additional analysis confirmed the first pattern of association (Supplementary Table 1). Other findings indicated a trend towards pure mediation (RBC folate), and reference and mediated interaction (ACR and co-morbidity), (Table 3).

Supplementary Table 1 presents study sample characteristics according to NfL levels (above vs. below median). Elevated NfL was associated with older age, a higher proportion of NH White adults, smaller mean household sizes, greater proportion of current smokers, higher means of systolic blood pressure, glycated hemoglobin, urinary ACR, total cholesterol, serum vitamin D3, poorer general health and presence of co-morbidities.

Supplementary Table 2A shows additional findings for associations between potential mediators/moderators, all-cause mortality and the main exposure, Loge transformed NfL. All key continuous variables (i.e. exposures and potential mediators/moderators) were z-scored. Models were fully adjusted for socio-demographic, lifestyle and health-related factors. Our findings suggested that there was a synergistic interaction between HbA1c and NfL exposure and that HbA1c was directly associated with both NfL and all-cause mortality. In contrast, ACR was linked to both NfL and all-cause mortality, though NfL was the only predictor for mortality in a model that included the 2-way interaction term, between NfL and ACR along with the main effects. Vitamin D status was an independent predictor of reduced mortality from NfL, with no interaction detected between the two variables, confirming our previous finding of pure though inconsistent mediation. Similarly, NfL’s association with all-cause mortality was stronger among individuals without cardiovascular or cancer-related co-morbidities and is an independent predictor of mortality from self-rated health. Finally, there was synergism detected between NfL and vitamin B-12 in relation to all-cause mortality, with B-12 positively associated with all-cause mortality, but not with NfL.

Examining these relationships on an additive scale with binary exposures and potential moderators (Supplementary Table 2B), the AP index was statistically significant indicative of a relatively higher proportion due to synergy for LnACR (above vs. below median) in its interaction with LnNfL(above vs. below median) in relation to all-cause mortality (e.g. first imputation: LnNfL(below median, 0), LnACR(above median, 1): excess relative risk (ERR) = (IR01/IR00)-1 =  + 0.085 ± 0.65, p = 0.89; LnNfL(above median, 1), LnACR(below median, 0): ERR = (IR10/IR00)-1 = 0.207 ± 0.654, p = 0.73; LnNfL(above median, 1), LnACR(above median, 1): ERR = (IR11/IR00)-1 =  + 2.45 ± 1.82, p = 0.019; RERI: 2.16 ± 1.16, p = 0.064; AP: 0.625 ± 0.213, p = 0.003; SI: 8.38). The remaining four imputations gave similar results and the full Output is provided on github: baydounm/NHANES_NfL_mortality (github.com).

Discussion

We investigated the relationship between serum NfL and all-cause mortality in a national sample of US adults aged 20-85y at baseline and followed for up to 6 years until the end of 2019. Furthermore, we tested these associations both overall and stratified by sex. Moreover, our study is also among the first to test potential interactive and mediating effects of BMI, other measures of cardio-metabolic risk, general health status and selected nutritional biomarkers on the main NfL-mortality relationship. Results indicated that mortality risk was greater in the above median NfL group compared to the group below median NfL, particularly in the overall sample (P = 0.010), with trends observed within each gender group (P < 0.10). When looking at Loge NfL as a continuum, each SD was associated with a markedly increased mortality risk (HR 1.88, 95% CI 1.60–2.20, P < 0.001) in the reduced model adjusted for age, sex, race, and poverty income ratio; a finding that was attenuated with addition of lifestyle and health-related factors (HR 1.67, 95% CI 1.41–1.98, P < 0.001). Four-way decomposition indicated that there was, among others, mediated interaction between NfL and HbA1c and pure mediation with 25(OH)D3 in predicting all-cause mortality, in models adjusted for all other covariates. The former suggests that there was significant interaction effect of the higher NfL exposure on mortality risk with higher HbA1c, when increased mortality risk is necessary for elevated HbA1c. The latter suggests an inconsistent type of mediation in which TE and CDE were positive, while PIE was negative [39, 45, 46]. Specifically, NfL was positively associated with 25(OH)D3 which was then inversely associated with mortality risk. More generally, the significant PIE suggests that the effect of the mediator on the outcome when the exposure is necessary for the presence of the mediator is an inverse one, suggesting that NfL may be potentially in part protective against mortality risk through a mechanism that elevates 25(OH)D3. Furthermore, ACR interacted synergistically with NfL in relation to mortality risk both on the additive and multiplicative scales.

Emerging evidence indicates that elevated plasma NfL levels are associated with a multitude of neurological diseases including sporadic and familial AD [3, 5, 6], frontotemporal degeneration [47], multiple sclerosis [8], traumatic brain injury [7], Parkinson’s disease [2] and other neurological disorders [9]. Importantly, higher blood NfL levels precede the onset of clinical symptoms of AD [48, 49]. Plasma NfL levels may also have clinical value in middle-aged dementia-free adults. For example, in White adults and in those > 50 years, plasma NfL was associated with a faster decline on normalized mental status scores [41]. However, data from the Multidomain Alzheimer’s Preventive Trial (MAPT) indicated that plasma NfL was associated with cognitive scores and executive function only in older adults (median age 75 years) with MCI but not in individuals without cognitive impairment [50]. Therefore, blood NfL levels are associated with neurological diseases, but we are only beginning to understand the relationship between blood NfL and cognition in population-based studies and when and if NfL can have prognostic value in this context.

Recent attention has focused on characterizing blood NfL levels not only in neurological diseases and disorders but also with the associated adverse health outcomes. For example, there have been several reports that have associated plasma NfL levels with mortality due to stroke [51, 52], sporadic Creutzfeldt-Jakob disease [53], and spontaneous subarachnoid and intracerebral hemorrhages [54, 55]. In these studies, the follow-up periods were ~ 30 days with the exception of the Creutzfeldt-Jakob disease study which had a 14.8-month follow-up period. Thus, in these cases of acute neuroaxonal damage higher blood NfL levels were associated with short-term mortality. There have been a few recent reports that have examined whether blood NfL levels could predict all-cause mortality in population-based studies. Three recent studies found that plasma NfL was associated with all-cause mortality in centenarians and nonagenarians [12], among older (> 65 years) adults [11] and among African American middle-aged adults followed for over a decade [13]. In the MEMO study of elderly adults, there were no sex differences reported [11]. This is in contrast to a recent study of diverse middle-aged (mean age around 48 years) African American and White adults [10]. In this study, plasma NfL was associated with mortality only in women [10]. Here, we have added to these current studies by examining serum NfL in a diverse, representative sample of the US that ranged in age from 20 to 85 years. Individuals with levels of NfL in the above median group have a greater mortality risk compared to the group below median NfL. These findings are for the overall sample and only a trend when examined within each sex group. In terms of exploring possible associations through potentially mediating and/or moderating factors, in the HANDLS cohort, there was a possible antagonistic interaction between hsCRP and NfL, indicating that NfL is a better prognostic indicator at normal hsCRP values [10]. Moreover, there was some evidence of synergistic interaction between HbA1c and annualized change in plasma NfL (δNfL) in determining mortality risk, overall [10]. This latter finding is in line our present study, even though we only had data on a single point NfL exposure. Thus, our study further elucidates the relationship between serum NfL and all-cause mortality in the general population. Poor glycemic control and not the duration of diabetes was also found in a previous study to be independently associated with sNfL. Specifically, HbA1c levels were positively correlated to sNfL levels [β(SE)adj = 1.85 (0.64), p < 0.01] [56]. This finding was corroborated by other studies among urban adults and in a comparable national sample [57, 58]. Our novel finding of pure inconsistent mediation with 25(OH)D3 is not supported by previous studies indicating that vitamin D supplementation had no detectable longitudinal effect on serum NfL in both animal and human studies [59, 60].

Our current study has several strengths, including the use of data from a large nationally representative sample of US adults with a wide age range (20–85 years) and a multi-racial/ethnic composition. In addition, this is among few studies to examine the association between serum NfL and mortality among community-dwelling, generally healthy US adults. The continuous NDI linkage to mortality data from baseline till end of 2019 allowed for enough statistical power to test our hypotheses. We used a number of advanced statistical techniques to conduct our analyses, including Cox proportional hazards models accounting for sampling design complexity and adjusting for potentially confounding covariates in multiple imputed data, 2-stage Heckman selection models to adjust for selection bias, and four-way decomposition models to tease out mediation from moderation.

Study findings should be interpreted in light of several limitations. First, even though cohort study designs are considered as the gold standard in epidemiological literature, cause-and-effect relationships cannot be clearly established in the context of observational studies. Second, sample sizes were likely too small to be able to detect interactions among variables of interest, including biomarkers and NfL in relation to all-cause mortality, and especially within sex groups. Thus, four-way decomposition models could not be constructed for men and women, separately, and its statistical power may be limited given the small number of deaths that occurred within the short period of time. The same limitation applied to interactions on the additive scale with binary exposures and potential moderators. Moreover, complete-subject analyses which depend on availability of data on key variables may have resulted in selection bias. Third, the role of chance cannot be ruled out given the large number of statistical tests being performed and thus our statistically significant findings should be interpreted in light of the multiplicity of statistical testing. Fourth, although many covariates were included in the multivariable models, residual confounding cannot be ruled out as an explanation for observed relationships. For example, total caloric intake was assessed using 24-h recall which may not be adequate for the determination of usual food consumption. Fifth, changes in NfL over time may be more important than NfL at a specific point in time. These data were not available in this wave of NHANES. Sixth, ultrasensitive protein assays have enabled accurate measurement of neuronal damage markers like NFL in blood [61]. Despite the inability to distinguish central from peripheral nerve degeneration, these biomarkers are effective in identifying patients with peripheral neuropathy and disease activity in rapidly progressive peripheral neuropathy [61]. However, their use in slowly progressive diseases without significant axonal loss remains uncertain [61]. Finally, four-way decomposition models rely on a strong set of assumptions, specifically that of no unmeasured confounding, which cannot be explicitly ruled out or tested empirically [62].

In conclusion, we report that serum NfL levels measured at the baseline MEC exam can predict all-cause mortality among both men and women. These findings merit further exploration in other large samples of adults, which will add to its usefulness as a potential prognostic marker at varying degrees of cardio-metabolic risk, particularly in terms of HbA1c levels. Our finding of pure inconsistent mediation through 25(OH)D3 merits further examination both in terms of replication and explanation through mechanistic studies. The same applies to the apparent synergism between NfL and urinary ACR in relation to mortality risk. It remains crucial to identify biomarkers for all-cause mortality, given the alarming increase in its rate among younger and middle-aged adults [63]. While, we did not observe any sex-specific findings, future studies should continue to examine heterogeneity in the NfL-mortality association across sex groups, given the sex difference in both NfL levels with age, and mortality. Together, this accumulating evidence suggests that serum levels of NfL are not only markers of neuropathology but can also predict mortality risk and thus may be useful for interventions aimed at elongating lifespan.