Background

Lung function is an important predictor of morbidity and mortality even among adults without chronic respiratory disease [1, 2]. Across the general population, lifetime lung function trajectories have been categorized as persistently poor, worsening, preserved impaired, preserved good, or preserved ideal lung health [3]. As lung function trajectory over the lifespan is a major determinant of the development of future chronic lung disease [3], the early identification of modifiable risk factors is critical.

Emerging evidence suggests that high quality plant-centered diets are associated with improved respiratory health. For example, a diet high in fruits and vegetables has been demonstrated to be associated with improved lung function among individuals without respiratory disease [4]. In another study of smokers without respiratory disease, greater adherence to a Western diet pattern, with higher consumption of red and cured meats and sweets, and lower consumption of fruits, vegetables, legumes, and fish, was associated with increased risk of impaired lung function [5]. Diets high in fruits and vegetables were associated cross-sectionally with a lower prevalence of current wheeze in children [6] and higher FEV1 [4] in adults. In a meta-analysis of mostly cross-sectional studies of fruit and vegetable intake on prevalent wheeze and asthma severity, fruit intake was negatively associated with prevalent wheeze and asthma severity, and vegetable intake was negatively associated with prevalent asthma [7]. While there is an established relationship between long-term consumption of a nutritionally-rich plant-centered diet and cardiovascular mortality throughout adulthood [8], its longitudinal association with lung health has not been explored. To address this gap, we tested the hypothesis that the consumption of a nutritionally-rich plant-centered dietary pattern is associated with improved lung function trajectory across early and middle adulthood among participants from The Coronary Artery Risk Development in Young Adults (CARDIA) Study.

Methods

Study design, setting and participants

CARDIA is a prospective multi-center cohort study of 5,115 young adults from four United States cities: Birmingham, Alabama; Chicago, Illinois; Minneapolis, Minnesota; and Oakland, California. Participants were 18–30 years old at baseline and were followed for 30 years with 71% retention at year 30. There were no exclusion criteria. Participants were randomly selected and recruited by telephone from census tracts in Minneapolis and Chicago, by telephone exchanges within the Birmingham city limit, and from lists of the Kaiser-Permanente Health Plan membership in Oakland and Berkeley [9]. The study protocol has been published elsewhere [10].

Assessment of plant-centered diet quality

Diet was assessed at years 0, 7, and 20 using the validated interviewer-administered CARDIA diet history. Adherence to a nutritionally-rich plant-centered diet was captured using the validated A Priori Diet Quality Score (APDQS), which weights 46 food groups rated a priori as beneficial, neutral, or adverse on the basis of current understanding of their known associations with cardiovascular risk. Each food group is divided into quintiles of consumption, and then scores of 0 (quintile 1) to 4 (quintile 5) are assigned to the beneficially rated food groups, whiles scores of 4 (quintile 1) to 0 (quintile 5) are assigned to the adversely related food groups. Higher scores, indicating better diet quality, are driven mainly by intake of nutritionally-rich plant foods. Plant-based foods such as fruits, avocado, green and yellow vegetables, and whole grains contribute to a higher score and are scored positively, whereas negatively scored foods include refined carbohydrates, red meats, processed meats, soft drinks, and high-fat dairy products. While the main contributors to a higher score are plant foods, certain animal products, including nonfried fish and poultry, also contribute, in recognition of the nutritious value of some non-plant-based foods. Details of the APDQS have been previously described [11].

Assessment of outcome variables

The primary outcome was lung function, including forced expiratory volume in 1 s (FEV1), forced vital capacity (FVC), and FEV1/FVC ratio. Lung function was measured at years 0, 2, 5, 10, 20, and 30 using standard procedures per European Respiratory Society and American Thoracic Society guidelines [12]. Extensive quality control of the measurement devices was carried out during each exam as well as between examinations, using waveform analysis to check comparability when a different device was used in one exam than in another [13]. Annualized lung function decline was calculated by subtracting spirometry parameters at year 30 from those parameters at each participant’s estimated peak lung function and dividing by the difference in years, as has been done previously in this cohort [14]. If year 30 data were not available, year 20 data were used. Time of peak lung function was defined separately for FVC and for FEV1 as the maximum among observations across all exams. Obstructive lung physiology was defined as a ratio of FEV1 to FVC < 0.7.

Other covariates

Demographics and clinical data included age (years), sex, maximal educational attainment (highest grade completed), race (Black, White), cigarette smoking and pack year history, height, weight, field center, and total daily energy intake. Smoking status was assessed yearly. Previous studies of CARDIA participants have shown strong correlation between self-reported cigarette smoking and year 0 cotinine measurements [15]. Cardiorespiratory fitness (assessed as treadmill time in seconds at years 0, 7, and 20) and history of asthma were included in sensitivity analyses.

Statistical analysis

Baseline descriptive statistics were reported according to quintiles of APDQS and whether the participant completed the study.

We identified the year of study at which each individual’s maximum lung function was achieved and examined annualized change in lung function from peak through year 30 (year 20 if year 30 was missing). We ran a simple linear regression, adjusting for sex, education, race, smoking status (measured at every annual follow-up), pack-year history (years 0, 2, 5, 7, 10, 15, and 20), height, BMI, total energy intake (averaged over the time period prior to spirometry), and site, in which the independent variable of interest was APDQS (average of all available diet data at years 0, 7, and 20). As a sensitivity analysis, we additionally adjusted for current asthma and cardiorespiratory fitness, which we conceptualized as confounders with influence on both diet and lung function. Cardiorespiratory fitness has been previously shown to be associated with lung function trajectory [17]. In order to account for potential misclassification of diet quintiles due to drop out, we performed a purely prospective sensitivity analysis using the year 0 APDQS rather than average.

In a more comprehensive analysis, we used a mixed linear model (SAS PROC MIXED) to evaluate associations between APDQS and pulmonary function and its changes, including FEV1, FVC, and FEV1/FVC. We used a serial averaging approach for APDQS, where dietary data is remeasured or carried forward and serially averaged up to and including each measurement of spirometry, which allows for minimization of random within-person error, better reflects the cumulative, long-term effect of diet, and preserves sample size. Specifically, year 0, 2, and 5 lung function were paired with year 0 APDQS; year 10 lung function was paired with the average of year 0 and 7 APDQS; and years 20 and 30 lung function were paired with the average of years 0, 7, and 20 APDQS. Pulmonary function, including FEV1, FVC and FEV1/FVC, was estimated per one standard deviation difference in APDQS (SD = 13 points) in repeated measures regression models, adjusting for the same covariates as in the simple linear regression. We adjusted models for age squared as prior work has shown lung function to have a quadratic decline [16]. If data on exposures and outcomes of interest were missing, participants were excluded. For continuous covariates (height and cardiorespiratory fitness), mean values were assigned if data were missing. To account for patients who had missing spirometry because of death before year 30, we evaluated the slope of FEV1% predicted among participants who died before year 30 by adding the interaction of death status with time to the main model. To characterize the change in pulmonary function and APDQS, we incorporated APDQS*time terms. To further understand the inverse APDQS* time interaction, we used annualized decline from peak lung function at each exam year as the dependent variables in additional separate linear regressions with overall average APDQS and the same covariates. Given that smokers may have important differences in both dietary pattern and susceptibility to environmental influences on lung function, we also tested for an interaction between smoking status and diet.

Lifetime trajectories of percent predicted lung function were generated using a group-based trajectory modeling approach (SAS PROC TRAJ), previously described by Washko et al. [3], which fits a mixture model via maximum likelihood. Participants were assigned a priori to one of the five trajectories derived from the model as persistently poor, worsening, preserved impaired, preserved good, or preserved ideal lung health [3], then stratified by APDQS quintiles.

For the outcome of airflow obstruction, Cox proportional-hazards regression models were created for incident obstructive lung physiology according to quintiles of the APDQS, stratified by smoking status (current, former, or never). Hazard ratios were adjusted for the same covariates. All analyses were conducted using SAS version 9.4 (SAS Institute Inc., Cary, NC).

Results

Study population

A total of 3,787 participants were included in this study, as shown in Table 1. We excluded participants who withdrew consent (n = 1), lacked outcome measures (n = 1243), or lacked exposure measures at Y0 (n = 132), leaving 3787 participants as a final sample. Ten participants (0.3%) and 24 (0.6%) were missing data on height and cardiorespiratory fitness, respectively. At enrollment, compared with the participants in the lowest APDQS quintile, those in the highest quintile were older, were more likely to be female and White, had higher maximal educational attainment, lower BMI, and lower energy intake, were less likely to be a current smoker, and had higher baseline FEV1 and FVC. Among all participants, 2627 (69.4%) had dietary information at all three timepoints, 967 (25.5%) had 2 measurements, and 193 (5.1%) had 1 measurement. Compared to participants who were excluded, participants who completed the study had a better diet, were older, and less likely to smoke, as shown in Additional file 1: Table S1. Dietary intake strongly tracked over time. For example, Year 0 APDQS had a correlation about 0.63 and 0.58 with Year 7 and Year 20 APDQS, respectively. The correlation between Year 7 and Year 20 was 0.64.

Table 1 Baseline characteristics (Year 0) of participants according to quintiles of the Year 0 APDQS

Lung function trajectory groups

The groups of participants with preserved good and preserved ideal lung health had a greater proportion of highest quintile APDQS (21% and 22%, respectively) than the group with persistently poor lung health (12%), whereas the group with persistently poor lung health had the highest proportion (34%) of lowest quintile APDQS (Fig. 1).

Fig. 1
figure 1

Relative distribution of APDQS quintiles among different lung function trajectories (FEV1% predicted). Only participants with year 30 data and at least one other timepoint (n = 3097) were included to ensure that trajectories reflected lung function changes into middle age. Quintile 5 APDQS was more represented in participants with preserved ideal and preserved good lung health, whereas participants with persistently poor lung health were more likely to have scores in quintile 1. The median APDQS scores were 52, 59.7, 66, 72.5, and 82 for quintiles 1, 2, 3, 4, and 5 respectively

APDQS and lung function

In multivariable repeated measures regression models, there were significant associations between APDQS and annual changes in both FVC and FEV1 (Table 2). Individuals in the lowest (poorest diet quality) APDQS quintile had declines in FEV1 that were 1.6 ml/year greater than individuals in the highest (best diet quality) quintile (35.0 vs. 33.4 ml/year; ß ± SE per 1 SD change APDQS, 0.94 ± 0.36, p = 0.009) and declines in FVC that were 2.4 ml/year greater than those in the highest quintile (37.0 vs 34.6 ml/year; ß ± SE per 1 SD change APDQS, 1.71 ± 0.46, p < 0.001). APDQS was not significantly associated with FEV1/FVC. The association was not different between never and ever smokers (pint = 0.07 for FVC and 0.32 for FEV1). In sensitivity analyses where current asthma diagnosis and cardiorespiratory fitness were further adjusted, results remained similar. When exposure was defined as year 0 APDQS, a weaker and marginally significant association was shown for FVC and no association was seen with FEV1. (Additional file 2: Table S2).

Table 2 Association between APDQS average and annual changes in pulmonary function measures

Those with fewer FEV1 measures had a higher death rate. In addition, those who died during follow-up had a faster decline in lung function than those who survived. Nevertheless, in a sensitivity analysis, accounting for those who died before year 30 and their trend in FEV1 or FVC predicted did not substantially alter the associations between APDQS and change in FEV1 or FVC (ß ± SE 0.91 ± 0.36 (p = 0.01) and 1.67 ± 0.46 (p < 0.001), respectively). Lastly, there was no significant difference in the development of incident obstructive lung physiology across quintiles of APDQS and smoking status in any of the models (Additional file 3: Table S3).

At cross-sectional analyses at each time point across young and middle adulthood, higher APDQS was associated with higher FEV1 and FVC (Fig. 2 and Table 3) after covariate adjustment. In the repeated measures analysis, p = 0.0092 for an association of FVC with APDQS that is weaker later in the study than earlier. This inverse interaction is described in separate linear regression analyses of lung function decline at each exam year according to average APDQS, as shown in Table 4.

Fig. 2
figure 2

Mean pulmonary function measures (A FEV1% predicted, B FVC % predicted) according to quintiles of the APDQS at each CARDIA exam year. Linear mixed effect models fitted to the repeated measures of lung function, with random intercepts and fixed slopes of APDQS × time interaction. N = 3787 (no. of observations = 20,134). Covariates included current age, time variables (Years 0, 2, 5, 10, 20, and 30), and time interactions with race (Black and White), sex, center (Birmingham, Chicago, Minneapolis, and Oakland), maximal educational attainment, height, total energy intake (Years 0, 7, 20), BMI (Years 0, 2, 5, 7, 10, 15, and 20), and lifetime pack-years of smoking (Years 0, 2, 5, 7, 10, 15, and 20). All covariates were time-varying except for race, sex, and height. 1 SD was 13

Table 3 Cross-sectional associations between the APDQS and pulmonary function measures at each exam year
Table 4 Association between APDQS average and annual changes in pulmonary function measures from peak at each exam year per 1 SD APDQSa,b

Discussion

In this 30-year follow-up longitudinal study, we found that a nutritionally-rich plant-centered diet was associated with significantly better lung function both cross-sectionally and longitudinally, even after adjustment for demographic and lifestyle factors influencing lung health. We found a difference of 1.6 ml/year decline in FEV1 when comparing participants in the 1st and 5th quintiles of APDQS. Putting this into context, one cohort study estimated the excess FEV1 decline from every 10 pack years of smoking at 2.5 ml/year [18], while current asthma has been associated with excess FEV1 decline of 3.7–9 ml/year [18,19,20]. Importantly, in our study smoking status did not significantly modify the benefits of consuming a nutritionally-rich plant-centered diet. Consistent with these findings, a higher proportion of high quality APDQS was observed in participants with preserved good and preserved ideal lung health trajectories than in patients with worsening or persistently poor lung health. Consequently, consumption of a typical American diet that is nutrient poor and rich in processed, calorie-dense foods may substantially contribute to the population burden of excess lung function decline and associated morbidity and mortality throughout adulthood.

Cross-sectional analysis at each time spirometry was measured demonstrated substantial changes in lung function according to diet. We found that an increase of 13 points in APDQS would be associated with a 47.18 ml increase in FEV1 for a 25-year-old Caucasian man of average height using Hankinson prediction equations. This is consistent with results of prior cross-sectional work: for example, the study by Tabak et al. [4] showed a difference of just 110 to 169 ml before energy adjustment and from 53 to 118 ml after energy adjustment comparing fruit and vegetable consumption above and below median. It is worth noting that differences in lung function between APDQS quintiles were apparent at the beginning of the study period and remained relatively stable through the study’s completion, raising the possibility that higher lifetime lung function observed among those in the top quintiles of plant-centered diet intake may in part be a function of reaching and/or sustaining optimal peak lung health after adolescence, and hence may reflect nutritional exposures occurring in the prenatal or childhood life stages and altering respiratory programming [21]. Intriguingly, the effect of diet lung function decline from peak was strongest early in the study period, again supporting the hypothesis that while diet may contribute to reaching and sustaining lung health, it is unable to overcome pathologic processes that take over later in the lifecourse. Research on critical windows is necessary to develop dietary recommendations for both children and adults to prevent adverse long-term respiratory outcomes.

Mechanistically, plant-centered diets rich in fruits and vegetables contain antioxidants (vitamin C, flavonoids, and carotenoids), which attenuate oxidative stress and may play a role in the pathogenesis of COPD [22]. In addition, dietary fiber, a key component of plant-based foods, has been shown to attenuate inflammatory responses [23], possibly through alterations in the gut microbiome and increased production of anti-inflammatory metabolites such as short-chain fatty acids [24]. In a study of mice exposed to cigarette smoke, a high fiber diet decreased interleukin-6 and interferon-gamma in bronchoalveolar lavage and serum samples, attenuated development of emphysema, and was protective against alveolar destruction [25]. An analysis of plant-based flavonoids and age-related decline in lung function from the Veteran’s Administration Normative Aging Study found that anthocyanins, a subclass of flavonoids found primarily in berries, were strongly associated with less age-related decline in FEV1 and FVC for the highest quartile of intake compared with the lowest [26]. On a micronutrient level, a nine-year longitudinal study of participants in Nottingham, England found that higher intake of vitamin C—abundant in fruits and vegetables—was associated with a lower rate of FEV1 decline by 50.8 mL per 100 mg of vitamin C [27]. Thus, individual components of a nutritionally-rich plant-centered diet, including vitamins, minerals, fibers, and phytochemicals may work synergistically to provide beneficial effects on lung function [28].

No association was observed between diet and FEV1/FVC. This is perhaps because of limitations of power, with compounding of measurement error in representing a derived quantity; it could also owe to the relative youth of the study participants, with most obstructive lung physiology developing later in life. Finally, it may be that the underlying inflammatory pathobiology of poor diet affects FEV1 and vital capacity equally, and future research is needed to elucidate the link between cell and tissues effects of inflammation and mechanical changes in lung physiology.

Strengths and limitations

Our study has several important strengths. Follow-up was over an extended period, with excellent retention and repeated spirometry over 30 years, as well as repeated diet information collected through an interviewer-administered diet history. By capturing adults in early to middle adulthood, we gained insight into early influences on lung function decline. The APDQS provides pragmatic, achievable pathways to healthy eating, reinforcing fruit and vegetable intake without excluding animal products. Smoking status was assessed annually with previous evaluations demonstrating a high degree of correlation between self-report of smoking status and cotinine concentrations [15].

There are a few limitations of the study worth noting. While we carefully adjusted for relevant covariates, including accounting for both the time-varying presence and extent of smoking behaviors, residual confounding remains a possibility, mediated by neighborhood level factors such as air pollution and secondhand smoke. Future work should pay careful attention to neighborhood as well as individual influences on lung function. Diet questionnaires, while administered by trained interviewers, were ultimately self-reported and subject to recall bias. The CARDIA cohort comprised only White and Black participants, limiting generalizability to other races/ethnicities. The small number of years with complete diet data limits assessment of change in diet over time, which might influence outcomes. The study had 71% retention at year 30, and participants with missing data may have been prognostically different. Participants for whom year 20 lung function data were used may not have had time to develop significant lung function decline, since lung function is usually maintained to around age 40; however, this would be expected to bias toward the null. Finally, the trajectory analysis is limited by its descriptive nature and smaller numbers of participants in the highest and lowest quintiles of APDQS. Further analyses could elucidate which specific foods contributed most to the primary outcome, the impact of dietary changes over the life course, and whether results differed by sex. Replication in an independent cohort would strengthen causal inferences. Since our study was not powered to determine the effect of plant-centered diet quality in smokers, future work should pay close attention to this vulnerable subgroup.

Conclusions

After adjustment for demographic and lifestyle factors impacting lung health, we found that adherence to a nutritionally-rich plant-centered diet was associated with cross-sectional differences in FEV1 and FVC and a significant, modest attenuation in FEV1 and FVC decline.