Associations between exploratory dietary patterns and incident type 2 diabetes: a federated meta-analysis of individual participant data from 25 cohort studies

Jannasch, Franziska; Dietrich, Stefan; Bishop, Tom R. P.; Pearce, Matthew; Fanidi, Anouar; O’Donoghue, Gráinne; O’Gorman, Donal; Marques-Vidal, Pedro; Vollenweider, Peter; Bes-Rastrollo, Maira; Byberg, Liisa; Wolk, Alicja; Hashemian, Maryam; Malekzadeh, Reza; Poustchi, Hossein; Luft, Vivian C.; de Matos, Sheila M. Alvim; Kim, Jihye; Kim, Mi Kyung; Kim, Yeonjung; Stern, Dalia; Lajous, Martin; Magliano, Dianna J.; Shaw, Jonathan E.; Akbaraly, Tasnime; Kivimaki, Mika; Maskarinec, Gertraud; Le Marchand, Loïc; Martínez-González, Miguel Ángel; Soedamah-Muthu, Sabita S.; Wareham, Nicholas J.; Forouhi, Nita G.; Schulze, Matthias B.

doi:10.1007/s00394-022-02909-9

Associations between exploratory dietary patterns and incident type 2 diabetes: a federated meta-analysis of individual participant data from 25 cohort studies

Original Contribution
Open access
Published: 01 June 2022

Volume 61, pages 3649–3667, (2022)
Cite this article

Download PDF

You have full access to this open access article

European Journal of Nutrition Aims and scope Submit manuscript

Associations between exploratory dietary patterns and incident type 2 diabetes: a federated meta-analysis of individual participant data from 25 cohort studies

Download PDF

Franziska Jannasch ORCID: orcid.org/0000-0003-3478-4758^1,2,3,
Stefan Dietrich^1,4,
Tom R. P. Bishop⁵,
Matthew Pearce⁵,
Anouar Fanidi⁵,
Gráinne O’Donoghue⁶,
Donal O’Gorman⁷,
Pedro Marques-Vidal⁸,
Peter Vollenweider⁸,
Maira Bes-Rastrollo^9,10,11,
Liisa Byberg¹²,
Alicja Wolk^12,13,
Maryam Hashemian^14,15,
Reza Malekzadeh¹⁴,
Hossein Poustchi¹⁶,
Vivian C. Luft¹⁷,
Sheila M. Alvim de Matos¹⁸,
Jihye Kim¹⁹,
Mi Kyung Kim¹⁹,
Yeonjung Kim²⁰,
Dalia Stern²¹,
Martin Lajous²¹,
Dianna J. Magliano²²,
Jonathan E. Shaw²²,
Tasnime Akbaraly^23,24,
Mika Kivimaki²⁴,
Gertraud Maskarinec²⁵,
Loïc Le Marchand²⁵,
Miguel Ángel Martínez-González^9,10,11,26,
Sabita S. Soedamah-Muthu^27,28,
EPIC-InterAct Consortium,
Nicholas J. Wareham⁵,
Nita G. Forouhi⁵ &
…
Matthias B. Schulze^1,2,3

3788 Accesses
5 Citations
7 Altmetric
Explore all metrics

Abstract

Purpose

In several studies, exploratory dietary patterns (DP), derived by principal component analysis, were inversely or positively associated with incident type 2 diabetes (T2D). However, findings remained study-specific, inconsistent and rarely replicated. This study aimed to investigate the associations between DPs and T2D in multiple cohorts across the world.

Methods

This federated meta-analysis of individual participant data was based on 25 prospective cohort studies from 5 continents including a total of 390,664 participants with a follow-up for T2D (3.8–25.0 years). After data harmonization across cohorts we evaluated 15 previously identified T2D-related DPs for association with incident T2D estimating pooled incidence rate ratios (IRR) and confidence intervals (CI) by Piecewise Poisson regression and random-effects meta-analysis.

Results

29,386 participants developed T2D during follow-up. Five DPs, characterized by higher intake of red meat, processed meat, French fries and refined grains, were associated with higher incidence of T2D. The strongest association was observed for a DP comprising these food groups besides others (IRR_pooled per 1 SD = 1.104, 95% CI 1.059–1.151). Although heterogeneity was present (I² = 85%), IRR exceeded 1 in 18 of the 20 meta-analyzed studies. Original DPs associated with lower T2D risk were not confirmed. Instead, a healthy DP (HDP1) was associated with higher T2D risk (IRR_pooled per 1 SD = 1.057, 95% CI 1.027–1.088).

Conclusion

Our findings from various cohorts revealed positive associations for several DPs, characterized by higher intake of red meat, processed meat, French fries and refined grains, adding to the evidence-base that links DPs to higher T2D risk. However, no inverse DP–T2D associations were confirmed.

Metabolic syndrome and dietary patterns: a systematic review and meta-analysis of observational studies

Article 07 September 2016

Link of dietary patterns with metabolic syndrome: analysis of the National Health and Nutrition Examination Survey

Article Open access 20 March 2017

Association between dietary patterns and prediabetes, undetected diabetes or clinically diagnosed diabetes: results from the KORA FF4 study

Article Open access 30 October 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

A large number of prospective studies have evaluated dietary patterns (DP) in relation to the risk of developing type 2 diabetes mellitus (T2D) [1]. While evidence for a priori, also called hypothesis-driven, DPs like the Mediterranean diet is convincing [2], evidence for DPs derived by exploratory methods using the data at hand is inconsistent [1]. Several studies reported associations of study-specific exploratory DPs with higher T2D risk, some of them labelled “Western” [3,4,5,6,7,8,9,10,11]. Such DPs frequently included red meat [3,4,5,6,7, 9, 10], refined grains [3,4,5,6,7,8,9,10], sugary drinks [3, 7, 8, 10] or French fries [3,4,5,6, 8,9,10]. However, the composition of these exploratory DPs still differs in other food groups (FG) besides those mutual ones (1) and the food groups per se can comprise different food items based on the study specific assessment and dietary habits. In addition, similarly labelled DPs (e.g. “Western”) showed heterogeneous associations with T2D risk [1, 12]. Thus, the exploratory nature of DPs results in study-specific observations rather than generalizable findings. So far, little effort has been made to assess the actual generalizability of DP–T2D associations. This limits the accumulation of consistent evidence from cohort studies on DP associations with T2D—thus, evidence from exploratory DPs to inform dietary recommendations has been sparse.

A solution to overcome the limitation of study-specific findings is to replicate the association of DPs with T2D in independent populations. So far, only one study investigated the generalizability of T2D-associations with DPs derived by principal component analysis (PCA). However, this study was restricted to European populations participating in the EPIC-InterAct consortium with the aim to replicate only those T2D-associated DPs which were derived in country-specific analyses within this consortium [13]. In addition to PCA, patterns derived by reduced rank regression, were also replicated [14,15,16]. The main principle for those replication approaches is the reconstruction of pattern variables based on the reported pattern structure. In this context, it has been proposed to derive so-called simplified DP variables to construct less population-dependent DP variables with a content approximately similar to that of original exploratory DPs. It has been shown that the DP variables, calculated with this method, correlated highly with the original DP and reflected variation in intake of individual components well [14, 16, 17]. Hence, this approach seems well suited to replicate study-specific associations of exploratory DPs in independent study populations. To date, however, this method has not been used to examine exploratory DPs in relation to T2D across populations from different continents of the world.

To overcome the research gap of investigating the generalizability of DP–T2D associations using the approach of simplified DPs, the present study aimed 1) to investigate the association of previously reported T2D-associated DPs [1] with incident T2D and 2) to evaluate, if two DPs of overlapping FGs (“mainly healthy” and “mainly unhealthy”), also previously identified in the same systematic review [1], are associated with incident T2D. For this purpose, the InterConnect collaboration project offers a well-suited research platform for federated meta-analyses of harmonized individual level study data from 25 cohorts [18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33] across different continents and adjusting for a common set of potential confounders across studies [34,35,36]. As another advantage, this approach allowed the inclusion of cohorts that have relevant data, but never published on the topic before.

Methods

Study populations

InterConnect was an EU-FP7 funded project which aimed to optimise the use of existing data by enabling cross-cohort analyses within consortia without pooling of data at a central location ("http://www.interconnect-diabetes.eu/") [34]. For the current study, the InterConnect Data Discovery registry (http://www.interconnect-diabetes.eu/data-discovery/) and literature was screened to identify cohorts with suitable data like study populations representing the general population without prevalent T2D, dietary intake information (amount, frequency), incident T2D as outcome (self-report, objective measures), and information on the covariates age, sex, smoking, body mass index (BMI), waist circumference or waist-hip ratio, physical activity, alcohol consumption, education or occupation, family history of diabetes, other health exposures (cardiovascular diseases, history of previous illness). Of 103 identified cohorts, 25 collaborating cohorts (Table S1) contributed data to this project [18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33, 37]. The Zutphen Elderly study also contributed data, but was excluded due to a too low number of cases [37]. Other reasons for non-participation (Fig. S1) were failed contact (n = 46), no interest in research question (n = 10), insufficient data (n = 15) or no study capacity (n = 6). The collaborating cohorts [18,19,20,21,22,23,24,25,26,27,28,29,30,31,32, 38] included 13 cohorts from Europe, eight from the Americas (North and South America), three from Western Pacific (Australia, Republic of Korea), and one from the Eastern Mediterranean (Iran). All cohorts obtained ethical review board approval at the host institution and informed consent from participants.

Dietary assessment and construction of dietary patterns

Dietary intake was assessed by food frequency questionnaires (FFQ) in most cohorts, by dietary history interview and a 24-h recall in one cohort each (Table S1). For the present study food intake encoded in g/day was used. Some cohorts provided only standard portion sizes and frequency of consumed food items, which were converted into g/day. For some US cohorts, where information on portion size was not available, variable-specific standard portion sizes sourced from the United States Department of Agriculture [39] were used.

The dietary data of all cohorts were then harmonized to form a set of food groups. For this purpose, the FGs used in the published DPs associated with T2D risk were compared. Based on this, a set of FGs was defined to be used across all published DPs (Tables 1, S2 and S3). If for a specific food item, which was used in the original DP, no intake information was available in other included studies, it was omitted. Then the respective study-specific food items were added in each InterConnect cohort to form the corresponding harmonized FG (Excel Table S6). Subsequently, DPs were constructed based on the harmonized FGs. The structure of DPs was defined based on the findings of our previous systematic review [1], thus reflecting a) DPs found to be significantly associated with T2D risk in at least one cohort study (13 individual DPs) and b) two DPs reflecting DPs with overlapping food composition: the DP reflecting the overlap of “mainly healthy” food groups was composed of fruits, vegetables, legumes, poultry and fish, while the DP of “ mainly unhealthy” food groups was composed of refined grains, French fries, red meat, processed meat, high-fat dairy products and eggs. Thus, 15 DPs in total were constructed. To calculate individual DP scores for study participants, the approach of simplified DPs [17] was used. In PCA-derived DPs, all food groups contribute with a respective factor loading to the overall pattern structure. The simplification approach considers only those FGs with strong contribution to the respective DP (factor loading (FL) ≥ 0.2) in the original DPs. Details of which FGs were combined to calculate the respective simplified DP scores are shown in Tables 1, S2 and S3. These FGs were standardized according to the distribution in each participating study, respectively. Then, simplified DP scores were calculated by summing up the selected FGs without any weighting (in original DP the respective FL is the weighting) and by also considering negative algebraic signs for those FGs with negative FL from the original publication. Finally, study-specific simplified DP scores were also standardized to allow meta-analysis across cohorts [17].

Table 1 Risk estimates for T2D from the original studies, where DPs were derived in and composition of simplified pattern variables used for the analyses in InterConnect

Full size table

Ascertainment of incident T2D

To minimize potential variations due to varying diagnosis criteria of T2D incidence across cohorts, two harmonized outcomes were defined [40]. As primary outcome, clinically incident T2D was defined when any one or more of the following criteria were fulfilled: (1) ascertained by linkage to a registry or medical record; (2) confirmed antidiabetic medication usage; (3) self-report of physician diagnosis or antidiabetic medication, verified by any of the following: (a) at least one additional source from 1 or 2 above, (b) biochemical measurement (glucose or HbA1c), (c) a validation study with high concordance. As secondary outcome with less strict criteria, we defined incident T2D, when any of the following criteria were fulfilled: (1) ascertained by linkage to a registry or medical record; (2) confirmed antidiabetic medication usage; (3) self-report of physician diagnosis or antidiabetic medication or (4) biochemical measurement (glucose or HbA1c).

Assessment of covariates

We defined a set of potential confounders to be used in analyses based on: (1) frequent usage in the studies of the 13 published T2D-associated DPs and (2) availability across all participating InterConnect cohorts (Table S4). The final set of confounders included: age at baseline (years), sex, body mass index (BMI) (kg/m²), physical activity (PA, cohort specific items were used), education (cohort specific items were used), smoking (never, former, current smoker), alcohol consumption (g/day), hypertension (yes/no), and energy intake (kcal/day). The recorded data of confounders of the respective InterConnect cohorts were used and harmonized across all cohorts, if possible (Table S5). All cohorts provided age in years, BMI in kg/m², hypertension as yes or no. Smoking was harmonized as never, former, and current smoker, energy intake into kcal/day and alcohol into g/day. In the Golestan Cohort Study from Iran alcohol consumption was used as never or ever drinker. Study-specific coding was used for PA and education because harmonization was not feasible due to extensive differences in codes (Table S5).

Statistical analysis

All analyses were conducted using R within the DataSHIELD federated meta-analysis programming library [35]. For analysis, participants with the following criteria were excluded: T2D, myocardial infarction, stroke or cancer at baseline to avoid reverse causation, extreme energy intake (men < 800 kcal or > 4200 kcal, women < 500 kcal or > 3500 kcal), missing follow up time, missing confounders, and more than 10% missing food items. In total, 46.9% of the participants of the InterConnect cohorts were excluded (Table 2). Baseline characteristics were calculated stratified by cohorts. Normally distributed variables were presented as mean and standard deviation (SD), not normally distributed as median and interquartile range (IQR), and categorical variables as relative percentages.

Table 2 Characteristics of analyzed data^a of the participating 25 InterConnect cohorts

Full size table

Incidence rate ratios (IRRs) and 95% confidence intervals (CI) were estimated to test for the associations between 1 standard deviation (SD) increase in DP scores and incident T2D in each cohort separately, using Piecewise Poisson regression adjusted for age, sex, BMI, PA, education, smoking, alcohol consumption, hypertension and energy intake. The Piecewise Poisson regression is available in the DataSHIELD library and has been shown to represent a close approximation to the Cox Proportional Hazards regression [41]. For the European Prospective Investigation into Cancer and Nutrition (EPIC)-InterAct cohorts a weighting was applied that is analogous to Prentice weighting (weights of 1 for all cases and weights of \(\frac{\#\mathrm{ non}-\mathrm{cases in whole cohort}}{\#\mathrm{ non}-\mathrm{cases in subcohort}}\) for non-cases) to account for the case-cohort design in survival analyses, when using the piecewise Poisson method [42].

Pooled IRR were estimated using random-effects meta-analysis models and were visualized with forest plots. Heterogeneity was assessed using I², p value of chi-square test and tau² statistic. For each DP a statistical model for the primary and the secondary outcome was calculated. For sensitivity analysis we calculated a second set of the 13 DPs by considering only FGs with FL ≥ 0.4 in the original publication to identify those strongly contributing to the DP. Moreover, a sensitivity analysis with exclusion of certain component FGs was conducted to estimate if few FGs were mainly driving the association from the UDP3, which showed the strongest association with T2D. To account for characteristics potentially explaining heterogeneity between the cohorts, meta-regressions were calculated with the pooled IRR as dependent variable and age, BMI, follow-up time and region as the independent variables. For this, the metareg function within the metafor package (version 3.02) in R was used.

Results

In the present analysis, data from 390,664 participants across 25 cohorts with a median follow-up time ranging from 3.8 to 25.0 years were included (Table 2). Four cohorts included only women (EPIC-InterAct-France, Mexican Teachers' Cohort (MTC), Swedish Mammography Cohort (SMC), Women's Health Initiative Observational Study (WHI-OS)) and two only men (Cohort of Swedish Men (COSM), Puerto Rico Heart Health Program (PRPHH)). Participants from Coronary Artery Risk Development in Young Adults (CARDIA) study, MTC and Seguimiento University of Navarra (SUN) cohort were of younger age (24.9–41.8 years), whereas participants from other cohorts were older (49.5–63.1 years). The mean BMI ranged from 23.9 kg/m² in SUN to 29.3 kg/m² in EPIC-InterAct-Spain. During follow-up, 29,386 clinically incident cases of T2D were recorded for the primary outcome and 36,527 incident cases for the secondary outcome.

The dietary intake of harmonized FGs showed marked differences between the cohorts (Excel Supplemental Table). For example, reported median fruit intake was highest in MTC (321.7 g/day) and about three times higher than median intake in the cohorts with lowest fruit intake like CARDIA (94.9 g/day) and EPIC-InterAct-Germany (91.4 g/day). Particularly high intakes compared to other cohorts were observed for vegetables in SUN Study (391 g/day), legumes and soy (but mostly beans) in Brazilian Longitudinal Study of Adult Health (ELSA-Brasil) (151.0 g /day), refined grains in Golestan (365.0 g/day), whole grains in COSM (127.0 g/day) and EPIC-InterAct-Germany (120.3 g/day), and sugary drinks in CARDIA (244.9 g/day).

Healthy dietary patterns and risk of T2D

None of the HDPs (Table 3, Figs. 1, 2, Supplemental Table 6, Supplemental Figs. 2–5) were robustly associated with a reduced risk of T2D. This was the case for the two outcome definitions and for the two versions of each HDP constructed using different cut-offs of FL to define component FGs. HDP1 was significantly associated with a higher T2D risk (primary outcome: pooled IRR per SD = 1.057, 95% CI 1.027–1.088; secondary outcome: IRR per SD = 1.042, 95% CI 1.018–1.065, Table 3). This DP contains vegetables, fruits, margarine, nuts, poultry, eggs, fish, red meat, whole milk, high fat dairy and low-medium fat dairy. However, this association was absent in sensitivity analysis, when only FGs with published absolute FL ≥ 0.4 (vegetables and fruits, Table 2) were used to construct the HDP1 (Supplemental Table 6). HDP3, composed of fruits and dairy products, was also not significantly associated with T2D risk (pooled IRR per SD = 0.976, 95% CI 0.948–1.005, Table 3), when using the secondary outcome definition. For the remaining HDPs (2, 4–6) the pooled risk estimators did not indicate associations with T2D risk (Table 3). Overall, there was moderate to substantial heterogeneity (I² = 58–83%, Table 3) for the HDP–T2D associations. For HDP1, none of the characteristics (age, BMI, follow-up time and region) explained the observed heterogeneity (I² = 66%) in meta-regressions (data not shown).

Table 3 Pooled findings of federated random effect meta-analyses to test for the association between the simplified healthy and unhealthy dietary pattern variables (per one standard deviation) (cut-off factor loadings > 0.2) and incident type 2 diabetes across InterConnect cohorts

Full size table

Unhealthy dietary patterns and risk of T2D

Five of the seven UDPs (UDP3-7) were associated with a higher T2D risk in pooled analyses across all cohorts (Table 3, Figs. 1, 2, Supplemental Table 6, Supplemental Figs. 2–5). The UDP 3–7 included mostly meat products, French fries and refined grains (Table 2). Only UDP 6 differed from these DPs, as meat products were not included, but soft drinks and the components whole grains, vegetables, fruits and legumes (including soy) with negative weightings were included. UDP 3 showed the strongest association with incident T2D (primary outcome: pooled IRR per 1 SD = 1.104, 95% CI 1.059–1.151; secondary outcome: pooled IRR per 1 SD = 1.094, 95% CI 1.056–1.133 for UDP3 based on FL ≥ 0.2). However, heterogeneity was substantial across studies (I² = 85% and 84%). The region partly explained heterogeneity for UDP3 (16%) in meta-regression. When UDP3 was constructed using FGs with FL ≥ 0.4, only red meat remained as component and associations were considerably weaker, although still statistically significant (Supplemental Table 6). Most cohort-specific IRRs indicated that UDP3 was associated with a higher T2D risk or a trend towards an association (Figs. 1, 2). Similar findings, although weaker, were observed for UDPs 4–7, where heterogeneity ranged from moderate (I² = 49% for UDP 4) to substantial (I² = 81% for UDP 6). Here, region explained a considerable proportion of the heterogeneity for UDP6 (29%) and UDP7 (25%), while follow-up time explained 30% for UDP5 and 24% for UDP6 of the overall heterogeneity. No association with T2D risk was found for UDP 1 and UDP 2, neither for the two outcome definitions nor for the two FL cut-offs (Table 3, Supplemental Table 6).

Dietary patterns with “mainly healthy” and “mainly unhealthy” food groups and T2D risk

We evaluated the two DPs reflecting previously published DPs with overlapping FG components irrespective of whether they have been described to be associated with T2D previously or not [1]. The DP consisting of “mainly healthy” FGs, i.e. fruits, vegetables, legumes, poultry and fish, was not associated with T2D risk across the included cohorts (primary outcome: pooled IRR per 1 SD = 1.033, 95% CI 0.998–1.071; secondary outcome: pooled IRR per 1 SD = 1.000, 95% CI 0.975–1.026) (Fig. 3, Supplemental Fig. 6). The heterogeneity across studies was substantial (primary outcome: I² = 84%, secondary outcome: I² = 76%). Hence, the forest plots show the cohorts arranged by region. In contrast, the DP consisting mainly of “mainly unhealthy” FGs, i.e. refined grains, French fries, red meat, processed meat, high-fat dairy products and eggs, was significantly associated with a higher T2D risk (primary outcome: pooled IRR per 1 SD = 1.079, 95% CI 1.051–1.108; secondary outcome: pooled IRR per 1 SD = 1.067, 95% CI 1.037–1.098) (Fig. 3, Supplemental Fig. 6). The heterogeneity was moderate for the primary outcome (I² = 58%), but substantial for the secondary outcome (I² = 74%). Most study-specific IRRs indicated a higher risk of this DP, except for the Golestan Cohort Study, which pointed towards an inverse association.

Sensitivity analysis of UDP 3

UDP3 was composed of the FGs red meat, processed meat, poultry, eggs, fish, French fries, refined grain products, and rice. To assess the contribution of these individual FGs to the T2D risk of UDP3, a sensitivity analysis was carried out by excluding individual FGs (Supplemental Table 7). The exclusion of refined grains resulted in the highest reduction of the IRR estimate (from 1.094–1.047, − 4.74%), followed by processed meat (− 1.66%) and eggs (− 1.10%).

Discussion

This study investigated associations between exploratory DPs and T2D risk in a large number of prospective cohort studies in a worldwide context, using harmonized data analyses across all studies and federated meta-analyses of individual studies. No robust inverse associations were observed between HDPs and risk of T2D. HDP1 was associated with a higher T2D risk in primary analysis, but this unexpected finding was not confirmed in sensitivity analyses. We observed more consistent findings for UDPs with five of the seven UDPs being associated with higher T2D risk in our meta-analysis of included studies. We investigated two DPs which reflect commonly shared FGs of exploratory DPs identified in previous studies on DP and T2D. The DP with “mainly healthy” FGs, characterized by higher intakes of vegetables, legumes, fruits, poultry and fish, was not associated with T2D risk, but the DP with “mainly unhealthy” FGs, characterized by red meat, processed meat, high-fat dairy products, eggs, refined grains and French fries, was associated with a higher T2D risk. The effect size for all the significant associations was relatively modest with IRRs being 1.10 per 1 SD increased DP score or less.

Previous studies have shown differences in risk associations between DPs and T2D in U.S. cohorts and the European EPIC-InterAct study, although this was restricted to a priori DPs like the Dietary Approaches to Stop Hypertension (DASH) diet, the Alternative Healthy Eating Index (AHEI) or reduced rank regression-derived DPs [1, 43]. Given the strong heterogeneity in the composition of exploratory DPs already in the European context, this underlines the importance of investigating if population-specific DP–T2D associations can be replicated across diverse populations, where even higher heterogeneity is expected. To our knowledge, this is the first study to investigate if associations of exploratory DPs with T2D risk can be replicated across cohorts from multiple regions across the world.

We have previously investigated the generalizability of exploratory DPs associations with T2D in EPIC-InterAct, a European-wide cohort study [13]. In this analysis, three DPs identified in country-specific analyses were associated with T2D. However, only one DP was consistently associated with T2D risk across the included European cohorts (pooled IRR per 1 SD: 1.12, 95% CI 1.04–1.20). This DP was characterized by high intakes of processed meat, potatoes (including French fries), vegetable oils, sugar, cake and cookies, and tea. Besides the EPIC-InterAct study, we are not aware of any further systematic replication of associations of exploratory DPs and T2D. Also, the EPIC-InterAct study did not attempt to replicate T2D-associated DPs identified in other cohorts than EPIC-InterAct, which has been our current major aim.

We were able to replicate associations with higher T2D risk for five of seven investigated UDPs. These five UDPs (UDP3-7) share red meat, processed meat, French fries and refined grains (comprising refined grain bread and refined grain breakfast cereals) as component FGs. Also eggs and high-fat dairy products were component FGs of three out of these five DPs. These FGs are identical to those which we used to construct one DP based on commonly shared “mainly unhealthy” FGs of published DPs [1]. Consequently, this pattern was also associated with a higher T2D risk in our meta-analysis: we observed a pooled IRR of 1.08 per 1 SD, 95% CI 1.05–1.11 for the primary outcome definition, being slightly stronger than the risk estimates for most of the UDPs, which ranged between pooled IRRs of 1.04 for the UDP5 by Yu et al. [7] and for UDP7 by Schoenaker et al. [9] to 1.07 for the UDP4 identified by Erber et al. [6]. However, an even higher risk estimate was found for UDP3 (IRR of 1.10 per 1 SD, 95% CI 1.06–1.15), which had been observed in the Melbourne Collaborative Cohort Study to be associated with higher risk of T2D [5]. This DP was not only characterized by red and processed meat, eggs, French fries, refined grains, but also by fish, poultry and rice. We noted that the DPs associated with higher risk in our meta-analyses had only potatoes (including French fries) and processed meat in common with the DP identified to be associated in the EPIC-InterAct study [13]. To gain insight into the role of individual FGs for pattern associations, we conducted a sensitivity analysis on the UDP3-T2D association by excluding individual FGs one at a time. Particularly the exclusion of refined grains led to an attenuation of the risk estimate from IRR of 1.10 to 1.05 for the primary outcome. Still, other components seemed to contribute to the associations and we interpret the synergy of these component FGs in this pattern as driving the association with T2D. The UDPs which were identified as being associated with a higher risk of T2D did not only show overlaps but also differences in component FGs. For example, butter (UDP4), sugar and confectionary and offals (UDP5) or pizza (UDP6, UDP7) were pattern-specific components besides the commonly shared FGs. Two of the UDPs (UDP5, UDP6) additionally shared the FG sugar-sweetened beverages. This food group was also a component in 4 out of 5 previously identified reduced rank regression-patterns, which were associated with higher T2D risk [14, 44,45,46] and evidence from a systematic literature review suggests 13% risk increase for T2D per one serving (250 mL/day), even after adjustment for BMI [47]. The UDP6 was furthermore characterized by the negatively weighted FGs cakes & cookies, legumes, vegetables, fruits and whole grains. However, after exclusion of these FGs due to the use of the cut-off FL ≥ 0.4, the IRR was only marginally changed.

None of the HDPs, either individual DPs described by single studies or the DP defined by commonly shared “mainly healthy” FGs of investigated patterns, were inversely associated with T2D risk in our meta-analyses. This is generally in line with evidence for single FGs being components of such DPs. For instance, vegetables, fruits, legumes, poultry and fish have not been clearly identified to relate to lower T2D risk in cohort studies [48]. In contrast to the original observation from the Finnish Mobile Clinic Health Examination Survey [4], we observed the HDP1 being associated with a higher risk of T2D. Red meat and eggs—frequent components of UDPs—were also contributing components of this pattern; thus, the direction of association in our analysis could potentially be driven by these two components. While a higher T2D risk of red meat is well documented [48], the role of egg consumption remains unclear [49]. Differences how specific foods are prepared and/or consumed together across populations may explain their association with healthy or unhealthy patterns. Furthermore, if a food group like fish is the main animal protein source in a population, detrimental components like methylmercury could play a more important role leading to health detrimental effects than in a population, where these components play a minor role due to less intake [50].

Besides the components of the investigated DPs, it is relevant to discuss overall methodological limitations. To enable the meta-analytical investigation of the DPs across so many different cohorts in the first place, we harmonized the cohort specific food items into a number of food groups. This inherits the problem of summarizing different numbers of food items into one food group, depending on the original dietary assessment. Hence, the difference in median intake of certain food groups between the cohorts could be due to real dietary intake differences in the populations or due to a higher extent of inquired food items. Furthermore, the condensing of food items into food groups led to a lack of granularity. Hence, potential differences in the association with T2D of specific food items, e.g. green leafy vegetables [51], could not be distinguished from other food items within this food group. Another methodological limitation could be the lack of detail about preparation methods, e.g. frying, in the dietary assessment of most of the participating cohorts. Hence, this may have led to an underestimation of the association for the UDP3, which related to each of fried fish, poultry and rice in the original study by Hodge et al. [5], while we could only consider overall intake of fish, poultry and rice in our study. A distinction between French fries and potatoes (non-fried) was also not possible in all participating cohorts. However, a recent meta-analysis investigated the association of potatoes with T2D risk and distinguished between French fries and boiled/baked/mashed potatoes and both types of potato culinary preparations were associated with a higher T2D risk, although to a higher extent for 150 g/day intake of French fries (RR of 1.66, 95% CI 1.43–1.94) compared to 150 g/day intake of boiled potatoes (RR of 1.09, 95% CI 1.01–1.18) [52]. Hence, we would still expect the risk estimates to point to a similar direction. Besides the food items, a common set of important and well-established confounders had to be harmonized across the cohorts. The set was selected based on those confounders, which were reported in the original publications of DPs and based on the availability of confounders in the participating InterConnect cohorts. Clearly, due to the harmonization approach and the technical setup for federated data analysis, it was not possible to account for all potential confounders, either being generally important (e.g. family history of diabetes) or being relevant for some specific study populations (e.g. ethnicity). Still, the consideration of a harmonized confounder set could be seen as strength of this study. Alongside the exposure and covariates, the outcome definitions needed also harmonization attempts. Due to different definitions of T2D as outcome in the participating cohorts, we have applied two different outcome definitions (primary, secondary). To assess if large differences in the number of T2D cases in some cohorts due to the definitions affect the associations, we conducted a sensitivity analysis. We compared the IRR for subgroup analyses of cohorts with a large (> 40%) to small (≤ 40%) difference and did observe slightly attenuated associations for all UDPs (data not shown). This indicated that a stricter outcome definition (“primary outcome”) resulted in slightly stronger associations.

Furthermore, the DPs were replicated in the different cohorts by using a simplification process which restricts the DP score calculations to those FGs with high FL and ignores differences in FL between FGs [17]. However, many original DPs contained only very few FGs with relative high FL (≥ 0.4). So, for instance, the simplified UDP3 resulted in red meat as the only FG and hence lost the complex pattern structure. Therefore, we decided to use FGs in the simplified pattern with FL ≥ 0.2 as the main analysis. The simplification ignores relative differences in contributions of FGs to DPs (reflected by differences in FLs), however, it supports interpretation of DPs in terms of FG intake [17]. While the approach has been successfully applied to replicate other data-driven pattern associations [14, 43], we cannot rule out that the relative loss in precision in DP score calculation has influenced the success of pattern-T2D association replications in our study.

We observed moderate to strong heterogeneity of associations across cohorts, with I² values ranging from 49% (UDP4) to 85% (UDP3). Heterogeneity between studies may have different explanations. The condensation of foods into harmonized FGs in the cohorts may have led to the inclusion of heterogeneous food items due to strong culinary differences between populations, but also due to different extent of inquired food items depending on the dietary assessment instrument. Another explanation for heterogeneity could have been the inclusion of cohorts with a short follow-up time, introducing the bias of reverse causation. Especially for HDPs, participants with a high risk at developing T2D could have changed their dietary habits by eating more health promoting food groups, but still developed the disease. However, this could not be confirmed by the results of our meta-regression on several characteristics of the cohorts (region, follow-up time, age, BMI). Here, the follow-up time explained only a considerable proportion of heterogeneity for two UDPs (UDP5, UDP6). Overall, the magnitude of the pooled risk estimates was much smaller compared to the original studies. However, comparability is constrained, since the risk estimates are given per 1 SD increase and SD is highly dependent on the population distribution of the respective DPs. Nevertheless, we were restricted to the calculation of analyses assuming a linear association between the DPs and T2D, due to the federated approach and the solutions, which could be realised with DataSHIELD. Hence, generalizable conclusions based solely on the magnitude of risk estimates from the meta-analyses should be done with caution and no quantitative recommendations can be deduced for public health guidance. Therefore, we mainly base our conclusions on the consistency of direction of associations: in the meta-analyses with significant pooled risk estimates, the majority of included cohorts pointed also towards a higher risk. Another limitation was the standardization of FGs for DP score calculation based on the distribution of FG intake in the respective cohorts. This could be a problem, if food intake distributions differ extensively between those cohorts compared to the study population where a DP had previously been reported from and hence, may jeopardize attempts to replicate associations of DPs with disease risk. However, two main reasons were pivotal for this approach. On the one hand, the information on the intake distribution was not provided in most original publications, but rather the correlation structure as a basis for the exploratory derivation of DPs. On the other hand, even if this information would be provided by the original publications, this would result in more limitations: In most studies, non- or semi-quantitative dietary assessment instrument were applied and hence, the reported intake distributions did not provide a valid estimation of absolute intakes. Furthermore, dietary assessment instruments per se differed between the cohorts and nothing is known about their comparability in estimating food intake. Another limitation of this study was the high exclusion rate of 46.9%. Hence, a potential selection bias due to missing follow-up time, covariates or food intake data could not be ruled out.

Conclusion

To our knowledge, this is the first study replicating population-specific associations of exploratory DPs with T2D risk across a large number of cohort studies from different continents. Our meta-analyses of harmonized individual-level data from various cohorts revealed a higher T2D risk for several DPs characterized by higher intake of red meat, processed meat, French fries and refined grains (comprising refined grain bread and refined grain breakfast cereals). These results confirm former study-specific results in a generalizable context and therefore enrich evidence for DPs related to higher T2D risk. However, none of the inverse associations of investigated HDPs could be confirmed across different cohorts.

Availability of data and material

Due to the federated and collaborated design of this InterConnect study, data and material cannot be made accessible. Individual study meta-data may be available upon request from the individual study PI’s.

Code availability

The analysis code can be provided on request.

References

Jannasch F, Kröger J, Schulze MB (2017) Dietary patterns and type 2 diabetes: a systematic literature review and meta-analysis of prospective studies. J Nutr 147(6):1174–1182. https://doi.org/10.3945/jn.116.242552
Article CAS PubMed Google Scholar
Esposito K, Maiorino MI, Bellastella G, Chiodini P, Panagiotakos D, Giugliano D (2015) A journey into a Mediterranean diet and type 2 diabetes: a systematic review with meta-analyses. BMJ Open 5(8):e008222. https://doi.org/10.1136/bmjopen-2015-008222
Article PubMed PubMed Central Google Scholar
van Dam RM, Rimm EB, Willett WC, Stampfer MJ, Hu FB (2002) Dietary patterns and risk for type 2 diabetes mellitus in US men. Ann Intern Med 136(3):201–209. https://doi.org/10.7326/0003-4819-136-3-200202050-00008
Article PubMed Google Scholar
Montonen J, Knekt P, Härkänen T, Järvinen R, Heliövaara M, Aromaa A, Reunanen A (2005) Dietary patterns and the incidence of type 2 diabetes. Am J Epidemiol 161(3):219–227. https://doi.org/10.1093/aje/kwi039
Article PubMed Google Scholar
Hodge AM, English DR, O’Dea K, Giles GG (2007) Dietary patterns and diabetes incidence in the Melbourne Collaborative Cohort Study. Am J Epidemiol 165(6):603–610. https://doi.org/10.1093/aje/kwk061
Article PubMed Google Scholar
Erber E, Hopping BN, Grandinetti A, Park SY, Kolonel LN, Maskarinec G (2010) Dietary patterns and risk for diabetes: the multiethnic cohort. Diabetes Care 33(3):532–538. https://doi.org/10.2337/dc09-1621
Article PubMed Google Scholar
Yu R, Woo J, Chan R, Sham A, Ho S, Tso A, Cheung B, Lam TH, Lam K (2011) Relationship between dietary intake and the development of type 2 diabetes in a Chinese population: the Hong Kong Dietary Survey. Public Health Nutr 14(7):1133–1141. https://doi.org/10.1017/s136898001100053x
Article PubMed Google Scholar
Bauer F, Beulens JW, van der Daphne A, Wijmenga C, Grobbee DE, Spijkerman AM, van der Schouw YT, Onland-Moret NC (2013) Dietary patterns and the risk of type 2 diabetes in overweight and obese individuals. Eur J Nutr 52(3):1127–1134. https://doi.org/10.1007/s00394-012-0423-4
Article CAS PubMed Google Scholar
Schoenaker DA, Dobson AJ, Soedamah-Muthu SS, Mishra GD (2013) Factor analysis is more appropriate to identify overall dietary patterns associated with diabetes when compared with Treelet transform analysis. J Nutr 143(3):392–398. https://doi.org/10.3945/jn.112.169011
Article CAS PubMed Google Scholar
Fung TT, Schulze M, Manson JE, Willett WC, Hu FB (2004) Dietary patterns, meat intake, and the risk of type 2 diabetes in women. Arch Intern Med 164(20):2235–2240. https://doi.org/10.1001/archinte.164.20.2235
Article PubMed Google Scholar
Odegaard AO, Koh WP, Butler LM, Duval S, Gross MD, Yu MC, Yuan JM, Pereira MA (2011) Dietary patterns and incident type 2 diabetes in Chinese men and women: the Singapore Chinese health study. Diabetes Care 34(4):880–885. https://doi.org/10.2337/dc10-2350
Article PubMed PubMed Central Google Scholar
McEvoy CT, Cardwell CR, Woodside JV, Young IS, Hunter SJ, McKinley MC (2014) A posteriori dietary patterns are related to risk of type 2 diabetes: findings from a systematic review and meta-analysis. J Acad Nutr Diet 114(11):1759–1775. https://doi.org/10.1016/j.jand.2014.05.001
Article PubMed Google Scholar
Jannasch F, Kroger J, Agnoli C, Barricarte A, Boeing H, Cayssials V, Colorado-Yohar S, Dahm CC, Dow C, Fagherazzi G, Franks PW, Freisling H, Gunter MJ, Kerrison ND, Key TJ, Khaw KT, Kuhn T, Kyro C, Mancini FR, Mokoroa O, Nilsson P, Overvad K, Palli D, Panico S, Garcia JRQ, Rolandsson O, Sacerdote C, Sanchez MJ, Sahrai MS, Schubel R, Sluijs I, Spijkerman AMW, Tjonneland A, Tong TYN, Tumino R, Riboli E, Langenberg C, Sharp SJ, Forouhi NG, Schulze MB, Wareham NJ (2019) Generalizability of a diabetes-associated country-specific exploratory dietary pattern is feasible across European populations. J Nutr 149(6):1047–1055. https://doi.org/10.1093/jn/nxz031
Article PubMed PubMed Central Google Scholar
Schulze MB, Hoffmann K, Manson JE, Willett WC, Meigs JB, Weikert C, Heidemann C, Colditz GA, Hu FB (2005) Dietary pattern, inflammation, and incidence of type 2 diabetes in women. Am J Clin Nutr 82(3):675–684. https://doi.org/10.1093/ajcn.82.3.675 (quiz 714–675)
Article CAS PubMed Google Scholar
Imamura F, Lichtenstein AH, Dallal GE, Meigs JB, Jacques PF (2009) Generalizability of dietary patterns associated with incidence of type 2 diabetes mellitus. Am J Clin Nutr 90(4):1075–1083. https://doi.org/10.3945/ajcn.2009.28009
Article CAS PubMed PubMed Central Google Scholar
Adherence to predefined dietary patterns and incident type 2 diabetes in European populations: EPIC-InterAct Study (2014). Diabetologia 57 (2):321–333. https://doi.org/10.1007/s00125-013-3092-9
Schulze MB, Hoffmann K, Kroke A, Boeing H (2003) An approach to construct simplified measures of dietary patterns from exploratory factor analysis. Br J Nutr 89(3):409–419. https://doi.org/10.1079/bjn2002778
Article CAS PubMed Google Scholar
The ARIC investigators (1989) The atherosclerosis risk in communities (ARIC) study: design and objectives. Am J Epidemiol 129(4):687–702
Article Google Scholar
Friedman GD, Cutter GR, Donahue RP, Hughes GH, Hulley SB, Jacobs DR Jr, Liu K, Savage PJ (1988) CARDIA: study design, recruitment, and some characteristics of the examined subjects. J Clin Epidemiol 41(11):1105–1116. https://doi.org/10.1016/0895-4356(88)90080-7
Article CAS PubMed Google Scholar
Aquino EM, Barreto SM, Bensenor IM, Carvalho MS, Chor D, Duncan BB, Lotufo PA, Mill JG, Molina Mdel C, Mota EL, Passos VM, Schmidt MI, Szklo M (2012) Brazilian Longitudinal Study of Adult Health (ELSA-Brasil): objectives and design. Am J Epidemiol 175(4):315–324. https://doi.org/10.1093/aje/kwr294
Article PubMed Google Scholar
Lajous M, Ortiz-Panozo E, Monge A, Santoyo-Vistrain R, García-Anaya A, Yunes-Díaz E, Rice MS, Blanco M, Hernández-Ávila M, Willett WC, Romieu I, López-Ridaura R (2017) Cohort Profile: the Mexican Teachers’ Cohort (MTC). Int J Epidemiol 46(2):e10. https://doi.org/10.1093/ije/dyv123
Article PubMed Google Scholar
Garcia-Palmieri MR, Sorlie P, Tillotson J, Costas R Jr, Cordero E, Rodriguez M (1980) Relationship of dietary intake to subsequent coronary heart disease incidence: the Puerto Rico Heart Health Program. Am J Clin Nutr 33(8):1818–1827. https://doi.org/10.1093/ajcn/33.8.1818
Article CAS PubMed Google Scholar
The Women’s Health Initiative Study Group (1998) Design of the Women’s Health Initiative clinical trial and observational study. Control Clin Trials 19(1):61–109. https://doi.org/10.1016/s0197-2456(97)00078-0
Article Google Scholar
Pourshams A, Khademi H, Malekshah AF, Islami F, Nouraei M, Sadjadi AR, Jafari E, Rakhshani N, Salahi R, Semnani S, Kamangar F, Abnet CC, Ponder B, Day N, Dawsey SM, Boffetta P, Malekzadeh R (2010) Cohort profile: the Golestan cohort study—a prospective study of oesophageal cancer in northern Iran. Int J Epidemiol 39(1):52–59. https://doi.org/10.1093/ije/dyp161
Article PubMed Google Scholar
Firmann M, Mayor V, Vidal PM, Bochud M, Pécoud A, Hayoz D, Paccaud F, Preisig M, Song KS, Yuan X, Danoff TM, Stirnadel HA, Waterworth D, Mooser V, Waeber G, Vollenweider P (2008) The CoLaus study: a population-based study to investigate the epidemiology and genetic determinants of cardiovascular risk factors and metabolic syndrome. BMC Cardiovasc Disord 8:6. https://doi.org/10.1186/1471-2261-8-6
Article CAS PubMed PubMed Central Google Scholar
Harris H, Hakansson N, Olofsson C, Julin B, Akesson A, Wolk A (2013) The Swedish mammography cohort and the cohort of Swedish men: study design and characteristics of 2 population-based longitudinal cohorts. OA Epidemiology. https://doi.org/10.13172/2053-079X-1-2-943
Article Google Scholar
Forouhi NG, Wareham NJ (2014) The EPIC-InterAct study: a study of the interplay between genetic and lifestyle behavioral factors on the risk of type 2 diabetes in european populations. Curr Nutr Rep 3(4):355–363. https://doi.org/10.1007/s13668-014-0098-y
Article CAS PubMed PubMed Central Google Scholar
Martínez-González MA (2006) The SUN cohort study (Seguimiento University of Navarra). Public Health Nutr 9(1a):127–131. https://doi.org/10.1079/phn2005935
Article PubMed Google Scholar
Marmot M, Brunner E (2005) Cohort profile: the Whitehall II study. Int J Epidemiol 34(2):251–256. https://doi.org/10.1093/ije/dyh372
Article PubMed Google Scholar
Dunstan DW, Zimmet PZ, Welborn TA, Cameron AJ, Shaw J, de Courten M, Jolley D, McCarty DJ (2002) The Australian Diabetes, obesity and lifestyle study (AusDiab)–methods and response rates. Diabetes Res Clin Pract 57(2):119–129. https://doi.org/10.1016/s0168-8227(02)00025-6
Article PubMed Google Scholar
Kim Y, Han BG (2017) Cohort Profile: The Korean Genome and Epidemiology Study (KoGES) Consortium. Int J Epidemiol 46(4):1350. https://doi.org/10.1093/ije/dyx105
Article CAS PubMed PubMed Central Google Scholar
Kolonel LN, Henderson BE, Hankin JH, Nomura AM, Wilkens LR, Pike MC, Stram DO, Monroe KR, Earle ME, Nagamine FS (2000) A multiethnic cohort in Hawaii and Los Angeles: baseline characteristics. Am J Epidemiol 151(4):346–357. https://doi.org/10.1093/oxfordjournals.aje.a010213
Article CAS PubMed Google Scholar
Bild DE, Bluemke DA, Burke GL, Detrano R, Diez Roux AV, Folsom AR, Greenland P, Jacob DR Jr, Kronmal R, Liu K, Nelson JC, O’Leary D, Saad MF, Shea S, Szklo M, Tracy RP (2002) Multi-Ethnic Study of Atherosclerosis: objectives and design. Am J Epidemiol 156(9):871–881. https://doi.org/10.1093/aje/kwf113
Article PubMed Google Scholar
Pastorino S, Bishop T, Crozier SR, Granström C, Kordas K, Küpers LK, O’Brien EC, Polanska K, Sauder KA, Zafarmand MH, Wilson RC, Agyemang C, Burton PR, Cooper C, Corpeleijn E, Dabelea D, Hanke W, Inskip HM, McAuliffe FM, Olsen SF, Vrijkotte TG, Brage S, Kennedy A, O’Gorman D, Scherer P, Wijndaele K, Wareham NJ, Desoye G, Ong KK (2019) Associations between maternal physical activity in early and late pregnancy and offspring birth size: remote federated individual level meta-analysis from eight cohort studies. BJOG 126(4):459–470. https://doi.org/10.1111/1471-0528.15476
Article CAS PubMed Google Scholar
Gaye A, Marcon Y, Isaeva J, LaFlamme P, Turner A, Jones EM, Minion J, Boyd AW, Newby CJ, Nuotio ML, Wilson R, Butters O, Murtagh B, Demir I, Doiron D, Giepmans L, Wallace SE, Budin-Ljøsne I, Oliver Schmidt C, Boffetta P, Boniol M, Bota M, Carter KW, deKlerk N, Dibben C, Francis RW, Hiekkalinna T, Hveem K, Kvaløy K, Millar S, Perry IJ, Peters A, Phillips CM, Popham F, Raab G, Reischl E, Sheehan N, Waldenberger M, Perola M, van den Heuvel E, Macleod J, Knoppers BM, Stolk RP, Fortier I, Harris JR, Woffenbuttel BH, Murtagh MJ, Ferretti V, Burton PR (2014) DataSHIELD: taking the analysis to the data, not the data to the analysis. Int J Epidemiol 43(6):1929–1944. https://doi.org/10.1093/ije/dyu188
Article PubMed PubMed Central Google Scholar
Pearce M, Fanidi A, Bishop TRP, Sharp SJ, Imamura F, Dietrich S, Akbaraly T, Bes-Rastrollo M, Beulens JWJ, Byberg L, Canhada S, Molina MdCB, Chen Z, Cortes-Valencia A, Du H, Duncan BB, Härkänen T, Hashemian M, Kim J, Kim MK, Kim Y, Knekt P, Kromhout D, Lassale C, Ridaura RL, Magliano DJ, Malekzadeh R, Marques-Vidal P, Martínez-González MÁ, O’Donoghue G, O’Gorman D, Shaw JE, Soedamah-Muthu SS, Stern D, Wolk A, Woo HW, Consortium EP-I, Wareham NJ, Forouhi NG (2021) Associations of total legume, pulse, and soy consumption with incident type 2 diabetes: federated meta-analysis of 27 studies from diverse world regions. J Nutr. https://doi.org/10.1093/jn/nxaa447
Article PubMed PubMed Central Google Scholar
Buijsse B, Feskens EJ, Kok FJ, Kromhout D (2006) Cocoa intake, blood pressure, and cardiovascular mortality: the Zutphen Elderly Study. Arch Intern Med 166(4):411–417. https://doi.org/10.1001/archinte.166.4.411
Article PubMed Google Scholar
Nettleton JA, Steffen LM, Ni H, Liu K, Jacobs DR Jr (2008) Dietary patterns and risk of incident type 2 diabetes in the Multi-Ethnic Study of Atherosclerosis (MESA). Diabetes Care 31(9):1777–1782. https://doi.org/10.2337/dc08-0760
Article PubMed PubMed Central Google Scholar
Hartley L, Igbinedion E, Holmes J, Flowers N, Thorogood M, Clarke A, Stranges S, Hooper L, Rees K (2013) Increased consumption of fruit and vegetables for the primary prevention of cardiovascular diseases. Cochrane Database Syst Rev 6:CD009874. https://doi.org/10.1002/14651858.CD009874.pub2
Article Google Scholar
Pearce M, Fanidi A, Bishop TRP, Sharp SJ, Imamura F, Dietrich S, Akbaraly T, Bes-Rastrollo M, Beulens JWJ, Byberg L, Canhada S, Molina M, Chen Z, Cortes-Valencia A, Du H, Duncan BB, Harkanen T, Hashemian M, Kim J, Kim MK, Kim Y, Knekt P, Kromhout D, Lassale C, Ridaura RL, Magliano DJ, Malekzadeh R, Marques-Vidal P, Martinez-Gonzalez MA, O’Donoghue G, O’Gorman D, Shaw JE, Soedamah-Muthu SS, Stern D, Wolk A, Woo HW, Consortium EP-I, Wareham NJ, Forouhi NG (2021) Associations of total legume, pulse, and soy consumption with incident type 2 diabetes: federated meta-analysis of 27 studies from diverse world regions. J Nutr. https://doi.org/10.1093/jn/nxaa447
Article PubMed PubMed Central Google Scholar
Selmer R (1990) A comparison of Poisson regression models fitted to multiway summary tables and Cox’s survival model using data from a blood pressure screening in the city of Bergen Norway. Stat Med 9(10):1157–1165. https://doi.org/10.1002/sim.4780091005
Article CAS PubMed Google Scholar
Onland-Moret NC, van der Daphne A, van der Schouw YT, Buschers W, Elias SG, van Gils CH, Koerselman J, Roest M, Grobbee DE, Peeters PH (2007) Analysis of case-cohort data: a comparison of different methods. J Clin Epidemiol 60(4):350–355. https://doi.org/10.1016/j.jclinepi.2006.06.022
Article PubMed Google Scholar
InterAct C (2014) Adherence to predefined dietary patterns and incident type 2 diabetes in European populations: EPIC-InterAct Study. Diabetologia 57(2):321–333. https://doi.org/10.1007/s00125-013-3092-9
Article CAS Google Scholar
Heidemann C, Hoffmann K, Spranger J, Klipstein-Grobusch K, Mohlig M, Pfeiffer AF, Boeing H, European Prospective Investigation into C, Nutrition–Potsdam Study C (2005) A dietary pattern protective against type 2 diabetes in the European Prospective Investigation into Cancer and Nutrition (EPIC)–Potsdam Study cohort. Diabetologia 48(6):1126–1134. https://doi.org/10.1007/s00125-005-1743-1
Article CAS PubMed Google Scholar
Hoffmann K, Schulze MB, Schienkiewitz A, Nothlings U, Boeing H (2004) Application of a new statistical method to derive dietary patterns in nutritional epidemiology. Am J Epidemiol 159(10):935–944. https://doi.org/10.1093/aje/kwh134
Article PubMed Google Scholar
McNaughton SA, Mishra GD, Brunner EJ (2008) Dietary patterns, insulin resistance, and incidence of type 2 diabetes in the Whitehall II Study. Diabetes Care 31(7):1343–1348. https://doi.org/10.2337/dc07-1946
Article PubMed PubMed Central Google Scholar
Imamura F, O’Connor L, Ye Z, Mursu J, Hayashino Y, Bhupathiraju SN, Forouhi NG (2016) Consumption of sugar sweetened beverages, artificially sweetened beverages, and fruit juice and incidence of type 2 diabetes: systematic review, meta-analysis, and estimation of population attributable fraction. Br J Sports Med 50(8):496–504. https://doi.org/10.1136/bjsports-2016-h3576rep
Article PubMed Google Scholar
Schulze MB, Martinez-Gonzalez MA, Fung TT, Lichtenstein AH, Forouhi NG (2018) Food based dietary patterns and chronic disease prevention. BMJ 361:k2396. https://doi.org/10.1136/bmj.k2396
Article PubMed PubMed Central Google Scholar
Geiker NRW, Larsen ML, Dyerberg J, Stender S, Astrup A (2018) Egg consumption, cardiovascular diseases and type 2 diabetes. Eur J Clin Nutr 72(1):44–56. https://doi.org/10.1038/ejcn.2017.153
Article CAS PubMed Google Scholar
Mozaffarian D, Rimm EB (2006) Fish intake, contaminants, and human health: evaluating the risks and the benefits. JAMA 296(15):1885–1899. https://doi.org/10.1001/jama.296.15.1885
Article CAS PubMed Google Scholar
Neuenschwander M, Ballon A, Weber KS, Norat T, Aune D, Schwingshackl L, Schlesinger S (2019) Role of diet in type 2 diabetes incidence: umbrella review of meta-analyses of prospective observational studies. BMJ 366:l2368. https://doi.org/10.1136/bmj.l2368
Article PubMed PubMed Central Google Scholar
Schwingshackl L, Schwedhelm C, Hoffmann G, Boeing H (2019) Potatoes and risk of chronic disease: a systematic review and dose-response meta-analysis. Eur J Nutr 58(6):2243–2251. https://doi.org/10.1007/s00394-018-1774-2
Article PubMed Google Scholar
Morimoto A, Ohno Y, Tatsumi Y, Mizuno S, Watanabe S (2012) Effects of healthy dietary pattern and other lifestyle factors on incidence of diabetes in a rural Japanese population. Asia Pac J Clin Nutr 21(4):601–608
CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank the participants, principal investigators, and study teams of the individual cohorts included in this collaboration. The work presented herein was made possible using the OBiBa suite (http://www.obiba.org), a software suite developed by Maelstrom Research (www.maelstrom-research.org), and DataSHIELD (http://www.datashield.ac.uk), a software suite developed by the Data to Knowledge (D2K) Research Group. We thank EPIC-InterAct collaborators and Nicola Kerrison at the MRC Epidemiology Unit for assistance relating to the EPIC-InterAct dataset. We also thank the AusDiab Steering Committee for providing data from the AusDiab study. Moreover, we thank the NIH Biologic Specimen and Data Repository Information Coordinating Center and the ARIC, CARDIA, MESA, PRHHP and WHI OS study for providing data for this study.

Funding

Open Access funding enabled and organized by Projekt DEAL. The InterConnect project is funded by the European Union’s Seventh Framework Programme (grant number 602068). FJ and MBS acknowledge funding from the German Federal Ministry of Education and Research and the State of Brandenburg to the German Center for Diabetes Research (DZD) (82DZD00302). Furthermore, this work was supported by the NutriAct – Competence Cluster Nutrition Research Berlin-Potsdam funded by the German Federal Ministry of Education and Research (FKZ: 01EA1806A). This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) - 491394008. This work was done as part of the NFDI4Health Consortium (www.nfdi4health.de). We gratefully acknowledge the financial support of the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – NFDI 13/1. NJW and NGF acknowledge funding from the Medical Research Council Epidemiology Unit (MC_UU_00006/1and MC_UU_00006/3) and NIHR Biomedical Research Centre Cambridge: Nutrition, Diet, and Lifestyle Research Theme (IS-BRC-1215-20014). NGF is an NIHR Senior Investigator (G111539). TB acknowledges funding from EUCAN-Connect under the European Union’s Horizon 2020 research and innovation programme (grant agreement number 824989). The InterAct project was funded by the EU FP6 programme (grant number LSHM_CT_2006_037197). MBR and MAMG acknowledge that the SUN Project has received funding from the Spanish Government-Instituto de Salud Carlos III, and the European Regional Development Fund (FEDER) (RD 06/0045, CIBER-OBN, Grants PI10/02658, PI10/02293, PI13/00615, PI14/01668, PI14/01798, PI14/01764, PI17/01795, PI20/00564 and G03/140), PNSD-2020/021, the Navarra Regional Government (27/2011, 45/2011, 122/2014), PNSD 2020/2021, and the University of Navarra. LB and AW acknowledge that the COSM and SMC are part of the Swedish Infrastructure for Medical Population-Based Life-Course and Environmental Research, SIMPLER. SIMPLER receives funding through the Swedish Research Council (grant no 2017–00644). RM acknowledges funding from the Tehran University of Medical Sciences (grant number: 81/15), Cancer Research UK (grant number: C20/A5860), the Intramural Research Program of the U.S. National Cancer Institute, NIH, and the International Agency for Research on Cancer. SMAM and VCL acknowledge that ELSA-Brasil was supported by the Brazilian Ministry of Health (Department of Science and Technology) and Ministry of Science, Technology and Innovation (Financiadora de Estudos e Projetos (FINEP); grant numbers 01 06 0010.00, 01 06 0212.00, 01 06 0300.00, 01 06 0278.00, 01 06 0115.00 and 01 06 0071.00) and the National Council for Scientific and Technological Development (CNPq). JK acknowledges funding from the Korea Centers for Disease Control and Prevention, Ministry for Health and Welfare, Republic of Korea (4845-301 and 4851-302), and the Collaborative Genome Program for Fostering New Post-Genome Industry of the National Research Foundation (NRF) funded by the Ministry of Science and ICT (NRF-2017M3C9A6047623). DS and ML acknowledge that this work of MTC was supported by the American Institute for Cancer Research (grant number 05B047) and the Consejo Nacional de Ciencia y Tecnología (CONACyT) (grant number S0008-2009-1: 000000000115312). LLM acknowledges funding from the US National Institutes of Health grant U01 CA164973; SSSM received the Wiebe Visser International Dairy Nutrition Prize and has received recent research funding (2019) for epidemiological studies on dairy products and cardiometabolic diseases from the Dutch Dairy Association and the Danish Dairy Research Foundation. PMV and PV acknowledge funding from GlaxoSmithKline, the Faculty of Biology and Medicine of Lausanne, and the Swiss National Science Foundation (grants 33CSCO-122661, 33CS30-139468, 33CS30-148401 and 33CS30_177535/1). MK and the Whitehall II study were supported by the UK Medical Research Council (MRCMR/R024227/1), the Wellcome Trust (221854/Z/20/Z) and the US National Institutes of Health (NIH, RF1AG062553, R01AG056477), during the conduct of the study. The funding sources did not participate in the design or conduct of the study; collection, management, analysis, or interpretation of the data; or preparation, review, or approval of the manuscript.

Author information

Authors and Affiliations

Department of Molecular Epidemiology, German Institute of Human Nutrition Potsdam-Rehbruecke, Nuthetal, Germany
Franziska Jannasch, Stefan Dietrich & Matthias B. Schulze
NutriAct Competence Cluster Nutrition Research Potsdam-Berlin, Nuthetal, Germany
Franziska Jannasch & Matthias B. Schulze
German Center for Diabetes Research, Munich-Neuherberg, Germany
Franziska Jannasch & Matthias B. Schulze
Department of Food Safety, German Federal Institute for Risk Assessment (BfR), Berlin, Germany
Stefan Dietrich
MRC Epidemiology Unit, School of Clinical Medicine, Institute of Metabolic Science, University of Cambridge, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK
Tom R. P. Bishop, Matthew Pearce, Anouar Fanidi, Nicholas J. Wareham & Nita G. Forouhi
School of Public Health, Physiotherapy and Sports Science, University College Dublin, Dublin, Ireland
Gráinne O’Donoghue
School of Health and Human Performance, Dublin City University, Dublin, Ireland
Donal O’Gorman
Department of Medicine, Internal Medicine, Lausanne University Hospital and University of Lausanne, Office BH10-642, Rue du Bugnon 46, 1011, Lausanne, Switzerland
Pedro Marques-Vidal & Peter Vollenweider
Department of Preventive Medicine and Public Health, University of Navarra, Pamplona, Spain
Maira Bes-Rastrollo & Miguel Ángel Martínez-González
CIBERobn, Instituto de Salud Carlos III, Madrid, Spain
Maira Bes-Rastrollo & Miguel Ángel Martínez-González
Navarra Institute for Health Research (IdiSNA), Pamplona, Spain
Maira Bes-Rastrollo & Miguel Ángel Martínez-González
Department of Surgical Sciences, Medical Epidemiology, Uppsala University, Uppsala, Sweden
Liisa Byberg & Alicja Wolk
Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
Alicja Wolk
Digestive Disease Research Center, Digestive Disease Research Institute, Tehran University of Medical Sciences, Tehran, Iran
Maryam Hashemian & Reza Malekzadeh
Biology Department, School of Arts and Sciences, Utica College, Utica, NY, USA
Maryam Hashemian
Liver and Pancreatobiliary Diseases Research Center, Digestive Diseases Research Institute, Tehran University of Medical Sciences, Tehran, Iran
Hossein Poustchi
Faculdade de Medicina, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
Vivian C. Luft
Institute of Collective Health, Federal University of Bahia, Salvador, Bahia, Brazil
Sheila M. Alvim de Matos
Department of Preventive Medicine, College of Medicine, Hanyang University, Seoul, South Korea
Jihye Kim & Mi Kyung Kim
Division of Health and Nutrition Survey and Analysis, Korea Disease Control Prevention Agency, Seoul, South Korea
Yeonjung Kim
CONACyT-Center for Research on Population Health, National Institute of Public Health, Cuernavaca, Morelos, Mexico
Dalia Stern & Martin Lajous
Baker Heart and Diabetes Institute, 75 Commercial Road, Melbourne, VIC, 3004, Australia
Dianna J. Magliano & Jonathan E. Shaw
Inserm U 1018, Université Paris-Saclay, UVSQ, Villejuif, Maison des Sciences de l’Homme – SUD, Montpellier, France
Tasnime Akbaraly
Department of Epidemiology and Public Health, University College London, London, UK
Tasnime Akbaraly & Mika Kivimaki
University of Hawaii Cancer Center, Honolulu, HI, USA
Gertraud Maskarinec & Loïc Le Marchand
Department of Nutrition, Harvard T.H. Chan School of Public Health, 665 Huntington Avenue, Boston, MA, USA
Miguel Ángel Martínez-González
Center of Research On Psychological and Somatic Disorders (CORPS), Department of Medical and Clinical Psychology, Tilburg University, PO Box 90153, 5000 LE, Tilburg, The Netherlands
Sabita S. Soedamah-Muthu
Institute for Food, Nutrition and Health, University of Reading, Reading, RG6 6AR, UK
Sabita S. Soedamah-Muthu

Authors

Franziska Jannasch
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Dietrich
View author publications
You can also search for this author in PubMed Google Scholar
Tom R. P. Bishop
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Pearce
View author publications
You can also search for this author in PubMed Google Scholar
Anouar Fanidi
View author publications
You can also search for this author in PubMed Google Scholar
Gráinne O’Donoghue
View author publications
You can also search for this author in PubMed Google Scholar
Donal O’Gorman
View author publications
You can also search for this author in PubMed Google Scholar
Pedro Marques-Vidal
View author publications
You can also search for this author in PubMed Google Scholar
Peter Vollenweider
View author publications
You can also search for this author in PubMed Google Scholar
Maira Bes-Rastrollo
View author publications
You can also search for this author in PubMed Google Scholar
Liisa Byberg
View author publications
You can also search for this author in PubMed Google Scholar
Alicja Wolk
View author publications
You can also search for this author in PubMed Google Scholar
Maryam Hashemian
View author publications
You can also search for this author in PubMed Google Scholar
Reza Malekzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Hossein Poustchi
View author publications
You can also search for this author in PubMed Google Scholar
Vivian C. Luft
View author publications
You can also search for this author in PubMed Google Scholar
Sheila M. Alvim de Matos
View author publications
You can also search for this author in PubMed Google Scholar
Jihye Kim
View author publications
You can also search for this author in PubMed Google Scholar
Mi Kyung Kim
View author publications
You can also search for this author in PubMed Google Scholar
Yeonjung Kim
View author publications
You can also search for this author in PubMed Google Scholar
Dalia Stern
View author publications
You can also search for this author in PubMed Google Scholar
Martin Lajous
View author publications
You can also search for this author in PubMed Google Scholar
Dianna J. Magliano
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan E. Shaw
View author publications
You can also search for this author in PubMed Google Scholar
Tasnime Akbaraly
View author publications
You can also search for this author in PubMed Google Scholar
Mika Kivimaki
View author publications
You can also search for this author in PubMed Google Scholar
Gertraud Maskarinec
View author publications
You can also search for this author in PubMed Google Scholar
Loïc Le Marchand
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Ángel Martínez-González
View author publications
You can also search for this author in PubMed Google Scholar
Sabita S. Soedamah-Muthu
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas J. Wareham
View author publications
You can also search for this author in PubMed Google Scholar
Nita G. Forouhi
View author publications
You can also search for this author in PubMed Google Scholar
Matthias B. Schulze
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

EPIC-InterAct Consortium

Contributions

The author’s responsibilities were as follows: MBS, NJW and NGF: designed the research; SD and AF: evaluated the meta-data, SD, AF, TRPB, MP and GOD: harmonized the InterConnect data; SD: analyzed data; SD, FJ and MBS: wrote the manuscript and have primary responsibility for final content; and all authors: interpreted the results and critically revised the article for important intellectual content, and read and approved the final manuscript. The corresponding authors attest that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted.

Corresponding author

Correspondence to Franziska Jannasch.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Ethics approval

All cohorts obtained ethical review board approval at the host institution and written informed consent from participants.

Consent to participate

All participants in the individual cohorts gave their signed informed consent at recruitment.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 7877 KB)

Supplementary file2 (XLSX 19 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jannasch, F., Dietrich, S., Bishop, T.R.P. et al. Associations between exploratory dietary patterns and incident type 2 diabetes: a federated meta-analysis of individual participant data from 25 cohort studies. Eur J Nutr 61, 3649–3667 (2022). https://doi.org/10.1007/s00394-022-02909-9

Download citation

Received: 04 August 2021
Accepted: 09 May 2022
Published: 01 June 2022
Issue Date: October 2022
DOI: https://doi.org/10.1007/s00394-022-02909-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Associations between exploratory dietary patterns and incident type 2 diabetes: a federated meta-analysis of individual participant data from 25 cohort studies

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

Metabolic syndrome and dietary patterns: a systematic review and meta-analysis of observational studies

Link of dietary patterns with metabolic syndrome: analysis of the National Health and Nutrition Examination Survey

Association between dietary patterns and prediabetes, undetected diabetes or clinically diagnosed diabetes: results from the KORA FF4 study

Introduction

Methods

Study populations

Dietary assessment and construction of dietary patterns

Ascertainment of incident T2D

Assessment of covariates

Statistical analysis

Results

Healthy dietary patterns and risk of T2D

Unhealthy dietary patterns and risk of T2D

Dietary patterns with “mainly healthy” and “mainly unhealthy” food groups and T2D risk

Sensitivity analysis of UDP 3

Discussion

Conclusion

Availability of data and material

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Consortia

EPIC-InterAct Consortium

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval

Consent to participate

Supplementary Information

Supplementary file1 (DOCX 7877 KB)

Supplementary file2 (XLSX 19 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation