Background

Excessive intake of carbohydrates favoring higher glucose levels has been extensively linked to clinical outcomes including cardiometabolic diseases [1] and neurological disorders [2]. Indeed, increasing evidence in population studies supports a strong relationship between quality and quantity of carbohydrates ingested with foods and increased risk of certain types of cancers [3, 4], diabetes [5, 6] and cardiovascular diseases [7,8,9]. This is likely explained through carbohydrates’ direct implications in regulating blood glucose levels, as well as in changing postprandial hormonal and metabolic responses in humans [10]. For these reasons, increasing attention has been given in the last decades to control the quantity and quality of carbohydrates’ dietary intake in clinical practice, through specific indices.

Among these, glycaemic index (GI) and glycaemic load (GL) represent two common dietary indices increasingly used to measure and control carbohydrate intake in people affected by cardiometabolic disorders, like diabetes [11]. GI [12], generally calculated from International GI tables [13, 14], is a measure of carbohydrate quality and refers to the postprandial blood glucose increase in response to a given carbohydrate intake, when compared with a reference food (either glucose or white bread). GL represents instead an indicator of both quality and quantity of carbohydrates and is calculated by multiplying the GI of a food item with the available carbohydrate content [11]. While both higher GI and higher GL have been associated with increased disease risk [3,4,5,6,7,8,9], their relationship with potential molecular mechanisms underlying cardiometabolic dysfunction remains largely neglected.

Altered DNA methylation patterns, particularly the level of methylation (5mC) and hydroxymethylation (5hmC) in the genome, have been often associated with type 2 diabetes (T2D) and with a general oxidative stress status [15,16,17,18].

Investigations focusing on the relationship between DNA methylation patterns and dietary GI and GL mainly consist of interventional studies on maternal dietary habits, aimed at detecting methylation changes in the offspring [19]. Geraghty et al. [20] analyzed DNA methylation levels at 771,484 CpGs sites across the genome in free DNA from cord blood serum in 60 newborns involved in the ROLO study. The methylome of low GI intervention newborns was found to be significantly lower than in controls [20]. By comparing two groups of pregnant women following a reduced vs. an increased carbohydrate diet, Yan et al. observed specific placenta DNA methylation changes at genes involved in insulin regulation, namely PLIN1, CPT1B, SSTR4 and CIDEA [21]. In a recent observational study, Alick et al. [22] found out that a maternal periconceptional diet characterized by a high glycaemic loading was associated with poorer neurodevelopmental status of children, in particular anxiety-related behavior, and with an increased mean methylation level of the imprint control region of SGCE/PEG10.

In spite of these suggestive independent lines of evidence, we are not aware of any study so far investigating the relationship between global methylation patterns and glycaemic index and load of diet, assessing both measures in the same subjects. Such a design would help in (1) building a closer relationship between nutritional and genome methylation patterns and (2) identifying the potential implications of dietary habits, which are of remarkable importance for cardiometabolic patients, on novel epigenetic measures.

In the present study, we aimed at investigating the associations between both GI and GL and global DNA methylation within an Italian population cohort enrolled in the Moli-sani study. We already reported a fine-grained analysis of global DNA methylation patterns at different nutritional levels in the same cohort, identifying a direct relationship between daily intake of zinc and global DNA methylation [23]. Here, we focused on dietary glycaemic indices to build a bridge with altered epigenetic patterns potentially underlying metabolic conditions.

Results

The characteristics of the analyzed sub-cohort (N = 1080 with at least an epigenetic measure available) are summarized elsewhere [23] and in Table 1. Compared to the whole Moli-sani cohort, the population under study showed similar sex ratio (48.0% vs 48.1% men) but slightly lower age (mean (SD) age 54.9 (11.5) year vs 55.8 (12.0) years, p < 0.0001), due to the removal of prevalent CVD cases. Similarly, in the analyzed sub-cohort there was a lower prevalence of diabetes (3.6% vs 5.0%, p = 0.02) and hyperlipidemia (4.2% vs 7.9%), as well as a higher energy intake (2210.19 kcal/d vs 2079.01 kcal/d, p < 0.0001). Overall, there was no systematic difference between the analyzed sub-cohort and the whole Moli-sani population, except those due to removal of individuals with history of CVD.

Table 1 Baseline characteristics of the analyzed sub-cohort (N = 1080) compared to the whole Moli-sani cohort (N = 24,325)

Association analyses of glycaemic parameters in the analyzed sub-cohort (Table 2) revealed significant negative associations of GI with both 5mC (standardized β (Standard Error) = − 0.073 (0.027), p = 0.007) and 5hmC (β (SE) = − 0.084 (0.030), p = 0.006) measured on buffy coat samples from the studied subjects. A significant negative association was also observed between GL and 5mC (β (SE) = − 0.146 (0.060), p = 0.015), but not with 5hmC. These associations remained significant after additional adjustments for other potential confounding factors like use of metformin (Additional file 1: Table S1).

Table 2 Association of dietary glycaemic parameters with methylation (5mC) and hydroxymethylation (5hmC) measures

In the mediation analysis of different circulating biomarkers—including C-reactive protein (CRP), glucose, C-peptide, insulin, total cholesterol, LDL, HDL—no significant proportion of the above- mentioned associations was explained by any of the circulating markers tested (Table 3).

Table 3 Mediation analysis of metabolic parameters on the association between dietary glycaemic parameters and methylation/hydroxymethylation measures

Gender interaction analyses revealed a significant association of the interaction term between gender and glycaemic load with 5mC levels (interaction β (SE) = − 0.16 (0.06), p = 0.005), with men showing an inverse association more than three times as large as in women (Table 4). No other significant interactive associations were detected.

Table 4 Gender interaction associations of dietary glycaemic parameters with 5mC and 5hmC

Discussion

Here we report, for the first time, a concordant negative association between two measurements reflecting carbohydrates’ quality and quantity (GI and GL), and global methylation levels measured within the same subjects from a general population. These associations resisted correction for several factors influencing hypomethylation, including use of metformin [24, 25] or self-reported diabetic status.

Nutritional factors, among all the environmental stimuli, can affect epigenetics both transiently and chronically [26,27,28]. The specific epigenetic changes caused by sustained hyperglycaemia are the basis for the establishment of the so-called metabolic memory [29] and are the means by which exposure to high glucose exerts its long-lasting detrimental effects on human health in the context of cancer [30], diabetes [31,32,33,34] and CVD [35,36,37]. Global hypomethylation is a generally accepted hallmark of cancer [38]. More controversial are the published studies that consider global DNA methylation levels and CVD or diabetes. However, lower global DNA methylation (5mC), evaluated with a similar technique to the one used in this study, has been generally associated with clinical and subclinical CVD phenotypes including hypertension, atherosclerosis, coronary artery disease and increased CVD risk in postmenopausal women [38,39,40,41,42]. Very few studies have investigated global DNA methylation in the context of diabetes and metabolic syndrome, generally reporting lower 5mC and 5hmC levels [16, 17] although not always consistently [18].

Consuming high GL foods is known to cause an increase in blood glucose and insulin levels with consequent increase in plasma-free fatty acids [43]. All together these factors contribute to lower insulin sensitivity and to the development of dyslipidemia [10, 43, 44]. Though some controversial data have been published on the topic, several observational studies have supported these findings by identifying a direct association between dietary GL and glucose metabolism parameters [45,46,47,48,49,50,51,52]. Based on this, GI and GL have been extensively considered and validated as risk factors for chronic diseases [3,4,5,6,7,8,9]. Therefore, identifying a possible molecular mechanism linking these variables to health outcomes could bare important opportunities to identify a novel marker for clinical risk assessment.

Higher GI and GL in the studied Moli-sani sub-cohort were associated with lower global methylation and hydroxymethylation levels in the genome. Although to our knowledge there are no such comparable studies in the field, we evaluated our findings in relation to previous studies on the link between T2D and genomic methylation patterns. Indeed, our observations are partly concordant with previous epidemiological evidence of lower hydroxymethylation levels found in diabetic patients compared to controls and with the functional evidence that glucose treatment increases 5hmC levels in specific cell lines like PBMCs, HUVECs and TF-1 [16]. This mechanism is mediated by downregulation of TET2 (ten-eleven translocation 2 protein) levels, an enzyme involved in the conversion of 5-methylcytosine into 5-hydroxymethylcytosine in the genome. This effect is counterbalanced by metformin treatment, which increases TET2 stability and 5hmC levels [16]. Our findings are also in line with previous reports of decreased 5mC levels in T2D patients compared to controls, also after adjustment for use of metformin. Conversely, contrasting evidence of higher 5mC and 5hmC levels in peripheral blood cells of poorly controlled compared to well-controlled diabetic patients and healthy controls has been reported [18]. Moreover, the lack of evidence of mediation by glucose, insulin and C-peptide in the link between dietary glycaemic indices and methylation patterns suggests that other biomarkers should be investigated to explain the significant associations observed here. Therefore, further studies aimed at disentangling the link between dietary glycaemic parameters and altered genomic methylation are needed to clarify the mechanisms linking nutrition, methylation patterns and diabetes-related traits.

Another interesting finding in our study was that some of the significant associations observed were not concordant between genders. Prominently, the decrease in methylation levels per SD increase of glycaemic load was more than three times as negative in men compared to women. If supported by independent studies, these findings may open a gender-based perspective on the investigation of potential effects of dietary glycaemic quantity, with immediate translational implications for the control of carbohydrate intake in patients also based on their gender. In view of the known link between metabolic and cardiovascular diseases [7,8,9], 5hmC could represent a specific novel marker for cardiometabolic risk prediction in women.

Limitations and implications for future studies

Although this study has the merit to provide a contribution to the molecular epidemiology of the relationship between DNA methylation and glucose-related dietary patterns, an aspect very scarcely investigated yet in the field of nutrigenomics, we need to acknowledge some limitations of the present work. The FFQ used in this study was not specifically designed to evaluate dietary GI and GL, but to provide estimates of total carbohydrate and total energy intake. Furthermore, GI and GL estimates derived from FFQs may not take into account several factors that can influence the postprandial glycaemic response, such as varying meal frequency, varying cooking methods or chewing habits. Also, dietary data were self-reported and this may lead to recall bias. Similarly, the cross-sectional setting does not allow to establish clear directionality of effect between glycaemic parameters and epigenetic modifications, nor does it give any precise information linking glycaemic nutritional parameters, altered methylation patterns and chronic disease risk. Also, the possibility of residual confounding by unmeasured factors cannot be fully excluded. This—along with the low number of T2D cases (< 40) in the analyzed sub-cohort—did not allow to test potential mediation effects of methylation patterns in the link between dietary parameters and diabetes, which was instead used as a covariate. Finally, our global measure of DNA methylation/hydroxymethylation does not allow to further explore potential mechanisms linking a specific gene or pathway to quality and quantity of carbohydrate intake, which makes it difficult to understand the functional meaning of these associations.

Conclusions

This study represents, to the best of our knowledge, the first attempt to investigate the relationship between methylation patterns in the genome and dietary glycaemic parameters in the same individual from an adult population cohort. Our findings suggest that global DNA methylation and hydroxymethylation patterns can be used as biomarkers of carbohydrate intake. Further approaches are necessary to better understand the gender-based potential effects of dietary GI and GL.

Methods

Study population

The study population was already described in Noro et al. [23] and composed of a randomly selected sub-cohort of 1,160 participants of the Moli-sani study (N = 24,325; 49.20% men; age ≥ 35 years, recruited between 2005 and 2010) [53, 54]. Subjects with incomplete dietary questionnaires or with missing values in the studied variables were excluded from the analysis to a final number of 1080 subjects.

DNA extraction and global DNA methylation assessment

We used a silica matrix-based method to extract buffy coat DNA as described in [55]. Out of the original 1160 DNA samples, 1140 were selected based on their DNA quality to be further used in the methylation study (see below). We used the MethylFlash Global DNA Methylation (5mC) ELISA Easy Kit (colorimetric) and the MethylFlash Hydroxymethylated DNA 5hmC Quantification Kit (colorimetric) (EpiGentek), according to the manufacturer’s instructions, to assess global levels of 5-methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC), respectively. DNA methylation quality control and statistical analyses were performed using R (The R Project, 2020; https://www.r-project.org/).

We overall measured 5mC and 5hmC global levels for 1214 samples (including 1,140 original and 74 duplicate samples). Of these, samples with absorbance optical density (OD) values below the mean of negative controls plus 2 standard deviations (SDs) for both global methylation measurements were set to missing as described [23]. We additionally excluded: (1) 17 and 2 outlier samples for 5mC and 5hmC, respectively, defined as samples with absolute values of standardized methylation levels above 3 standard deviations; and (2) all prevalent CVD cases (56 and 58 samples for 5mC and 5hmC, respectively) to exclude reverse causality of CVD on methylation levels [56]. Finally, 1067 samples for 5mC and 1075 samples for 5hmC were used in the following statistical analyses. 5mC and 5hmC showed a modest but significant inverse correlation (Pearson’s r = − 0.21, p = 1.1 × 10–11). For both 5mC and 5hmC, the study population distributions were approaching normality (Additional file 1: Figure S1a, b).

Dietary assessment and calculation of dietary glycaemic index and load

Food intake during the year before enrollment was assessed by the Italian version of the semiquantitative EPIC food frequency questionnaire (FFQ) [57]. The FFQ contains 14 sections (i.e., pasta/rice, soup, meat (excluding salami and other cured meats), fish, raw vegetables, cooked vegetables, eggs, sandwiches, salami and other cured meats, cheese, fruit, bread/wine, milk/coffee/cakes and herbs/spices) with 248 questions concerning 188 different food items.

Frequencies and quantities of each food were linked to Italian Food Tables [58] using a specifically designed software in order to obtain estimates of daily intake of macro- and micronutrients plus energy.

The average dietary GI for each volunteer was calculated as the sum of the GIs of each food item consumed, multiplied by the average daily amount consumed and the percentage of carbohydrate content, all divided by the total daily carbohydrate intake. The GL was calculated similarly except that there was no division by total carbohydrate intake.

Generally, the more digestible a carbohydrate is, the higher its GI will be. Some carbohydrates are absorbed quickly and lead to rapid rise in blood glucose (high GI), while others release glucose more slowly (low GI).

The glycaemic load (GL), instead, was calculated by multiplying the GI of each specific food for its total carbohydrate content (g), then dividing by 100 [59]. GL is therefore meant to represent the actual increase of blood glucose caused by ingesting a given quantity of carbohydrates contained in a portion of food.

Dietary GI and GL for each study participant were calculated as the sum of the GIs and GLs of all foods consumed in the diet [60]. GI and GL showed a modest positive correlation (r = 0.13, p = 1.3 × 10–5) and were their distributions approached normality in the population under study (Additional file 1: Figure S2a, b).

Covariates assessment and selection

Several covariates were considered or tested as potential confounders of the relationship between glycaemic index (GI) and load (GL) and global DNA methylation levels (5mC/5hmC).

Among them, sex and age were selected by default, since they are typical confounding factors which influence both methylation measures [61, 62] and nutritional patterns [63]. Similarly, energy intake (kcal/day) was included because also the amount of eaten food might influence participant methylation levels [64], as was educational attainment (defined as completed school level: primary, lower, upper secondary and post-secondary), which is associated with both methylation [65] and nutritional patterns [66]. White blood cell (granulocytes, monocytes, lymphocytes) fractions were also included by default to account for their heterogeneity, since global DNA methylation was measured on DNA extracted from these cells.

Other variables—including smoking habits, leisure time physical activity, abdominal obesity, alcohol intake, diabetes, hyperlipidaemia and cancer (as defined previously)—were added to the models since they showed a univariate trend of association with both the nutritional exposure and the methylation outcome (p < 0.2). The definition of covariates is reported in Additional file 1.

Statistical analyses

Statistical analyses were carried out in R (https://www.r-project.org/). The association between GI and GL of diet (exposures) and standardized global methylation levels (outcome) was analyzed through linear regressions (lm() function in R), separately for 5mC and 5hmC, adjusting for different potential confounders of these relationship, which included age, sex and educational attainment, white blood cell fractions, smoking, leisure time physical activity, abdominal obesity, alcohol intake, prevalent diabetes, hyperlipidaemia and cancer (see Supplementary Methods for definitions and details on selection).

Sensitivity analyses were carried out to disentangle significant associations detected, through further adjusting them for use of metformin—an antidiabetic drug with known altering effects on DNA methylation (24)—so as to ensure that the detected associations were independent from this factor.

Moreover, we performed mediation analyses through the mediate function of the mediation package (https://cran.r-project.org/web/packages/mediation/), to estimate the proportion of association between dietary glycaemic indices (GI and GL) and methylation measures (5mC/5hmC) explained by different circulating biomarkers, including high sensitivity C-reactive protein (CRP, tagging circulating inflammation), glucose, insulin and C-peptide levels (tagging glucose homeostasis) and total, LDL and HDL cholesterol.

Similarly, we performed gender interaction association analyses for both exposures and both outcomes mentioned above, to detect potential gender-specific associations.