Background

Large-scale genome-wide association studies (GWAS) have identified numerous loci associated with fasting blood lipids and other cardiovascular diseases (CVDs) [1]. Epigenetic analysis has gained attention in the past few years as an alternative perspective on the etiology of complex diseases. Epigenetic adaptations alter gene expressions and are heritable through many cell divisions, even across generations, while they do not alter the primary DNA sequence. To advance blood lipids and CVD research, it is important to apply epigenome-wide association study (EWAS) to detect the epigenetic risk factors. The study of molecular mechanisms underlying epigenetic inheritance, such as DNA methylation, will provide insights in advancing and shaping ideas of the role that epigenetic phenomena play in high blood lipids and CVD. In this paper, we used data from the Genetics of Lipid-lowering Drugs and Diet Network (GOLDN) study provided by the GAW20 to examine the methylation levels of lipid-lowering treatment on the fasting blood triglycerides [2].

Methods

Data

The GAW20 data sets are drawn from the GOLDN study with a total number of 1105 participants [2]. The data sets include GWAS and EWAS data before and after the fenofibrate (blood lipid-lowering drug) intervention. The EWAS data set contains 2 triglyceride (TG) measurements and methylation levels of 463,995 cytosine-phosphate-guanine (CpG) sites for 995 pretreatment individuals and 530 posttreatment individuals, respectively. The log-transformed mean pre- and posttreatment TG levels were used as the outcome variable in our model. Control variables include age, gender, study center, and family pedigree.

EWAS model

We applied mixed-effects models for two repeated measures of log TG levels with fixed effects of time (0 = pre, 1 = post), methylation level, and their interactions, adjusting for age (18 years of age to approximately 87 years of age), sex, study site, and top 4 methylation principal components. Pedigree and subject IDs are controlled as nested random effects. These fixed effects of time, methylation levels, and the interaction term, measure the associations for both pre- and posttreatment periods, and the treatment modification effects, respectively.

Let Yijk denote the log TG measurements at kth time (0 = pre, 1 = post) for the ith individual in the jth pedigree; Xijk denote the methylation level; and tk denote treatment while t0= 0 and t1=1. The model equation can be written as:

$$ {\displaystyle \begin{array}{l}{Y}_{ij k}={\beta}_0+{\beta}_1{X}_{ij k}+\gamma {t}_k+\delta \left({X}_{ij k}{t}_k\right)+{\beta}_2{Age}_i+{\beta}_3{Site}_i\\ {}+{\beta}_4 PC{1}_{ik}+{\beta}_5 PC{2}_{ik}+{\beta}_6 PC{3}_{ik}+{\beta}_7 PC{4}_{ik}\\ {}+{S}_{ij}+{\varepsilon}_{ij k}\end{array}} $$

where the main effect β1is the pretreatment methylation effect on log TG; γ is the main treatment effect; δ is the interaction effect between methylation and treatment (i.e., the treatment modification effect); and Sij is the random effect of the individual nested within the pedigree. The general linear hypothesis tests were applied to calculate postmethylation effect (β1 + δ), the standard errors, and the p values. We examined each CpG site on the whole genome (463,995 sites). Mixed-effects models for repeated measures enable us to examine the individual patterns of change by excluding between-individual variability and provide more efficient estimators of treatment effects. The main effects and interactions work together to identify the epigenetic risk factors of TG levels for pretreatment, posttreatment, and potential gene–drug interactions simultaneously [3, 4]. Compared to cross-sectional study, the repeated measure analysis has the advantage of making reliable inferences by capturing the systemic changes within individuals, thereby achieving more sensitive tests and higher statistical power for a fixed number of individuals [5, 6]. Statistical software R (version 3.2.3) was used for the entire analyses, with R package nlme for mixed-effects modeling [7], car for linear hypothesis tests [8], and qqman for Manhattan plots [9].We applied a relatively loose significance threshold (p value <1E-5) for modification effects and posttreatment associations because of the exploratory nature of proposed method and the moderate sample size (N = 536 posttreatment measures). A less-stringent threshold might imply potential drug modification effects, as empirical evaluation suggests a possible relaxation in the current GWAS threshold for replication studies [10].

Pathway-enrichment analysis

After EWAS analyses of CpG sites for pretreatment, posttreatment, and interaction effects, we mapped them to corresponding genes. To provide a functional insight of the results, we applied a gene set enrichment analysis (GSEA) [11] preranked test to each of 3 gene lists with log-transformed p values. To compute the empirical p values and false discovery rates (FDRs) for pathways, we performed 1000 permutations. Pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database [12] were used in our analysis.

GSEA is a robust technique that searches for pathways (gene sets) that contain abundant highly significant genes (CpG sites) based on a Kolmogorov-Smirnov test [11] to reveal biological insights of genome/epigenome data.

Results

Table 1 lists selected CpG sites that are associated with pre- and post-log TGs, and modified by treatment, and Fig. 1 shows the corresponding Manhattan plots. The methylation level of 3 CpG sites (cg00574958, cg17058475, and cg01082498) on CPT1A gene, chromosome 11, and 2 other CpG sites (cg03725309 on gene SARS, chromosome 1, and cg06500161 on ABCG1, chromosome 21) were found to be associated with both pre- and posttreatment TG levels (p values <1E-5). Moreover, the methylation levels of 2 sites on chromosome 11 are associated with pretreatment log TG but not with posttreatment log TG levels (cg12556569 on gene APOA5 and cg11376147 on gene SLC43A1).

Table 1 Selected EWAS CpG sites for pretreatment, posttreatment, and gene–drug interactions (p value <1E-5 is considered statistical signifiance and italicized)
Fig. 1
figure 1

Manhattan plots of EWAS analyses for (a) pretreatment, (b) posttreatment, (c) and drug–gene interactions (i.e., drug modification effect)

In addition, the associations between methylation level and log TG levels were modified by the fenofibrate therapy on the following four CpG sites with p value < 1E-5:cg20015535 (gene EGLN1, chromosome 1), cg24870738 (gene RNF220, chromosome 1), cg06891775 (gene LOC283050, chromosome 10), and cg00607630 (gene USP7, chromosome 16).

The GSEA results are recorded in Table 2, which shows FDR q-values for the top 15 pathways across pre- and posttreatment associations, and the treatment-modifying effects. Several cancer-related pathways (KEGG_ENDOMETRIAL_CANCER; KEGG_PATHWAYS_IN_CANCER; KEGG_CHRONIC_MYELOID_LEUKEMIA; KEGG_BASAL_CELL_CARCINOMA; and KEGG_NON_SMALL_CELL_LUNG_CANCER) show consistent significant results. In addition, metabolism related pathways were also observed (KEGG_TYPE_II_DIABETES_MELLITUS and KEGG_ADIPOCYTOKINE_SIGNALING_PATHWAY).

Table 2 FDR q-values for the top 15 pathways across different results from GSEA

Discussion

The proposed mixed-effects model examines the methylation sites that are associated with blood TG levels before and after the fenofibrate therapy, and the potential gene–drug interactions. Using the linear hypothesis test, we identified 7 CpG sites that are associated with pretreatment TG levels (p value <1E-7) and 5 sites for posttreatment (p value <1E-5). All 5 posttreatment CpG sites are overlapped with pretreatment CpG sites. Among these CpG sites, 3 are located in gene CPT1A, which encodes a key enzyme in the carnitine-dependent transport of long-chain fatty acids across the mitochondria membrane whose deficiency will result in downregulation of fatty acid β-oxidation [13].The consistent findings suggest a strong association between blood TGs and DNA methylation of CPT1A regardless of the interference of lipid-lowering drug. In addition, we also observed 4 potential drug-interacted CpG sites from our results that belong to genes EGLN1, LOC283050, USP7, and RNF220. Previous study shows that the inhibition of EGLN1 improves the glucose and lipid metabolism, and protects against obesity and metabolic dysfunction [14]. Less-significant results were observed for drug modification effects, which were in part a result of the moderate sample size (536 posttreatment measures). But our results provide initial evidence of gene–drug interaction and warrant replication studies [10].

To provide further biological insight to 4 EWAS results, GSEA was applied to examine the associated biological pathways using KEGG database [12]. It is worth noting that 5 cancer-related pathways were enriched by TG-associated CpG sites. We observed potential associations between blood TG levels and cancer risk from an epigenetic point of view. Although obesity was recognized as a risk factor for several different cancers, for example, endometrial cancer [15], further mechanism research is necessary to determine whether there is any association between methylation level and cancer risk. In addition to these cancer-related pathways, 2 metabolism-related pathways were also observed. For Type II diabetes, elevated TG levels are common dyslipidemic features [16] and could be identified as an independent risk factor [17]. Another significant pathway is the KEGG_ADIPOCYTOKINE_SIGNALING_PATHWAY. In addition to the fatty acid metabolism and β-oxidation, this pathway is also associated with glucose uptake and insulin resistance.

Conclusions

In summary, we used linear mixed models with interaction terms to study pre- and posttreatment associations between blood TGs and CpG methylation levels and drug–gene interactions simultaneously across the whole-genome. We found several CpG sites that were consistently associated with blood TG levels in both pre- and posttreatment. In addition, by testing on the interaction term, we found potential treatment modification effects on certain CpG sites. Our pathway-enrichment analysis revealed a number of cancer-related biological pathways that were significantly enriched by TG-associated CpG sties. The results suggest connections between TG levels and cancer risk through an epigenetic point of view. However, because only 1 cohort with a limited sample size was studied in our analyses, further research on independent cohorts and experimental biology validations are needed for convincing conclusions.