Background

Genome wide association studies (GWAS) performed in large international consortia have demonstrated that variation at chromosome 1p13 is associated with the risk of coronary artery disease (CAD) mainly through its association with LDL and cholesterol serum levels [16]. Two leading SNPs mapping at this locus rs646776T/C and rs599839A/G explain 1% of the genetic variation in circulating LDL-cholesterol levels and the rare alleles are associated with reduced LDL-cholesterol levels [5]. Chromosome 1p13 maps in close proximity to the cadherin EGF LAG seven-pass G-type receptor (CELSR2) and the proline/serine-rich coiled-coil protein 1 (PSRC1) genes, involved in the regulation of cell adhesion, proliferation and intracellular trafficking, and in proximity to the gene coding sortilin (SORT1) a cell surface receptor involved in the glucose and lipid uptake. Functional studies have shown that the genetic variants at this locus modulate cholesterol metabolism through the regulation of sortilin expression and LDL uptake in hepatocytes and influence the diameter of the circulating LDL particles [7, 8].

The estimated risk [expressed as odds ratio (OR) and 95% confidence interval (95%CI)] of CAD in individuals carrying the allele associated with high LDL-cholesterol levels ranges from 1.20 (1.1-1.3) [6] to 1.29 (1.2-1.4) [9] and 1.19 (1.13-1.26) for the early onset myocardial infarction (MI) [10]. Consistently, the rs599839 G allele, associated with low LDL-cholesterol levels, was associated with a 13% 90%CI (10-17) reduction in the risk of CAD [7].

The actual effect of a genetic variant on the risk of complex diseases can vary across different studies [11] and populations depending on the genetic architecture, the outcome of the study and the exposure to different risk factors [1214]. To overcome these limitations and fully explain the risk of cardiovascular diseases associated with these newly discovered genetic variants, different approaches have been proposed and applied. In particular, fine mapping of the region of interest [15], the analysis of the association with more specific traits and the analysis of gene and environment interactions [14] have been recently proposed to fill in the so called “missing heritability” gap.

Here we investigate if an interaction between variants at chromosome 1p13 and serum lipid levels was associated with the risk of non-fatal MI. We performed the present study in the Stockholm Heart Epidemiology Program, SHEEP, a large case control population recruited in the Stockholm area specifically designed to investigate the role of genetic and environmental factors in the occurrence of MI in men and women.

Methods

Study population

SHEEP [16] was designed as a population based case control study to dissect both genetic and environmental factors underlying the occurrence of MI and to compare the effects of the different risk factors in men and women. Cases were identified during the period 1992 to 1994. The sources were the coronary and intensive care units, the discharge charts from the hospitals in the Stockholm County area and the death certificates from the Swedish National Causes of Death Register. The criteria for myocardial infarction included changes in the CK and LDH blood levels, presence of specified ECG changes and/or the autopsy finding of a myocardial necrosis whose age was compatible with the time of disease onset. Only patients who survived at least 28 days after the MI event were included in the present study (n = 1213, men = 852; women = 361). One control per case was randomly selected from the Stockholm County population registry after stratification for age (with a 5-years interval), sex and residential area. In addition other 5 controls were selected at the same time to replace eventual non-responders. When the initial control replied late, both the initial and the already enrolled substitute control have been included in the study. This resulted in the inclusion of more controls (n = 1561, men = 1054; women = 507) than cases.

Anthropometric measures were recorded at physical examination and blood samples were collected about three months after the MI [16]. Biochemical measurements were done as previously reported [17]. Family history of CAD was defined as having at least one close relative affected before the age of 65.

Ethics

The Ethical Committee at Karolinska Institutet approved the SHEEP study design in 1991 (Protocol Number 1991, 91:259). All the study participants gave their informed oral consent to be enrolled in the study, since at the time the study was initiated (1992) no forms for the written consent were available or in current use. The Ethical Committee at Karolinska Institutet has then approved molecular genetic analyses to be performed on the SHEEP material in 2001 (Protocol Number 2001, 01-097).

Single nucleotide polymorphism (SNP) genotyping

Three SNPs showing the strongest association in the published GWAs studies [6, 10] with LDL-serum levels were genotyped and analysed in the present study: two intergenic SNPs, rs599839 and rs646776, and rs12740374 that maps at the 3´UTR of the CELSR2 gene. Rs599839 was genotyped by Taqman and rs12740374 and rs646776 through the Sequenom iPLEX MassARRAY platforms. Random DNA samples were genotyped twice to check for concordance of genotyping. The call rates were 0.98 (rs599839) and 0.99 (rs12740374 and rs646776).

Statistical analysis

Continuous traits were expressed as median ± interquartile range (IQTR) and the differences in the distribution of quantitative traits and categorical variables calculated by Kruskal-Wallis and χ2 test, respectively. Kolmogorov-Smirnov test was used to test the normality of the distribution of the lipid serum levels as well as of dependant biomarkers. Pairwise linkage disequilibrium (LD) was estimated by calculation of the r2 metric using the software Plink [18]. Concordance to the Hardy-Weinberg equilibrium was tested in cases and controls by the χ2 test with 1DF and threshold p-value of 0.05.

Serum lipid levels were not normally distributed in the SHEEP. To test the effect of the SNPs under investigation on lipid serum levels, a weighted least squares regression, a linear regression analysis that does not assume constant variance for the regression residuals, was used to estimate the regression-coefficient (b) and standard error (SE) under the hypothesis of an additive model, i.e. change in serum levels according to the number of risk alleles (i.e. 00 vs 01 vs 11). To test the association with MI, a logistic regression analysis was performed and odds ratios (OR) with 95% confidence interval (95%CI) were estimated under the assumption of an additive (i.e. 00 vs 01 vs 11), dominant (00 vs 01 + 11) and recessive (00 + 01 vs 11) model of inheritance. The crude ORs (95%CI) were adjusted by age, sex and residential area. Further adjustments including BMI, smoking, hypertension, hypercholesterolemia, hypertriglyceridemia and diabetes mellitus were performed in the adjusted analysis.

The interaction between genotypes and the serum lipid parameters (total-, LDL-cholesterol and ApoB serum levels) was calculated using the biological approach [19]. The biological interaction estimates the difference in the risk, expressed as OR (95%CI), associated with the exposure to only one factor (e.g. ApoB or genotype) and the risk associated with the exposure to both factors as compared to the risk observed in the absence of exposure to both factors. The ratio between the risk observed in the presence of both factors and the risk observed in the reference group can be used to derive the Synergy index (S) [20]. A S > 1 indicates the presence of a synergism while a S < 1 indicates the presence of an antagonism between the two interaction terms [20, 21]. In the interaction analysis we have defined the exposure to high serum levels as exposure to serum levels higher or equal to the 75th percentile of total-cholesterol ≥6.6 mmol/L, LDL-cholesterol ≥4.6 mmol/L and ApoB ≥1.7 g/L; the exposure to the genotype as presence of the minor allele versus absence of the minor allele (e.g. AG + GG vs AA). For the purpose of interaction analysis ORs (95%CI) were only adjusted by age, sex and residential area.

Calculations were carried out by SAS (vers 9.1, SAS Institute Inc. Cary, NC).

Results

Table1 summarizes the demographic characteristics, serum lipids and biomarkers in the SHEEP study. Men were aged 60 (53-65) and women 61 (54-66). Cardiovascular risk factors were more often observed in cases than in controls. In particular, cases had a higher proportion of hypercholesterolemia than controls (42% vs 30%, p < 0.0001).

Table 1 Study population: age, serum lipids, biomarkers and cardiovascular risk factor distribution in cases and controls

Rs12740374 and rs646776 showed a high degree of pairwise LD (r2 = 0.99), while rs599839 was in moderate LD with rs12740374 and rs646776 (both r2 = 0.51) therefore only rs646776 and rs599839 were analysed for association.

Genotype and allele frequencies were concordant with those predicted by the Hardy-Weinberg proportions in both cases (rs599839 p = 0.85 and rs646776 p = 0.91) and controls (rs599839 p = 0.30 and rs646776 p = 0.24).

We tested the association of genotypes at rs599839 and rs646776 with lipid serum levels (Table2). In the presence of the genotype GG at rs599839 and CC at rs646776 lower levels of ApoB, serum total - and LDL-cholesterol were observed. This observation is consistent with published data [1, 46]where the presence of the G at rs599839 and of the C allele rs646776 were associated with LDL-cholesterol serum levels about 0.2 mmol/L (6-7 mg/dl) lower than the alternate allele and with lower total-cholesterol serum levels [about 0.5 mmol/L (19 mg/dl)]. The effect of each SNP on lipid serum levels is reported in Table2 and indicates a progressive reduction in total-, LDL-cholesterol and ApoB serum levels associated with the G and the C alleles.

Table 2 Distribution across the genotype strata of total-, LDL-cholesterol and ApoB serum levels and effect of rs599839 and rs646776 on total-, LDL-cholesterol and ApoB serum levels in the SHEEP study

When the analysis was performed in men and women separately, only men consistently showed reduced serum levels of ApoB, total-cholesterol and LDL-cholesterol serum levels (Additional file 1: Table S1).

No significant association of the G at rs599839 as well as of the C allele at rs646776 allele with serum levels of triglycerides, HDL-cholesterol, ApoA1 was observed in men or in women (Additional file 1: Table S2).

No significant differences were observed in genotype and allele frequencies at these two SNPs between MI cases and controls and no association with the risk of MI was observed in this population. Table3 shows the genotype and allele frequencies of the two SNPs and the analysis of association with the risk of MI under the three different models of inheritance. Allele G frequency at rs599839 was 0.18 in cases and 0.17 in controls, while the allele C frequency at rs646776 was 0.23 in both cases and controls. No association of any of the two SNPs with the risk of MI was observed at the univariate analysis [OR(95%CI)] using three different analytical models, additive [rs599839 1.08(0.93-1.26); rs646776 0.97 (0.85-1.11)], dominant, [rs599839 1.07(0.91-1.27); rs646776 0.95 (0.82-1.11)], and recessive [rs599839 1.36(0.79-2.13); rs646776 1.07 (0.75-1.53)]. Adjustment for the other covariates did not substantial change the risk estimates as shown in Table3.

Table 3 Genotype, allele frequencies and risk of MI associated with rs599839 and rs646776 in the SHEEP

We have then tested the hypothesis that, in the SHEEP, the interaction between the genetic variants at 1p13 and serum lipid levels was an important player in explaining the lack of association between the 1p13 genetic variants and the MI risk. Given the causal association between serum lipid levels and MI, we analysed the interaction between serum lipid levels and genotypes using the biological approach. As reported in Table4 and Figure1 (top panel), presence of the allele G at rs599839 antagonizes the risk associated with the exposure to high (≥75th percentile) serum levels of ApoB. The calculated S of 0.47 (0.24-0.90) indicates antagonism between the two interaction terms. When the analysis was performed in men and women the protective effect of the rare allele was observed only in men with a S of 0.26 (0.07-0.91) (Figure1, top panel, middle bar graph). The actual ORs (95%CI) for the gender specific interaction analysis are reported in the Additional file 1: Table S3. A trend toward a reduction in MI risk in men carrying the G allele and exposed to increased total- and LDL cholesterol levels was also observed with a S lower than 1, however the results fell short of statistical significance (Table4).

Table 4 Effect of interaction between serum lipid levels and genetic variants at 1p13 on the risk of MI expressed as Odds Ratio (OR) and 95%CI in the SHEEP
Figure 1
figure 1

Top panel (A). Biological interaction (left to right striped bars) between the exposure to high (≥75th percentile) ApoB serum levels (white bars) and the presence of the rare allele at rs599839 (gray bars) in all SHEEP participants (left), in men (middle) and in women (right). Bottom panel (B). Biological interaction (right to left striped bars) between the exposure to high (≥75thpercentile) ApoB serum levels (white bars) and the presence of the rare allele and at rs646776 (black bars), in all SHEEP participants (left), in men (middle) and in women (right). The reference group is represented by the individuals not exposed to ApoB serum levels nor to the G or C alleles. Error bars indicate the 95%CI; S: Sinergy Index.

The analysis of the interaction between high ApoB serum levels and the C allele at rs646776 also suggested the presence of an antagonistic effect with and S index of 0.62 (0.34-1.12) (Table4 and Figure1, bottom panel) that was also confirmed in men with a S 0.40 (0.14-1.09), but with a larger 95%CI (Additional file 1: Table S2 and Figure1 bottom panel).

Discussion

The intergenic SNPs rs599839 and rs646776 have been identified through GWAs as novel genetic markers for two complex and related traits, serum lipid levels and CAD. In the present study, performed in the SHEEP, a large Swedish population, we confirmed the association of these two genetic variants with serum lipid levels; however we have not observed a direct association between these two genetic variants and the risk of non-fatal MI. We have therefore tested the hypothesis that the interaction between these two SNPs and serum lipid levels contributed to the risk of non-fatal MI in the SHEEP.

The analysis of the association of genetic variants with complex phenotypes may largely vary among different populations. Genes do not have large effect on complex traits and differences in the definition of the trait under investigation as well as the genetic structure of the loci may create large differences in the association results. Although the lack of association in the SHEEP population might partly reflect a reduced power in the association analysis as compared to the analysis of genetic association in large international consortia, several other factors should be taken into account. In the populations formerly investigated different criteria have been used to identify to cases, the phenotype under investigation was either CAD [6] or early myocardial infarction in patients with at least one first degree relative with premature CAD [1] and the matching criteria for the controls were sometimes incompletely described [6, 9]. In the current study, only MI patients who survived at least 28 days after the MI event have been included and the referent population has been matched according to age, sex and residential area. Therefore lack of direct association of 1p13 variants with MI in the SHEEP might be related to the differences in the definition of cases as well as to the controls selection. In addition, differences in the genetic structure of the populations under investigation may hamper the replication of genetic associations. With regard to the chromosome 1p13 locus we have observed that in the SHEEP the pairwise LD value between rs599839 and rs646776 is different from the one currently reported in the Hapmap Consortium (http://www.hapmap.org) for the European population (r2 = 0.51 observed vs 0.87 reported). These data speak in favor of a different genetic structure of this locus in the Swedish population and are consistent with the hypothesis raised by evolutionary geneticists stating that European populations have a composite genetic structure due to recent gene selection events (10 000-20 000 years ago) that might have changed pairwise LD values [22]. In addition, the G allele frequency at rs599839 in the SHEEP (17%) is lower than previously reported in former studies (23%) [2, 4, 5] and in the European panel of the HapMap (33%). Such findings underscore the importance of the knowledge of the locus structure when analysing the effect of genetic variants on a phenotype even within populations of the same ethnicity [17, 2325] and may well explain discrepancies in the genetic effect of even truly genetic susceptibility variants [12, 26].

In the SHEEP, the risk of MI was increased in the presence of high ApoB serum levels and presence of the rare allele at rs599839, and to a lesser extent of the rare allele at rs647767, was found to antagonize the increased risk due to the exposure to high ApoB serum levels, as shown by the results of the interaction analysis. Although we cannot provide proof of a biological mechanism, this interpretation is in line with the results of the original GWAs studies, where the protective effect of 1p13 was observed in populations where the proportion of cases with dyslipidemia ranged from 76 to 80% [4, 6] and was therefore higher than the proportion reported in the SHEEP that is about 40%.

The analysis of interaction represents a powerful tool to integrate genetic association data into the complexity of multifactorial traits [19]. In the present study we have utilized the biological method to analyze the effect of the interaction between genetic variants at 1p13 and serum lipid levels because they participate in the same causal mechanism that leads to MI. The elucidation of the mechanisms underlying interactions between genetic variants and environmental factors or, as in the present study exposure that may be modulated by pharmacological interventions, might have important implication in the assessment of the individual cardiovascular risk as well as in the clinical practice. Exposure to a specific agent may in fact have more or less detrimental effect in different genotype groups if an interaction between the genotype and the exposure exists [27]. In this perspective gene environment interaction analyses hold the promise to contribute to a better understanding of the effect of genetic variants on the risk of cardiovascular diseases.

The association with reduced serum lipid levels was evident only in men. A gender specific association of genetic variants with MI and intermediate phenotypes has already been reported in the SHEEP [17, 28] and may reflect the selective effect of risk factors in men and women [29].

Several limitations of the present study should be acknowledged. The choice of the SNPs to be investigated in the present studies relies on published data and does not include the other two tagSNPs, rs4970834 and rs611917, at chromosome1p13. The interaction analyses may be hard to interpreter and require large study population to achieve a sufficient power, therefore the replication of our findings in a larger and independent population is warranted.

Conclusions

In conclusion, our results demonstrate that genetic variants at chromosome 1p13 reduce the MI risk in this Swedish population mainly through the interaction with ApoB serum levels, thus supporting the evidence for a causal role of this locus in the occurrence of MI.