Background

Stroke is the leading neurological cause of death worldwide [1]. It is also called brain attack, typically characterized by neurological dysfunction due to acute focal injury of the central nervous system [2]. At a given age, the incidence and mortality rates of stroke are higher in men, however, women are more likely to have stroke because of longer life, and stroke events increase rapidly with aging [3]. Moreover, stroke is already becoming a global issue and it is threatening the health of human beings. Additionally, epidemiologic studies reported that the incidence of IS in China is significantly higher than that in the developed countries [4].

Hence, it is urgent to clarify the etiology of stroke. The majority of stroke are ischemic (approximately 80%), whereas 20% is due to primary hemorrhage [5]. Several studies demonstrated that some risk factors are involved in the development of ischemic stroke (IS), such as a family history of cardiovascular disease, older age, sex [6, 7], hypertension [8], diabetes [9]. However, a large proportion of IS remains unclear. Recently, the role of genetic factors in development of IS was determined in large-scale, collaborative, genome-wide association studies (GWAS) [10]. Some susceptibility genes of IS have been found, including PITX2 [11], ABO [12], and HABP2 [13]. It demonstrated that genetic factors may regulate the pathophysiological process of IS.

Cytochrome P450 (CYP450) plays a vital role in the metabolism of exogenous and endogenous compounds and is widely distributed in human [14]. Cytochrome P450 family 4 subfamily F member 2 (CYP4F2) as a member of CYP450 superfamily, is located on human chromosome 19p13.11 [15]. Previous studies have demonstrated CYP4F2 gene polymorphisms are associated with the risk of IS. For example, Nakamura et al. [16] conducted a case-control study based on the Japanese population to identify whether single nucleotide polymorphisms (SNPs) in CYP4F2 gene associated with the susceptibility to IS, they found ‘G’ allele of rs2108622 was correlated with increased risk of IS in male patients. A study in Sweden [17] reported the V433 M mutation in the CYP4F2 was correlated with cerebral infarction in male patients. It provided strong evidence on the association of CYP4F2 gene and IS. In the present study, we selected five candidate SNPs (rs3093203, rs3093193, rs12459936, rs3093144, and rs3093110) in CYP4F2 and performed a case-control study in Han Chinese population.

Methods

Subjects

The 477 cases were recruited from the First and Second Affiliated Hospital of Xi’an Jiaotong University between January 2016 and October 2018. In our study, all cases were consistent with the World Health Organization’s diagnostic criteria for IS. IS was confirmed by at least two independent neurologists using computed tomography (CT) scans and/or magnetic resonance imaging (MRI). IS patients with history of transient ischemic attack, coronary artery disease, autoimmune disease, systemic inflammatory disease, malignant tumor and other chronic diseases were excluded from the study. Healthy individuals (495 controls) were identified through the annual health assessment and were randomly recruited from the same hospital physical examination center between April 2016 and October 2018. All members were Han Chinese population living in the Shaanxi Province of China and were unrelated in at least three generations.

SNP selection and genotyping

In total, we successfully select five variants (rs3093203, rs3093193, rs12459936, rs3093144, and rs3093110) in the global population of the 1000 genome (http://www.internationalgenome.org/) with minor allele frequencies (MAFs) > 5%. Blood samples were collected in the test tubes containing ethylenediamine tetraacetic acid (EDTA) for anticoagulation and stored at − 80 °C after centrifugation at 2000 rpm for 10 min. Genomic DNA was extracted from peripheral blood samples of the 477 cases and the 495 controls using genomic DNA purification kit (GoldMag, Xi’an, China). And we used NanoDrop 2000 (Thermo Scientific, Waltham, MA) to measure the DNA concentration and purity. Agena MassARRAY Assay Design 3.0 Software [18] was used to design the primers for amplification and extension reactions. The information of primers was shown in Table 1. We used Agena MassARRAY RS1000 to perform the SNP genotyping according to the manufacturer’s instruction and used Agena Typer 4.0 software to manage and analyze the data [18, 19].

Table 1 Primers used for this study

Statistical analysis

Excel (Microsoft Corp., Redmond, WA, United States) and SPSS statistical software (version 17.0, SPSS, Chicago, IL) were used for statistical analyses. All statistical tests were two-sided, and p < 0.05 was considered to be statistical significance. SNP genotype frequencies among cases and controls were calculated by Chi- squared test, and the Hardy–Weinberg equilibrium (HWE) was used to compare the actual and expected frequencies of the genotypes in the controls. In order to explore the effects of the CYP4F2 variants on the risk of IS based on the various genetic models (co- dominant, dominant, recessive, and log- additive), logistic regression analysis was used to calculate the odds ratios (ORs) and 95% CIs on PLINK software (version 1.07) [20]. Finally, the Haploview software package (version 4.2) and SNPStats (https://www.snpstats.net/start.htm?q=snpstats/start.htm) were used to estimate pairwise linkage disequilibrium (LD), haplotype construction, and genetic association at polymorphism loci [21, 22]. HaploReg v4.1 (https://pubs.broadinstitute.org/mammals/haploreg/haploreg.php) was used to predict the potential functions of variants in CYP4F2 gene.

Results

Study participants

The basic characteristics of all study participants were shown in Table 2. Four hundred seventy-seven cases (316 males and 161 females) and 495 healthy controls (326 males and 169 females) were recruited in this case-control study. The mean ages of the patients and the controls were 60.05 ± 6.56 and 64.13 ± 10.82 years old, respectively. There was no statistically significant difference (p = 0.898) on the gender distribution, while the age distribution was statistically significant difference (p = 0.000). As expected, the prevalence of the most common risk factor for IS were significantly different between cases and controls, such as HDL cholesterol and LDL cholesterol (p < 0.001).

Table 2 Basic characteristics of controls and cases

The association between CYP4F2 gene and the risk of IS

Table 3 summarized the basic information of the SNPs including gene, SNP ID, chromosomal position, minor allele frequency (MAF) of cases and controls and HWE test results. Five SNPs in the CYP4F2 gene were successfully genotyped for further analysis. The genotype distribution of all SNPs conformed to HWE in control subjects (p > 0.05). The differences in allele frequency distributions between patients and controls were identified by two-sided Pearson chi-squared tests. The rs12459936 was associated with decreased risk of IS (OR = 0.82, 95% CI = 0.68–0.98, p = 0.028). And the rs3093144 was associated with 1.27-fold (95% CI = 1.01–1.60, p = 0.043) increase in the risk of IS. Further, multiple inheritance models were applied for analyzing the association between CYP4F2 SNPs and IS risk by unconditional logistic regression analysis adjusted for age and gender (Table 4).

Table 3 Basic information of candidate SNPs in CYP4F2 and associations with stroke
Table 4 Association of SNPs with risk of stroke based on logistic tests adjusted by gender and age

Stratification analysis by age and gender

Next, we conducted stratified analysis by age. The results showed that three SNPs were associated with the risk of IS in patients (age > 61 years old), as displayed in Table 5. Rs3093193 in CYP4F2 gene increased the susceptibility of IS in allele model (G vs. C, OR = 1.34, 95% CI = 1.02–1.77, p = 0.038), dominant model (GC/GG vs. CC, OR = 1.61, 95% CI = 1.07–2.42, p = 0.022) and log-additive model (OR = 1.44, 95% CI = 1.05–1.98, p = 0.024). Similarly, rs3093144 was observed to be associated with the higher risk of IS in allele model (T vs. C, OR = 1.53, 95% CI = 1.09–2.14, p = 0.013), dominant model (TC/TT vs. CC, OR = 1.62, 95% CI = 1.05–2.49, p = 0.029) and log-additive model (OR = 1.56, 95% CI = 1.07–2.27, p = 0.020). Conversely, rs12459936 was observed to be associated with the decreased risk of IS in allele model (T vs. C, OR = 0.73, 95% CI = 0.56–0.94, p = 0.016), co-dominant model (TT vs. CC, OR = 0.48, 95% CI = 0.26–0.88, p = 0.018), dominant model (TC/TT vs. CC, OR = 0.63, 95% CI = 0.41–0.94, p = 0.036) and log-additive model (OR = 0.69, 95% CI = 0.52–0.93, p = 0.015). In addition, these was no other significant association between SNPs and IS risk (age ≤ 61 years).

Table 5 Stratification analysis of the association of CYP4F2 polymorphisms with stroke under genetic models

Moreover, we conducted stratified analysis by gender and found three loci associated with the risk of IS in male. Two of these SNPs were correlated with the increased risk of IS in allele model (rs3093193, G vs. C, OR = 1.35, 95% CI = 1.06–1.71, p = 0.016; rs3093144, T vs. C, OR = 1.36, 95% CI = 1.01–1.82, p = 0.040), co-dominant model (rs3093193, GG vs. CC, OR = 2.29, 95% CI = 1.21–4.32, p = 0.011; rs3093144, TT vs. CC, OR = 4.5, 95% CI = 1.22–16.64, p = 0.024), recessive model (rs3093193, GG vs. CC/GC, OR = 2.19, 95% CI = 1.18–4.05, p = 0.013; rs3093144, TT vs. CC/TC, OR = 4.32, 95% CI = 1.17–15.89, p = 0.028) and log-additive model (rs3093193, OR = 1.31, 95% CI = 1.02–1.69, p = 0.036). Conversely, rs12459936 was associated with decreased risk of IS in allele model (T vs. C, OR = 0.78, 95% CI = 0.63–0.98, p = 0.031).

Associations between haplotype analyses and IS risk

The LD and haplotype analyses of the CYP4F2 polymorphisms in the cases and controls were further studied. All SNPs in CYP4F2 gene were found to exist in LD block (Fig. 1). The distributions of different haplotypes in two groups are presented in Table 6. Although the five SNPs in the CYP4F2 gene had strong linkage, haplotype analysis did not show the significant association (p > 0.05). Then, we conducted haplotype analysis of three positive SNPs (rs3093193, rs12459936 and rs3093144) and IS risk (Table 7). The result demonstrated that the haplotype “GCT” was linked to an increased risk of IS (OR = 1.31, 95% CI = 1.01–1.70, p = 0.043).

Fig. 1
figure 1

Linkage disequilibrium (LD) analysis of five SNPs in CYP4F2 Gene

Table 6 Haplotype frequencies of CYP4F2 SNPs and the association with stroke
Table 7 Haplotype frequencies of CYP4F2 positive SNPs and the association with stroke

In silico analysis

In silico analysis, HaploReg v4.1 was used to assess the function of the selected variants in CYP4F2 gene (Additional file 1: Table S1). The results suggested rs3093203, rs3093193, and rs12459936 were associated with Motifs changed, selected eQTL hits. And rs3093144 was associated with enhancer histone marks, motifs changed, selected eQTL hits. Rs3093110 was related to promoter histone marks, motifs changed, selected eQTL, GRASP QTL hits.

Discussion

In the present study, allele, genotype and haplotype frequencies of five SNPs in the CYP4F2 gene between IS patients and healthy controls were compared and stratified analyses by age and gender were conducted. We found that three SNPs (rs3093193, rs3093144 and rs12459936) in the CYP4F2 were associated with IS risk in a Chinese Han population. Rs3093144 and rs12459936 were associated with IS risk in the overall. For the individuals older than 61 years old, rs3093193 and rs3093144 were related to the increased risk of IS, whereas rs12459936 was related to the decreased risk of IS. In addition, we observed that three SNPs (rs3093193, rs3093144 and rs12459936) were associated with the susceptibility to IS in the male. And five SNPs in the CYP4F2 gene showed strong linkage. These findings suggested that genetic polymorphisms in CYP4F2 may play an important role in the etiology of IS.

Genetic variation is the molecular basis of human genetic diversity. IS is a complex polygenic disease resulting from the genetic factors and environmental risk factors [23]. Approximately 80% of stroke is IS, and the incidence of IS increases with aging. As a member of the CYP450 superfamily, CYP4F2 plays a pivotal role in the metabolism of exogenous and endogenous compounds [24]. It is expressed in the liver, heart, lungs, and kidneys, and is involved in leukotriene B4 (LTB4) and 20--hydroxy eicosane arachidonic acid (20-HETE) metabolism [25]. 20-HETE in vivo can depolarize vascular smooth muscle by blocking Ca2+ activation and K+ channels, resulting in strong vasoconstriction [26]. In hypertensive rat models, increased 20-HETE can lead to oxidative stress and endothelial cell damage, thus increasing the incidence of IS [27]. CYP4F2 is the main synthetase that catalyzes the generation of 20-HETE from arachidonic acid. Recent studies revealed that CYP4F2 was associated with the IS risk. H.-Q. Yan et al. [28] conducted a case-control study to identify the association between the selected SNPs in CYP4F2 gene and the risk of IS. The results showed that the ‘GG’ genotype of CYP4F2 rs2108622 was correlated with an increased risk of IS. In addition, Colàs-Campàs L et al. [29] found rs2108622 ‘AA’ genotype in CYP4F2 gene was significantly associated with a risk of early IS in non-valvular atrial fibrillation patients under vitamin K antagonists treatment. However, study based on this CYP4F2 gene is rare. Hence, our study discussed the relationship between CYP4F2 and IS risk. We found that three SNPs (rs3093193, rs3093144 and rs12459936) in the CYP4F2 were associated with IS risk in a Chinese Han population, suggesting CYP4F2 gene may play an important function in affecting IS.

In addition, it is worth noting that in our results, there were two SNPs (rs3093193 and rs3093144) increased the IS risk and one (rs12459936) decreased the IS risk. However, the haplotype analysis among the three SNPs mentioned above found it was significantly associated with an increased risk of IS (Table 7). For rs12459936, the minor allele ‘T’ when compared to the wild-type allele ‘C’ was protective factor. In turn, the wild-type allele ‘C’ was associated with an increased risk of stroke when compared with the minor allele ‘T’ as reference allele. Thus, based on the positive loci, the haplotype “GCT” was related to elevating risk of IS.

Furthermore, we have compared results with other stroke studies performed in the same population. Lee et al. [30] conducted that a genome-wide association study links small-vessel ischemic stroke to autophagy. And their study focused on Han Chinese in Taiwan. In addition, the study was replicated in an independent Han Chinese population. Imputation analysis also supported the association between three SNPs (rs2594966, rs2594973, rs4684776) in ATG7 gene and stroke-small-vessel occlusion (SVO). When compared to their study, our sample size was not large enough because of strict recruitment criteria. And the study failed to replicate the results within an independent sample. So that’s a part of what we’re going to do next.

Our study also has some potential limitations. The ethnicity of study subjects was limited to the Han Chinese population. Hence, whether our results could apply to other ethnicities is unclear. Furthermore, our current research is fundamental, functional studies are required to understand function of genetic variants and mechanisms underlying this association.

Conclusions

To sum up, we firstly provide new evidence for the association between CYP4F2 variants and IS risk in Han Chinese population, which may support for screening of IS in Han Chinese population and shed light on the mechanism of IS.