Introduction

Stroke is the second leading cause of death worldwide and the highest mortality and disability rate in China [1]. Ischemic stroke (IS) is a severely disabling cerebrovascular disease, caused by blockage of an artery in the brain. Being one of the most common types of cerebrovascular events in China, IS accounts for 70% of all strokes [2]. Stroke can happen at any time but is more likely to occur at an older age. Prevalence rates of stroke increase with age [3]. Racial and ethnic differences also influence the progression of stroke [4]. Age-specific stroke rates are higher in men than in women [5]. Hypertension, diabetes, dyslipidaemia, smoking and alcohol consumption are the most common modifiable risk factors for IS in China [6]. Additionally, increasing evidence suggests that genetic factors may play an important role in the occurrence and etiology of IS. Genome-wide association studies (GWAS) have identified several genetic variants associated with IS risk, including GPX7, LBH, ZCCHC10, DENND2A, and NUDT14 [7,8,9]. However, many variants remain to be discovered.

Cytochrome P450 family 4 (CYP4) enzymes are related to fatty acids metabolism, which is responsible for eliminating excess free fatty acids from the body [10]. CYP4 expression is involved in angiogenesis through the production of 20-HETE [11]. Several reports have displayed that CYP4 family genes are associated with multiple cardiovascular diseases by producing 20-HETE or interfering with fatty acid metabolism [12, 13]. CYP4 subfamily V member 2 (CYP4V2) is located on chromosome 4q35 and encodes a protein belonging to the P450 heme-thiolate protein superfamily [14]. CYP4V2 was first discovered in inflammatory macrophages [15], and has been reported to be involved in fatty acid and corticosteroid metabolism [16]. A meta-analysis has demonstrated that CYP4V2 genetic variants are the risk factors for venous thromboembolism [17]. Most stroke-related deaths are caused by thrombotic occlusion of cerebral vessels [18]. However, there are no studies to our knowledge of the effect of CYP4V2 polymorphisms on IS susceptibility.

Here, five polymorphisms in CYP4V2 were selected and genotyped to evaluate their impact on the susceptibility to IS in a Chinese Han population. Moreover, heterogeneity for genetic association defined by age, gender, smoking, drinking status and hypertension was investigated, and the relationship between these selected polymorphisms and clinical index was assessed.

Subjects and methods

Study population

All subjects were recruited from the Second Affiliated Hospital of Hainan Medical University from January 2020 to March 2021, including 575 IS cases (347 males and 228 females, 63.87 ± 10.44 years) and 575 healthy controls (344 males and 231 females, 63.09 ± 7.44 years). IS patients were diagnosed and confirmed through neurological examinations including brain computed tomography (CT) and magnetic resonance imaging (MRI) scans by two independent neurologists. Patients with brain trauma, embolic brain infarction, subarachnoid hemorrhage, other brain disease, cardiovascular diseases, tumors or serious chronic diseases were excluded. Controls were age-, gender-, race-, and geographical- matched and were from the health checkup center of the same hospital during the same period. Controls with a history of stroke, other neurological diseases, cerebrovascular, arterial vascular, and cardiovascular diseases, or inflammatory disorders were excluded. The research protocol complied with the Declaration of Helsinki and was approved by the ethics committee of the Second Affiliated Hospital of Hainan Medical University (NO. LW2021056). All subjects provided written informed consent. Demographic (age, gender, smoking and alcohol consumption) and blood biochemical indicators [total protein, serum uric acid, triglyceride total cholesterol, high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), leukocyte, red blood cell (RBC), hemoglobin, and platelet] were collected from standardized questionnaires and medical records of each subject by trained research staff, respectively. For smoking, participants were classified as nonsmokers (never) and smokers (ever or current). Subjects who smoked one cigarette a day were regarded as current smokers. For drinking, participants were classified as nondrinkers (never) and drinkers (ever or current). Subjects who drank at least 100 g of alcohol a week were considered drinkers. Hypertension was determined as systolic blood pressure (SBP) ≥ 140 mmHg and/or diastolic blood pressure (DBP) ≥ 90 mmHg, or currently receiving anti-hypertensive medication.

Laboratory investigation

Peripheral blood samples (5 mL) were collected from all participants after 12 h of fasting to perform biochemical assays on the second day of hospitalization. Plasma was obtained by centrifugation at 3000× g for 10 min. Total protein, serum uric acid, triglyceride total cholesterol, HDL-C, LDL-C, and hemoglobin were measured by a fully automatic biochemical analyzer (Hitachi 7600). Leukocyte, RBC, and platelet were detected by automatic blood cell analyzer (Beckman DxH800).

Genotyping of CYP4V2 polymorphisms

Hardy–Weinberg equilibrium (HWE) is a general and far-reaching principle in population genetics, which has a wide range of applications. HWE is commonly used to detect genotyping errors in genetic association studies [19]. The genotypes of all single nucleotide polymorphisms (SNPs) were consistent with HWE (p > 0.05), which indicated the subjects had a representative of population. We selected tagSNPs based on HWE > 0.05, minor allele frequency (MAF) > 0.1, min genotype > 75%, and r2 > 0.8 using the e!GRCh37 database. Combined MassARRAY primer design software, MAF > 0.5 and the call rate > 95% among our study population, five functional SNPs in CYP4V2 including rs1398007, rs13146272, rs3736455, rs1053094 and rs56413992 were selected. Bioinformatics tools including SNPinfo Web Server (https://snpinfo.niehs.nih.gov/snpinfo/index.html) and HaploReg v4.1 (https://pubs.broadinstitute.org/mammals/haploreg/haploreg.php) were used to identify the potential functional SNPs in the human CYP4V2 gene. Blood samples (5 mL) were collected in EDTA-containing tubes, and genomic DNA was purified using commercially available DNA extraction kits (GoldMag Co. Ltd, Xi′an, China).

The MassARRAY platform is based on MALDI-TOF (matrix-assisted laser desorption/ionization—time of flight) mass spectrometry in a high-throughput and cost-effective manner [20, 21]. The general principle of the MassARRAY platform is to resolve differences in primer masses due to changes in sequence such as the binding of different terminator nucleotides at the 3’-end of the primer bound adjacent to a variation site. The analytical accuracy of MALDI-TOF MS is quite high and 0.1–0.01% of the determined mass. Genotyping of CYP4V2 polymorphisms was performed using the Agena MassARRAY platform (Agena, San Diego, CA, USA). Primer design (Additional file 1: Table S1) and data management were performed based on corresponding supporting software, including MassARRAY Nano dispenser and Agena Bioscience Typer 4.0 software. Genotyping technology included PCR multiplex assay, shrimp alkaline phosphatase treatment, single base extension, and MALDI-TOF mass. In addition, this study also set double wells for each sample to ensure the accuracy of the results. About 10% of subjects were randomly selected for re-genotyping to verify genotyping quality, and the results were consistent.

Statistical analysis

In order to ensure the accuracy and credibility of the research results, G*power 3.1.9.7 software (https://stats.idre.ucla.edu/other/gpower/) was used to estimate the sample size prior to the study. The specific parameters were set as follows: effect size d = 0.2; α error probability = 0.05; and power (1-β err prob) = 90%. This calculation generated a sample of at least 429 cases and 429 controls. In our study, we recruited 575 cases and 575 controls, which was larger than the total sample size recommended by G*power.

Categorical variables and continuous variables between IS patients and healthy controls were compared using the chi-square and Student’s t tests, respectively. HWE for genotype frequencies was tested by χ2 test. Logistic regression analysis with adjustment for age, sex, smoking, and drinking was used to calculate odds ratios (ORs) and 95% confidence intervals (CIs) to evaluate the relationship of CYP4V2 variants to IS susceptibility. False-positive report probability (FPRP) analysis was applied to assess the noteworthy relationship of the significant findings. The threshold of FPRP was set at 0.2 for the significant relationship under investigation. Multifactor dimensionality reduction (MDR) was used to assess the optimal interaction effect of CYP4V2 polymorphisms on IS susceptibility. Analysis of variance (ANOVA) was performed to analyze the correlation of CYP4V2 variants with clinical features of IS patients and healthy controls. Data analysis was performed using IBM SPSS v18.0 software (Chicago, IL, USA). A two-tailed p value < 0.05 indicated statistical significance, and a Bonferroni-corrected p < 0.05/5 was considered significant.

Results

Characteristics of participants

Basic features of subjects were shown in Table 1. The participants consisted of 575 IS cases and 575 controls. No statistically significant differences in the distribution of age (p = 0.146), gender (p = 0.857), smoking (p = 0.813), and alcohol consumption (p = 1.000) were found. Nevertheless, differences in clinical biochemical indexes including total protein, serum uric acid, total cholesterol, HDL-C, LDL-C, leukocyte, RBC, hemoglobin, and platelet were observed between IS cases and controls (p < 0.001).

Table 1 Characteristics of IS patients and controls

Association of CYP4V2 SNPs with IS susceptibility

Five SNPs (rs1398007, rs13146272, rs3736455, rs1053094 and rs56413992) in CYP4V2 were selected and were consistent with HWE in the control group (p > 0.05, Additional file 1: Table S2). The MAFs of all SNPs in patients and controls were > 5%, suggesting that the subjects had a representative of population. The results of genotyping displayed that genotyping success rate of each SNP was > 99.5%. The allele frequencies of these CYP4V2 SNPs were not significantly different between cases and controls (p > 0.05).

Additional file 1: Table S3 showed the potential functions and the MAFs of these polymorphisms in different populations. Through HaploReg annotation, these variants were found to be related to the regulation of promoter and/or enhancer histone marks, DNase, proteins bound, motifs changed, GRASP quantitative trait locus (QTL) hits and/or selected eQTL hits. The SNPinfo web server database displayed that rs1398007 might be a transcription factor binding sites (TFBS), and rs13146272 and rs3736455 might be associated with splicing. Moreover, rs1053094 and rs56413992 located in the 3’-UTR region of the CYP4V2 gene might be related to the binding of miRNA.

The association of CYP4V2 SNPs with IS susceptibility was assessed (Table 2). CYP4V2 rs1398007 might be a risk factor for IS under the codominant (OR = 1.34, 95% CI 1.05–1.71, p = 0.009) model. The significant association between CYP4V2 rs1398007 and IS risk still existed after Bonferroni correction (p < 0.05/5).

Table 2 Association between CYP4V2 polymorphisms and IS risk

Stratification analysis for the genetic correlation

The contribution of confounding factors (age, gender, smoking and drinking status) to the genetic association between CYP4V2 polymorphisms and IS susceptibility was evaluated (Table 3). Stratified analysis by age, comparable risk effect against IS was observed for rs1398007 under the codominant (OR = 1.64, 95% CI 1.06–2.56, p = 0.028) model in subjects aged ≤ 60 years.

Table 3 Association between CYP4V2 polymorphisms and IS risk according to the stratification analysis

In the stratified analysis by gender, rs1398007 (OR = 1.37, 95% CI 1.01–1.88, p = 0.040), rs1053094 (codominant: OR = 1.80, 95% CI 1.08–3.00, p = 0.029; dominant OR = 1.40, 95% CI 1.03–1.91, p = 0.030; log-additive: OR = 1.34, 95% CI 1.06–1.68, p = 0.014) and rs56413992 (log-additive: OR = 1.33, 95% CI 1.01–1.75, p = 0.045) were observed to be significant genetic risk variants for IS in males. In addition, rs1398007 was also associated with the development of IS (OR = 1.50, 95% CI 1.03–2.17, p = 0.033) among females.

In the stratified analysis by smoking and drinking status, we found that rs1398007 was associated with increased susceptibility to IS among non-smokers (OR = 1.51, 95% CI 1.06–2.14, p = 0.010) or non-drinkers (codominant: OR = 1.66, 95% CI 1.17–2.37, p = 0.004; dominant: OR = 1.45, 95% CI 1.03–2.03, p = 0.026), respectively. However, no significant correlation of these polymorphisms with IS risk in smokers and drinkers was found.

Association between CYP4V2 rs1398007 and IS patients with hypertension

The relationship between CYP4V2 SNPs and IS risk in IS patients with or without hypertension compared with healthy controls was also assessed (Table 4). Compared with healthy controls, CYP4V2 rs1398007 was associated with an increased risk for IS patients with hypertension (OR = 1.50, 95% CI 1.14–1.96, p = 0.008; OR = 1.40, 95% CI 1.08–1.82, p = 0.010), while it exerted a protective effect on IS patients without hypertension (OR = 0.37, 95% CI 0.15–0.88, p = 0.031). Genotype frequencies between IS patients with hypertension and IS patients without hypertension were compared, CYP4V2 rs1398007 (codominant: OR = 2.63, 95% CI 1.05–6.61, p = 0.035; dominant OR = 1.49, 95% CI 1.04–2.13, p = 0.030; log-additive: OR = 1.48, 95% CI 1.09–2.02, p = 0.011) was observed to be a genetic risk variant for IS patients with hypertension.

Table 4 Association between CYP4V2 rs1398007 and IS patients with and without hypertension

FPRP analysis for the association of CYP4V2 SNPs with IS susceptibility

FPRP analysis was performed to interrogate whether the important findings were worthy of attention (Table 5). At a prior probability level of 0.1, the significant relationship between rs1398007 and IS susceptibility (FPRP = 0.170) was still noteworthy in the overall analysis. The significant finding for rs1053094 (FPRP = 0.108) remained noteworthy among males. Moreover, the relationship between rs1398007 and IS risk in non-smokers (FPRP = 0.164) or non-drinkers (FPRP = 0.053) was also significant at the prior probability level of 0.1. This significant relationship between rs1398007 and the risk of IS patients with hypertension was still noteworthy (vs. controls: FPRP = 0.026 and FPRP = 0.134; vs. patients without hypertension: FPRP = 0.111).

Table 5 False-positive report probability values for the associations between CYP4V2 polymorphisms and IS susceptibility

MDR analysis for SNP-SNP interaction

The interactions of these SNPs in CYP4V2 were assessed using MDR analysis, and the results were demonstrated in Table 6. CYP4V2 rs1398007 was the best single-locus model for predicting IS susceptibility (testing accuracy, 0.5417; p = 0.0046; cross-validation consistency: 10/10). The best multi-locus model was the two-locus model, the combination of rs13146272 and rs3736455, with the highest testing accuracy (0.5139). As shown in Fig. 1, the dendrogram and Fruchterman-Reingold plot displayed the interactions between these SNPs. The entropy patterns summarized the main and/or interaction effect of each paired attribute combination. The strongest interaction was the interaction between rs13146272 and rs3736455 with the information gain value of 0.40%.

Table 6 MDR analysis for SNP–SNP interaction in CYP4V2 with IS susceptibility
Fig. 1
figure 1

Dendrogram (left) and the Fruchterman-Reingold (right) for the interactions between these SNPs. Positive percent entropy indicates synergy whereas the negative percent indicates redundancy. Orange line indicated positive interaction, green and blue color indicated weak interactions

The relationship of CYP4V2 SNPs with clinical characteristics among IS patients/healthy controls

The relationship of CYP4V2 SNPs with clinical characteristics among IS patients/healthy controls was assessed (Table 7). The genotypes of rs1398007 (p = 0.006) and rs1053094 (p = 0.044) were associated with the levels of HDL-C among healthy controls. Moreover, the genotypes of rs13146272 was related with the levels of RBC in IS patients.

Table 7 Association of clinical characteristics with different genotypes of CYP4V2 polymorphisms among IS patients

Discussion

In the present study, CYP4V2 rs1398007 might be a risk factor for IS after Bonferroni correction. Specially, confounding factors (age, gender, smoking, and drinking status) might affect the association of rs1398007 with IS susceptibility. Moreover, rs1053094 and rs56413992 were also observed to be significant genetic risk variants for IS occurrence among males. MDR analysis showed that the best multi- locus model was the two-locus model, the combination of rs13146272 and rs3736455 with the strongest interaction effect (information gain values of 0.40%). Furthermore, genotypes of rs1398007 (p = 0.006) and rs1053094 (p = 0.044) were related to the levels of HDL-C among healthy controls. Our results first revealed the relationship between CYP4V2 variants and IS occurrence in the Chinese Han population.

In the study, five SNPs in CYP4V2 were selected based on MAF > 0.5, HWE > 0.5 and call rate > 95%. The Hapmap project takes SNPs with MAF > 0.05 as the primary research target [22]. MAF is widely used in GWAS of complex diseases. In association studies, a smaller MAF will reduce statistical power, resulting in false negative results. HWE test is commonly used to detect genotyping errors in genetic association studies [19]. p > 0.05 indicates that the study population has reached a genetic balance, that is, the data of this population investigation is credible. The success rate of genotyping often reflects the quality of DNA samples and the applicability of genotyping technology, which affects the reliability of data analysis results to a certain extent. We chose SNPs with call rate greater than 95% to eliminate low-quality SNPs. In this study, SNP selection based on these SNP criteria is helpful to improve the reliability of the analysis results and to reduce the false negative/positive rate.

IS is a complicated heterogeneous multifactorial and multi-gene disease [23]. Several researches have displayed that genetic polymorphisms in CYP4 family genes such as CYP4A11 and CYP4F2 are related to the susceptibility to cardiovascular diseases [24, 25]. Variants in CYP4V2 are related to the occurrence of deep vein thrombosis and Bietti corneoretinal crystalline dystrophy [26, 27]. However, there are no reports on the association of CYP4V2 variants with IS susceptibility. Our results first displayed that CYP4V2 rs1398007 might be a risk factor for IS after Bonferroni correction in the Chinese Han population. The mutant allele frequency of rs1398007 was quite different between different ethnic groups. The MAF of rs1398007 in Asian populations (0.29) was similar to that in African populations (0.23) and lower than that in American (0.44) and European (0.53) populations, according to the HaploReg database. These results suggested that the association of rs1398007 with IS susceptibility might be related to racial differences, and our findings need to be confirmed in various populations. In addition, there is no report on the functional mechanism of rs1398007. Based on the HaploReg database, rs1398007 might be involved in the regulation of promoter histone marks, DNAse, proteins binding, changed motifs and selected eQTL hits. Further functional assays are needed to investigate the underlying function and mechanism of rs1398007 in IS progression.

Age and sex are key factors in IS pathology [28]. The incidence of ischemic vascular disease increases with age. Compared with younger patients, elderly patients with stroke have a higher mortality rate and a worse quality of life [29, 30]. Gender also affects the prevalence and outcomes of stroke, and males have a higher incidence of stroke than females throughout most of the lifespan [31, 32]. In the stratification analysis by age, rs1398007 was correlated with increased IS risk in subjects aged ≤ 60 years, but not in subjects aged > 60 years. Stratified by sex, rs1398007, rs1053094 and rs56413992 were associated with higher IS susceptibility among males, but not among females. These results demonstrated that the effect of CYP4V2 genetic variants on IS occurrence was age- and gender-specific.

Studies have found that cigarette smoking may increase IS risk through the content of oxides of nitrogen, free radicals and other toxic substances [33]. Additionally, a previous study has reported that light and moderate drinking is only inversely related to IS risk, while heavy alcohol consumption increases the occurrence of all stroke types [34]. Therefore, we explored the heterogeneity of smoking and drinking on the relationship between CYP4V2 variants and IS risk. When stratified by smoking and drinking status, we found that rs1398007 was associated with increased susceptibility to IS among non-smokers or non-drinkers. Hypertension is positively associated with the risk of IS [35]. In our study, rs1398007 was also found to be associated with increased risk of IS patients with hypertension. Due to the moderate sample size after stratification, it is necessary to further verify our results in a larger sample size.

HDL-C is also considered to be a risk factor for IS [36]. We found that the genotypes of rs1398007 and rs1053094 were associated with the levels of HDL-C among healthy controls, but not in IS patients, which might be caused by abnormal HDL-C levels in IS patients. The results revealed that CYP4V2 polymorphisms might have a potential impact on serum HDL-C concentration. Nonetheless, more functional studies are necessary.

Several potential limitations are unavoidable in our study. First, the study population was a Chinese Han population from the same hospital. Therefore, the inherent selection bias was inevitable, and our results need to be further evaluated in other ethnic groups. Second, we analyzed only five SNPs in the CYP4V2 gene, which may not represent the whole gene, and the functional roles of CYP4V2 polymorphisms in the pathogenesis of IS were not assessed. Third, a limitation is insufficient information about subtypes of stroke. It will be interesting to study whether CYP4V2 rs1398007 is related to other stroke subtypes.

These findings may deepen our understanding of CYP4V2 in the occurrence and development of IS. Our finding increased our knowledge regarding the effect of CYP4V2 gene on the process of IS, and also provided some data for further exploring the relationship between CYP4V2 and IS risk in different populations, which will help to establish new warning and treatment methods for IS in future studies. In subsequent studies, we will further explore the functions of these SNPs based on this study, in order to provide new theoretical basis and targets for the diagnosis and treatment of IS.

Conclusion

In conclusion, our results first provided evidence that CYP4V2 rs1398007 might be a risk factor for IS after Bonferroni correction in the Chinese Han population, which provides instructive clues for CYP4V2 polymorphisms in the pathogenesis of IS. However, the potential contribution in other populations remains to be determined.