Introduction

Coronavirus Disease 2019 (COVID-19) is a worldwide pandemic with a high mortality rate. It is caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) [1, 2]. Patients infected with SARS-CoV-2 exhibit a broad variety of symptoms and disease progressions that are not solely due to their age or other health issues. Because of the wide spectrum of intensity in clinical manifestations, host genetics are likely to have a significant effect on COVID-19 vulnerability and severity [3,4,5,6].

Extensive research has been conducted on the role of host genetic factors in COVID-19 development, and these investigations have identified several gene susceptibility variations, although with various degrees of evidence [7]. The most studied candidate gene polymorphisms for the impact of COVID-19 are interferon-induced transmembrane 3 (IFITM3) and interleukin-6 (IL6) [8,9,10].The IFITM3 protein is essential for adaptive and innate immunity [10]. IFITM3 is expressed in the membranes of the lysosome and endosome and inhibits the enveloped virus from fusing with the membranes of the host. Prior studies on IFITM3 single nucleotide polymorphisms (SNPs) have shown that some alleles may be risk factors that affect the severity of viral infections, such as SARS-CoV-2 [11, 12].

Interleukin 6 (IL6) is a multifaceted cytokine, with both pro-inflammatory and anti-inflammatory properties. It is highly involved in the immune system’s cytokine cascade, and plays a vital role in mounting an immunological response [13]. The IL6 gene is located on chromosome 7 in the 7p21 region. The expression of IL6 is regulated mainly by the 5-prime region of the gene. This region is known to contain multiple single nucleotide polymorphisms (SNPs) that affect the promoter and transcription regions of the IL6 [14]. The most extensively studied SNP was rs1800795 (-147 G/C), which affects IL6 protein levels. The candidate SNP may influence the intensity of inflammatory, autoimmune, and infectious illnesses [15, 16]. Several studies tried to investigate the effect of these SNPs on the IL6 and IFITM3 genes, but the findings were inconclusive [10, 17, 18]. This inconsistency is attributable to ethnic variations and group genetic diversity [19].

Kurds are the world’s largest nation without an independent state, and they inhabit the region of Kurdistan, which is partitioned among four countries [14]. Sulaimani Province in Kurdistan Region of Iraq is home to a large Kurdish Sorani population and a smaller Kurdish Hawrami population. During the COVID-19 pandemic, there were some non-official indications and beliefs among medical staff that the proportion of mortality and COVID-19 severity were lower among the Hawrami population. Therefore, we hypothesised that the Hawrami population might have a different polymorphism in the IFITM3 and IL6 genes compared to the Kurdish Sorani population; thus, this study was conducted on a group from both populations to determine the potential cause of the mild outcome of COVID-19 in the Hawrami Kurdish population.

Materials and methods

Study participants and sampling

In this research, nasopharyngeal swab samples were taken from 98 Kurdish Sorani population members in Sulaimani province, KRG, Iraq. Another 48 samples were taken from the Kurdish Hawrami population. The donors were all unvaccinated, and their ages ranged from 20 to 60. The nasopharyngeal swabs were stored in viral transport medium (VTM) at -70 °C in the COVID-19 molecular diagnostic laboratory of Shahid Tahir.

Amplification of IFITM3 and IL6 gene

The VTM samples were subjected to DNA extraction using an addbio genomic DNA extraction kit according to the manufacturer’s instructions (addbio, South Korea). The PCR amplification reaction was done according to the manufacturer’s instructions using the Add Star Taq master mix PCR kit (addbio, Korea). In brief, the PCR reaction was done by mixing 10 µL of master mix, 5.0 µL of DNA sample, 3 µL of DEPC-H2O, and 1 µL of 10 pmol from each of the forward primer and reverse primer (Table S1). The thermocycler (Bio-Rad, USA) was set up for an initial denaturation phase of 5 min at 95 °C, followed by 40 cycles of denaturation at 94 °C for 30 s, annealing at 59 °C for 30 s, extension at 72 °C for 45 s, and a final extension phase of 3 min at 72 °C. Loading 4.0 L of PCR product on a 1% agarose gel in 1 TBE buffer was used to examine the PCR products. The gel was stained with 7 µL safe gel dye (addbio, Korea). Electrophoresis was run at 120 volts for an hour on the electrophoresis system. The 100 bp DNA ladder is used to show the migration pattern of the PCR amplicons.

Sequencing and genotyping

The PCR products were purified according to the manufacturer’s instructions using the addPrep PCR Purification Kit (addbio, South Korea). The samples were subsequently sequenced using the Sanger technique at a South Korean Macrogen sequencing facility. Using forward and reverse primers, the identification of each nucleotide sequence was validated (Table S1). FinchTV was used to evaluate and analyze the DNA sequence quality. High single-base peaks with high Phred quality scores were identified as homozygote SNPs, whereas overlapping peaks of two nucleotide bases with lower Phred quality scores were identified as heterozygote SNPs. The DNA plus strand was used to identify all SNPs in this investigation (cDNA strands). For the rs1800795 SNP, for instance, guanine (G) is the major frequency allele and cytosine (C) is the minor frequency allele, and so on for other SNPs.

Statistical examination

Hardy-Weinberg equilibrium was computed using the database at wpcalc.com (https://wpcalc.com/en/equilibrium-hardy-weinberg/). Allele frequency, genotype distribution, and haplotype analyses were performed using the SHEsis online database (http://analysis.bio-x.cn/myAnalysis.php) [20]. Version 25.0 of the Statistical Package for the Social Sciences was used for statistical analysis (SPSS, Chicago, IL). The chi-square test was performed to analyze the association between SNP, genotype, and allele in the Sorani and Hawrami groups. The odds ratios (OR) and 95% confidence intervals (CI) were analyzed using logistic regression to assess the role, degree of relationship, and risk of the variations in both groups.

Results

This study was conducted to compare the IFITM3 and IL6 polymorphisms of the Sorani and Hawrami Kurdish groups. Twelve SNPs were genotyped, including two synonymous SNPs (rs12252 c.42 A > G (S14S) and rs11553885, c.165G > A (P55P) and one SNP from the IFITM3 core promoter region near the transcription start point, c. -23 C > T (rs34481144). (Fig. 1A). As well as two missense SNPs in IL6, rs2069830 c.94 C > T (P32S), and rs142759801, c.91 C > A (P30T). These two SNPs also overlapped with the intron of IL6-AS1, an antisense long non-coding RNA (lncRNA) (Fig. 1B). The rs2069857 c.-48 variant is also found in the intron of the IL6-AS1 overlap sequence and the prime untranslated region of IL6. The remaining six SNPs were in the IL6 promoter region near the transcription start point and overlapped with the intron of IL6-AS1. These include the following variants: -190 C > T (rs2069829), -176 G > C (rs2234683), -174 G > C (rs1800795), -111 G > T (rs13447446), and c. -85 T > C (rs527770772), -63 C > G (rs13447445). All SNPs were found to be in Hardy-Weinberg equilibrium. (Table 1) provides a summary of the SNP genotyping results.

Table 1 The genotyping results of IFITM3 and IL6 SNPs between Sorani and Hawrami populations
Fig. 1
figure 1

SNPs representation of the IFTIM3 and IL6 gene (A) A cartoon representation of the seven SNPs in the IFTIM3 gene (B) A cartoon representation of the seven SNPs in the IL6 gene

The frequency of the AA genotype for the rs12252 SNP of IFITM3 was greater in the Sorani group than in the Hawrami group. Nevertheless, the chi-square test revealed no statistically significant association between the Sorani and Hawrami groups (chi2 = 1.346, P = 0.245). The statistical analysis of logistic regression was used to figure out the role and strength of the association between the variants. It showed that there was no significant difference between the Sorani and Hawrami groups for rs12252, with P = 0.310 and odds ratios of 0.671. (Fig. 2A). The genotype combinations of the (AG + GG vs. AA) dominant model were also evaluated using MedCalc biostatistical analysis, which demonstrated a statistically non-significant difference (P = 0.000, OR = 5.212) between the two groups. In this research, the frequency of alleles was also analyzed, and the miner (G) allele was shown to be non-significantly more abundant in the Sorani group (P = 0.341, OR = 1.304). (Table 1). In IFITM3 rs34481144, the Hawrami group had a greater frequency of the CC genotype, while the Sorani group had a higher frequency of the TT genotype, but there was no statistical significance (Fig. 2B).

Fig. 2
figure 2

Overview of genotypes of IFITM3 and IL6 SNPs between the Sorani Kurdish group and Hawrami Kurdish group. (A) ?rs12252 SNP graph shows non-significant association of AA genotype with Hawrami population. (B) rs34481144 SNP graph, the percentage of the CC homozygote is more prevalent in Hawrami population. (C) Graph of Rs1800795 genotype analysis shows a non-significant association with the Hawrami and Sorani groups, with higher percentage of GG genotype in Hawrami population

The Hawrami group exhibited a greater proportion of GG genotypes for the rs1800795 SNP on the IL6 gene, while the Sorani group had a higher proportion of CC genotypes (Fig. 1C). Chi-square analysis showed that rs1800795 didn’t have a statistically significant link to either group (Table 1). To investigate the role and degree of association between the variants, a logistic regression analysis was done. According to the data, the odds ratios were not significant. MedCalc biostatistical analysis was also performed on the genotype combinations of the dominant model (CG + CC vs. GG); the findings were (P = 0.25, OR = 0.718). This indicates that, compared to the GC and CC genotypes, the dominant model of the GG genotype was 0.718 times more common in the Hawrami population. Statistically, this increase in frequency is insignificant.

All patients in both groups were homozygous for the following main alleles: rs11553885 (G), rs2069830 (C), rs142759801 (C), rs2069857 (C), rs2069829 (G), rs2234683 (G), rs13447446 (T), rs527770772 (C), and rs13447445 (C). Because there was no polymorphism in these nucleotides in both groups of the Kurdish population, statistical analysis was not applicable (Table 1).

In this study, haplotype analyses were conducted for all SNPs to identify possible significant haplotypes. As a result, the SHEsis database demonstrated five haplotypes (Table 2S2). No haplotypes had a statistically significant prevalence in either group.

Discussions

IFITM3 has been shown to inhibit many pathogenic viruses from the orthomyxovirus, flavivirus, filovirus, and coronavirus families [21,22,23]. Spike proteins from various coronaviruses, including SARS-Co-2, use distinct methods to infiltrate cells. Polymorphisms in the IFITM3 genes are thought to affect the positioning of IFTIMs in the plasma membrane, which impacts vulnerability to viral infection [19, 21, 22].

The IL6 gene encodes interleukin-6, a protein with both anti- and pro-inflammatory characteristics. According to epidemiological and DNA polymorphism studies, chronic obstructive pulmonary disease and asthma have different degrees of symptoms across individuals [24]. Certain viral diseases have been related to IL6 polymorphisms, including hepatitis B and hepatitis C virus (HCV), as well as influenza and coronaviruses [25,26,27,28,29]. Several investigations have linked polymorphisms in the IL6 gene promoter area to inflammation [15, 30]. Because IL6 is important in the regulation of CD4-T cells, studying SNPs in IL6 may provide insight into the genesis and development of COVID-19 [31].

This study looked at three SNPs in the IFITM3 gene and nine SNPs in the IL6 gene to see if there was a link between these genetic differences and the Sorani and Hawrami groups of Kurds. The goal was to find out why the Hawrami Kurds have a milder reaction to COVID-19.

We hypothesized that the Hawrami population would have a distinct allele as compared to the Kurdish Sorani population in the IFITM3 and IL6 gene because of the reduction in mortality and intensive care unit admissions that we observed in the Hawrami population in relation to COVID-19.

Both the A and G alleles can be found at the IFITM3 rs12252 locus, but the G allele is a minor variant found with much greater frequency among East Asians. This suggests that the consequences of the G allele may be more prominent in the East Asian population (Table S3) [32]. Initial research conducted by Zhang et al. on Chinese patients indicated that those who had the G allele were up to 6.37 times more likely to have severe COVID-19 [33]. A study on Arab ethnicity in Saudi Arabia found that COVID-19 patients with the G allele of rs12252 in IFITM3 had a higher risk of hospitalization and death [11]. However, in our study, there was no discernible difference in the frequency of the G allele in either Kurdish group, and there were also no homozygous GG genotypes. Research on the Iranian population reveals that the AA genotype of rs12252 IFITM3 could aid in the recovery of Iranian patients with COVID-19 [34]. In the current study, the percentage of AA genotype was higher in the Sorani group, which is not consistent with our hypothesis. Despite that, German cohort research could not verify the association between rs12252 IFITM3 polymorphisms and COVID-19 severity [9].

One SNP in the IFITM3 promoter is rs34481144 (Fig. 1A). Reduction of IFITM3 expression, caused by the substitution of the minor T allele for the dominant C allele in rs34481144, which subsequently associates with a decrease in antiviral CD8 + T cells in influenza-infected lung tissue [35]. In this research, a higher rate of risk T alleles was observed in Sorani groups (Table 1). This association, however, was not statistically significant. A meta-analysis of the research demonstrated that there was no statistical association between the rs34481144 gene polymorphism and COVID-19 susceptibility or severity in allele, genotype, and dominant genetic model [36]. However, other research has revealed a link between rs34481144 and a higher likelihood of SARS-CoV-2 hospitalization [10, 18].

Extensive research has been conducted on the rs1800795 SNP of the IL6 promoter gene for a several illnesses across a wide range of ethnic groups and populations in the world; however, the association of the SNP remains contentious due to differences in the pathological condition states and genetic variability among the population. Based on what we found, the GG genotype of the rs1800795 SNP was more common in the Hawrami group, but this was not a statistically significant difference (Fig. 1C). Our results are in line with research in the Kurdish population in Iran, where a correlation between the GG genotype and mild COVID-19 was shown to be statistically insignificant [8]. Furthermore, the GG genotype in the Egyptian community was found to be protective against acute respiratory failure and ICU admittance [37]. Researchers found no statistically significant correlation between the C minor allele of rs1800795 and mortality in the COVID-19 population [38], the later study again support the hypothesis of current research; in which there was no CC genotype identified in the Hawrami population. On the other hand, in the Turkish population, the IL6 rs1800795 polymorphism, GG genotype¸ was found to be significantly linked to the risk of severe COVID-19, which is different from what we found in the Hawrami population [39].

In this investigation, all nine additional SNPs were monomorphic in either Kurdish population; all possessed the dominant allele: rs2069830 (C), rs142759801 (C), rs2069857 (C), rs2069829 (G), rs2234683 (G), rs13447446 (T), rs527770772 (C), and rs13447445 (C). Similar findings were seen in cohort studies on IL6 polymorphism, where alleles for rs13447445, rs2069829, and rs13447446 were monomorphic in Caucasian Americans [40]. One of the limitations of this study is the small sample sizes in both the Hawrami and Sorani Kurdish groups. More research is needed to fully characterize the majority of Iraq’s populations and ethnic groups, as well as to look into a broader range of host genetic polymorphisms that may affect the severity of COVID-19.

Conclusion

The current study was carried out in response to earlier work that recommended doing host genetic analysis on IFITM3 and IL6 promoter polymorphisms in various countries and ethnicities to investigate the influence of the SNPs on the COVID-19 outcome. This study demonstrated that the studied SNPs of IFITM3 and IL6 were statistically not associated with the Hawrami and Sorani populations at the genotype level. However, a lower frequency of miner-risk alleles in rs34481144 of IFITM3 and rs1800795 of IL6 in the Hawrami population may support to some extent the hypothesis of the study.