Background

Rheumatoid arthritis (RA) is a chronic inflammatory disorder, it mostly affects the joints, but it can also reach and affect other organ systems such as skin and the lungs as well. RA is an autoimmune process that is typically triggered by an interaction between a genetic factor and an environmental factor. A person, with a certain gene for an immune protein—like human leukocyte antigen or HLA-DR1 and HLA-DR4, might develop RA [1] after getting exposed to an environmental factor like cigarette smoke or a specific pathogen. RA is known to occur with higher chances in females rather than males with a 3:1 chance. The average age in females for the disease to appear is in the 30 s and 40 s wherein males it is discovered further delayed than females [2].

This work aims to observe the genetic association between six different single-nucleotide polymorphisms (SNPs) and the RA disease. The study applied four statistical genetic models for measuring the association between the SNPs and the disease, which are multiplicative, dominant, recessive and co-dominant. Each model works on a different exposed group and unexposed group in cases and controls, where the data represented in two by two contingency table. The multiplicative model works on the alleles level. The dominant model compares major allele homozygote versus heterozygote and minor allele homozygote, while the recessive compares major allele homozygote and heterozygote versus minor allele homozygote. The co-dominant model compares the major homozygote versus minor allele homozygote or the heterozygote.

Some studies found that micro-ribonucleic acid (miRNA) changes T cell selection and receptor sensitivity [3, 4]; accordingly, miRNA may be associated with autoimmune diseases; also there are studies showed that miRNAs play a role in the progress of RA [5, 6].

This study worked on examining 6 SNPs that are located within six different genes and on different chromosomes. SNP rs2910164 is located on the microRNA146a (MIR146A) gene and on chromosome 5. SNP rs3746444 is located between two genes that are microRNA 499 (MIR499) and microRNA 499a (MIR499A); mir-499 can affect the production of C-reactive protein (CRP) and inflammation in RA [7]. This SNP is located on chromosome 20. The rs12537 is located on myotubularin related protein 3 (MTMR3) and chromosome 22. SNP rs767649 is located on MIR155 host gene (MIR155HG) and on chromosome 21. SNP rs3027898 is located on interleukin 1 receptor-associated kinase 1 (IRAK1) and belongs to the X chromosome. And finally, SNP rs1748033 is located on peptidyl arginine deiminase type 4 (PADI4) that is located on chromosome 1. (PADI4) gene encodes PADI4 enzyme; PADI4 enzyme produces autoantigen citrullinated peptides recognized by anti-cyclic citrullinated peptide (anti-CCP) [8] that increases the risk of being diagnosed of RA disease.

Many studies included these SNPS to explore the linkage between them and rheumatoid arthritis disease such as rs767649 [9], rs2910164 [10,11,12], rs3746444 [7, 10], rs12537 [13], rs3027898 [14, 15], rs1748033 [16], and this research is following the footsteps of these by studying these SNPS as well.

Methods

Molecular genetic methods

DNA was extracted from whole blood using Qia-amplification DNA extraction kit (Qiagen, USA). 200 ul EDTA whole blood was used for extraction of DNA by DNA extraction kit according to the protocol provided with the kit. The extracted DNA samples were subjected to DNA quantitation and purity assessment using the NanoDrop® (ND)-1000 spectrophotometer (NanoDrop Technologies, Inc. Wilmington, USA). Genotyping was performed using real-time PCR with TaqMan allelic discrimination assay (Applied Biosystems, USA). The DNA used is (100 μg); this study is consistent with the declaration of Helsinki. Each participant gave a written consent before experiment. A predesigned primer/probe set for the two genotypes was used (Applied Biosystems, USA). Probes were synthesized with reporter dye FAM or VIC covalently linked at the 5/end and a quencher dye MGB linked to the 3/end of the probe (Applied Biosystems, USA) (rs2910164 (C/G), rs3746444 (T/C), rs12537(C/T), rs767649 (A/T), rs3027898 (A/C) and rs1748033 (C/T)). Real-time PCR was performed using a Rotor gene Q Real-Time PCR System (Qiagen, Valencia, CA, USA) with the following conditions: after a denaturation time of 10 min at 95 °C, 45 cycles at 92 °C for 15 s then 60 °C for 90 s for annealing and extension were carried out and fluorescence was measured at the end of every cycle and at the endpoint.

Statistical methods

The Chi-square test (χ2-test) was used to analyze genotype and allele distributions of cases and controls. Odds ratio (OR) and 95% confidence interval (95% CI) were calculated as well. Hardy–Weinberg equilibrium (HWE) test was applied on the controls. If the HWE p value was less than 0.001 [17, 18], this indicates a deviation in the population from HWE and the corresponding SNP was excluded from the study. The ratio of the males versus females between the cases and controls was analyzed by proportion test using STATA software, and it showed that the ratio is not statistically significant as the p value was equals to 0.068 which passes the defined 0.05 threshold.

The Chi-square test is a formal statistical test used to analyze categorical data to verify the statistical significance of the results. Generally, the lower the χ2 value, the greater the likelihood that there is no significant difference between cases and controls. To be sure that the Chi-square result gives a real statistically significant difference, the p value should be looked up. A low p value indicates a low expectation of finding these results by coincidence, while a high p value means a high probability of finding these results by chance.

In case of p value of 1, it means that the two groups were not different at all. Odds ratio is one of the most popular measures of the strength of association between a disease severity and a biomarker SNP. OR is used to determine the probability of disease severity presence versus the disease severity absence in exposed and unexposed individuals.

Confidence interval (CI) is a formula that shows how to use a sample data to calculate an interval that estimates a point estimate (OR). A large CI marks a low level of precision of the OR, while a narrower CI indicates a reliable OR. When the two values of CI are less than 1, this indicates a protective association. When the two values of CI are greater than 1, this indicates a susceptible association.

The used equations for calculating OR, CI, and Chi-square test are as follows:

For the allelic odds ratio:

$${\text{OR}}_{A} = \frac{{m_{12} m_{21} }}{{m_{11} m_{22} }}$$
(1)

where m11 and m21 indicate allele a in cases and controls, respectively, and m12 and m22 refer to allele A in cases and controls, and the equation is to calculate the OR for allele A.

The following formulas are used for a 95% confidence interval (CI):

$${\text{Upper}}\,{\text{95\% }}\,{\text{CI}} = e^{{\left[ {\ln \left( {{\text{OR}}} \right) + 1.96 \times \sqrt {\left( {\frac{1}{{m_{11} }} + \frac{1}{{m_{12} }} + \frac{1}{{m_{21} }} + \frac{1}{{m_{22} }}} \right)} } \right]}}$$
(2)
$${\text{Lower}}\,{\text{95\% }}\,{\text{CI}} = e^{{\left[ {\ln \left( {{\text{OR}}} \right) - 1.96 \times \sqrt {\left( {\frac{1}{{m_{11} }} + \frac{1}{{m_{12} }} + \frac{1}{{m_{21} }} + \frac{1}{{m_{22} }}} \right)} } \right]}}$$
(3)

Chi-square test formula:

$$\chi^{2} = \mathop \sum \limits_{i} \frac{{\left( {O_{i} - E_{i} } \right)^{2} }}{{E_{i} }}$$
(4)

where Oi represents the observed frequency and Ei represents the expected frequency.

Study population

RA patients were diagnosed by physician investigators and followed the 1987 American College of Rheumatology (ACR) criteria. DAS28 (Disease Activity Score in 28 Joints), which is a validated score for established RA, was used as a measure for disease activity.

The observed controls had no signs of RA, including morning joint stiffness, citrulline antibody, positive rheumatoid factor (RF), or the findings of rheumatoid nodules. Furthermore, the patients with other inflammatory disorders or autoimmune diseases unrelated to RA were not included.

Subjects

The study consists of 52 RA patients, 85% of them are females, and 49 controls that 67% of whom are females. In cases, the mean age of females’ ± standard deviation (SD) was 40 ± 9.6 years. The mean age of males was 37 ± 16.64 years. In the controls, the mean age of females was 40 ± 9.92 years, while the mean age of males was 37.2 ± 10.7 years. The average disease duration of RA in females was 6.63 ± 4.29 years, while the duration of RA in males was 9.06 ± 8.25.

The study was approved by the Ethical Committee of Faculty of Medicine, Cairo University, and an oral and written informed consent was obtained from all participants.

Results

In this study, HWE, minor allele frequency (MAF), and the percentage of individual genotyped for each SNP have been conducted. The minimum accepted genotype percentage was 80%. The markers that were significantly deviated from HWE, where the HWE p value was less than 0.001, were excluded.

The minimum MAF accepted is at 0.05. Table 1 showcases the results of the marker checks. All the SNPs for all individuals were fully genotyped, and just three SNPs passed the marker checks. Table 1 also provides ID, physical position, chromosome number, major allele, and minor allele for each SNP. From Table 1, it can be observed that rs2910164, rs3746444, and rs12537 passed all the marker checks, and accordingly, they are included in the association study. The other excluded SNPs’ results are shown as reference for future studies.

Table 1 Marker checks of the studied SNPs

Table 2 presents the association between the SNPs that passed HWE test and the RA disease; the highlighted cells with the green color present the RA protective association of the examined SNPs, while the purple color indicates the susceptible association of the SNPs to the RA.

Table 2 Chosen RA SNPs analysis

Figure 1 shows a graphical representation of the association results for the included SNPs, while Fig. 2 demonstrates the genotype distribution in RA cases and controls. It is deduced from Table 2 and Fig. 2 that both MIR146A and MTMR3 are significantly associated with RA, while MIR499, MIR499A showed no associativity with the disease. As for the excluded SNPs MIR155HG, IRAK1, and PADI4, Figs. 3 and 4 represent the genotype distribution between patients and control groups and the association analysis, respectively. Table 3 demonstrates the excluded SNPs analysis with different models, with same color scheme meaning used in Table 2.

Fig. 1
figure 1

Association analysis for examined SNPs with RA disease. The horizontal line in each model represents the significance level of the p value (0.05). a Multiplicative model, b dominant model, c recessive model, d co-dominant with heterozygote model, e co-dominant with minor homozygote model

Fig. 2
figure 2

Genotype distribution in RA cases and controls for the included SNPs

Fig. 3
figure 3

Genotype distribution in RA cases and controls for the excluded SNPs

Fig. 4
figure 4

Association analysis for the excluded SNPs. The horizontal line in each model represents the significance level of the p value (0.050). a Multiplicative model, b dominant model, c recessive model, d co-dominant with heterozygote model, e co-dominant with minor homozygote model

Table 3 SNP analysis of the excluded SNPs

Figure 1 shows the associativity of each studied SNP with RA according to all models (multiplicative, dominant, recessive, co-dominant).

As for the excluded SNPs, MIR155HG had a protective association in the recessive and co-dominant minor models, while the SNP IRAK1 had a susceptible association in the dominant and the co-dominant heterozygote, and protective association in the recessive model. Finally, PADI4 had only a susceptible association in the dominant model.

Discussion

This study examined the association with the RA susceptibility in the Egyptian population with 52 cases and 49 controls. There are six SNPs that had been investigated with their distribution of genotypes and alleles.

As shown in Table 2 and Fig. 2, MTMR3 and MIR146A are associated with RA. Firstly, for MIR146A rs2910164 (C:G), the major allele C is protective in the multiplicative model, wherein the dominant model the major homozygote (CC) is also protectively associated. Both co-dominant models, heterozygote and minor homozygote, also verify the protective association. The recessive model, however, didn’t indicate any association. As for MIR499, MIR499A rs3746444 (T:C), the results presented no association with RA in any of the tested models.

Furthermore, MTMR3 rs12537 (C:T) indicates no association with RA in both the multiplicative and co-dominant minor homozygote; however, the major homozygote is susceptible in both the dominant and co-dominant heterozygote models while the recessive model indicated that the minor homozygote is susceptible. At the same time, [13] concluded that rs12537TT was positively associated with disease severity in RA. They identify a novel role of rs12537 in the susceptibility and progression of RA through increased autophagy.

Table 4 demonstrates a summary of previous studies along with this study of the 6 genes included and their associations with RA. This study shows a protective association to RA on MIR146A; on the other hand, other studies showed no association to RA. The other studies were conducted on Greek, Iranian, Polish, Asian, Chinese, and Egyptian populations; they all concluded that there is no association with the disease [24]. This variation might be a result of the difference in sample sizes between the studied populations as shown in Table 4, while both this study and the one in [23] studied Egyptian population. This study included both genders males and females while [23] included females only.

Table 4 Previous studies on RA association

[30] concluded that miRNA-146a (rs2710164) was significantly associated with RA susceptibility. The C allele of miRNA-146a (rs2710164) can be considered to be protective.

As for MIR499, MIR499A [26, 27], these studies were applied on two different Chinese population, and their conclusion was that there is no association with the disease. In a further study on the same SNP rs3746444 conducted on Han Chinese [22], there was no association when tested; however, the finding indicated that CT genotype increases the level of anti-CCP antibody which increases the risk of RA. Other studies showed a variation in associations; for example, in [21] Zhou et. al. studied Caucasian and Asian populations. Their conclusion was that the gene is susceptible in Caucasian, while there was no association in the Asian population. The gene was susceptible to RA in an Egyptian population study [7]. Another study of the Egyptian population, but limited to females only, indicated the association of the gene with RA [23].

miRNA-146a (rs2710164) polymorphism, the GG genotype was significantly higher in RA patients compared to the control subjects and the frequency of the G allele of the miRNA-146a (rs2910164) polymorphism was significantly higher in the RA patients than in the control subjects. The C allele of miRNA-146a (rs2710164) can be considered to be protective, as patients with the CC genotype had decreased deformity frequency and DD [30].

Additionally, [13] results suggest that rs12537 could increase the risk and progression of RA by reducing MTMR3 levels leading to increased autophagy level.

This study has some limitations, including the small sample size and the number of SNPS. The study could be applied on more patients or another bigger population for verification.

Conclusion

In summary, this study examined six SNPs in the Egyptian population; three of them were included in the association study, the rest were excluded on an HWE basis, but its results are shown for references. The MIR146A and MTMR3 polymorphisms showed association with RA. However, it has been demonstrated that MIR499/MIR499A has no role in the disease. Further work needs to be done on a larger sample size on the Egyptian population to confirm the results.