Introduction

Rheumatoid arthritis (RA) is a chronic inflammatory joint disease of unknown etiology, which can result in cartilage and skeletal damage, as well as disability. Although primarily affecting the joints, RA should be regarded as a syndrome encompassing extra-articular manifestations, such as rheumatoid nodules, pulmonary involvement, gastrointestinal diseases, and systemic complications (JS et al. 2016). Inflammatory bowel disease (IBD) is a chronic nonspecific inflammatory disorder primarily affecting the gastrointestinal tract, encompassing Crohn's disease (CD) and ulcerative colitis (UC). Crohn’s disease is a recurrent transmural inflammatory disorder of the gastrointestinal mucosa, which involves nearly any part of the gastrointestinal tract in a non-continuous manner, with the terminal ileum being commonly affected. It is often accompanied by complications such as strictures, abscesses, or fistulas. In contrast, ulcerative colitis is limited to the colon and its surrounding areas (Baumgart and Sandborn 2007). Histologically, UC exhibits superficial inflammation limited to the mucosa and submucosa, with cryptitis and crypt abscesses. Microscopic features of CD include thickening of the submucosal layer, transmural inflammation, fissuring ulcers, and non-caseating granulomas (Khor et al. 2011).

Epidemiological studies have indicated an increased risk of autoimmune and inflammatory diseases in patients with IBD, both in individuals with CD and UC, with a higher susceptibility to RA (Cohen et al. 2008; Halling et al. 2017; Yang et al. 2018; Park et al. 2019). Simultaneously, an experimental study about RA has also demonstrated that RA is prone to inducing alterations in the gastrointestinal microbiota, particularly in the early stages of the disease (Zaiss et al. 2021). However, the relationship between RA and IBD remains contentious due to the absence of large-scale randomized controlled trials. A study conducted in a European population revealed a positive correlation between the genetic risk of RA and an increased risk of overall IBD, including CD and UC (Meisinger and Freuer 2022). Nevertheless, whether this causal association exists in East Asian populations remains unknown.

In this study, we conducted a two-sample Mendelian randomization (MR) to evaluate the potential causal relationship between RA and IBD, including CD and UC, in East Asian populations. The results showed a potential increased susceptibility to CD and a decreased susceptibility to UC in patients with RA, and SNP rs2071475 may play pivotal role in this causal relationship.

Methods

Study design and methods selection

Mendelian randomization (MR) provides a novel approach for causal inference by utilizing genetic variations strongly associated with the exposure factor as instrumental variables (IVs) to infer the causal effects of the exposure on a specific outcome (Davies et al. 2018). Due to the Mendelian inheritance law, which states that alleles are randomly allocated from parents to offspring during gamete formation, genetic variations are not influenced by conventional confounding factors such as environmental influences, socioeconomic factors, or individual behaviors. The temporal relationship between genetic variations and outcomes is therefore plausible. Thus, MR can minimize confounding and reverse causation biases that are commonly encountered in observational studies, offering stronger evidence than observational research (Skrivankova et al. 2021; Davey Smith and Ebrahim 2005). The study design flow chart is shown in Fig. 1.

Fig. 1
figure 1

Directed acyclic graph of the MR framework investigating the causal relationship between RA and IBD. SNPs single nucleotide polymorphisms; RA rheumatoid arthritis; IBD inflammatory bowel disease; CD Crohn's disease; UC ulcerative colitis; MR Mendelian randomization

Study population selection

This study utilized summary-level data from published studies and databases. Ethical approval and the requirement for informed consent from relevant patients was waived. All genetic data were obtained through a meta-analysis of genome-wide association studies (GWAS) and can be accessible at https://gwas.mrcieu.ac.uk/. The dataset for RA included 19,190 samples (3636 cases and 15,554 controls) from East Asian populations (Okada et al. 2014). The genetic data for IBD were obtained from the International IBD Genetics Consortium (IIBDGC) within GWAS, which included 6543 samples (2824 cases and 3719 controls) from the East Asian population (Liu et al. 2015). In addition, the data of CD with 5409 samples (1690 cases and 3719 controls) and UC with 4853 samples (1134 cases and 3719 controls) were also obtained. Additionally, genetic data from another East Asian population were used to validate the applicability of the results for RA. This validation dataset comprised 22,515 samples (4873 cases and 17,642 controls). Detailed information regarding the GWAS summary data from GWAS for RA and IBD, including CD and UC, can be found in Supplementary Table S1.

Genetic instrumental variable selection

The single nucleotide polymorphisms (SNPs) serving as instrumental variables (IVs) for RA were obtained from the Japanese Biobank. RA-associated SNPs were selected based on genome-wide significance threshold (P < 5 × 10−8), indicating substantial correlation with both the SNPs and RA, as well as screening criteria (the linkage disequilibrium R2 < 0.01 and the length between adjacent SNPs < 1000 kb). Palindromic IVs were excluded after data harmonization, as palindromic SNPs have intermediate allele frequencies. Additionally, an F-statistic threshold exceeding 10 was used to exclude genetic variations as potential IVs (Skrivankova et al. 2021; Burgess and Thompson 2011).

Statistical analysis

To a valid MR analysis, three assumptions should be met: (1) The genetic variations considered as IVs should be closely associated with the exposure, (2) the genetic variations designated as IVs should not be associated with any confounding factors, and (3) the genetic variations used as IVs should only influence the risk of the outcome through the exposure (Hemani et al. 2018a).

We primarily employed the inverse variance-weighted (IVW) method to assess the impact of exposure (RA) on IBD as well as CD and UC (Bowden et al. 2018). Additionally, we utilized four supplementary analyses to confirm the results, including weighted mode, weighted median, simple mode, and MR-Egger regression (Bowden et al. 2016, 2015; Burgess et al. 2017). To assess whether the MR analysis adheres to the aforementioned three assumptions, we employed Cochran's Q test to assess heterogeneity among SNPs in the IVW and MR-Egger methods (Pereira et al. 2010). MR-Egger regression was used to evaluate the pleiotropy of the IVs (Bowden et al. 2015). MR-PRESSO test was applied to detect and correct for horizontal pleiotropy and leave-one-out testing was performed to examine the influence of specific SNPs (Verbanck et al. 2018). Odds ratios (ORs) with 95% confidence intervals (CIs) were calculated to quantify the relationship between RA and IBD. We drew scatter plots, funnel plots, etc., to clearly visualize the SNP-related RA and IBD risk.

Statistical analysis was conducted using the “TwoSampleMR” (Hemani et al. 2018b) and “MR-PRESSO” (Verbanck et al. 2018) packages in R version 4.2.2, and p-value < 0.05 was considered statistically significant.

Results

Selection of instrumental variables

We extracted 59 SNPs as IVs from the RA dataset (from the bbj-a-72 project), with a significance level of P < 5 × 10−8. Additionally, we calculated the F-statistic for each SNP, which ranged from 31.61 to 634.53 and all exceeded 10. This indicates that the IVs are unlikely to be influenced by instrumental bias and are in accordance with the first hypothesis. The details including p-values, beta coefficients, standard errors (SEs) and effect allele for the association between SNPs and RA of the selected IVs are provided in Supplementary Table S2. Finally, for different outcome events, including IBD, CD, and UC, we selected 11/11/10 SNPs, respectively, as genetic instruments for the MR analysis. The information regarding the RA-related genetic variants and their effects on IBD, CD, and UC can be found in Tables 1, 2, 3.

Table 1 Characteristic of the RA-related genetic variants and their effects on IBD (11 SNPs)
Table 2 Characteristic of the RA-related genetic variants and their effects on CD (11 SNPs)
Table 3 Characteristic of the RA-related genetic variants and their effects on UC (10 SNPs)

The effect of RA on IBD, CD and UC

According to the IVW analysis, there is no significant causal effect (OR = 1.028; 95% CI: 0.935–1.129; P = 0.567) between genetic liability to RA and IBD in the East Asian population (Table 4). However, in the subsequent analysis focusing on the two subtypes, CD and UC, we observed distinct causal associations. Specifically, RA showed a positive correlation with CD (OR = 1.268; 95% CI: 1.108–1.451; P < 0.001), while a negative correlation (OR = 0.839; 95% CI: 0.710–0.993; P = 0.041) was found between RA and UC (Table 4), and these was visualized in the scatter plot (Fig. 2). This suggests that the presence of RA may serve as a risk factor for CD occurrence in the East Asian population, while acting as a protective factor for UC development. The results obtained from the weighted median method also support these findings.

Table 4 Effect estimates of the associations between RA and IBD in East Asian populations
Fig. 2
figure 2

Scatter plots showing the causal effect of SNPs on RA (bbj–a–72) against the effects on CD(A) and UC(B). SNP single nucleotide polymorphisms; MR Mendelian randomization

Sensitivity analysis

For the stability of the results, the Cochran's Q test indicated no significant heterogeneity under the influence of SNPs for both CD and UC (CD: Q = 14.912, P = 0.135; UC: Q = 13.838, P = 0.128), and were shown in the funnel plot (Fig. 3). The intercept p-values obtained from the MR-Egger method were 0.969 and 0.399 for CD and UC, respectively, both greater than 0.05, indicating the absence of horizontal pleiotropy in the IVs. The results of MR-PRESSO method also supported this conclusion (Table 4). Additionally, the leave-one-out sensitivity analysis was conducted to assess the impact of each SNP on the overall causal estimate. No significant changes in the estimated causal effects were observed when individual SNPs were excluded (Fig. 4).

Fig. 3
figure 3

Funnel plots showing no significant heterogeneity among the SNPs of CD(A) and UC(B). SE standard error

Fig. 4
figure 4

The Forest plot of leave-one-out sensitivity analysis showing the impact of each SNP on the overall causal estimate to CD(A) and UC(B)

To validate our conclusions, we employed genetic data related to RA from another East Asian population (ieu-a-831) for verification. A total of 53 RA-associated SNPs were included as IVs, and the F-statistic was also examined, the detail information can be found in Supplementary Table S3. Due to palindromic with intermediate allele frequencies, one SNP (rs210180) was removed from the relevant MR studies after data harmonization. Information regarding RA-associated genetic variants and their effects on IBD, CD, and UC can be found in Supplementary Table S4–S6. The final IVW analysis yielded similar results, indicating no significant association between RA and overall IBD (OR = 1.102; 95% CI: 0.980–1.239; P = 0.104), a positive correlation (OR = 1.202; 95% CI: 1.047–1.381; P = 0.009) between RA and CD, and a negative correlation (OR = 0.852; 95% CI: 0.741–0.980; P = 0.025) between RA and UC (Table 5). The scatter plot can be found in Fig. S1. Sensitivity analysis also indicated the absence of heterogeneity and horizontal pleiotropy for these SNPs (Figs. S2, S3).

Table 5 Verification of the associations between RA and IBD in East Asian populations

Discussion

As far as we know, this is the first two-sample MR study to comprehensively evaluate the causal relationship between genetic susceptibility to RA on the risk of developing IBD in the East Asian population. By selecting reliable SNPs as instrumental variables (IVs), our findings suggest that genetically predicted RA is significantly associated with an increased incidence of CD and a decreased incidence of UC in the East Asian population.

Previous observational studies have suggested a relationship between RA and IBD, with some prospective studies reporting a higher incidence of RA in patients with IBD (Halling et al. 2017; Yang et al. 2018). Furthermore, one study indicated that RA may potentially induce the occurrence of gut dysbiosis and increase the likelihood of developing IBD (Zaiss et al. 2021). However, these studies have certain limitations. Due to the inherent limitations of observational study designs, current epidemiological research has not adequately assessed the influence of potential confounding variables, such as study duration, variations in different screening programs, environmental exposures, and data collection methods, which could potentially distort the underlying association between RA and IBD. Therefore, drawing a definitive conclusion may be insufficient at this stage.

Mendelian randomization (MR) is a novel approach that utilizes genetic variants as instrumental variables to determine the impact of certain exposures on outcomes (Davies et al. 2018; Skrivankova et al. 2021). Firstly, genetic influences are relatively stable and largely unaffected by environmental factors. Additionally, MR employs stringent quality control criteria and analytical methods, utilizing various models to examine causal effects. Therefore, MR has the potential to overcome limitations associated with traditional observational studies and generate reliable research findings (Davies et al. 2018; Davey Smith and Hemani 2014; Smith and Ebrahim 2003).

Our study found no association between RA and the overall incidence of IBD in the East Asian population. However, interestingly, we observed a higher incidence of CD and a lower incidence of UC among RA patients. It is well known that CD and UC are two subtypes of IBD, both characterized by non-specific inflammation, but they also exhibit some differences in terms of pathology and histology. Further analysis of the genetic variants associated with RA revealed a key SNP, rs2071475. While this SNP showed a strong association with RA (β = 0.35326, P = 5.10152E-38), it exhibited completely opposite effects in CD (SNP-CD: β = 0.156929, P < 0.001) and UC (SNP-UC: β = − 0.191489, P < 0.001), the data can be found in Tables 2 and 3. The reference SNP report for rs2071475 indicates a close association with HLA-DOB (accessible at https://www.ncbi.nlm.nih.gov/snp/rs2071475). A large-scale meta-analysis has identified that HLA-DOB is significantly upregulated in RA (Afroz et al. 2017). Previous researches have indicated that human leukocyte antigen HLA-DOB, which can influence several alleles involved in antigen presentation, plays a crucial role in viral infections (Denzin et al. 2017; Han et al. 2020). Simultaneously, a study involving transcriptome analysis of the colon found that HLA-DOB was significantly upregulated only in patients with CD, while there was no significant change in patients with UC (Yang et al. 2019). And this study suggests that virus-induced autoimmunity may represent a hypothesis for IBD, particularly CD. This provides a plausible explanation for the differential impact of RA on the incidence of CD and UC observed in our study, possibly due to variations in HLA-DOB expression.

Additionally, some studies have found that Toll-like receptor 4 (TLR4) and its associated innate immune pathways, play a more significant role in UC (Yang et al. 2019; Pastille et al. 2021). Furthermore, another case–control study revealed a significant association between TLR4 polymorphisms and the risk of IBD in individuals of Caucasians (Wang et al. 2019). This also explains why in another study conducted on a European population, patients with rheumatoid arthritis exhibited an increased risk of overall IBD, including both CD and UC (Meisinger and Freuer 2022).

One advantage of our study is that, for the first time, we utilized a two-sample MR approach to assess the impact of RA on IBD, including CD and UC in the East Asian population. Additionally, we identified a significant SNP, rs2071475, that plays a crucial role in CD and UC, offering novel insights into their pathogenesis, diagnosis, and treatment, which hold strong clinical significance. Furthermore, compared to traditional observational study designs, the MR approach is less susceptible to confounding factors, providing more reliable evidence. The consistent and robust causal estimates were further supported by sensitivity analyses. One limitation of our study is the lack of specific clinical data, such as age, gender, duration, activity levels, and treatment modalities, related to RA patients, which hinders further subgroup analyses. Therefore, further research is needed to validate the precise causal relationship between RA and IBD.

Conclusion

Overall, this MR study reveals a causal association between RA and IBD, specifically an increased risk of CD and a decreased risk of UC among RA patients in the East Asian population. These findings pave the way for further exploration of the underlying molecular mechanisms, clinical implications, strengthened epidemiological surveillance, and informed public health decision-making.