Background

The etiology of the inflammatory bowel diseases (IBD) has been extensively studied [1], however, the environmental determinants of disease pathogenesis are not fully understood [2, 3]. IBD is widely believed to be associated with industrialization of nations. This hypothesis is supported by the significant geographic variation in IBD with the highest incidence rate of IBD in North America and Europe [4]. Migrant studies have demonstrated that immigrants, and particularly their offspring, from low prevalent regions acquire a similar risk of IBD as the local population [5]. Furthermore, the incidence of IBD is steadily rising in several developing nations as they have become industrialized [6, 7]. Within a country the incidence of IBD has been proposed to be higher in urban versus rural areas [8, 9].

Although numerous studies have investigated the association between urban environment and IBD, findings remain inconsistent. An urban association has been considered as far back as 1963, when Acheson and Nefzger suggested a positive relationship between urban areas and ulcerative colitis (UC) [10]. Many observational studies have subsequently shown an increase in UC and Crohn's disease (CD) incidence in more densely populated areas [8, 1115]. Numerous studies, however, have failed to find an association between urban exposure and IBD [4, 16], while others have shown an inverse association [17]. Establishing whether the risk of IBD is greater in urban environments is important because environmental exposures in urban societies are significantly different than those in rural areas. This information may direct research initiatives on specific environmental risk factors of IBD and establish distribution of IBD burden in society.

A systematic analysis of the association between living in an urban environment and IBD has yet to be conducted. Thus, we performed a systematic review and meta-analysis of all case–control and cohort studies that explored the association between residing in an urban region and IBD in order to determine whether the risk of CD and/or UC was increased in urban as compared to rural areas.

Methods

Search strategy

We conducted a systematic literature search using a predetermined protocol and in accordance with the quality of reporting meta-analyses of observational studies (MOOSE) [18]. We searched two computer-stored databases, Medline (1950-present) and Embase (Excerpta Medica Database; 1980-present) for studies describing the association between urban environment and IBD as of October 2009. The search strategy for Medline and Embase was conducted based on three themes. The first theme, the outcome measure, combined the exploded version of Medical Subject Headings (MESH) “inflammatory bowel disease” or “Colitis, Ulcerative” or “Crohn Disease”. The second theme, the exposure, combined the exploded version of MESH headings “urban population” or “urban health” or “rural population” or “rural health” or “geography”. The third theme combined exploded versions of MESH headings “risk factors” or “risk assessment” or “epidemiology” or “demography”. All the keywords were used to search the titles and abstracts. The search was not limited by language or human subjects to ensure capture of all appropriate papers. Abstracts from the American College of Gastroenterology for 2006, 2007, and 2008, and the American Gastroenterological Association for 2006 and 2007 were reviewed. The reference lists of relevant articles were also reviewed.

Selection criteria

Two reviewers (N.M. and I.S.) identified articles eligible for further review by performing an initial screen of identified abstracts and titles. Articles were eliminated in this initial screen if they were not observational or did not either investigate environmental risk factors for IBD in a case–control study or investigate epidemiology of IBD using incident cases. Studies that did not report original data and duplicated publications were also excluded. Full-text of the remaining articles were retrieved and systematically reviewed. Articles were considered for inclusion in the second screening if they reported a measure of association between urban environment and UC and/or CD. In both study designs, UC and CD were required to be reported separately for inclusion into the systematic review. Studies that did not report adequate information to calculate incidence rate ratio (IRR) or odds ratio (OR) with 95% confidence interval (CI) were excluded. Disagreement between reviewers was resolved by consensus with third party experts (DR and GK).

Data extraction

The outcome variable of interest was defined as the presence of UC and/or CD. The exposure variable of interest was residing in an urban versus rural environment. Urban and rural populations were not consistently defined across all studies. Several studies (n = 10 for UC; n = 11 for CD) did not define an urban and/or rural environment [12, 13, 16, 17, 1931]. The studies that defined this exposure used definitions that varied and some papers stratified by multiple levels of exposure (e.g. urban, semi-urban, and rural) [24, 3236]. A priori we identified studies that defined urban as a population greater than 10,000 [37, 38]. Secondary variables extracted from the manuscripts included: study design (i.e. case–control or cohort); country of origin; publication year; timing of exposure; source of controls for case–control studies; and information on key indicators of study quality, using MOOSE [18].

We extracted reported OR and IRR with 95% (CIs) or data enabling the calculation of these association measures. Both adjusted and unadjusted values were extracted; though, when available, the adjusted estimates were used.

Statistical analysis

Meta-analyses were initially conducted by combining cohort and case–control studies using IRRs, defined as the incidence rate of IBD in urban versus rural populations. Given the low prevalence of IBD, the OR would approximate the IRR under the rare disease assumption. Case–control and cohort studies were then analyzed separately using different measures of association. The IRR was used as measure of association for cohort studies and OR was used for case–control studies. The Test of Heterogeneity was performed using a Q statistic (5% level) and random effects models were used because of the presence of heterogeneity between studies. Stratified analyses were performed to explore factors that may explain heterogeneity between studies. Publication year was a priori stratified into three year categories: 1962–1988; 1989–1998; and 1999–2009. Similarly, stratified analyses and meta-regression were performed based on study design (i.e. case–control or cohort study), region of publication, timing of exposure, and source of controls in case–control studies. Sensitivity analyses were performed to exclude studies that did not include a definition of urban/rural region. Further sensitivity analysis was conducted on studies that used >10,000 people as a definition of urban environment [37, 38]. Papers that stratified by multiple levels of exposure (e.g. urban, semi-urban, and rural) were analyzed as urban versus rural. The possibility of publication bias was assessed using the Begg tests.

Results

Literature search

The search strategy retrieved 6940 unique citations: 2964 from Medline and 3976 from Embase. Of these, 6434 citations were excluded after the first screening based on titles and abstracts, leaving 506 articles for full-text review (Figure 1). The observed agreement between reviewers for eligibility of articles was 97%, corresponding to a kappa statistic of 0.77. Upon full text review of 506 articles, 466 were excluded (see Figure 1 for rationale of exclusions), leaving 40 studies for final inclusion in the systematic review and meta-analysis. Of these 40 studies, 25 investigated the relationship between urban environment and UC and 30 investigated this relationship with CD. Included in our analysis were 7 case–control UC studies, 9 case–control CD studies, 18 cohort UC studies, and 21 cohort CD studies.

Figure 1
figure 1

Literature search results.

Demographic and study quality characteristics

Characteristics of the 18 cohort studies for UC are shown in Table 1[1921, 23, 24, 3235, 3945]. The reported incidence rate ranged from 0.97 to 15.6 per 100,000/year. A definition for urban and/or rural environment was provided for 13 cohort studies. Characteristics of the 7 selected case–control studies for UC are shown in Table 2[8, 12, 17, 2628, 46]. Two of the studies had definitions for urban and/or rural environment. Characteristics of the 21 selected cohort studies for CD are shown in Tables 1[14, 15, 22, 23, 25, 30, 31, 36, 3943, 45, 4752]. The reported incidence rate of CD ranged from 0.51 to 15.6 per 100,000/year. A definition for urban environment was reported in 16 studies. Characteristics of the nine selected case–control studies for CD are shown in Table 2[8, 13, 16, 17, 27, 29, 46, 53]. The studies were based in North America, Australia, Asia and Europe. The association between urban environment and IBD was not the primary outcome of interest in any of the cohort or case–control studies.

Table 1 Characteristics of cohort studies for ulcerative colitis and Crohn’s disease studies
Table 2 Characteristics of case–control studies for ulcerative colitis and Crohn’s disease studies

Urban/rural environment and IBD

The pooled crude IRR for the 25 UC studies was 1.17 (1.03, 1.32) (Figure 2A). For the 18 cohort studies, the pooled crude IRR was 1.19 (1.03, 1.36) (Figure 2B). In 13 of the studies, a positive association between UC and urban environment was found, 7 of which were statistically significant. Heterogeneity across studies was observed (Q statistic, 95.14; P < .001). For the UC case–control studies, the pooled OR for the association between urban environment and UC was 1.06 (0.78, 1.45) (Figure 2C). Three out of seven studies found a positive association, and two were statistically significant. Heterogeneity was observed across studies (Q statistic, 19.32; P = .004). The pooled IRR for the 30 CD studies was 1.42 (1.26, 1.60) (Figure 3A). For the 21 cohort studies, the pooled crude IRR was 1.50 (1.30, 1.72) (Figure 3B). In 20 of the studies, a positive association between CD and urban environment was found, 10 of which were statistically significant. Heterogeneity across cohort studies was observed (Q statistic, 108.56; P < .001). For the 9 CD case–control studies, the pooled OR for the association between urban environment and CD was 1.26 (1.03, 1.53) (Figure 3C). Six out of the seven studies found a positive association, but only three were statistically significant. Heterogeneity was not observed across studies (Q statistic, 11.88; P = .157).

Figure 2
figure 2

Forest plot of the summary effect estimate with 95% CI studies that explored the relationship between urban environment and ulcerative colitis for both cohort and case–control studies (A); cohort studies (B); and case–control studies (C).

Figure 3
figure 3

Forest plot of the summary effect estimate with 95% CI studies that explored the relationship between urban environment and Crohn’s disease for both cohort and case–control studies (A); cohort studies (B); and case–control studies (C).

Sensitivity and stratified analyses

After excluding the UC cohort studies that did not define urban and/or rural environment the pooled IRR was 1.18 (1.09, 1.27), which did not differ from the crude pooled IRR (Table 3). The IRR lost significance 1.16 (0.9, 1.5) when we included only studies that defined urban as >10,000 people. Only cohort studies published after 1999 demonstrated an association between urban environment and UC (1.30; 95% CI: 1.10, 1.54). A stratified analysis was similarly performed for region in which the study was based, divided into European and non-European countries. The IRR for Non-European studies was statistically significant with pooled estimate of 1.13 (1.06, 1.22). Meta-regression was not statistically significant for year of publication, region of publication and European countries.

Table 3 Sensitivity and stratified analyses of pooled relative risk for cohort studies

For the UC case–control studies, we failed to find a statistically significant association when stratifying by timing of exposure (Table 4). Meta-regression was performed for both timing of exposure and source of controls and no statistically significant associations were found.

Table 4 Sensitivity and stratified analyses of pooled odds ratio for case–control studies

After excluding CD cohort studies that did not define urban and/or rural environment, the pooled IRR was 1.28 (1.14, 1.43), which was similar to the crude IRR (Table 3). Restricting the urban definition to >10,000 people resulted in a loss of significance with an IRR of 1.38 (0.97, 1.97). The pooled IRRs were statistically significant across all 3 categories of publication year (Table 3). A stratified analysis was similarly performed for region in which the study was based, divided into European and non-European countries. The IRRs for both European and non-European studies were statistically significant with pooled estimates of 1.55 (1.28, 1.88) and 1.39 (1.18, 1.62), respectively. In meta-regression analysis only the European countries variable was statistically significant (Table 3).

For the CD case–control studies, we failed to find a statistically significant association when stratified by timing of exposure (Table 4). Meta-regression was performed for both time of exposure and source of controls and no statistically significant associations were found. The OR for population-based controls was 1.40 (1.13, 1.73) and for hospital/clinic-based controls was 1.09 (0.79, 1.51). Meta-regression was performed for both variables and no statistically significant result was found.

Publication bias

No publication bias was found; the Begg tests were not statistically significant for UC (z = −0.42, P = .675) or CD studies (z = 0.70, P = .487).

Discussion

Urbanization of society is an important risk factor for the development of IBD. This meta-analysis suggests that living in urban environments may increase the risk of developing CD and UC. Though, the strengths of association varied among the 40 studies due to heterogeneity between studies. The association between CD and urban environment persisted across a number of stratified analyses that explored clinical and study quality factors. In contrast, the association between UC and urban environment was weaker and less consistent upon sensitivity analyses. Publication bias was not observed suggesting that the association were likely not an artifact of unpublished studies.

Several theories may explain the increased incidence of IBD in urban societies. The Hygiene Hypothesis proposes that the lack of early childhood exposure to enteric pathogens with improved sanitation in urban cities increases the incidence of IBD [5, 55]. The lack of exposure to enteric pathogens may lead to a greater susceptibility to develop an inappropriate immunologic response upon exposure to new antigens (e.g. gastrointestinal infection) later in life [56]. Other environmental risk factors of IBD that are more predominant in urban societies include smoking, lack of helminths exposure, and antibiotic use [5759]. Additionally, a recent study demonstrated that air pollution in urban cities was associated with IBD in children [60]. Urban occupations such as driving and manufacturing have also been reported as risk factors for IBD [61]. While a cohesive hypothesis that explains the environmental determinants of IBD has not been proven, this meta-analysis supports further research exploring the environmental risk factors of urbanization and IBD.

Heterogeneity was observed in both the CD and UC cohort studies, whereas statistically significant heterogeneity was not observed for the UC and CD case–control studies. This inconsistency was likely due to the large overall study population in the cohort studies, leading to an overpowered heterogeneity statistical test. In contrast, the case–control studies were small in total study population. Alternatively, differences in heterogeneity between studies may be explained by the difference in the study design and by the intrinsic biases associated with cohort and case–control studies that explore environmental risk factors of IBD.

When studies were stratified by region the association between UC and urban environment persisted only in non-European studies, whereas the association between CD and urban environment remained significant in most regions. Regional differences may be due to the fact that studies originated from countries that differed by ethnicity, prevalence of IBD, and rates of IBD susceptibility genes [62, 63]. Alternatively, the small number of studies in each region made appropriate and meaningful inferences challenging. Furthermore, data was lacking for low prevalent regions (i.e. developing world) and thus we could not explore the urban/rural effect in these regions.

We explored whether the year of publication contributed to the heterogeneity observed between studies. For UC, only publications after 1999 were significant after pooling. This time stratified finding may be due to study design differences in more recent publications or the changing pattern of UC diagnosis. In contrast, the urban/rural relationship persisted across all time strata for CD, which suggested a greater strength of association for CD over UC.

The definition of an urban environment was a source of heterogeneity across studies. The studies that clearly defined this exposure included population estimates of the urban and/or rural areas; however, the inconsistency or lack of definitions made study comparisons challenging. We a priori selected the definition of urban as more than 10,000 people because this definition is consistent with census of populations in many Countries (e.g. United Kingdom [37] and Canada [38]). When considering only studies that defined urban as >10,000, the association was no longer significant, which was likely due to the small number of studies with this definition (n = 2 for UC and n = 4 for CD). However, this finding may also reflect that the risk associated with urban society may be driven by greater population size (e.g. living in a metropolis) and/or by population density. Ideally, we would have investigated the potential for a dose response effect to explore whether the risk of IBD increased with increasing population sizes. However, few papers stratified their results by multiple levels of exposure; for example, Tsianos [24] and Ladas [34] stratified their analysis into urban, semi-rural, and rural. Future studies of the urban–rural association should explore the importance of a dose response effect, a threshold value of population size, and population density.

Heterogeneity between studies may have occurred due to the biases associated with cohort and case–control studies. For the case–control studies, a selection bias may have influenced the association if cases were selected by a different mechanism than controls. The controls were grouped into population- and hospital/clinic-based selection categories. For CD, only population-based controls demonstrated a significant association with urban environment; in contrast, the urban–rural association was insignificant in both control populations for UC. Additionally, inconsistency in timing of exposure (e.g. defining urban–rural status at time of diagnosis versus childhood) may have contributed to the heterogeneity between case–control studies. People in rural settings may have less access to health care leading to under diagnosis of IBD. Although the majority of the case–control studies were matched by age and gender, other potential confounders including socioeconomic status and smoking were often not considered. Finally, studies that used administrative databases to identify IBD patients likely introduced misclassification errors. Interpreting the results of meta-analysis should be cautious because pooling data does not address the intrinsic biases of observational studies.

Limitations in our systematic review should be considered. First, the number of studies included in the stratified analyses was small and may have been underpowered. Second, the quality of studies was not always optimal as was shown with the inconsistent definitions of urban/rural and time of exposure. Third, the included studies used population estimates as indicators for urban environment; however, other factors that contribute to an urban setting, such as socioeconomic characteristics, were not considered. Finally, due to the nature of observational studies, a temporal relationship could not be determined. Thus, urban environment cannot be conclusively established as a causal factor in IBD.

Conclusions

In spite of differences in study design and population characteristics, the meta-analysis demonstrated that living in an urban society was positively associated with the development of IBD; though, the consistency and strength of the association was greatest for CD. Additionally, the meta-analysis identified important study limitations and thus, future studies should be properly designed using a standard definition of urban/rural. Furthermore, additional studies are need to evaluate whether the following factors affect the risk of developing IBD: increasing population size (i.e. a dose response effect); a threshold value of population size; the number of people living per unit area (i.e. population density); and the duration and timing of exposure. Finally, the meta-analysis highlights that researchers should continue to explore for environmental differences between urban and rural societies.

Author contributions

G K. Dr. Kaplan participated in conceiving the study idea, developing the study design, interpreting results, and writing of the manuscript. Dr. Kaplan confirms that he has had full access to all the data in the study and had final responsibility for the decision to submit for publication. Dr. Kaplan has seen and approved the final version of the manuscript and has no relevant conflicts of interest. I S S. Dr. Soon participated in conceiving the study and design, acquisition of data, analysis, interpretation of data, drafting manuscript, and statistical analysis. Dr. Soon has seen and approved the final version of the manuscript and has no relevant conflicts of interest. N M. Ms Molodecky participated in conceiving the study concept and design, acquisition of data, analysis and interpretation of data, drafting manuscript, and statistical analysis. Ms Molodecky has seen and approved the final version of the manuscript and has no conflicts of interest. D R. Dr. Rabi participated in conceiving the study study concept and design, analysis and interpretation of data, and critical revision of the manuscript for important intellectual content. Dr. Rabi has seen and approved the final version of the manuscript. Dr. Rabi has no conflicts of interest. W G. Dr. Ghali participated in analysis and interpretation of data, and critical revision of the manuscript for important intellectual content. Dr. Ghali has seen and approved the final version of the manuscriptand has no conflicts of interest. H B. Dr. Barkema participated in analysis and interpretation of data, and critical revision of the manuscript for important intellectual content. Dr. Barkema has seen and approved the final version of the manuscript, and has no conflicts of interest.

Source of funding

This research is supported by The Alberta IBD Consortium, which is funded by an AHFMR Interdisciplinary Team Grant. AHFMR is now Alberta Innovates - Health Solutions.