Background

External factors that modify tissue homeostasis, such as microorganism infection, tissue injury and exposure to contaminants may induce inflammatory processes [1]. Chronic inflammation is associated with the appearance of malignant cells and these cells’ proliferation, invasion and metastasis processes [2].

The inflammatory response may be modulated through gene expression variations due to structural genetic polymorphisms or regulatory regions. Therefore, an individual genetic composition may influence not only the inflammatory response but also susceptibility to cancer development [3, 4].

Recently, many studies have tested the association between different forms of cancer and functional variants of genes responsible for the inflammatory process [58], such as interleukin 1A (IL1A), interleukin 4 (IL4), nuclear factor kappa B1 (NFKB1) and protease-activated receptor 1 (PAR1) [912].

The gene for the pro-inflammatory cytokine IL1A presents an insertion/deletion (INDEL) polymorphism (3′UTR indel TTCA, rs3783553) in a binding site for the microRNAs miR-122 and miR-378 that modifies the connection between these microRNAs. The insertion (Ins) allele is related to higher gene expression [13]. Different investigations have demonstrated that individuals homozygous for the insertion (Ins/Ins) are less susceptible to developing gastric cancer, nasopharyngeal cancer, hepatocellular carcinoma and cervical cancer [1316].

Interleukin-4 (IL4), which is an anti-inflammatory cytokine gene, presents a VNTR polymorphism (intron 3 VNTR—70 bp, rs79071878) located on the third intron. Different alleles for this polymorphism modulate gene expression. The allele with two repeats (A2) is related to higher gene expression, while the allele with three repeats (A3) is related to lower expression [17, 18]. A homozygous genotype for higher IL4 expression (A2/A2) is associated with a worse prognosis in bladder cancer patients and with oral and pharyngeal cancers [19, 20].

Transcription factor NFKB1 plays an important role in the inflammatory process, and it can influence cancer development and aggressiveness, increasing tumour angiogenesis and repressing the immune response [21]. The NFKB1 gene carries an INDEL polymorphism in the promoter region (−94 indel ATTG, rs28362491) that exerts a regulatory effect on gene expression. The insertion allele is associated with an increase in promoter activity and protein synthesis [22].

Several studies on the association between rs28362491 and the risk of developing cancer show conflicting results [9, 2325]. In a recent meta-analysis of 21 case–control studies, Yang et al. [21] showed that the insertion allele for this polymorphism is significantly associated with a risk of developing oral, prostate and ovary cancers. The analyses also reveal similar results in Asian populations, but not in European populations, which suggests ethnic variations in predisposition to different types of cancer.

The receptor PAR1 is a member of the superfamily of G protein-coupled membrane receptors [26]. These receptors regulate processes ranging from vascular integrity to systemic inflammation. Activation of the PAR1 receptor in epithelial cells, macrophages and endothelial cells promotes the release of pro-inflammatory mediators, such as TNF, IL1β, IL2, IL6, CXCL8 and CCL2 [27]. The PAR1 gene features an INDEL polymorphism (−506 indel—13 bp, rs11267092) located in the promoter region [28]. The deletion allele (Del) is related to a better prognosis in breast, stomach and oesophagus cancers [2931].

The polymorphisms described above exhibit common features. (1) They are functional polymorphisms that alter the expression of genes that participate in and for metabolic pathways associated with carcinogenesis. (2) Such genes are associated with different types of cancer with high incidence in the Brazilian population, especially prostate, stomach, oesophagus, breast and cervical cancers [32]. (3) The genes vary in populations with different ethnic and geographic origins [33]. (4) Information on allele distributions in Native Americans and admixed populations, such as the Brazilian population, is limited. (5) From a technical perspective, all of the investigated polymorphisms are small INDELs; therefore, genotyping may be performed using a single PCR step followed by capillary electrophoresis, which is an accessible and low-cost laboratory technique.

The Brazilian population is one of the most heterogeneous populations worldwide that is formed by an admixture of Native Americans, Europeans and Africans. This heterogeneity has been well-documented in several genetic investigations [3439]. The admixture process occurred through different means between the Brazilian geographic regions. Therefore, the Native American contribution is more pronounced in Northern Brazil; the African contribution is more elevated in Northeast Brazil; and the South features a European predominance with little Native American and African influence [39].

We must know the distribution of allelic frequencies of these polymorphisms [IL-1A (rs3783553), IL4 (rs79071878), NFKB1 (rs28362491) and PAR1 (rs11267092)] for association studies on cancer, data for which is limited on Brazilian populations. Thus, the aim of this study is to characterize allelic distributions in a representative sample of the Brazilian population from all geographic regions of the country.

Results

The allele and genotype frequencies of the investigated polymorphisms (rs3783553, rs79071878, rs28362491 and rs11267092) in samples from the five Brazilian geographic regions and in the parental populations are presented in Table 1. The genotypic distributions of the markers exhibited a Hardy–Weinberg equilibrium in the investigated populations, except for the NFKB1 gene marker in the European sample (p = 0.003).

Table 1 Distribution of the polymorphisms in IL-1A, IL4, NFKB1 and PAR1 genes in Brazilian regions and parental populations

When we compared allelic distributions in parental populations, all markers exhibited a significantly different allele frequency between the three groups, and the Native American populations exhibited more differences from the continental populations (Table 2). Among the four markers, the average frequency difference (value δ) between the Native Americans and Africans was 34 %; between the Native Americans and Europeans was 37 %; and between the Europeans and Africans was 20 %. When we considered each marker individually, a higher average frequency difference was observed between the continents for IL1A (40 %) followed by IL4 (34 %), PAR1 (30 %) and NFKB1 (17 %).

Table 2 Comparison of the genotypic distribution of polymorphisms in IL1A, IL4, NFKB1 and PAR1 genes between the studied populations

For the geographic region comparisons (Table 2), rs11267092 (PAR1 gene) showed no significant difference between the Brazilian regions. The distributions of the other three polymorphisms (rs3783553, rs79071878 and rs28362491) were statistically similar between the northeast, south and southeast regions.

The analyses showed statistically significant differences in the rs3783553, rs79071878 and rs28362491 polymorphism distributions between the northern population and the populations in the other regions, except the rs3783553 polymorphism in the midwest region (p = 0.543). This polymorphism exhibited a significantly different distribution in the midwest population compared with the northeast, south and southeast regions.

All polymorphisms investigated here have been previously described as associated with a predisposition to some form of cancer [14, 15, 21, 24, 25], as well as compared with the prognosis of the disease [19, 2931]. We proposed a different approach to analysing the population data. Under this new approach, the proportion of individuals who are allele carriers are considered at risk for cancer or worse prognosis for cancer (referred to as potentially deleterious alleles [40]), including the deletion allele for rs3783553, allele A2 for rs79071878, the insertion allele for rs28362491, and the insertion allele for rs11267092.

Based on this analysis, the proportion of potentially deleterious alleles is higher in the Native American population (50 %) than the African (44 %) and European (35 %) populations. Among Brazilians, the proportion of potentially deleterious alleles is 40 %. Likewise, the proportion of individuals who are carriers of six or more alleles associated with cancer is greater among the Native Americans (8.6 %) than the Africans (5.2 %) and Europeans (1.9 %), and the proportion of Brazilians (4.2 %) is intermediate among the ancestral population values. The proportion of carriers is not evenly distributed between the geographic regions. The proportion is smaller in the Northeast Brazil population (2.7 %), intermediate in Midwest and South Brazil populations (mean = 3.4 %) and more elevated in the North (5.0 %) and Southeast Brazil populations (6.5 %).

Discussion

This is the most comprehensive study on the rs3783553, rs79071878, rs28362491 and rs11267092 polymorphism variations in Brazil and among its ancestral populations. This is also the first study to describe the variability of these markers in Native American populations.

We evaluated the distribution of allele and genotype frequencies for these four markers among the pairs of ancestral and Brazilian populations using the Chi-square test with the due statistical corrections (false discovery rate) to avoid spurious correlations. This test is adequate to compare two or more populations for a qualitative variable [41]. Furthermore, our sample size is representative of the populations studied here (928 Brazilians, 222 Native Americans, 211 Africans and 268 Europeans) and adequate for using INDEL-type polymorphisms, which present a low mutation rate compared with other types of polymorphisms [42].

The data from the ancestral populations investigated here reveal considerable heterogeneity between continental populations. All investigated markers present δ values greater than 30 %, except for the NFKB1 gene polymorphism. In this set of markers, the Native American populations were the most differentiated among the investigated continental populations.

Given the differences in the observed allelic distribution between the parental populations, we tested the hypothesis that these differences were due to population sampling. Therefore, we used data published in the 1000 Genomes project (Table 3). We verified that the cited frequency for the IL1A, NFKB1 and PAR1 markers (the three markers for which data were available) in the African and European populations are similar to the frequency observed herein. Thus, we discard the hypothesis and assume that our samples are representative of the European, African and Native American populations.

Table 3 Frequencies of the deletion allele in the polymorphisms of the IL1A, IL4, NFKB1 and PAR1 genes in Brazil and in other world populations

Despite the frequency differences between the parental populations and the intense and heterogeneous process of admixture that formed the current Brazilian population, the allelic distribution is relatively homogeneous throughout most of the country. In general, only the northern population exhibits an allele distribution that significantly differs from the other geographic regions for three of the four investigated markers.

We understand that these differences may be explained by a greater contribution of Native Americans to the northern populations. Previous estimates using different types of genetic markers show that the greatest Native American genetic contribution among Brazilians occurs in North populations [36, 38, 39, 43]. Moreover, data generated in the present work show that Native Americans form the most differentiated group of all the continental populations that form the Brazilian population. Therefore, we believe that, conjunctively, these two factors may explain the observed differentiation in the Northern Brazil population.

Our analyses involving potentially deleterious alleles demonstrate that the Native American population presents a higher proportion (50 %) of these alleles compared with the other parental populations (44 and 35 % for African and European, respectively). This analysis holds when considering the proportion of carriers with six or more (among eight possible) potentially deleterious alleles.

However, we cannot evaluate how this genetic composition may be associated with a higher incidence of cancer in this population because Native American populations form traditional communities that reside in outlying areas of urban centres. Epidemiological data are not available on the incidence of different types of cancer among these populations.

Despite the absence of epidemiological data, recent studies demonstrate that different forms of cancer are associated with a higher (or lower) contribution from Native American ancestry in admixed populations from South America. The investigations were associated with acute lymphoblastic leukaemia [44], breast cancer [45] and gastric cancer [4547].

Conclusion

In summary, our study shows that the allelic distribution of the IL-1A (rs3783553), IL4 (rs79071878), NFKB1 (rs28362491) and PAR1 (rs11267092) gene polymorphisms differs between European, African and Native American populations. Further, the same heterogeneity is not observed between regional populations in Brazil, except for the northern region, which significantly differs from the other four Brazilian regions.

Moreover, the results show that the Native American population includes a greater proportion of carriers with six or more alleles associated with cancer, which suggests that this population may have a higher risk of developing (or worse prognosis for) diseases associated with these alleles. The presented genetic data may be useful for future studies on the association between these polymorphisms and cancer in these populations.

Methods

Study population

The study population consists of 928 non-related and cancer-free adult individuals, recruited in ten Brazilian Federal Units, between the years of 2009 and 2010. In the present work, 180 individuals (90 male and 90 female) residing in the states of Pará (60), Amazonas (60) and Rondônia (60) represented the North region; 187 individuals (107 male and 80 female) residing in the states of Ceará (135) and Pernambuco (52) represented the Northeast region; 186 individuals (95 male and 91 female) residing in the states of Goiás (101), Mato Grosso do Sul (49) and Distrito Federal (36) represented the Midwest region; 184 individuals (90 male and 94 female) residing in the state of São Paulo represented the Southeast region; and 191 individuals (96 male and 95 female) residing in the state of Rio Grande do Sul represented the South region. Additional details may be found in a previous study [48].

In addition, we investigated a sample of 701 individuals representative of the main ethnic groups that originated the Brazilian population: 222 Native Americans from nine tribes of the Brazilian Amazon (Tiriyó, Waiãpi, Zoé, Urubu-Kaapor, Awa-Guajá, Parakanã, Wai Wai, Gavião, Zoró) [49]; 211 Africans (Angola, Mozambique, Congo Republic, Cameroon, Ivory Coast) [50]; and 268 Europeans (all from Portugal and Spain) that have been commonly studied in previous investigations on Brazilian ancestry [37, 39].

Informed consent for DNA analysis was obtained from healthy individuals for research purposes. Ethics approval was obtained from the local committee of Instituto de Ciências da Saúde, Universidade Federal do Pará.

Genotyping of investigated polymorphisms

Four polymorphisms were genotyped by a single multiplex reaction with Master Mix QIAGEN® Multiplex PCR kit (Qiagen, Hilden, Germany) and the primers described in Table 4.

Table 4 Characteristics of the investigated markers

Multiplex PCR products were separated and analyzed by capillary electrophoresis on the ABI 3130 Genetic Analyzer instrument (Applied Biosystems), using GS-500 LIX as pattern of molecular weight (Applied Biosystems), G5 virtual filter matrix and POP7 (Applied Biosystems). After data collect, samples were analyzed in GeneMapper®3.7 software (Applied Biosystems).

Statistical analyses

Allelic and genotypic frequencies were obtained by direct counting and δ value (delta value) was determined by substracting values of allele frequency in the studied parental populations, as described by Santos et al. [39]. Hardy–Weinberg equilibrium deviations were tested in Arlequin 3.1 software [51]. Differences in genotypic frequencies between Brazilian regions and parental populations were measured by Chi-square test (χ2 test, df = 2) in BioEstat software [52]. FDR (False Discovery Rate) method was used to correct multiple analyses [53]. These analyses were performed in the statistical package R Calculation. P value was considered significant if lower than 0.05.