Introduction

Systemic lupus erythematosus (SLE) is a complex disease characterized by autoantibody production and involvement of multiple organs, including kidneys. Both genetic and environmental factors contribute to the development of SLE [1]. Until now, several genes have been reported to be associated with SLE, of which interferon regulatory factor-5 (IRF5) has been identified as a susceptibility gene common to multiple populations [26]. Recently, association of STAT4 (signal transducers and activators of transcription-4) haplotype tagged by rs7574865T with SLE was demonstrated in Caucasians [7]. Subsequently, two genome-wide association studies [8, 9], a study focused on the STAT4 region in Caucasians [10], and replication studies in Colombians [11] and a Japanese population [12] have confirmed the association. In addition, an association of STAT4 with SLE phenotypes such as anti-double-stranded DNA (anti-dsDNA) autoantibodies, renal disorder, and age at diagnosis was reported [10, 13]. An association of rs7574865 with other autoimmune diseases such as rheumatoid arthritis and primary Sjögren syndrome has also been demonstrated [7, 11, 12, 14]. The STAT4 gene encodes a transcription factor belonging to the STAT family expressed in lymphocytes, macrophages, and dendritic cells. STAT4 is essential for interleukin (IL)-12 signaling and induces interferon-gamma (IFNγ) production and Th1 differentiation [15]. STAT4 is also activated by type I IFNs (IFNα/β) [16]. Moreover, the requirement of STAT4 in IL-23-induced IL-17 production has been suggested [17]. Two isoforms of STAT4, STAT4α and STAT4β, are known [18]. Expression of STAT4β, lacking the transactivation domain, did not appear to be affected by the STAT4 single nucleotide polymorphisms (SNPs) [13]. STAT1, another member of the STAT family, is activated by type I IFNs and IFNγ and plays an important role in immune responses [19]. STAT1 has been reported to be upregulated in peripheral blood mononuclear cells from SLE patients and in kidneys of lupus mice with nephritis [20, 21], suggesting that STAT1 may play a role in the pathogenesis of SLE. A possible role of SNPs in the STAT1-STAT4 region other than the haplotype tagged by rs7574865T has recently been excluded in Caucasians [10]. However, in view of substantial differences in disease-associated alleles among populations [2], such analysis should be performed in each population. In this study, we carried out a comprehensive association analysis of the STAT1-STAT4 region with SLE in a Japanese population by scanning 52 tag SNPs of the region encompassing STAT1 and STAT4.

Materials and methods

Patients and healthy controls

Patients and controls were recruited at Juntendo University, the University of Tsukuba, and the University of Tokyo. All patients and healthy controls were unrelated Japanese persons living in the central part of Japan. Three hundred eight SLE patients (18 males and 290 females, average age 41.4 ± 13.5 years) and 306 healthy individuals (119 males and 187 females, average age 32.6 ± 9.8 years) were studied. Diagnosis of SLE and classification of the patients into clinical subsets were carried out according to the American College of Rheumatology criteria for SLE [22]. There was no overlap in cases or controls between this study and the recently reported study in a Japanese population [12]. These studies were reviewed and approved by the research ethics committees of the University of Tsukuba, the University of Tokyo, and Juntendo University. Informed consent was obtained from all study participants.

Association study

Fifty-two tag SNPs in the STAT1-STAT4 region were selected with an r2 threshold of 0.9 based on the HapMap Phase II JPT (Japanese in Tokyo, Japan) data. These tag SNPs captured 127 SNPs with a minor allele frequency of greater than or equal to 0.05. First screening was performed in 105 Japanese female SLE patients and 102 female healthy controls using the GoldenGate SNP genotyping assay (Illumina, Inc., San Diego, CA, USA). For the three SNPs that exhibited significant association (P < 0.01), additional samples were genotyped using the TaqMan SNP Genotyping Assay (Applied Biosystems, Foster City, CA, USA), and association was examined in 308 SLE patients and 306 healthy individuals.

Statistical analysis

Association of each SNP was analyzed by chi-square test. Because of the replicative nature of this study, correction for multiple testing was not performed, and unadjusted P values are shown. Haplotype frequency estimation and association analysis using the permutation test were performed with Haploview version 4.0 software (Broad Institute of MIT and Harvard, Cambridge, MA, USA). In the haplotype analysis, the genotype data for rs10168266, rs11889341, and rs7574865 were used and these SNPs were assumed to compose a single haplotype block. In the permutation test, only frequencies of haplotypes in this block were compared (that is, the 'Haplotypes in Blocks Only' option was used). Ten million permutations were performed. To test the significance of each SNP conditional on the genotypes of other SNPs, logistic regression analysis was performed under the additive model for the minor allele. Assuming a polymorphic site with two alleles A and a, genotypes were encoded as 0 = aa, 1 = Aa, and 2 = AA. Population attributable risk percentage (PAR%) for the risk genotype (rs7574865T/T and T/G) was estimated by the formula

PAR% = Pe (RR - 1)/(Pe [RR - 1] + 1),

where Pe represents the risk genotype frequency in the population and RR represents relative risk of the risk genotype [23]. Given the low prevalence of SLE, Pe can be estimated based on the genotype frequencies in healthy controls and RR can be approximated by odds ratio (OR) for the risk genotypes.

Results and Discussion

The STAT4 gene is located on 2q32.2-q32.3 adjacently to STAT1 gene, and the region encompassing STAT1 and STAT4 spans approximately 180 kilobase pairs. In the first screening, 52 tag SNPs in the STAT1-STAT4 region, selected with an r2 threshold of 0.9 based on the HapMap Phase II JPT data, were genotyped in 105 Japanese female SLE patients and 102 female healthy controls, and allele frequencies were compared between SLE patients and controls. A linkage disequilibrium (LD) plot and the results of the association study in the STAT1-STAT4 region are shown in Figure 1. Pairwise r2 values between 52 tag SNPs were calculated using genotyping data from 102 healthy individuals.

Figure 1
figure 1

Linkage disequilibrium plot of the STAT1-STAT4 region in a Japanese population and first screening of 52 tag single nucleotide polymorphisms (SNPs). In the upper panel, P values for differences in allele frequencies were calculated by chi-square test using two-by-two contingency tables. The -log P value for each SNP is shown. In the lower panel, r2 values calculated using Haploview version 4.0 software based on data from 102 healthy individuals are shown. The location and direction of transcription of STAT1 and STAT4 are indicated by arrows. SNPs rs10168266, rs11889341, and rs7574865 belong to the same haplotype block.

Among the tag SNPs, rs10168266C>T, rs11889341C>T, and rs7574865G>T were most significantly associated with SLE in the first screening (P < 0.01). Allele frequencies of rs10168266T, rs11889341T, and rs7574865T were significantly increased in SLE compared with healthy controls (Table 1 and Figure 1). These SNPs were located in the introns of STAT4 and in LD with each other. In contrast, significant association was not detected for SNPs in the STAT1 region (P > 0.05).

Table 1 Minor allele frequencies and P values for 52 tag single nucleotide polymorphisms in the STAT1-STAT4 region in the first screening

To confirm the association detected in the first screening, additional patients and controls were genotyped for the three SNPs using the TaqMan SNP Genotyping Assay, and association was examined in 308 SLE patients and 306 healthy controls in total (Table 2). Significant deviation from Hardy-Weinberg equilibrium was not detected in healthy controls (P > 0.05). The rs7574865T allele, previously shown to be associated with SLE in Caucasians, was significantly increased in SLE patients (46.3%) compared with controls (33.5%, P = 4.9 × 10-6, OR 1.71). The association was compatible with the dominant model, under which the OR was 2.19 (T/T + G/T versus G/G).

Table 2 Association of STAT4 single nucleotide polymorphisms rs10168266, rs11889341, and rs7574865 with systemic lupus erythematosus

The SNPs rs11889341 and rs10168266 were in LD with rs7574865 (r2: 0.57 to 0.78, D': 0.91 to 0.97) and were also significantly associated with SLE (allele frequency: P = 6.6 × 10-6 and P = 6.3 × 10-6, respectively). Haplotype analysis revealed that the haplotype carrying rs10168266T, rs11889341T, and rs7574865T was significantly increased (SLE: 36.8%, control: 24.3%, P = 1.5 × 10-6) whereas the haplotype carrying 10168266C, rs11889341C, and rs7574865G was significantly decreased in SLE (SLE: 52.7%, control: 65.0%, P = 1.0 × 10-5). Logistic regression analysis demonstrated that the association of each SNP lost statistical significance when adjusted for genotype of the other SNPs (Table 3). Thus, due to the strong LD, it was impossible to identify a single causative SNP among the three.

Table 3 Logistic regression analysis of the systemic lupus erythematosus-associated single nucleotide polymorphisms in STAT4

We next tested whether STAT4 rs7574865 was associated with phenotypes of SLE such as presence of nephritis, anti-dsDNA antibodies, and early age of onset (less than 20 years) as STAT4 genotype has been shown to be more strongly associated with subgroups of SLE with these phenotypes [10] (Table 4). Association of rs7574865 was observed both in SLE patients with nephritis (P = 1.0 × 10-5, OR = 1.85) and in those without nephritis (P = 0.0031, OR = 1.55). The association was stronger in SLE patients with nephritis, although the difference between SLE with and without nephritis (case-only analysis) did not reach statistical significance. Similarly, rs7574865T was significantly increased in SLE patients with anti-dsDNA antibodies compared with healthy controls, whereas association was not detected in SLE patients without anti-dsDNA antibodies. The frequency of rs7574865T was slightly higher in the patients with an age of onset of less than 20 years as compared with greater than or equal to 20 years, although the difference was not statistically significant. These tendencies are consistent with those reported in Caucasians [10]. These interpretations were not affected when the significance level was corrected for the number of comparisons (three phenotypes).

Table 4 Association of STAT4 rs7574865 with characteristics of systemic lupus erythematosus such as nephritis, age of onset, and anti-double-stranded-DNA antibodies

To evaluate the epidemiological significance of STAT4 polymorphism in the genetic background of SLE in the Japanese population, we estimated the PAR% in Japanese persons and Caucasians using our present data and previously reported data [8, 11, 12] (Table 5). Because the frequency and OR of the risk genotype of rs7574865 were greater in the Japanese population than those of North Americans of European descent [8], PAR% in the Japanese population (40.2%) was much higher than that of the latter (19.5%). A similarly high PAR% was observed in two of the three Japanese case-control series reported by Kobayashi and colleagues [12] and in Colombians [11]. Because PAR% may be affected by the difference in the method of ascertainment of each study, this comparison may not be completely valid. Nevertheless, these observations suggested that the contribution of STAT4 for SLE is greater in the Japanese population as compared with the Americans of European descent.

Table 5 Population attributable risk percentage of STAT4 rs7574865 under the dominant model

At this point, molecular mechanisms that account for the association of STAT4 intron SNPs with SLE remain unclear. Studies with lupus model mice lacking Stat4 showed conflicting results. Stat4 deficiency reduced nephritis and autoantibody production in B6.NZM.Sle1.Sle2.Sle3 mice [24]. In contrast, Stat4-deficient NZM (New Zealand mixed) mice developed accelerated nephritis and increased mortality in the absence of high levels of autoantibodies [25]. STAT4 has been shown to be involved in the induction of IFNγ, differentiation of Th1 and Th17 cells, and signal transduction from type I IFN receptors [15]. Th1 cytokines, especially IFNγ, have been shown to play a role in the pathogenesis of lupus nephritis [26]. Recently, T cells from SLE patients were shown to produce excessive amounts of IFNγ upon stimulation [27]. These observations may implicate the role of STAT4 SNPs in IFNγ production.

The role of type I IFNs in SLE has been established [1]. Elevated serum type I IFN levels and expression of IFN-inducible genes in peripheral mononuclear cells were reported in SLE [28, 29]. The association of IRF5, which induces type I IFNs, with SLE has been established [26]. STAT4 is activated by type I IFN as well as IL-12 signals and produces IFNγ [15]. Thus, STAT4 may also contribute to SLE as a component of the type I IFN signal pathway. Furthermore, STAT4 has been reported to transduce IL-12 signals to induce IFNγ production in B cells [30].

It is interesting to note that significant association of STAT4 was not observed in SLE patients without anti-dsDNA antibodies (Table 4). It would have been interesting to examine the effect of the genotype on the levels, rather than presence or absence, of anti-dsDNA antibody However, because the antibody levels fluctuate in association with disease activity and treatment, association with the genotype should be examined using the lifetime highest anti-dsDNA antibody level of each patient. Such data were not available for this study, and we hope that we can address this issue in the future.

Most of these observations imply that STAT4 risk genotype may be associated with an elevated expression level and/or function of STAT4 protein. A recent study reported that the STAT4 risk allele was associated with overexpression of STAT4 in osteoblasts but not in B cells [13]. To address the significance of such findings, it will be necessary to examine the effect of this genotype on the expression levels and splicing isoforms in T and B cells.

Conclusion

Through comprehensive association analysis of the STAT1-STAT4 region with SLE in the Japanese population, we demonstrated that the same STAT4 risk allele in Caucasians was strongly associated with susceptibility to SLE in the Japanese population. In contrast, evidence for an association of STAT1 SNPs was not observed. The contribution of STAT4 SNPs to the genetic background of SLE may be greater in the Japanese population than in Americans of European descent.