Background

Human C4 is one of the most striking examples of genetic diversity, due to the great variation in number and size of genes between subjects, as well as some intrinsic polymorphisms. C4 is the only component of complement that displays two different isotypes encoded by two different genes: C4A and C4B. These genes differ in only five nucleotides, but the proteins encoded by them have different functions. While C4A protein is more reactive against targets containing amino groups and has greater ability to covalently bind immune complexes, C4B has greater affinity for hydroxyl groups and exhibit a hemolytic potential at least four times higher than C4A [1].

Genes encoding the C4 protein are located in the MHC class III region, in the short arm of chromosome 6, and form, together with three neighboring genes (serine-threonine kinase, steroid 21-hydroxylase and tenascin-X), a genetic unit called RCCX module [2]. Theoretically, each chromosome 6 can harbor zero to four copies of these modules, determining wide gene copy number variation of C4.

Around 80% of the Caucasians from United States and Europe have three or four C4 copies, 20% have five or six, and less than 2% have only two copies in a diploid genome [3]. Complete deficiency of components C4A and C4B is extremely rare in healthy controls [3]. C4A or C4B homozygous deficiency occurs in 0.5–1% Caucasians, while the presence of a single C4A or C4B gene occurs in approximately 20% [4, 5]. C4A deficiency has been linked to systemic autoimmune diseases, especially systemic lupus erythematosus (SLE) [6,7,8,9]; the complete absence of C4A appears in 10–15% of patients [10, 11] and the heterozygous deficiency appears in 40–60% of SLE patients in several ethnic groups [12,13,14]. The association with certain clinical manifestations, however, is still not clear as different studies show different associations [7, 15,16,17].

The significant association of C4A deficiency with SLE in different ethnic groups suggests that C4A deficiency is a risk factor for this disease. There are several potential mechanisms for this association, such as the role of C4 in humoral immune response [18] and in the clearance of immune complexes and apoptotic cells [19, 20]. Interestingly, patients with SLE in Spain have an increased frequency of C4B deficiency and not C4A, which highlights the importance of both C4A and C4B in autoimmunity control in other ethnic groups [21].

Most epidemiologic studies of C4 in SLE, however, do not consider Copy Number Variation (CNV) of both C4A and C4B. In fact, there are few studies demonstrating that a low copy number of C4A and C4B increases the susceptibility to SLE [7,8,9, 17]. In the present study, we originally investigated the association of C4, C4A and C4B GCN variation with SLE in a large cohort of Brazilian adult patients and healthy controls, as well as a possible association with the disease phenotype and severity.

Methods

Study subjects

This observational cross-sectional study included 427 patients with SLE and 301 healthy controls. SLE patients were recruited consecutively from the Autoimmune Rheumatic Diseases Out-patient Clinic of the University Hospital, and fulfilled four or more classification criteria for SLE from the American College of Rheumatology [22]. Patients underwent a detailed clinical evaluation, with emphasis on SLE clinical manifestations, recurrent infections, current and previous medications, age at SLE onset, presence of other autoimmune diseases, and determination of Systemic Lupus International Collaborating Clinics – Damage Index (SLICC-DI) [23]. All subjects were at least 18 years old and signed the informed consent. This study was approved by the hospital ethics committee (protocol CEP#0330/09). The healthy volunteers answered a questionnaire, emphasizing signs and symptoms of autoimmune diseases and recurrent infections, which could be related to deficiency of some component of Complement. In this study, recurrent infections was defined according to the criteria suggested by Modell et al. [24], i.e., two or more upper respiratory infections (otitis or sinusitis, in the absence of the allergy) or pneumonias in one year (for more than one year), recurrent viral infections (herpes, warts), persistent fungal infections or tuberculosis. All data collected were used to analyze possible associations with the configuration of the complex gene C4A/C4B. Subjects with changes detected in any of the items in the questionnaire or a family history of SLE or other autoimmune rheumatic disease were excluded from the control group.

Determination of GCN of C4A, C4B and total C4

Genomic DNA was extracted from peripheral blood using the FlexiGene DNA purification kit (Qiagen). GCN was determined by real time PCR (qPCR) using TaqMan probes conjugated with a minor groove binder (MGB), as described in the study of Szilagyi et al. [5]. In both reactions to determine C4A and C4B GCN, we used 250 nM of forward C4 primer (5′-GCAGGAGACATCTAACTGGCTTCT-3′) and 500 nM of reverse (5′-CCGCACCTGCATGCTCCT-3′), 250 nM of the TaqMan probe specific for C4A (5′-FAM-ACCCCTGTCCAGTGTTAG-MGB-3′) or C4B (5’FAM-ACCTCTCTCCAGTGATAC-MGB-3′), 1x TaqMan Universal Master Mix with UNG (Applied Biosystems, Foster City, CA) and 30 ng of genomic DNA in a total volume of 25 μL. Reactions targeting RNase P, which was our single-copy reference gene, contained the mix of primers and TaqMan probe labeled with VIC for this gene (Applied Biosystems Foster City, CA), 30 ng of genomic DNA and 1x TaqMan Universal Master Mix with UNG (Applied Biosystems),

Triplicates of each reaction were amplified and read on the Rotor Gene 3000 (Corbett, Sidney) using the following conditions (50 °C for 2 min, 95 °C for 10 min, 45 cycles of 95 °C for 15 s and 60 °C for 1 min). The results were analyzed by Rotor Gene 3000 software. For the relative quantification of the number of alleles C4A and C4B, we used control DNA samples with previously defined C4A and C4B GCN, kindly provided by Szilagyi et al. [5]. The RNase P constitutive gene was used as a control for the amplification reaction. The calculation of GCN was made using the ΔΔCT method [25].

Statistical analysis

Continuous variables with normal distribution were analyzed with the Student’s t test and those with non-parametric distribution were analyzed with the Mann-Whitney test. Qualitative parameters were analyzed by the Chi-square test and the Fisher’s exact test when appropriate. Multiparametric analyses were performed with the one-way ANOVA test with Bonferroni’s test as a post-hoc test when appropriate. Correlation analysis was performed by the Spearman’s correlation method. To calculate the risk of developing disease (OR + 95% CI), we performed a binary logistic regression analysis. Statistical analysis was performed with the Statistical Package for the Social Sciences (SPSS) version 17.0 (SPSS Inc., 2008). Statistical inference level was set at 0.05.

Results

Study subjects

The female to male ratio was 286:15 in the healthy control group and 406 to 21 in the SLE group, with ages varying from 18 to 61 years old (35.1 ± 11.1) and 17 to 77 years old (39.9 ± 12.2), respectively. Patients and controls did not differ regarding gender distribution, although patients were significantly older (p < 0.001). Patients presented more recurrent infections (n = 60; 14.05%) compared to healthy controls (n = 10; 3.33%; p < 0.001). The clinical characteristics of SLE patients are depicted in Table 1.

Table 1 Clinical characteristics of systemic lupus erythematosus patients (n = 427)

Gene copy number of total C4, C 4A and C 4B in SLE patients and healthy controls

Total C4 GCN ranged from two to eight in controls and from one to eight in SLE patients. C4A GCN varied from zero to six in patients and in controls, while C4B GCN varied from zero to five in both groups. SLE patients presented lower mean GCN for total C4 and C4A compared to healthy controls (Table 2).

Table 2 Gene copy number of total C4, C4A and C4B in SLE and controls

Accordingly, patients had a higher frequency of low GCN for total C4 and C4A compared to healthy controls (Fig. 1a-b). Comparing to healthy controls, the SLE group showed higher frequency of subjects with two (SLE = 7% vs controls = 1%, p < 0.001) and three copies of total C4 (SLE = 22% vs controls = 12.6%, p = 0.001). On the other hand, a higher number of controls with four copies of total C4 was observed compared to patients (SLE = 52.5% vs controls = 63.1%, p < 0.01) (Fig. 1a; Table 3).

Fig. 1
figure 1

Distributions of gene copy number (GCN) for total C4, C4A and C4B in healthy controls and systemic lupus erythematosus patients. GCN for total C4 (a), C4A (b) and C4B (c) was determined in 427 patients with systemic lupus erythematosus (SLE) and 301 healthy controls by quantitative real time PCR

Table 3 Distribution of SLE and controls according to the gene copy number for total C4, C4A and C4B

Regarding C4A GCN, there was a higher frequency of SLE patients with only one copy (SLE = 18.3% vs controls = 6.3%, p < 0.001), while there was a greater proportion of controls with two copies of C4A (SLE = 58.3% vs controls = 69.4%, p < 0.01) (Table 3). There was a trend toward higher frequency of total absence of C4A genes (C4A = 0) in SLE patients (2.1%) compared to controls (0.3%; p = 0.053) (Fig. 1b). Although there was no significant difference of the C4B GCN between the two studied groups (p = 0.192), the distribution of subjects according to the GCN resembled the distribution curve for total C4 and C4A (Fig. 1c). Interestingly, a lower percentage of patients with two copies of C4B compared to controls was observed (SLE = 57.4% vs controls = 65.8%; p = 0.025) (Table 3).

There was a higher frequency of SLE patients in the group of subjects with low GCN (total C4 < 4, C4A or C4B < 2) than in the group of subjects with normal or high GCN (total C4 ≥ 4, C4A or C4B ≥ 2) (Table 4). The risk of developing SLE was 2.62 times higher in subjects with low total C4 GCN and 3.59 times higher in subjects with low C4A GCN compared to those with normal or high C4 and C4A GCN. The same was observed regarding C4B, albeit to a lesser degree (OR = 1.46).

Table 4 Low gene copy number (GCN) for total C4, C4A and C4B genes is associated with systemic lupus erythematosus (SLE)

Total C4, C 4A and C 4B GCN impacts in SLE progression and clinical phenotype

The clinical phenotype analysis showed no association between low GCN for C4, C4A or C4B and a specific clinical manifestation. On the contrary, a higher frequency of articular involvement was seen in SLE patients with high C4B GCN compared to those with low GCN (p = 0.022; Table 5). Furthermore, there was higher frequency of serositis in patients with high C4A copy number compared to those with low copy number (p = 0.019).

Table 5 SLE clinical manifestations according to total C4, C4A and C4B gene copy number (GCN)

In addition, we investigated if the GCN for C4, C4A and C4B correlates with the progression of the disease in 335 SLE patients, using for that, the permanent damage index (DI) of the disease as measured by SLICC [23]. The Spearman correlation coefficient showed a mild but significant negative correlation between SLICC-DI and the GCN for total C4 (rho = − 0.133, p = 0.015) and C4A (rho = − 0.136, p = 0.012), but not C4B (rho = 0.223, p = 0.675). Moreover, patients with low C4A GCN had higher permanent damage index (median = 1.5; 95% CI: 1.2–1.9) than patients with normal or high C4A GCN (median = 1.0; 95% CI: 0.8–1.1; p = 0.004), regardless of age of onset and number of years of disease. This association was not observed for low total C4 [1.3 (1.0–1.6) vs 1.0 (0.8–1.2); p = 0.087] or low C4B [1.0 (0.8–1.3) vs 1.0 (0.9–1.2); p = 0.916]. There was no correlation between the GCN for C4, C4A or C4B and age onset of SLE.

Discussion

The present study confirmed, also in the peculiar blended Brazilian ethnicity, the association between low total C4 and C4A GCN and adult SLE previously reported in other ethnicities [7,8,9], and expanded this concept to low C4B GCN. This result corroborates a previous study with 90 patients with juvenile SLE and 170 patients with adult-onset SLE in Brazilian population, which showed a strong association with low number of copies of C4, C4A and C4B with SLE, remarkably in juvenile-onset lupus [26]. In addition, the present study originally demonstrated the impact of a lower C4A GCN on the progression of the disease. The distribution of total C4, C4A and C4B GCN in healthy controls was similar to that observed in other studies, with the majority of subjects presenting four copies of total C4 (63.1%), two copies of C4A (69.4%) and two copies of C4B (65.8%). Interestingly, 20% of healthy subjects had less than two copies of C4B, whereas only 6% had less than two copies of C4A.

Although most SLE patients presented four copies of total C4, there was a higher percentage of subjects with less than four copies compared to controls, as well as less than two copies of C4A. Moreover, the risk of developing SLE was 2.6 and 3.6 times higher in subjects with lower total C4 GCN and lower C4A GCN, respectively. These data are consistent with those previously observed in European, American and Asian samples, albeit with minor differences [7, 9, 27]. Only one study in the literature, performed with Japanese subjects, failed to demonstrate this association [28]. These genetic background differences had already been demonstrated, since C4A gene deletion has been more frequently reported in SLE patients in African American and Caucasian [11, 29, 30] than in Asian patients [12, 31, 32]. Moreover, the 2 bp insertion in exon 29, which leads to a C4A-null allele, was reported in Caucasian and African American SLE patients, but not in Asians [7, 28].

Although with a lower level of statistical significance, there was an association between low C4B GCN and SLE in our cohort. Previous studies have shown correlation between C4B deficiency and SLE in Spanish population [21, 27], in contrast to the UK cohort that revealed an association between high C4B GCN and SLE. Other studies failed to find association between C4B GCN and SLE [7, 9, 17]. Thus, the association between C4B GCN and SLE seems to be strongly influenced by ethnicity and further studies are needed to pinpoint the underlying genomic basis.

Despite the association of SLE with low GCN for C4, C4A and C4B, there were some SLE patients with normal or even high GCN for these genes. Conversely, some normal individuals had low GCN for these genes. This indicates that the SLE phenotype may be favored by low GCN of the C4 gene system, but this is a relative factor that is integrated with a host of modulating genetic and environmental factors that ultimately define the fate of the phenotype.

While the association between low C4 GCN and SLE has been consistently demonstrated, the impact of C4 CNV on the disease phenotypes remains controversial. In our previous study in juvenile-onset SLE, the low GCN for total C4 and C4A was associated with pericarditis [26]. A study in Chinese Han patients reported an association between low C4A deficiency and arthritis [7], while a more recent study reported serositis as the only clinical manifestation associated with low C4A GCN [17]. The current study adds some degree of complexity to this issue by showing a “protective” effect of low C4B GCN regarding the occurrence of arthritis, as well as between high C4A GCN and serositis. This apparent paradox of the “protective” effect of low C4 GCN in the development of some SLE manifestations may reflect the dual role of the Complement system in autoimmune diseases. If C4 deficiency may favor autoimmune diseases onset, due to the impairment in the clearance of autoantigens and negative selection of auto-reactive B cells, on the other hand it may reduce inflammation, because the classical cascade of the Complement system will be also partially hampered in the context of low C4A GCN [33].

Recent studies have suggested that low C4A GCN may also be linked to increased susceptibility to infections [34]. However, this is also far from unanimous, since other authors found no association between low levels of C4A and C4B proteins or different haplotypes of C4 with recurrent infections [35, 36]. In our series, there was no association between C4 CNV and infectious diseases, and only 14% of SLE patients had recurrent infections. The differences in the results of the several studies may be secondary to the criteria used to define recurrent infections. In our study, we used the criteria suggested by Modell et al. for the definition of recurrent infections [24]. It is important to consider, however, that such information was collected by questionnaire or during the retrospective analysis of medical records. Although this bias could have affected equally the groups of lupus patients and healthy controls, it is reasonable to hypothesize that this could underestimate the prevalence of infections in the present study.

An interesting and original finding of the present study was the association between low C4A GCN and permanent damage measured by SLICC-DI, suggesting that patients with low C4A GCN may have worse prognosis than those with normal or high GCN. The SLICC-DI measures permanent damage caused by SLE, by therapeutics and comorbidities. Actually, there are several studies associating C4 CNV and obesity [4] diabetes mellitus [35] and cardiovascular disease [37]. It is, therefore, possible that non-SLE conditions related to low C4 GCN may contribute to the observed increase in permanent damage in the present cohort.

Since C4 deficiency increases the activation of auto-reactive B cells and autoantibody production in animal models [38], we might expect an association between low GCN, especially of C4A, and the presence of events related to immune complex deposition, such as glomerulonephritis. However, such association was not observed either in the present study or in the one reported by Lv et al. [7], analyzing 924 Chinese SLE patients. It is possible that several genetic and adaptive factors interact, promoting or protecting against the disease and certain manifestations, what result in a wide and heterogeneous clinical spectrum.

Recently, several studies showed the association between lupus susceptibility or phenotypic features and polymorphism of different genes, such as PDCD1, BLK, TNIP1, TNFAIP3, SLC15A4, ETS1, and RasGRP3 IKZF1 [39,40,41]. The knowledge of the interplay between susceptibility genes and disease phenotypes can help us understand the clinical heterogeneity of this disease and its pathophysiology. Ultimately, this knowledge may contribute to the development of individualized approach for each patient.

In summary, the present study determined the distribution of C4 GCN in a sample of the Brazilian SLE patients and healthy controls. We confirmed the association between low C4A and low total C4 GCN and SLE, as previously described, and additionally showed similar findings for low C4B GCN. Furthermore, we originally documented a higher cumulative damage in patients with low C4A GCN as well as lower frequency of serositis in low C4A GCN patients. Finally, we observed an increased frequency of arthritis in patients with normal or high C4B GCN. Therefore, as demonstrated in this and in previous studies, low C4 GCN may be a genetic risk factor for the development of SLE and may be related to other factors, culminating in a worse disease outcome.

Conclusions

Low gene copy number for C4 genes, especially the C4A isoform, is a risk factor for development of systemic lupus erythematosus in the Brazilian population. In addition, low C4A gene copy number favors the occurrence of serositis and a more severe disease, while normal to high number of C4B genes favors the occurrence of arthritis in lupus patients. Therefore, the determination of C4 gene copy number may be useful in sub-phenotyping and managing SLE patients.