Background

Total serum immunoglobulin E (IgE) is a risk factor for both the development of [1] and disease severity in asthma [2]. The production of IgE is controlled by a complex regulatory process that ultimately involves isotype class switching by mononuclear B lymphocytes, [3] a CD4+ T cell dependent process [3]. However, to our knowledge, there has been no analysis of the correlation between genome-wide CD4+ T cell gene expression and the variability in serum IgE among asthmatics.

Sex is a critical determinant of IgE level, with males having a stronger tendency towards higher total IgE than females [4]. Sex-related differences in IgE are present early in life [5] and persist into adulthood [6]. Furthermore, there is growing evidence that the underlying molecular underpinnings of IgE levels differ between males and females. For example, we have previously demonstrated striking differences by sex in IgE heritability estimates, with 83% of the variability in IgE levels attributable to genetic factors in males, as compared to 63% in females [7]. Moreover, evidence for sexually-distinct genetic determinants of IgE has been demonstrated in genome-wide linkage analyses of IgE in some, [7] but not all studies [8].

Based on these findings we hypothesized that genome-wide CD4+ T cell microarray analyses would provide unique insights into genes that correlate with the variability in IgE level, and that sex would be an important modifier of these correlations. To test these hypotheses we explored the relationship between total serum IgE levels and genome-wide gene expression transcript abundance in a cohort of 223 young adults with asthma, both in all subjects and within each sex specifically.

Methods

Study population and sample collection

CAMP was a multicenter clinical trial of the effects of anti-inflammatory medications in children with mild to moderate asthma. All participants had asthma defined by symptoms greater than twice per week, use of an inhaled bronchodilator at least twice weekly or use of daily medication for asthma, and increased airway responsiveness [9]. Participant data relating to atopy was obtained from the original CAMP dataset. The completion of the trial was followed by two four-year observation studies - CAMP Continuation Study (CAMPCS) 1 and 2. During Years 3 and 4 of CAMPCS/2, 299 children at four of eight CAMP study centers (Baltimore, Boston, Denver, St. Louis) participated in an ancillary study of gene expression in asthma. The study visit included questionnaire assessments of asthma severity, measurements of pulmonary function, and collection of peripheral blood for the simultaneous measurement of cell count, total serum IgE, and CD4+ T cell gene expression. Samples for RNA extraction were collected with BD Vacutainer CPT tubes, (BD Diagnostics, Franklin Lakes, New Jersey) placed on ice, centrifuged within 1 hour (20 minutes at 1700RCF) followed by mononuclear cell layer isolation and suspension in 10 ml of PBS. We isolated CD4+ T cells using anti-CD4+ microbeads by column separation (Miltenyi Biotec, Auburn, CA), [10] using 20 ul anti-CD4+ Micro beads per 106 total cells. Pilot studies confirmed CD4+ cell yields of ~4 × 106 at ≥ 95% purity per collection. Total RNA was extracted using the RNeasy Mini Protocol (QIAGEN, Valencia, CA) [11] and stored at -80°C. 2100 Bioanayzer (Agilent Technologies, Santa Clara, CA) analysis confirmed average total RNA yields of 2 ug per collection, with minimal evidence of RNA degradation and 28S:16S ratios approaching 2.0. Written informed consent was obtained from participants. CAMP was approved by the Institutional Review Boards of Brigham and Women's Hospital and the other participating centers.

Microarray hybridization and sample preprocessing

Expression profiles were generated with Illumina HumanRef8 v2 BeadChip arrays (Illumina, San Diego CA) using 100 ng of CD4+ total RNA from each sample and the Illumina BeadStation 500G according to protocol [12]. Arrays were read using the BeadArray scanner (Illumina) and analyzed using BeadStudio (version 3.1.7) without background correction. Raw expression intensities were processed using the lumi package [13] with background adjustment with RMA convolution [14] and log2 transformation of each array. The combined set of arrays was subsequently quantile normalized. The Gene Expression Omnibus number for this dataset is GSE22324.

Quantitative Real-Time PCR

For validation of our microarray results using real-time PCR (RT-PCR) was performed on a subset of subjects (n = 10) with the Taqman 7900 gene expression assay using primers and probes from Applied Biosystems. Quantitative RT-PCR analyses were performed in triplicate with appropriate positive and negative controls. The mRNA values were normalized with TATA binding protein (TBP) serving as the internal control. Relative mRNA expression levels were calculated using the delta cycle time (Ct) method.

Serum Total IgE

Total serum IgE was measured by radioimmunosorbent assays from blood samples collected during the CAMPCS/2 visits. Serum IgE was transformed to the log10 scale for analysis.

Pulmonary Function and Methacholine challenge testing

Pre- and post-bronchodilator spirometry was performed according to American Thoracic Society recommendations with a volume-displaced spirometer, and airway responsiveness was assessed by methacholine challenge with the Wright nebulizer tidal breathing technique [15].

Statistical analysis

All expression microarray analyses were performed in R 2.9/Bioconductor [16, 17]. Variance-based non-specific filtering was used to remove uninformative probes specifying a minimum interquartile range of 1 on the log2 scale. We used the Limma package [18] to analyze the linear correlation between gene expression level and log10IgE measurements both unadjusted for covariates and adjusted for age, sex (where appropriate), ethnicity, and center of sample collection. Joint and sex-stratified analyses were performed to assess the influence of sex. To control for multiple testing, we used the FDR procedure as proposed by Benjamini and Hochberg [19] with a threshold of 0.05 to define statistical significance. To generate multivariate linear models accounting for the large-p (expression transcript levels) small-n (number of subjects) problem, and to accommodate correlations among predictor variables, we used the least absolute shrinkage and selection operator (LASSO) [20] estimation. LASSO is a coefficient shrinkage and variable selection method that achieves model sparsity by setting coefficients of many irrelevant transcripts to 0. The LASSO analysis was performed using the least angle regression (LARS) algorithm, utilizing 5-fold cross-validation, as implemented in the R package lars. [21] We performed Canonical Pathway Analyses using Ingenuity Pathway Analysis ([IPA], Ingenuity Systems®, http://www.ingenuity.com) software (utilizing gene transcripts attaining nominal levels of significance [p < 0.05] in all subjects and within males and females specifically).

Results

Expression data from peripheral blood CD4+ T cells was measured in 299 asthmatics from CAMP; 223 of these (75%) also had a simultaneous measurement of total serum IgE. The baseline characteristics of the 223 CAMP subjects included in this analysis are presented in Table 1. There were no significant differences in any clinical variable between subjects with and without IgE measurement. We tested for evidence of linear association between gene transcript abundance in peripheral blood CD4+ T cells and total serum IgE. After non-specific gene filtering, 13,310 (64.5%) transcripts demonstrating variable expression across samples were considered for testing.

Table 1 Baseline characteristics of subjects from the Childhood Asthma Management Program (CAMP)

All Subjects

Single Gene Regressive Strategies

Significant evidence for association between transcript abundance and total serum IgE was identified for a single gene, interleukin 17 receptor B (IL17RB), explaining 12% of the variance (r2) in total serum IgE measurement (p value = 7 × 10-7, 9 × 10-3 after adjustment for multiple testing, see Figure 1A). Similar results were noted in analyses restricted to the 168 white subjects (IL17RB, p value = 2 × 10-6, 2 × 10-2 after adjustment for multiple testing), and technical validation by RT-PCR confirmed the association between total serum IgE levels and IL17RB transcript abundance (a positive correlation between delta CT and total serum IgE [r2 0.37, p value 0.03]). We re-analyzed our data with the inclusion of a quadratic term for gene expression level but found no significant findings of association under this non-linear modeling assumption. A list of the top 50 transcripts is presented in Additional File 1. No differences were noted in models additionally adjusting for measures of severity including methacholine PC20 (a measurement of airway responsiveness) or pre-bronchodilator FEV1 (a measurement of pulmonary function). Of note, exploratory analyses of the relationship between IL17RB transcript abundance and other intermediate asthma phenotypes revealed evidence of association with both total serum eosinophil count (p value = 6 × 10-6) and methacholine PC20 (p value = 4 × 10-4).

Figure 1
figure 1

The Correlation Between CD4+ Lymphocyte Measurement of IL17RB and Serum IgE. Figure 1A and 1B represent plots of log2-transformed gene expression measurement of IL17RB (on the x-axis) vs. log10-transformed total serum IgE measurement (on the y-axis), with regression lines, for all subjects without (1A) and with (1B) consideration of male (in blue) and female (pink) sex. Figure 1C represents a plot of the (-) delta cycle time from the real time-polymerase chain reaction of measurement of IL17RB (on the x-axis) vs. log10-transformed total serum IgE measurement (on the y-axis) in all subjects.

Due to the well-documented sexual dimorphism of IgE levels, we assessed whether sex would modify the association between gene expression and total serum IgE by repeating the analysis for males (n = 134, 60%) and females (n = 89, 40%) separately. These analyses revealed that the correlation between IL17RB and total serum IgE was restricted to males only (r2 = 0.19, p value = 8 × 10-8, 1 × 10-3 after adjustment for multiple testing), and was not observed in females (r2 = .03, p value = 0.13, 1 after adjustment for multiple testing). Formal testing confirmed the observed differences in correlation of IL17RB and IgE in males and females was statistically significant (test for sex-specific interaction p < 0.05, see Figure 1B). It is important to note that the differences in correlation between IL17RB and IgE were not due to baseline differences in IL17RB levels between males and females (mean log2 IL17RB level 7.5 in both males and females, t-test p value 0.5 for the difference). No other transcript was associated with total serum IgE among either males or females after correction for multiple testing. Plots of the linear correlation between IL17RB level and total serum IgE in males and females specifically, are presented in Figure 1B. The gene transcript with the strongest correlation with total serum IgE among CAMP females (gamma-aminobutyric B receptor 1 [GABBR1], unadjusted p-value 9 × 10-4) did not remain significant after adjustment for multiple testing (p = 0.9998). Lists of the top 50 transcripts for both males and females are presented in Additional Files 2, 3.

Next we examined the degree of overlap in the correlations between gene transcript measurement and total serum IgE between males and females. At an arbitrary cutoff (p value < 0.05), there were 872 and 154 transcripts demonstrating correlation with total serum IgE among males and females, respectively. Only 16 of these 1010 unique transcripts (1.6%) were significantly correlated with total serum IgE in both males and females; and this result falls within the 95% confidence interval ([95% CI] 5-17) of the number of transcripts that should overlap by chance alone. At a more liberal threshold for inclusion of the results from females (p < 0.1), only 38 of 1205 unique transcripts (3.1%) overlap. At a more conservative threshold (p < 0.01) there were no transcripts common to both males and females.

We considered whether our findings of association in males but not females could be attributable to differences in sample size. Though power was somewhat lower in females due to the smaller number of subjects (89 females vs. 139 males), power was very high (>91%) to detect correlations half as strong as those observed among males (power 91.4% to detect an r2 of 0.09 in the 89 CAMP females).

LASSO

Next we evaluated whether an alternate multi-gene modeling approach utilizing 5-fold cross validation (the LASSO method) could both confirm our findings of association from single gene regressive strategies, and could identify additional determinants of total serum IgE levels. Among all subjects, the optimal multivariate model included 17 transcripts (including IL17RB), of which 16 map to unique HUGO gene identifiers (see Additional File 4). Consistent with our single gene regressive strategy the best fitting model includes IL17RB, the transcript with the largest estimated regression coefficient. Among males, the optimal multivariate model included 30 transcripts, including 6 that overlap with the best fitting regression model for all subjects: ASB9, H2A histone family, member J (H2AFJ), hexose-6-phosphate dehydrogenase (H6PD), IL17RB, poly(A) binding protein interacting protein 1 (PAIP1), and PVALB (see Additional File 5). Again, IL17RB was the transcript with the largest estimated regression coefficient among males. For females, LASSO selected the null model as its best prediction model, which suggests that none of the genes were informative in predicting IgE level.

Ingenuity Canonical Pathway Analysis

To better understand the combinatorial effect of multiple gene transcripts on IgE regulation we performed canonical pathway analyses. Of the 763 transcripts demonstrating nominally significant (p < 0.05) correlation with total serum IgE, 652 (85%) map to a unique HUGO gene id and were used for IPA canonical pathway analysis. Among all subjects, enrichment for six pathways was observed (Fisher's exact p < 0.05) including the antigen presentation pathway, propanoate metabolism, interferon signaling, primary immunodeficiency signaling, the coagulation system, and the pantothenate and CoA biosynthesis pathway (see Figure 2A). Similar to the regression models, sex-stratified pathway analysis revealed different patterns of pathway enrichment between males and females. Among males, nominal evidence for enrichment was observed for 13 pathways (Figure 2B). Among females, enrichment was observed for 6 pathways (Figure 2C), including strong evidence of enrichment for the interferon signaling pathway in females (p = 3.0 × 10-6). Notably, there were no significant canonical pathways in common between males and females. Additional information about the canonical pathway analyses, including lists of differentially expressed genes within the pathways are included in Additional Files 6, 7, 8. Of note, IL17RB is not currently a constituent of an Ingenuity canonical pathway.

Figure 2
figure 2

Canonical Pathway Analysis of Serum IgE in CD4+ Lymphocytes. Figures A-C represent the results of Canonical Pathway Analyses using Ingenuity Pathway Analysis ([IPA], Ingenuity Systems®, http://www.ingenuity.com) software in all subjects, males, and females respectively. The significance of the association between the our data set and identified pathways from the Ingenuity Pathway Analysis library of canonical pathways was measured as a ratio of the number of genes from our dataset that map to the pathway divided by the total number of molecules that exist within the canonical pathway (presented as a line plot). Fischer's exact tests were used to calculate p-values (presented as -log10 p values in a bar graph). In all subjects 763 transcripts demonstrated nominally significant (p < 0.05) correlation with total serum IgE. Of these 763 transcripts, 652 (85%) mapped to a unique HUGO gene id and were used for IPA canonical pathway analysis. Six pathways were nominally significant using a Fisher's exact test and are listed in Figure 2A. Among males, 900 transcripts were correlated with total serum IgE, 873 (97%) mapped to a unique HUGO gene id, and thirteen pathways were significant (Figure 2B). Among females, 158 transcripts were correlated with total serum IgE, 154 (97%) mapped to unique HUGO gene ids, and six pathways were significant (Figure 2C).

Discussion

This is the first study to correlate genome-wide CD4+ T cell transcript abundance with IgE level in humans and is, to our knowledge, the largest gene-expression study of as asthma-related quantitative phenotype published to date. Our results indicate that IL17RB may be the only gene expressed in CD4+ T cells whose transcript measurement is correlated with the variation in IgE level in asthmatics. Given that IgE level is strongly influenced by environmental factors (such as allergen exposures) it is surprising that 12% of the variation in IgE can be explained by IL17RB transcript measurement, a correlation that is substantially greater than that explained genetic determinants from three significant genomic regions (r2 = 1.9%) identified in a recent genome-wide association study of IgE [22].

While it is often not possible within the confines of a cross-sectional epidemiologic study to determine the cause and effect relationship between two correlated variables, abundant molecular biologic evidence suggests that elevation of IL17RB should occur upstream from increases in IgE. IL17RB, located on chromosome 3p21.1, is the receptor for interleukin 17 B (IL17B) but has much higher binding avidity to interleukin 17 E ([IL17E] also known as IL25, IL-17Rh1, and Evi27 in the mouse) [23]. While structural homology places the IL17E-IL17RB ligand-receptor pathway within the IL17 family, as opposed to IL17A and IL17F which are cytokines characteristic of Th17 cells, [24] several groups have demonstrated that the IL17E-IL17RB pathway is a strong inducer of Th2 responses [25]. In humans, IL17RB expression has been demonstrated on the surface of lung fibroblasts, [26] prominently on activated CD4+ Th2 memory cells, [27] and recently in basophils [28]. Comparable to our findings, IL17RB has recently been shown to be the most differentially expressed gene in allergen challenged peripheral blood mononuclear cells from patients seasonal allergic rhinitis compared to healthy controls [28]. Thymic stromal lymphopoietin activated DCs appear to provide an important activating signal allowing IL17RB+ Th2 memory cells to respond to IL17E [27]. Elevated IL17E expression results in elevated Th2 cytokine production, [29] asthma severity, and elevations in IgE [30].

Another finding of our analyses is that the magnitude of the relationship between patterns of CD4+ lymphocyte gene expression and total serum IgE levels is substantially different between males and females. Several lines of evidence are provided, including: strong association of IL17RB transcript abundance with IgE levels in males (r2 = 19%), but not in females (r2 = 3%); evidence for a multivariate gene signature of total IgE levels in males, but not in females; and discrete, non-overlapping canonical gene pathway enrichment in each sex. Notably, we found limited statistical evidence to suggest that there is significant overlap between CD4+ T cell genes whose transcript measurements correlate with IgE level in both male and female asthmatics. Our work stands with a body of literature now implicating an important role for sex in the heritability, genome-wide linkage analyses, [7] and genome-wide gene expression analyses (this report) of IgE, and strongly support the notion that genetic and genomic studies of IgE should consider the role of sexual dimorphism.

Our study has several limitations. While our study is among the largest gene expression studies of asthma and allergy to date it is important to note that sample size may still be an important limitation of our analyses. The imbalance in sample size between males and females in this study, and the inherently lower mean values of IgE levels in females can further reduce statistical power. Though our power calculations suggest ample power (>90%) to detect the male-specific associations in our female sample, replication in larger sample sizes could help to determine if the lack of overlap we note between males and females is primarily an issue of the magnitude of the correlations or of true differences in the genomic regulation of IgE. In addition, our gene expression analyses include gene transcript measurements from unstimulated peripheral blood CD4+ lymphocytes in a population of adolescents and young adults recruited on the basis of childhood asthma. We urge caution in extrapolating our findings to other populations and cell types. Finally, while an assessment of IL17RB protein levels would provide additional supportive information, we do not have intact CD4+ lymphocytes enabling suitable cell surface protein quantification.

While the precise role that sex plays in the genomic regulation of IgE is unknown, sex differences in the immune environments experienced by CD4+ T cells are well described. In animals, females are more likely to develop a Th1 response after antigen challenge, except during pregnancy when a Th2 environment dominates [31]. Although little is known about the role sex could play in IL17RB pathway regulation, some evidence suggests that IL17RB level may influence the response to sex hormones [32]. We recently demonstrated sex-specific effects of TSLP polymorphisms on the regulation of specific, and total serum IgE in children with asthma [33]. This is intriguing given that TSLP activated dendritic cells (TSLP DCs) appear to be one of the most potent stimuli of IL17RB expression [27].

Though we have limited evidence that single genes, or genetic models strongly correlate with IgE level among females, it is intriguing that the canonical pathway most significantly correlated with IgE in females was the interferon signaling pathway (p = 3 × 10-6), particularly in light of prior studies suggesting the importance of interferon-γ (INFG) in the female (more than male) allergic response [34]. In mice, females demonstrate increased T-cell proliferative responses, [34] and consistently produce increased INFγ [35] in response to antigen challenge compared with their male counterparts. In humans, females have been shown to increase INFγ levels in response to a variety of stimuli [36]. Thus, our observation of differential down-regulation of the IFN signaling in females (but not in males) with higher levels of IgE is consistent with these prior studies and suggests an important regulatory role for interferon signaling in the female allergic response.

Conclusions

Our results indicate that IL17RB may be the only gene expressed in CD4+ T cells whose transcript measurement is correlated with the variation in IgE level in asthmatics. We have demonstrated that sex is a critical modifier of the correlation between the CD4+ T cell transcriptome and IgE, and suggest that such sex-specific modification should be consideration in future studies of IgE.