Background

Asthma is a disease characterised by recurrent respiratory symptoms, reversible variable airway obstruction, airway inflammation and increased bronchial hyper-responsiveness [1]. Estimates suggest that 100–150 million people worldwide have asthma. Atopy is a predisposition towards the development of immediate hypersensitivity against common environmental antigens. Atopy and asthma are closely related, however they are not interchangeable. Most asthmatic individuals are atopic but atopic individuals may not have asthmatic symptoms. Asthma and atopic disease show strong familial aggregation and heritability estimates vary between 36–79% [2]. A greater understanding of the genetic basis of asthma and atopy holds great promise for the identification of novel therapeutic targets.

Linkage analysis using short tandem repeats or microsatellites to follow the transfer of genetic information between generations has been used to identify chromosomal regions that potentially contain asthma and atopy susceptibility genes. Commonly sub-phenotypes of clinical relevance are used including; elevated total Immunoglobulin E (IgE) levels, atopy defined by positive skin prick test to one or more allergen or elevated specific IgE and bronchial hyper-responsiveness (BHR) [3]. These studies have identified linkage on multiple chromosomal regions e.g. 2q22-33, 5q31.1-33, 6p21.3, 11q13, 12q14.3-24.1, 13q14, 14q11.2-13 and 19q13; however replication of linkage findings has been limited [3]. Low statistical power and the potential for type I and type II errors may explain these findings. Combining data has the potential to provide inferences about the consistency of results across studies and to identify regions that contain asthma and atopy susceptibility genes.

The aim of the current study was to complete the first meta-analysis of available genome wide linkage data for asthma and related traits (asthma per se, BHR, total IgE, allergen skin prick test response (SPT)) in the Caucasian population using the Genome Scan Meta Analysis (GSMA) method [4]. GSMA is a non parametric, rank based approach and has been used extensively in other disorders e.g. schizophrenia [5].

Methods

Systematic Literature Search

To identify published studies for inclusion in the GSMA of asthma and related phenotypes we completed a systematic literature review in September 2006. We used PubMed and the following search string (Asthma OR BHR OR bronchial hyper responsiveness OR bronchial hyperreactivity OR AHR OR airway hyper responsiveness OR respiratory hypersensitivity OR histamine OR slope OR methacholine OR atopy OR atopic OR dermatitis, atopic OR IgE OR immunoglobulin E OR SPT OR skin prick tests OR skin tests) AND linkage AND genome-scan OR scan OR genome OR genomewide OR genome-wide OR LOD OR microsatellite). Limits were set on the search including; published in English, human studies, published 1996–2006 and the exclusion of reviews. This initial search identified 516 matches of which 488 were discarded as not containing genome-wide linkage data. A further eight studies were discarded as they were in non-Caucasian populations and we wished to avoid any population stratification issues leaving 20 potential Caucasian studies for inclusion. Genome-wide linkage analyses for asthma related traits in the Hutterite Founder population [6] was not included in the current analyses as limited data was available and the focus of the present study was Caucasian out-bred populations.

Of the 20 manuscripts identified a further nine were removed from the analyses for a combination of the following reasons; the study was superseded by another including the families from the original, LOD score plots in the manuscript were not labelled and/or unreadable, no genome-wide data was presented e.g. in the manuscript describing the positional cloning of ADAM33, linkage analyses in 460 families for asthma, IgE and BHR phenotypes were performed but has never been published in full [7] or the phenotypes studied did not meet our criteria. All authors were contacted and invited to provide complete datasets.

Phenotype definition and study inclusion/exclusion

There was a large degree of heterogeneity in phenotype definitions and so these were standardised for inclusion. Asthma was defined using doctor diagnosis and/or currently taking asthma medication, however we did include data from the Dutch population which used an algorithm based on asthma symptoms, the presence of BHR, reversibility to β2-adrenergic receptor agonist and smoking history to define asthma [8]. Analyses were completed with and without the Dutch families. Total IgE levels were analysed in the genome scans using quantitative data generated by Pharmacia CAP system [9], Pharmacia IgE EIA [10], Phadebas PRIST [11] and ELISA techniques [12, 13] which have shown good inter assay correlation [14, 15], therefore all studies were included. Positive skin prick response to one or more allergen was used as a marker of atopy and for inclusion in the GSMA. However, allergens used in each study varied; Dermatphagoides pteronyssinus, mixed grass pollen [9], Dermatphagoides pteronyssinus, Cladosporium herbarum, Alternaria tenuis, timothy grass, olive, birch, Parieteria judaica, ragweed, Aspergillus, Blatella Germancia [11], mixed grass and tree pollens, mixed weeds, Dermatphagoides pteronyssinus, dog, cat, a mixture of guinea pig and rabbit, horse, Aspergillus fumigatus, Alternaria alternate [10], house dust mite [12], Dermatphagoides pteronyssinus and 10 others [13] and Dermatphagoides pteronyssinus, D. farinae, dog, cat, grass mix, pollen and alternaria [16]. BHR was measured in multiple ways using different provocation stimuli e.g. histamine or methacholine providing categorical and/or quantitative analyses. These provocation stimuli have shown a significant correlation (r = 0.95) in the responses induced [17], however it is worth noting that this has not been reproduced in all studies. Studies with BHR data were included in the GSMA irrespective of the criteria used in the original manuscript.

Genome Scan Meta Analysis (GSMA)

GSMA was used as it is able to combine linkage data from studies with different marker sets and analysed by different methods including permutated p-values. GSMA was implemented using GSMA software [4, 5, 18]. Briefly, the genome was divided into 120 bins of approximately 30 cM, for each study the maximum evidence for linkage e.g. LOD score or p-value was identified for each bin and these bins were then ranked relative to their evidence for linkage in that study. These ranks were summed across studies and the summed rank (SR) forms the basis of the test statistic [4]. An ordered rank (OR) statistic was also generated which gives a genome wide interpretation of significance by comparing the n-th highest summed rank with the distribution of the n-th highest summed ranks obtained through simulation [5]. We completed an unweighted and weighted analyses using information content (√(no. pedigrees × no. markers)) as a weighting factor.

Statistical Significance

Simulation studies have shown that any bin with a p(SR) < 0.05 and a p(OR) < 0.05 has a high probability of containing a true susceptibility gene [5]. Applying Bonferroni correction a p < 0.000417 provides evidence for genome wide significance for linkage and a p < 0.0083 provides suggestive evidence for linkage [5].

Results

Data included

The selection criteria and data requests provided eleven studies of nine Caucasian asthma populations for our analyses (Table 1) including data from 1,267 pedigrees (n = 5,832) for asthma (80.2% of pedigrees available in the public domain or following request, missing 249 [19] and 65 pedigrees [20]), 989 pedigrees (n = 4,294) for BHR (79.9% of available, missing 249 pedigrees [19]), 1,093 pedigrees (n = 4,746) for SPT (81.5% of available, missing 249 pedigrees [19]) and 729 pedigrees (n = 3,224) for total IgE (65.9% of available, missing 249 [19], and 129 pedigrees [21]).

Table 1 Characteristics of genome-wide linkage studies included in the GSMA

Asthma

The weighted asthma analyses did not identify any chromosomal region with a p(SR) and p(OR) < 0.05 (Figure 1 and Table 2). No bin p(SR) met genome wide significance (p < 0.000417) or suggestive evidence for linkage (p < 0.0083) in these analyses however three regions demonstrated a p(SR) < 0.05; 6p22.3-p21.1, 10p14-q11.21 and 12q24.31-qter (Table 2). Eight regions met suggestive linkage criteria in the ordered rank analyses; 1p31.1-p13.3, 2p12-q22.1, 4p14-q13.3, 7q34-qter, 12pter-p12,1, 12p12.1-p11.21, 14q32.12-qter, 17pter-p12 and 20pter-p12.3 (Table 2). Analyses of the asthma phenotype using unweighted GSMA generated similar findings to the weighted analyses (Figure 2). To confirm that the inclusion of the Dutch linkage data for the asthma phenotype (defined by algorithm) had not confounded the analyses we completed GSMA without these data focusing on doctor diagnosed asthma only (1,067 pedigrees). Again, no chromosomal region with a p(SR) and p(OR) < 0.05 was identified (data not shown).

Figure 1
figure 1_669

p(SR) & p(OR) in weighted GSMA.A. asthma. B. bronchial hyper-responsiveness. C. total serum IgE. D. skin prick test response. A p(SR) of p < 0.000417 = significant linkage, p < 0.0083 = suggestive linkage. p(SR) & p(OR) data were transformed using f(x) = 0.05/x and plotted on a log10 scale to improve clarity.

Table 2 Weighted GSMA for Asthma (1,267 pedigrees, n = 5,832)
Figure 2
figure 2_669

p(SR) & p(OR) in unweighted GSMA.A. asthma. B. bronchial hyper-responsiveness. C. total serum IgE. D. skin prick test response. A p(SR) of p < 0.000417 = significant linkage, p < 0.0083 = suggestive linkage. p(SR) & p(OR) data were transformed using f(x) = 0.05/x and plotted on a log10 scale to improve clarity.

Bronchial Hyper-responsiveness

The weighted BHR analyses strongly suggested that 6p22.3-p21.1 contains BHR susceptibility gene(s) as a p(SR) and p(OR) < 0.05 was observed (p = 0.007, p = 0.049 respectively (Figure 1 and Table 3)). Two other regions showed suggestive evidence (p < 0.0083) for linkage to the BHR phenotype; 2p12-q22.1 (p(SR) = 0.006) and 11q24.1-qter (p(SR) = 0.005). In the unweighted analyses three regions showed evidence for linkage (p(SR) and p(OR) ≤ 0.05), i.e. 2q22.1-q23.3, 7q12.11-q31.1 and 5q23.2-q34 (Figure 2).

Table 3 Weighted GSMA for bronchial hyper-responsiveness (989 pedigrees, n = 4,294)

Positive allergen skin prick test

Weighted analyses of the SPT phenotype identified two regions that had a p(SR) and p(OR) ≤ 0.05 strongly suggesting these regions are susceptibility loci (Figure 1 and Table 4). These regions are 17p12-q24.3 (two adjacent bins, 17p12-q21.33 p(SR) = 0.00043, p(OR) = 0.050 and 17q21.33-q24.3 p(SR) = 0.047, p(OR) = 0.038) and region 3p22.1-q22.1 (three adjacent bins, 3p22.1-p14.1 p(SR) = 0.045, p(OR) = 0.063, 3p14.1-q12.3 p(SR) = 0.003, p(OR) = 0.0045 and 3q12.3-q22.1 p(SR) = 0.00084, p(OR) = 0.00406). The analyses of the unweighted SPT datasets identified chromosomes 3 and 17 as containing the major determinants (Figure 2).

Table 4 Weighted GSMA for skin prick test response (1,093 pedigrees, n = 4,746)

Total Immunoglobulin E

Weighted analyses of the IgE phenotype strongly suggested 5q11.2-q14.3 (p(SR) = 0.031, p(OR) = 0.060) and 6pter-p22.3 (p(SR) = 0.033, p(OR) = 0.026) contain genes that influence IgE levels (Figure 1 and Table 5). The region adjacent to 6pter-p22.3, i.e. 6p22.3-p21.1 has a p(SR) = 0.00999 approaching suggestive linkage providing further evidence for this region. Analyses of the IgE phenotype using the unweighted GSMA showed similar overall findings (Figure 2).

Table 5 Weighted GSMA for total serum IgE (729 pedigrees, n = 3,224)

Multiple phenotype analyses

In addition to the primary phenotype analyses we investigated overlapping chromosomal regions containing genetic determinants of asthma and asthma related traits consistent with gene(s) having pleiotrophic effects in asthma and allergy (Table 6). Interestingly, region 6p22.3-p21.1 which contains the HLA region showed a p(SR) < 0.05 in the asthma, BHR and IgE analyses potentially as expected due to the role of HLA restriction in many immunological mechanisms. Several other regions also showed overlapping concordance, in particular regions; 3p14.1-q12.3 (asthma, SPT), 5q23.2-q34 (asthma, BHR, IgE) and 7p21.1-14.1 (asthma, BHR, IgE).

Table 6 Overlapping chromosomal regions in weighted GSMA for asthma, and the three intermediate phenotypes

Discussion

This study represents the first meta-analysis of asthma and related trait linkage data using the majority of the data available for the Caucasian asthma cohorts in the public domain. This analysis combines data from 10 years of asthma and atopy genetics and is extremely timely providing a definitive analysis of available linkage data to complement the highly anticipated whole genome association findings. Analysis of asthma and atopy quantitative traits identified significant evidence for relatively few chromosomal regions as containing susceptibility gene(s) using the most stringent genome-wide criteria i.e. BHR (6p22.3-p21.1), total IgE (5q11.2-q14.3 and 6pter-p22.3) and positive allergen skin prick test (3p22.1-q22.1, 17p12-q24.3). Significantly no chromosomal region met stringent genome-wide criteria in the asthma phenotype analyses. This study did provide supporting evidence for regions containing previously identified asthma susceptibility genes.

Linkage analyses has proven to be highly successful in single gene disorders e.g. cystic fibrosis but has been problematic in asthma and atopy mainly due to the complex genetic basis of these phenotypes and the use of inadequate samples sizes leading to both type I and type II errors. In the current study we aimed to combine all available linkage data for asthma and related trait phenotypes (BHR, total IgE, positive allergen skin prick test) and provide inferences about the consistency of results across studies, ultimately providing a focus for future gene discovery. The analyses of the quantitative traits provided the most significant findings and may be consistent with the observation that using objective markers of disease adds to the homogeneity of the data and may improve results. In addition the number of genes regulating these phenotypes may be smaller than "asthma" and power to find these may be increased.

This study strongly suggested that regions 17p12-q21.33 and 3p14.1q22.1 contain gene(s) that influence allergen skin prick responses and by inference atopy. Both of these regions are large spanning 28.5 and 70.3 Mbp respectively. The 3p21 region has been identified as containing genetic determinants of specific allergen responses in the Hutterite founder asthma cohort [22] and has been identified as an atopic dermatitis locus (3p24-22) [23]. Linkage to chromosome 17 and specific allergen responses has been described in the Hutterite population however these linkages map to 17q25 in asthma [22]. Linkage to atopic dermatitis on 17q23.1 has been reported [23]. Also of significance is the fact that the chromosome 17 locus (17p12-q21.33) identified in the current analyses contains the recently identified ORMDL3 gene [24]. Using whole genome association variants in the ORMDL3 gene were shown to be associated with childhood onset asthma [24]. Overall our data suggest that the major genes influencing allergen skin prick responses are found on chromosomes 3 and 17.

In the total IgE analyses there was strong evidence for the presence of genes(s) regulating IgE production in the 5q11.2-q14.3 and 6pter-p21.1 regions. This region on chromosome 6 contains the Human Leukocyte Antigen (HLA) locus and so may be predicted to contain determinants of immunological processes. The finding that 5q11.2-q14.3 may contain gene(s) that influence IgE production is novel and warrants further investigation. The IgE analyses also confirmed the potential contribution of genes within the 5q23.2-q34 and 11q13.3-q22.3 regions that have previously been suggested [25]. Interestingly, of the four positionally cloned genes identified using IgE as a key phenotype only the region encompassing the GPRA gene showed limited (non significant) linkage (7p21.1-p14.1, p(SR) = 0.027).

In the BHR analyses the 6p22.3-p21.1 region was identified as containing susceptibility gene(s). This region contains the HLA locus and the HLA-G gene within this region has previously been identified as a potential asthma and BHR susceptibility gene using four cohorts (including the Dutch cohort used in the current analyses) [26]. Our data confirms this region as a BHR locus and less significantly a potential asthma locus (p(SR) = 0.039). Interestingly, four of the six populations used for the BHR analyses ranked the chromosome 2p locus identified in the top 33% of bins including the Genetics of Asthma International Network (GAIN) study (data not shown). Further mapping of the 2p locus in the GAIN population using single nucleotide polymorphisms (SNP) spanning the region refined the linkage peak to ~70 cM with the greatest evidence being for the BHR phenotype (LOD score 4.58) [16]. Region 2p12-q22.1 contains the DPP10 gene that has previously been identified as an asthma and total IgE susceptibility gene [27] and the IL1RN gene identified using asthma as the primary phenotype [28]. The identification of 11q24.1-qter as a potential BHR susceptibility locus appears to be novel and therefore these data may provide a platform for novel BHR susceptibility gene identification. The BHR analyses also confirmed the potentially modest contribution of other loci in determining the BHR phenotype including e.g. 5q23.2-q34 and 19q12-q13.33. In the analyses of the BHR phenotype significant linkage was not driven by studies using a specific agonist i.e. studies using both methacholine and histamine provocation contributed to the signal at a specific locus (data not shown). These data suggest responsiveness to these agents share a common genetic basis and provide support for combining studies in the meta-analyses using these different provocation stimuli.

Significantly, using asthma as a phenotype we did not identify any chromosomal region as showing genome-wide significance in our analyses. In most studies, asthma was defined as a doctor's diagnosis. In the Dutch study, families were ascertained through a proband with a doctor's diagnosis of asthma. In the offspring of these asthma patients, an algorithm was used since a doctor's diagnosis per se underestimated the prevalence of asthma in the offspring [8]. To confirm that the inclusion of the Dutch linkage data for the asthma phenotype had not confounded the analyses we completed GSMA without these data and again no chromosomal region with a p(SR) and p(OR) < 0.05 was identified (data not shown). These data may reflect the true locus heterogeneity in asthma or reflect differences in phenotype definition when comparing asthma based on doctors' diagnosis over different cohorts. In addition the use if a binary trait i.e. affection will have the lowest intrinsic power compared to continual data e.g. IgE levels.

Interestingly, the recently published whole genome association study using 994 asthmatic children and 1,243 non asthmatic children identified only 16 SNPs (eight on chromosome 17) from a total of 317,447 SNPs tested showing a significant association with asthma per se (5% false discovery threshold, stratification corrected) [24]. This study complements our analysis using data from 1,257 families and taken together suggests that the use of asthma as a phenotype may be confounded due to locus heterogeneity in asthma and/or issues concerning phenotype definition/heterogeneity when combining cohorts. It is important to note that the family based studies included in this meta-analyses address the genetic basis of asthma defined in children i.e. the mean age of siblings in most studies is < 16 years.

Several regions showed suggestive evidence for linkage to the asthma phenotype mainly based on the p(OR) statistic, however caution should be taken interpreting p(OR) in isolation, especially in the presence of incomplete data sets [5]. Further evidence for the chromosome 12 and 20 loci comes from the finding that adjacent bins have a p(OR) < 0.05 suggesting the linkage is spanning the bin interval. 4/6 regions containing the positionally cloned asthma susceptibility genes i.e. ADAM33 (20p13[7]), PHF11 (13q14.3 [29]), DPP10 (2q14.1 [27]), HLAG (6p21.3 [26]), GPRA (7p14.3 [30]), IL1RN (2q13 [28]) and the recently reported gene ORMDL3 (17q12-q21[24]) were identified by the GSMA approach, ADAM33 (p(OR) = 0.005), DPP10 and IL1RN (p(OR) = 0.007) and less significantly HLA-G (p(SR) = 0.039) and GRPA (p(OR) = 0.031). In addition our data also suggests that further investigation of additional chromosomal regions may be productive e.g. 1p31.1-p13.3 and 14q32.12-qter. Recently, the 1p31 and 14q32 regions have been highlighted as potential asthma loci in a French cohort with data suggesting 1p31 may contain gene(s) of importance to asthma and atopic dermatitis co morbidity and the 14q32 locus may interact with smoking exposure and contain asthma susceptibility gene(s) [31, 32].

The analysis of overlap between chromosomal regions confirmed the importance of the HLA locus on chromosome 6 as being a key susceptibility locus in asthma and also highlighted other regions that may be of importance i.e. 5q23.2-q34 and 7p21.1-14.1. The 5q23.2-q34 region contains the cytokine gene cluster (IL4, Il13, IL5, IL12B) and has previously been suggested as an asthma/atopy susceptibility locus [3] and the 7p21.1-14.1 region contains the previously identified asthma susceptibility gene, GPRA (7p14.3) [30].

In conclusion, we present the first systematic meta-analyses of asthma and related trait linkage data in the Caucasian population. These data are based on the majority of the data available in the public domain (or through collaboration) therefore we do not consider that exclusion or missing data for other populations has biased our analyses. While the GSMA method has limitations e.g. only large chromosomal regions can be identified, these analyses have determined the role of several previously identified susceptibility loci and highlighted the significance of regions not previously implicated in asthma and atopy susceptibility. Importantly, this study also highlighted the limitations of using asthma as a phenotype in contrast to quantitative traits even with the increased power of 1,267 families composed of 5,832 individuals. Finally, these data will provide useful guidance for the interpretation of the anticipated genome wide association analyses in asthma and atopy.