Background

Chronic obstructive pulmonary disease (COPD) is characterized by progressive airflow limitation driven by an abnormal inflammatory response of the airways to inhaled particles and fumes [1]. The disease is predicted to become the third most common cause of death and the fifth most common cause of disability in the world by 2020 [2]. This disease remains under-recognized and under-diagnosed; the pathogenesis needs to be investigated.

Cigarette smoking is one of the most important risk factors for COPD, but the severity of the disease varies considerably, irrespective of the number of pack-years of smoking. Furthermore, only a minority of smokers (20%) develop the disease clinically, suggesting that in addition to smoking, COPD is partially genetically determined [3, 4]. COPD may be caused by a combination of genes and environmental influences. Genes have been associated with COPD in a family-based study, and some previous studies have demonstrated familial aggregation of COPD. The heritability of COPD is estimated to be 40–77% [5]. Some other twin studies have also indicated a genetic contribution to clinically relevant parameters of pulmonary function, such as forced expiratory volume in 1s (FEV1) and forced vital capacity (FVC) [6, 7].

Many studies of candidate genes for COPD and pulmonary function have been conducted over the past few years. Above all, genome-wide association studies (GWASs) have identified some loci associated with susceptibility to COPD [812] but with varying degrees of reproducibility. Conflicting results among these studies may be attributable to ethnic differences and sample sizes. In the past, candidate gene studies have focused on a single gene or on a few genes in combination; these genes were identified based on prior knowledge or suspected mechanisms of disease pathogenesis. Nonetheless, elucidating the genetics of respiratory disorders is severely hampered by genetic heterogeneity, the low penetrance of individual disease alleles, and the potential for gene–gene and gene–environment interactions. To date, the only proven genetic risk factor for COPD is the severe deficiency of alpha-1-antitrypsin (AAT), which is associated with a predisposition to early onset panacinar (panlobular) emphysema [13].

Furthermore, few gene studies performed on the Chinese population have focused on COPD. However, in China, the disease is increasingly prevalent. A 2007 survey of 20,245 participants in seven regions of China reported the occurrence of COPD in adults aged ≥40 years to be 8.2% [14]. Therefore, more gene-association studies are needed to identify genetic polymorphisms associated with the development of COPD in the Chinese population.

The aim of our current case–control study was to locate genes related to susceptibility to COPD in the Chinese Han population. We aimed to identify loci associated with COPD among 97 single-nucleotide polymorphisms (SNPs) of 46 genes (Table 1).

Table 1 Gene location and alleles of 97 single-nucleotide polymorphisms (SNPs)

Methods

Subjects

In total, 331 COPD patients were recruited: 256 from the Department of Respiratory Diseases of Shanghai Rui Jin Hospital, 60 from the Shanghai Jing-an Geriatric Hospital, and 15 from the Shanghai Gong Hui Hospital. COPD was diagnosed according to the criteria established by the NHLBI/WHO Global Initiative for COPD (GOLD) [15]. The diagnoses were based on certain patient parameters (e.g., age≥40 years and smoking history of ≥20 pack-years) and on the presence of relentless and progressive symptoms: cough, productive sputum, and breathlessness over many years; airflow limitation as indicated by FEV1/ FVC≤70%; FEV1 reversibility after the inhalation of salbutamol <12% of the pre-bronchodilator FEV1 (MS-Body Diffusion, Germany). Patients were excluded if they had a comorbid diagnosis such as asthma or lung cancer, or had radiographic abnormalities suggestive of other significant respiratory diseases and any hereditary diseases.

Control subjects (n = 213) were selected from a pool of healthy people who visited the general health checkup center of Shanghai Rui Jin Hospital during the same period. The enrollment criteria for the controls were as follows: age≥40 years, smoker, no known disease, no history of any disease. Lung function was measured at baseline following the American Thoracic Society/European Respiratory Society standard procedure to confirm no evidence of airflow obstruction. All of the COPD patients and control subjects were ethnic Han Chinese. The study protocol was approved by the medical ethics committee of Shanghai Rui Jin Hospital, Shanghai Jiaotong University School of Medicine, and all participants gave written informed consent.

DNA extraction and genotyping

We chose 97 candidate SNPs identified in previously published GWASs and by searching the dbSNP database of NCBI (references in Additional file 1: Table S1). Their minor allele frequencies (MAFs) were >0.05 in the Han Chinese population we studied.

A 4-ml peripheral blood sample was obtained from each participant for DNA analysis. Plasma was separated by centrifuge and stored at −80°C until further use. We extracted genomic DNA using a QuickGene DNA whole blood kit, (Life Sciences, FUJIFILM, Japan). Any sample with DNA concentration <10 ng/ul was excluded, and another sample was acquired.

For genotyping, we first performed multiplex PCR, a variant of PCR that enables simultaneous amplification of many targets of interest in one reaction by using more than one pair of primers [16]. Mass-ARRAY™ Assay Design 2.0 software was used to design multiplex primers for each SNP: 1st PCR primer, 2nd PCR primer, and UEP primer. The primers of 97 SNPs are shown in Additional file 1: Table S1. Genotyping was achieved using the Mass-Array™ Technology platform of Sequenom, Inc. (San Diego, CA, USA). For quality control, two independent readers interpreted the results, and a random selection of 10% of all samples was retested. No discrepancies were discovered in the replicate tests. All genotyping analyses were blinded with respect to the case/control status, and all samples were analyzed in the same lab and under the same conditions. The results were 100% concordant. Several SNP samples were finally excluded because ≥10% of the genotyping data were missing.

Data analysis

Data analyses were performed using the Statistical Package for the Social Science 20.0 (SPSS, Inc., Chicago, IL, USA). Continuous variables (age, smoking history, and pulmonary function) were calculated as means (± standard deviation). The two-sided Student’s t-test was used to determine significant differences in clinical data between the COPD cases and the control subjects. The significance level for t tests of clinical information was 0.05. The χ2 test and unconditional logistic method were applied to compare genotype and allele frequencies between the two groups, logistic analysis was adjusted for age, gender and smoking. Frequencies were compared, respectively, using a p cutoff of 0.05 (like previous studies) and the Bonferroni correction method for multiple testing. The relative risk associated was estimated as an odds ratio (OR) with a 95% confidence interval (95% CI). Each of the SNPs in the control group was analyzed for Hardy-Weinberg equilibrium (HWE) using chi-square test and exact test, SNPs were excluded from the analysis if they were out of HWE (p≤0.05). Haplotype frequencies and linkage disequilibrium (LD) analyses were evaluated using PHASE and Haploview software.

Results

Study population characteristics

The study population characteristics are described in Table 2. They did not significantly differ in sex, age, or smoking history. FEV1 predictive (FEV1%) and FEV1/FVC of the case group were significantly decreased compared with the control group (p<0.05).

Table 2 Demographics of COPD patients and control subjects

Association analysis of each genotype

Eight SNPs (rs361525, rs1042713, rs34829399, rs2853677, rs2571445, rs8192288, rs2066960, and rs2230054) that deviated from HWE in the controls were removed from the association analysis. Thirteen SNPs (rs1130866, rs56155294, rs10498230, rs2035901, rs3091244, rs511898, rs2869967, rs7583463, rs2276109, rs737693, rs9904270, rs4934, and rs6830970) were also eliminated from the analysis due to lack of genotyping data in ≥10% of the sample. Finally, 76 of the 97 SNPs were included in the association analysis. The allele frequencies (Table 3) and the genotype distributions for these SNPs were analyzed in samples from 331 COPD patients and 213 control subjects. Seven SNPs tended to be associated with COPD: rs2353397, rs1800629, rs2241712, rs1205, rs20541, rs2070600, and rs10947233. Among these seven SNPs, after Bonferroni correction, rs2353397 was most strongly associated with susceptibility to COPD. The C allele (rs2353397) of the human hedgehog interacting protein (HHIP) gene occurred more frequently in COPD patients (58%) than in the control subjects (29%) (OR = 2.16, 95% CI 1.66–2.81, p<0.0001, p(Bonferroni) <0.0001). The G allele (rs1800629) of the TNF-α gene was more frequently detected in COPD patients (95%) versus control subjects (90%) (OR=1.97, 95% CI 1.21–3.21, p=0.0060, p(Bonferroni)=0.4560). The frequency of the A allele (rs2241712) of the TGF-β1 gene was significantly higher in COPD patients (52%) than in healthy controls (45%) (OR=1.24, 95% CI 0.96–1.59, p=0.0460, p(Bonferroni)=3.7848). The C allele (rs1205) of the CRP gene occurred more frequently in COPD patients (47%) compared with control subjects (40%) (OR=1.48, 95% CI 1.14–1.91, p=0.0030, p(Bonferroni)=0.2280). More COPD patients (35%) carried the T allele (rs20541) of the IL-13 gene than control subjects (28%) (OR=1.36, 95% CI 1.04–1.80, p=0.0280, p(Bonferroni)=2.1280). The G allele (rs2070600) of the AGER gene was found more frequently in COPD patients (81%) than in healthy controls (73%) (OR=1.47, 95% CI 1.08–1.98, p=0.0130, p(Bonferroni)=0.9880). The G allele (rs10947233) of the PPT2 gene occurred more frequently in COPD patients (79%) than in control subjects (72%) (OR=1.51, 95% CI 1.12–2.03, p=0.0060, p(Bonferroni)=0.4560).

Table 3 Allele frequencies in COPD and control subjects for SNPs

For analysis of genotypic association of these seven SNPs under certain genotype models (Table 4), rs2353397 TT protected subjects from the disease; CC, CT carriers were more susceptible to COPD (OR=1.01, 95% CI 0.79–1.32, p<0.0001). The rs1800629 GG homozygous carriers exhibited an increased susceptibility to the disease compared with AA, GA carriers (OR=1.90, 95% CI 1.12–3.21, p=0.0170). The rs1205 CC, CT genotype increased risk for COPD compared with the TT homozygous genotype (OR=1.82, 95% CI 1.21–2.73, p=0.0040). Individuals carrying the rs20541 TT, CT genotype were at a significantly higher risk for COPD than were healthy subjects carrying the CC genotype (OR=1.47, 95% CI 1.01–2.13, p=0.0450). The rs2070600 GG homozygous carriers tended to develop COPD more frequently that AA, GA carriers (OR=1.55, 95% CI 1.06–2.26, p=0.0240). rs10947233 GG, GT carriers were associated with susceptibility to COPD compared with TT carriers (OR=3.30, 95% CI 1.47–7.44, p=0.0040).GG carriers versus GT, TT carriers (OR=1.56, 95% CI 1.07–2.27, p=0.0200).

Table 4 Analysis of Genotypic Association of SNPs identified under Genetic Models

The complete genotype distributions of the other SNPs are listed in Table 5. Frequencies under different genotypic models of each SNP were compared between the COPD group and the control group.

Table 5 Genotype frequencies of each SNP in COPD and control subjects for SNPs

Haplotype analysis

Using Haploview software, CRP gene polymorphisms were determined to be in linkage disequilibrium (D’=1.0). Using PHASE software, haplotype frequencies for the polymorphisms rs1205 and rs2808630 of CRP at chromosome 1 were compared with respect to frequency between COPD patients and healthy controls. These two SNPs formed three haplotypes: CC, CT, and TT (Table 6 and Figure 1). Among them, only the TT haplotype was more frequently detected in controls (61%) compared with COPD patients (52%) (OR=0.69, 95% CI 0.49–0.98, p=0.0377). TGF-β1 rs2241718 and CDC97 rs6957 were in linkage disequilibrium (D’=0.98); they formed two haplotypes, GC and GT. These two haplotypes were respectively more frequent in healthy controls compared with COPD patients (OR=0.33, 95% CI 0.23–0.48, p=1.88×10-9) (Table 6 and Figure 2). TGF-β1 rs1800469 and rs2241712 were also in linkage disequilibrium (D’=0.98); they formed TG and CA haplotypes. However, no significant differences between the two groups were detected (Table 6 and Figure 2).

Table 6 Haplotypes of the CRP and TGF-β1 gene
Figure 1
figure 1

Linkage disequilibrium of SNPs in the CRP gene using Haploview software. The red color indicates the higher linkage disequilibrium (D’=1.0) between rs2808630 and rs1205.

Figure 2
figure 2

Linkage disequilibrium of SNPs in the TGF-β1 and CDC97 genes using Haploview software. The red color indicates the higher linkage disequilibrium (D’=0.98) between the rs2241718 and rs6957; between the rs1800469 and rs2241712.

Discussion

In this study, we sought to determine which of 76 SNPs we chose were associated with the development of COPD. Our case–control study verified that the following SNPs were associated with COPD: rs2353397 C, rs1800629 G, rs2241712 A, rs1205 C, rs20541 T, rs2070600 G, and rs10947233 G. The rs2353397 C allele was most strongly associated with COPD.

rs2353397 CC, CT genotypes of the HHIP gene were associated with susceptibility to COPD in the Chinese Han population. The HHIP gene is located at chromosome 4q31.21–31.3, position 145517578; it encodes the HHIP protein [17]. This protein is a critical regulator of the hedgehog (Hh) signalling pathway, which has been implicated in cell development, cell repair, and cancer development in multiple tissues [18]. The idea that COPD could be associated with inappropriate growth or structural defects in small airways makes HHIP an attractive candidate developmental gene. Several GWASs have also demonstrated that the 4q31 locus, which contains the HHIP gene, is associated with COPD and lung function [912]. A GWA meta-analysis for pulmonary function in 20,890 participants of European white ancestry revealed eight genes associated with COPD and concluded that HHIP is associated with FEV1/FVC [10]. Few studies prior to our current study have reported the SNPs of the HHIP gene related to COPD in an Asian population. Polymorphisms may lead to changes in gene expression, resulting in functional alteration and, subsequently, to COPD. According to Zhou et al. (2012) [19], significant decreases in expression of the HHIP gene at the mRNA and protein levels were observed in COPD lungs compared with lungs of smokers with normal lung function. The risk-associated haplotype confers decreased activity on the HHIP promoter, indicating that lower HHIP expression may exacerbate smoking-induced COPD pathogenesis. Lemjabbar-Alaoui et al. (2006) [20] demonstrated that hedgehog signaling proteins are critical mediators of cigarette smoke-induced disease, such as lung cancer and chronic airway inflammatory disease, and the expression levels of hedgehog signaling proteins are modulated by HHIP. Based on our current study of HHIP polymorphisms and the Hh signaling pathway, we need to further our mechanistic research in the context of smoking.

Our results also demonstrated that rs1800629 GG carriers of the TNF-α gene were at several times the risk for COPD compared with GA, AA carriers. TNF-α is critical in the regulation of inflammation; it induces a cascade of other inflammatory cytokines, chemokines, and other growth factors; it is important in the pathogenesis of many diseases. Several gene studies have also determined that the promoter polymorphism of TNF-α is associated with chronic bronchitis or the extent of emphysematous changes. Two of these studies were performed with Caucasian subjects, two with Japanese subjects [2124]. The promoter polymorphism may have caused varied concentrations of serum TNF-α, which have been associated with induced sputum in bronchial biopsies and with bronchoalveolar lavage fluid in stable COPD patients and during exacerbations, compared with that of control subjects [25]. Some investigators have shown that TNF-α genotypes do influence the severity of infectious diseases, while others have concluded that polymorphisms of this gene promoter are of no functional consequence [26, 27].

A polymorphism of the TGF-β1 gene, rs2241712 A, tended to be a risk-associated allele in our study. TGF-β1 is one of the important cytokines involved in the inflammatory process of COPD. TGF-β1 expression is usually increased in the airways of patients. Su et al. (2005) [28] found that more carriers of the -800A allele, or fewer carriers of the -509T allele were detected among the COPD patients, but only 84 COPD cases and 97 healthy controls participated in their research. In addition, we used a Chinese Han population, while Su et al. recruited people in a general Chinese population. Van Diemen et al. (2010) [29] showed that the TGF-β1 rs6957 SNP haplotype with the major allele of rs6957 and minor alleles of rs1800469 and rs1982073 were associated with COPD. The differences in study populations may explain these dissimilarities between our studies. Various studies have indicated that certain SNPs of the TGF-β1 gene are functional and result in higher levels of circulating TGF-β1 [30, 31].

The SNP rs20541 at IL-13 gene exon 4 tended to be associated with COPD in our study. Genotype TT, CT carriers were at risk. IL-13 is a Th2 cytokine implicated in the recruitment of inflammatory cells from the blood to the lung, which may be involved in the pathogenesis of COPD. In experimental studies, the overexpression of IL-13 in the adult murine lung caused emphysema [32]. The number of IL-13+ cells was elevated in the bronchial submucosa of smokers with chronic bronchitis compared to asymptomatic smokers [33]. rs2066960, rs20541, and rs1295685 in the IL-13 gene were associated with COPD risk and lower baseline lung function in the study by Beghé et al. (2010) [34], which used a Caucasian study population. We chose the same SNPs located in the IL-13 gene, but our results showed that only rs20541 is of significance in susceptibility to COPD in the Chinese Han population. In addition, another study revealed the role of rs20541 in another chronic airway inflammatory disease (asthma), which indicates that the polymorphism in the coding region might contribute to airflow limitation [35].

The polymorphism of another inflammatory marker in the CRP gene, rs1205 C, was also a risk-associated allele according to our current study. Sunyer et al. (2008) [36] assessed the association between rs1205 and lung function; they demonstrated that the TT homozygous genotype in the CRP gene is associated with better lung function. Our result that the TT genotype protects people against COPD is similar to theirs because COPD is characterized by airflow limitation according to lung function. This polymorphism has been previously reported associated with varied levels of CRP in several studies [37]. Higher levels of CRP in peripheral blood may cause impaired lung function [38].

Two other SNPs, AGER rs2070600 and PPT2 rs10947233, tended to be associated with COPD in our current study. rs2070600 GG and rs10947233 GG, GT carriers tended to develop COPD. An occidental GWAS demonstrated a role for the chromosome 6p21 locus including the AGER and PPT2 genes in COPD development in smokers [39]. Repapi et al. (2010) [12] reported a meta-analysis of GWAS results from 20,288 participants and follow-up analyses in 54,276 participants; they identified five novel, genome-wide, significant loci for pulmonary function containing AGER rs2070600, but their analysis subject group did not include Asians. In addition, rs2070600 was associated with severe COPD in a study of Caucasian smokers from Poland [40]. The two SNPs of AGER and PPT2 were also identified associated with FEV1/FVC in the 2009 work of Hancock et al. [10].

In addition, our study revealed some haplotypes composed of rs1205 and rs 2808630 in the CRP gene, rs2241718 and rs 6957 at the TGF-β1 and CDC97 genes. In our future research, we will further analyze our data with regard to lung function.

Our research had some limitations. First, a larger sample size would have improved quality of the results. Second, although we selected 97 SNPs for the study and found some loci related to the disease, further GWASs of COPD are needed in the Chinese Han population to identify more associated polymorphisms. It is likely that more genetic risk factors than those identified in this study contribute to the development of COPD. In our future work, we will further our investigation on gene function of the genetic factors related to the development of COPD.

Conclusion

Our findings identified some genetic variants associated with COPD. This study has provided important information regarding the association of these polymorphisms to the susceptibility to COPD in the Chinese Han population. However, these findings need to be verified. These data emphasize the need for further research regarding gene function in COPD that will ultimately contribute to future gene therapies for this significant and costly disease in the Chinese Han population.