Background

Lung cancer is the worldwide leading cause of cancer-related death [1]. During the past decades, the rate of incidence and mortality of lung cancer in Korea have been increasing significantly and constantly[2]. Although lung cancer has been considered as a disease caused by smoking and environmental/occupational exposure, previous studies suggest that genetic factors may also contribute to the risk of lung cancer [3].

Single nucleotide polymorphisms (SNPs) are the most common form of human genetic variation, and they may contribute to an individual's susceptibility to cancer [4, 5]. Many previous studies have demonstrated that some polymorphisms of certain genes are associated with the risk of lung cancer, affecting either the gene expression or activities of enzymes [68].

The HER-2 (also known as erbB-2 or neu and a member of the epidermal growth factor receptor family), proto-oncogene is located at chromosome 17q21 and encodes a transmembrane glycoprotein (p185) with tyrosine kinase activity [9, 10]. Somatic mutations in the HER-2 gene have not been observed in humans, however amplification and overexpression of the HER-2 protein occur in lung cancer and contribute to poor prognosis [1113]. Recently, several polymorphisms of the HER-2 gene have been deposited in public databases http://www.ncbi.nlm.nih.gov/SNP. Previous studies have suggested functional implications of the polymorphisms by showing that the polymorphisms of the HER-2 gene might result in increased autophosporylation and tyrosine kinase activation [14, 15]. It is possible that the polymorphisms are associated with lung cancer, modulating the susceptibility to disease.

To test this hypothesis, we performed a case-control study to evaluate the association between the polymorphisms of the HER-2 gene and the risk of lung cancer in the Korean population.

Methods

Study subjects

All 814 study subjects were recruited between August 2001 and June 2005, including 407 lung cancer patients recruited from the patient pool at the Genomic Research Center for Lung and Breast/Ovarian Cancers (Seoul, Korea) and Inha University Medical Center (Incheon, Korea). The histological classification and staging of all lung cancer cases was performed by pathological evaluation at the time of diagnosis. 407 control subjects randomly selected from a pool of healthy volunteers who had visited the Cardiovascular Genome Center (Seoul, Korea), Genome Research Center for Allergy and Respiratory Disease (Seoul, Korea), and Keimyung University Dongsan Medical Center (Daegu, Korea). Detailed information on diet, smoking status, drinking status, lifestyle, and medical history were collected by a trained interviewer using a structured questionnaire. This study was also approved by the institutional review board of participating institutes, and a written informed consent was obtained from each participant.

Selection of the polymorphisms for the HER-2 gene

6 SNPs of HER-2 gene were chosen in the promoter regions (-3444 C>T, -3396 C>T, -3374 A>G, -1985 G>T) and 2 SNPs in the coding regions (I655A A>G and P1170A C>G) (Fig. 1a). -3444 C>T, -3396 C>T, -3374 A>G, and -1985 G>T polymorphism was found by the direct-sequencing for the region spanning ~4kb upstream from the translation initiation site (NM_004448) of HER-2 gene in a small set of 24 lung cancer cases. I655A A>G and P1170A C>G polymorphisms were selected by reviewing the previous reports in the Korean population [16, 17].

Figure 1
figure 1

Distribution of HER-2 polymorphism. (a) The locations of the six SNPs used in the HER-2 genotyping analysis [vertical arrows (numbered 1 to 6 from 5' to 3'): 1, rs2643194; 2, rs2517951; 3, rs2643195; 4, rs2934971; 5, rs1801200; 6, rs1058808]. The translation start site is marked with right-angled arrow at +1. Exons are dipicted as small blocks. (b) The pairwise measure of linkage disequilibrium (LD) between the six SNPs in (a) with D' statistics (values in the bottom left area of the table). The pairs exhibiting a strong linkage disequilibrium pattern are shaded. Especially, the three SNPs (numbered in 1, 2, 3) are in completed LD. r2 values are shown on the top right area of the table.

Genotyping

Genomic DNA was prepared from 814 peripheral blood samples using Puregene blood DNA kit (Gentra, Minneapolis, MN). Polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) and single-nucleotide polymorphism-identification technology (SNP-IT) assay were performed for genotyping of HER-2 polymorphisms.

Genotyping of -1985 G>T and I655V A>G SNPs was carried out by PCR-RFLP. The PCR primer sequences for -1985 SNP were: forward, 5'-ACC CCA GCA TAG TAT GTC AGA TG-3', and reverse, 5'-ATC CTA GGG AGT TGA GAA ACA GG-3'. The PCR products were digested by Xmn I and resolved by gel electrophoresis. The I655V SNP was analyzed as described previously [18].

- 3444 C>T and P1170 C>G SNPs genotyping was performed by SNP-IT assay using the SNP stream 25 K System (Orchid Biosciences, Princeton, NJ). Briefly, the genomic DNA region spanning the polymorphic site was PCR-amplified using one phosphothiolated primer and one regular PCR primers. The amplified PCR products were then digested with exonuclease. The phosphothioated strand of the PCR product was protected from exonuclease digestion, generating a single-stranded PCR template. The single-stranded PCR template was overlaid on a 384-well plate that contained a covalently attached SNP-IT extension primer which was designed to hybridize immediately adjacent to the polymorphic site. The SNP-IT primer was extended for a single base with DNA polymerase and a mixture of appropriate dideoxynucleotide tagged with either fluorescein isothiocyanate (FITC) or biotin. The identity of the incorporated nucleotide was determined by serial colorimetric reactions with anti-FITC-alkaline phosphatase and streptavidin-horse radish peroxidase, resulting blue and/or yellow color developments were analyzed with an ELISA reader and the final genotyping (allele) calls were made with the QCReview program.

Statistical analysis

Genotype frequencies and departures of genotype distributions from the Hardy-Weinberg equilibrium for each SNP were analyzed using the chi-square test or Fisher's exact test. Pairwise LD for calculating D' and r2 was evaluated as described previously [19]. Linkage disequilibrium (LD) values were estimated using the Haploview version 3.32 http://www.broad.mit.edu/mpg/haploview. A pair of SNPs is defined to have "strong linkage disequilibrium" if the one-sided upper 95% confidence interval (95% CI) boundary on D' is >0.98 and the lower boundary is >0.7. Conversely, SNP pairs are said to have "strong evidence of historical recombination" if the upper CI boundary on D' is <0.9. Genotype-specific risks were estimated as odds ratios with associated 95% confidence intervals by unconditional logistic regression (SAS Institute, Cary, NC) and adjusted for age and gender. The p-value of < 0.05 was considered statistically significant. Unconditional logistic regression analysis was used to calculate odds ratios (ORs) and 95% confidence intervals (CIs) for overall lung cancer, with adjustment gender and age. In addition to the overall association analysis, we performed a stratified analysis by age (median age, = 60 years/>60 years), gender, smoking status, drinking status and tumor histology to further explore the association between HER-2 genotypes and the risk of lung cancer in each stratum. The ORs and 95% CIs in the stratification analyses were calculated using unconditional logistic regression analysis, with adjustment for gender and age, when appropriate.

Results

We investigated the relationship between polymorphisms of HER-2 and the risk of lung cancer in a case-control study of 814 age-gender matched case and control subjects. The distributions of age, gender, smoking status and drinking status of every subject are summarized in Table 1. There were no differences in median age and gender between cases and controls. However, smoking status and drinking status were missing in some patients. These differences were controlled in the later logistic regression analysis with adjusted gender and age.

Table 1 Demographic characteristics of case-control study population

In this study, we have chosen 4 SNPs in the promoter regions (-3444 C>T, -3396 C>T, -3374 A>G, -1985 G>T) and 2 SNPs in the coding regions (I655A A>G and P1170A C>G) of the HER-2 gene (Figure 1a). Interestingly, the 3 SNPs (-3444 C>T, -3396 C>T and -3374 A>G) in the promoter regions showed a complete linkage disequilibrium (LD) (D' = 1)(Figure 1b). These results imply that the 3 polymorphisms are a SNP block. We representatively showed the data of -3444C>T for the 3 SNPs (-3444 C>T, -3396 C>T and -3374 A>G) in the following results. Therefore, we only performed genotyping for the 4 SNPs (-3444 C>T, -1985 G>T, I655A A>G and P1170A C>G). Therefore, we only performed genotyping for the 4 SNPs (-3444 C>T, -1985 G>T, I655A A>G and P1170A C>G) in all study subjects (407 case including the previous 24 case samples and 407 controls).

The frequencies of the 4 SNPs (-3444 C>T, -1985 G>T, I655A A>G and P1170A C>G) in the total lung cancer and healthy-control subjects are shown in Table 2. The distributions of the 4 SNPs in the cases and controls were in Hardy-Weinberg equilibrium. There were no significant differences between the 4 polymorphisms of the HER-2 and the risk of lung cancer in all subjects (Table 2). However, in the subgroup analysis, there were statistically significant differences between the genotype frequencies of the 3 SNPs (-3444 C>T, -1985 G>T and P1170A C>G) and the risk of lung cancer (Table 3, 4 and 5, Additional File 1). In females, -3444 TT and -1985 TT genotypes exhibited increased risk of lung cancer (Table 3). In non-smokers and non-drinkers, -3444 TT, -1985 TT and P1170A GG genotypes showed higher risk of lung cancer than other genotypes of the SNPs and the differences were more magnified after adjustment for age and gender (Table 4 and 5). Furthermore, we additionally performed the statistical analysis to demonstrate whether the polymorphisms are associated with tumor histology in the subgroups. We found that-3444 TT and -1985 TT genotypes exhibited the increased risk of lung cancer in non-smokers group with adenocarcinoma, adjusting with gender and age (Table 6). Hence, our results demonstrated that the 3 SNPs of HER-2 were associated with the risk of lung cancer in females, non-smokers, non-drinkers and tumor histology.

Table 2 Genotyping frequencies for HER-2 in cases and controls and their association with risk of overall lung cancer
Table 3 Comparative analysis between genotype frequencies and the risk of lung cancer in the subgroups of females
Table 4 Comparative analysis between genotype frequencies and the risk of lung cancer in the subgroups of non-smokers
Table 5 Comparative analysis between genotype frequencies and the risk of lung cancer in the subgroups of non-drinkers
Table 6 Comparative analysis between genotype frequencies and the risk of lung cancer in non-smoker groups in adenocarcinoma-controls

Discussion

In this study, we investigated whether the polymorphisms of the HER-2 gene were associated with the risk of lung cancer in the Korean population. Although the distribution of the polymorphisms of the HER-2 gene did not display statistically significant associations with all lung cancer and control subjects, the 3 polymorphisms (-3444 C>T, -1985 G>T and P1170A C>G) of HER-2 gene were found to be associated with the risk of lung cancer in female, non-smoker and non-drinker subgroups.

The previous studies on SNPs of the HER-2 gene have mainly been focused on breast cancer [14, 18, 20]. In the Korean population, Han et al. [16] showed that the polymorphisms of the HER-2 gene are associated with HER-2 protein expression and disease outcome in breast cancer. On the other hand, there are some reports to show that overexpression of HER-2 is an independent prognostic factor of survival and to be a combinatorial target in lung cancer, using an EGFR tyrosine kinase inhibitor together with HER-2 dimerization inhibitors [12, 13, 21]. Although cigarette smoking is the most important risk factor of lung cancer, previous studies have suggested differences in epidemiologic characteristics and environmental subtypes, suggesting an existence of non-tobacco related risk factors in the pathogenesis of lung cancer [2224]. Alcohol consumption is an established risk factor for cancers of the oral cavity, pharynx, larynx, esophagus, and liver [25]. The associations between alcohol drinking and lung cancer risk have also been reported [26, 27]. Moreover, somatic mutations of EGFR and HER-2 were preferentially identified in Oriental ethnicity compared with other ethnicities, female gender compared with male gender and never smokers compared with smokers, showing clinical responses to treatment [2830]. Among the total 407 lung cancer patients, 89 patients were female and non-smoker in this study. Interestingly, about 72% (64/89) of non-smoker, female lung cancer patients were diagnosed as adenocarcinoma. We showed significant association between the polymorphisms and adenocarcinoma in the subgroups. Our results, therefore, suggest that the polymorphisms of the HER-2 gene might contribute to the susceptibility of female, non-smoker and non-drinker subgroups in the Korean population to lung cancer as EGFR mutations. However, the significant difference of the polymorphisms in females disappeared after adjustment for age. These discrepant results might be caused by the small number of young women in our study subjects [31]. Thus, our results need to be confirmed in future with a larger sample size of young women.

I655V A>G and P1170A C>G of HER-2 gene are located in the coding region that can affect protein structure and enzyme activity. In our results, I655V A>G was not associated with the risk of lung cancer in the Korean population, in consistance with previous study [16]. On the other hand, P1170A C>G was related to the risk of lung cancer in non-smokers and non-drinkers. These results suggest that P1170A C>G polymorphism may influence the activation of HER-2 gene, affecting the binding activity of specific effector proteins in lung cancer. Indeed, a recent review article describes several HER-2 binding partners [32], and a previous report also showed that the substitution of amino acid residue in the functional domain of the proteins can destabilize the protein-protein interactions [33]. Further studies are needed to clearly elucidate the relationship between the polymorphism and HER-2 activation.

To strengthen the significance of the present findings, some further studies have to be performed. First, ethnic difference of the polymorphisms of HER-2 gene should more clearly be established, because some polymorphisms of genes including HER-2 have different distribution in multiethnic population [34, 35]. Second, the sample size of the subgroups of females, non-smokers and non-drinkers should be increased.

Conclusion

Taken together, we conclude that the polymorphisms of HER-2 gene do not play strong roles in lung cancer susceptibility in the majority of the Korean population. However, we first demonstrated that the polymorphisms of HER-2 gene are associated with increased risk of lung cancer within the subgroups of females, non-smokers and non-drinkers in the Korean population. Our results can contribute to understanding susceptibility of specific subgroups in lung cancer.