Background

Lung cancer is a major cause of cancer mortality worldwide, and more than a million people in the world die from the disease each year [1]. Adenocarcinoma accounts for about 25% to 40% of all lung cancers, and is now the most common form of lung cancer in women [2]. It is the most frequent subtype occurring in those who have never smoked. Five-year survival rate of lung cancer is at only 15% in the United States and even lower in China [3]. Genetic factors are considered to influence the treatment effectiveness of lung cancer [4], and thus affect the prognosis of patients. There are some molecular markers showing potential as therapeutic and prognostic indicators, but none could be used into clinical practice [5, 6]. Of these factors, DNA repair capacity (DRC) is an important one. Several studies have shown associations between inefficient DNA repair and lung cancer risk [7, 8]. It is also possible that individual DRC can affect the survival of lung cancer patients. It has been speculated that single nucleotide polymorphisms (SNPs) in DNA repair genes may change gene expression and activity, hence influence the effectiveness of cancer treatment and survival of patients [9]. To test above possibility, we assess the relationship between survival of lung adenocarcinoma patients and SNPs in three DNA repair genes, including excision repair cross-complementing group 1 (ERCC1) and group 2 (ERCC2), and X-ray repair cross-complementing group 1 (XRCC1). ERCC2 is located in chromosome 19q13.2-13.3 and codes for an evolutionarily conserved helicase, a subunit of TFIIH complex which is essential for transcription and nucleotide excision repair (NER). The common SNPs of ERCC2 gene is at codon 751 (A > C substitution at nucleotide position 35931, exon 23, Lys>Gln, rs13181) and codon 312 (G >A substitution at position 23951, exon 10, Asp>Asn, rs1799793). ERCC1 is also located in chromosome 19q13.2-13.3 and codes for a leading protein in NER, responsible for recognition of DNA damage and removal of the damaged nucleotides. The common SNP of ERCC1 gene is at codon 118 (C > T substitution at exon 4, without amino acid change--Asn/Asn, rs11615). The study showed that ERCC1 and ERCC2 mRNA levels were correlated with DRC [10]. XRCC1 protein plays a central role in base excision repair (BER) pathway by interacting with other DNA repair proteins. The most extensively studied SNP of XRCC1 gene is at codon 399 (G > A substitution at position 28152, exon 10, Arg>Gln, rs25487), which has been reported to be associated with an altered DNA repair activity [11, 12]. SNP analyzing involves little more than a blood sample and relatively simple and precise polymerase chain reaction (PCR)-based techniques, making it more practical in the clinical testing than many other studied prognostic markers. Therefore, in this study we prospectively assess the relationship between the four SNPs in DNA repair genes and survival of non-smoking female patients with lung adenocarcinoma.

Methods

Patient recruitment and follow-up

All patients were from the ongoing study of lung cancer in non-smoking females, which started from July, 1999 in Shenyang city, China. The human investigations were approved by the Institutional Review Board of China Medical University, and informed consent was obtained from each participant or their representatives if direct consent could not be obtained. All patients were unrelated ethnic Han Chinese. Individual with a total of 100 cigarettes in his lifetime was defined as a smoker, otherwise he was considered as a non-smoker. Each participant donated 10 ml venous blood and was interviewed to collect demographic data and clinical information. For this study, we identified 285 patients who were diagnosed with histologically confirmed lung adenocarcinoma (stage I-IV) between the years 1999 and 2004. The year 2004 was chosen as the last year of eligibility in order that all participants have adequate follow-up. About the histological subtype of 285 patients, 228 were diagnosed using surgical specimen and 57 using exfoliated cells of lung adenocarcinoma patients whose tumor stages were determined by thoracic imageology.

Dates of death were obtained using at least one of the four following methods: inpatient and outpatient medical records, the registry of causes of death in Shenyang Center for Disease Control and Prevention (CDC), registration and presumption of death in Shenyang Public Security Bureau, and telephone follow-up. Twenty eight patients were lost to follow up. However, there was no significant difference in the characteristics between the patients with and without follow-up information. Finally, 257 patients were included in this analysis.

DNA extraction and genotyping

Genomic DNA samples were extracted by guanidine hydrochloride (GuHCl) method. SNP was analyzed by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) method as described previously [13]. The PCR primers (Takara Biotechnology Dalian Co. Ltd., China) for amplifying DNA fragment containing the ERCC2 751, ERCC2 312, ERCC1 118 and XRCC1 399 sites were 751 F5'-GCC CGC TCT GGA TTA TAC G-3' and R5'-CTA TCA TCT CCT GGC CCC C-3', 312 F5'-CTG TTG GTG GGT GCC CGT ATC TGT TGG TCT-3' and R5'-TAA TAT CGG GGC TCA CCC TGC AGC ACT TCC T-3', 118 F5'-AGG ACC ACA GGA CAC GCA GA-3' and R5'-CAT AGA ACA GTC CAG AAC AC-3', 399 F5'-TTG TGC TTT CTC TGT GTC CA-3' and R5'-TCC TCC AGC CTT TTC TGA TA-3', respectively. The PCR products were digested with restriction enzyme (New England Biolabs, Beverly, MA) PstI (for ERCC2 751), StyI (for ERCC2 312), BsrdI (for ERCC1 118) and MspI (for XRCC1 399) to determine the genotypes. A 10% masked and random sample of patients was tested twice by different persons, and the results were found to be concordant for all of the masked duplicate sets.

Statistical analysis

The associations between overall survival and demographic characteristics, clinical features, and genetic SNPs were estimated using the Kaplan-Meier method and Log-rank test. Survival time was calculated from the date of cancer diagnosis to the date of death or last follow-up. In some analyses, we combined the heterozygous genotype with the homozygous rare genotype as the particular group in order to increase sample size. Univariate and multivariable Cox proportional hazards regression models were performed to estimate crude hazard ratio (HR) or adjusted HR and their 95% confidence intervals (CIs). The stepwise Cox regression model was also used to determine factors predictive of cancer prognosis, with a significant level of P < 0.05 for entering and P > 0.10 for removal of the variables. All the statistical analyses were performed using Statistical Product and Service Solutions (SPSS) v13.0.

Results

Patient characteristics

The mean age of patients was 51.16 ± 9.48 years (range 18-75 years). The distribution of characteristics and clinical features of 257 lung adenocarcinoma patients were shown in Table 1. There were 206 deaths. The overall median survival time (MST) was 13.07 months. As Table 1 showed, patients with advanced cancer or without surgical operation had significantly shorter MSTs (Log-rank P < 0.05). Univariate Cox regression analysis suggested that the risks of death of lung adenocarcinoma were increased in patients with stage II, III and IV compared with those with stage I (HRs were 2.48, 3.46 and 4.66, respectively). The result also showed that the patients undergoing surgical operation had a decreased risk of death (HR = 0.61).

Table 1 Patient characteristics and clinical features

Genotype frequencies of genetic polymorphisms

Table 2 showed the genotype frequencies of four SNPs in 257 patients. All genotype frequencies of these four polymorphisms were found to be in Hardy-Weinberg equilibrium. The variant allele frequencies were 13.0% for ERCC2 751, 5.6% for ERCC2 312, 27.2% for ERCC1 118 and 34.6% for XRCC1 399 polymorphism. No associations were found between genotypes and age, tumor stage, surgical operation, chemotherapy or radiotherapy (data not shown).

Table 2 Genetic polymorphisms in DNA repair genes and survival of patients

Genetic polymorphisms and survival of patients

The associations between genotypes of four SNPs and survival of non-smoking female patients with lung adenocarcinoma were suggested in Table 2. Patients with CT or TT genotype at ERCC1 Asn118Asn showed significantly shorter survival time than those with CC genotype (11.07, 6.20 months versus 17.23 months) (Log-rank test, P < 0.001). In terms of XRCC1 399 polymorphism, the difference in the MSTs among patients with GG (19.10 months), GA (10.40 months) and AA (9.23 months) was statistically significant (Log-rank test, P < 0.001). However no associations were found between two SNPs of ERCC2 gene and the overall survival of patients.

In the further analysis, lung adenocarcinoma patients were stratified by stage. There were no significant differences in survival times of stage I-II patients with variant genotypes of four SNPs. For patients with stage III or IV, Kaplan-Meier analyses proved that individuals with heterozygous variant or homozygous variant genotype at ERCC1 Asn118Asn or XRCC1 Arg399Gln had shorter MSTs than those with wild genotype (Figure 1). The MSTs of patients with CC, CT and TT genotype at ERCC1 Asn118Asn were 14.53, 9.57 and 5.03 months and the difference was significant (Log-rank P < 0.001). For XRCC1 399 polymorphism, patients with GA or AA genotype lived shorter than those with GG genotype and corresponding MSTs were 9.57, 4.27 and 14.20 months(Log-rank P < 0.001). There were no differences in MSTs according to genotypes of ERCC2 751 or 312 polymorphism.

Figure 1
figure 1

Kaplan-Meier curves for patients with stage III-IV by (A) ERCC1 118 genotypes and (B) XRCC1 399 genotypes.

In the Cox regression model, after adjusting for age, tumor stage, surgical operation and chemotherapy or radiotherapy, variant genotypes of ERCC1 118 or XRCC1 399 polymorphism were associated with higher risks of death for non-smoking female patients with lung adenocarcinoma (Table 2). With the CC genotype at ERCC1 Asn118Asn being the reference, the HR for CT genotype was 1.48 (P = 0.009) compared to 2.67 (P < 0.001) in the TT genotype. In terms of XRCC1 399 polymorphism, we found that compared with those carrying GG genotype, the HRs were 1.28 (P = 0.109) and 2.68 (P < 0.001) for individuals with GA and AA genotype, respectively.

In addition to evaluating the genetic polymorphisms separately, we studied the association between patients' survival and the total number of variant alleles of ERCC1 and XRCC1 polymorphisms (Table 3). In the double homozygous group (0 variant allele), the overall MST was 25.10 months. As the number of variant alleles increased, the MSTs decreased to 13.07, 9.27, 6.30 and 3.87 months. The Log-rank test was statistically significant (P < 0.001). The HRs for individuals with 1, 2, 3, 4 variant allele(s) were 1.52, 2.33, 2.98 and 9.24 compared with those carrying 0 variant allele. Besides, the effect was also conspicuous for the patients with stage III-IV (Log-rank P < 0.001) but not significant for those with stage I-II (P = 0.145).

Table 3 Genetic polymorphisms in combination of ERCC1 and XRCC1 and survival of patients

Furthermore we evaluated an interaction effect of the two SNPs on the survival of non-smoking female patients with lung adenocarcinoma. We found that individuals carrying both ERCC1 118 and XRCC1 399 variant alleles were at a higher death risk of lung adenocarcinoma than those with only one of them (adjusted HRs were 2.44, 1.79 and 1.64, respectively) (Table 4).

Table 4 Interaction of ERCC1 and XRCC1 polymorphisms on survival of lung adenocarcinoma

Stepwise Cox proportional hazard analysis was used to study the relationship between factors including demographic characteristics, clinical features and genetic SNPs and survival of non-smoking female patients with lung adenocarcinoma. Four variables (stage, chemotherapy or radiotherapy, ERCC1 118 and XRCC1 399 polymorphisms) were included in the Cox regression model (Table 5).

Table 5 Stepwise Cox regression analysis on survival of lung adenocarcinoma

Discussion

In this study, we explored the relationship between SNPs of three DNA repair genes and survival of non-smoking female patients with lung adenocarcinoma in China. In WHO Western Pacific Region, lung cancer was the commonest cancer cause of death in women and the proportion of adenocarcinoma in lung cancer has increased quickly in both genders over the last few decades [14]. So we focused our study on lung adenocarcinoma in females who were requested to be non-smokers in order to control the confounding influence of smoking. This study showed that being advanced stage, without chemotherapy or radiotherapy, carrying variant genotypes at ERCC1 Asn118Asn or XRCC1 Arg399Gln were proved to be unfavorable prognostic factors for lung adenocarcinoma in Chinese non-smoking women. However, the associations have not been found between ERCC2 751 or 312 polymorphism and survival of lung adenocarcinoma.

The XRCC1 protein is considered to play an important role in both base excision repair and single-strand break repair. XRCC1 Arg399Gln polymorphism was the commonest one among more than 60 validated SNPs in XRCC1 gene and showed no major variations by ethnicity [15]. This polymorphism has been suggested to be a risk factor for the development of lung cancer [1620]. In our previous study, a significant association between XRCC1 399Gln/Gln genotype and risk of lung cancer in Chinese non-smoking women was found [21]. In the present study, we found that non-smoking female lung adenocarcinoma patients with AA genotype at XRCC1 Arg399Gln had a shorter survival time (9.23 months vs. 19.10 months) and higher risk of death (adjusted HR = 2.68, 95%CI = 1.79-4.02) than those with GG genotype. It is consistent with other studies, although those results were obtained from the subjects of all genders, smoking status and histopathologic subtypes [2224].

As for ERCC1 Asn118Asn polymorphism, we observed that the patients with CT or TT genotype showed a significantly shorter survival time than those with CC genotype (11.07 and 6.20 versus 17.23 months). In the multivariable Cox regression, ERCC1 118 variant genotype (CT/TT) remained prognostic factors of lung adenocarcinoma (HR = 1.60, P value = 0.001). To date, only a few studies have examined the relationship between ERCC1 polymorphism and survival of lung cancer, and they didn't control the influence of gender, smoking status and histopathologic subtypes [2527]. In terms of chemotherapy response, a few studies suggested an association between ERCC1 Asn118Asn polymorphism and response to platinum-based treatment of lung cancer [25, 27, 28]. Although these studies have shown possible relationship between ERCC1 polymorphism and effectiveness of cancer treatment or survival of cancer patients, the biological effect of this synonymous SNP is unclear.

In our study, no associations were found between the overall survival and two SNPs of ERCC2 gene. This is consistent with other studies [22, 25, 29], but there are studies showing an opposite effect [24]. The explanation for these discordant results remains to be elucidated.

Many clinical features may play important roles in the survival of cancer patients. The multivariable Cox regression in this study showed that being advanced stage (III+IV) and without chemotherapy or radiotherapy treatment were independent unfavorable prognostic factor for lung adenocarcinoma in non-smoking female population. Surgical status wasn't included in the stepwise Cox model maybe because the surgical treatment is decided by the patients' conditions such as tumor stage and histopathologic subtypes, so the surgical status may be a dependent variable but not an independent prognostic factor.

Genetic polymorphisms as either prognostic or predictive biomarkers have many advantages, especially in the advanced cancer setting. First of all, the biological specimen for detecting SNP is easily to obtain. Second, the detecting method is simple, precise and practical. Finally, in the advanced cancer setting, diagnoses are made using specialized and mostly body-harmed method; otherwise SNP detecting can avoid these problems.

In conclusion, this study analyzed four SNPs in three DNA repair genes in relation to survival of non-smoking female patients with lung adenocarcinoma in China. The results suggested that besides clinical features such as tumor stage and chemotherapy or radiotherapy treatment, polymorphisms of ERCC1 Asn118Asn and XRCC1 Arg399Gln were associated with survival of patients. Because DNA repair is a complex system including many pathways and genes, larger studies with more genetic polymorphisms, even haplotypes, in different ethnic populations and the functional or biological relevance of these polymorphisms are needed to confirm our conclusions.

Conclusions

Genetic polymorphisms in ERCC1 and XRCC1 genes might be prognostic factors in non-smoking female patients with lung adenocarcinoma.