Introduction

Lung cancer is a complex and malignant cancer with the high morbidity. Both environmental and genetic factors were acknowledged to the most important pathogenic factors for lung cancer. Lung cancer is divided into non-small cell lung cancer (NSCLC) and small cell lung cancer. NSCLC includes four major types: adenocarcinoma, squamous cell carcinoma, large cell carcinoma, and adenosquamous carcinoma. Age, lifestyle [1], genetic , endocrine [2], occupational exposure [3, 4], and other environmental factors [5] play important roles in lung cancer. In the past years, studies had shown that smoking was the strongest environmental risk factor for the occurrence and development of lung cancer. However, it was also reported that 25% of lung cancer patients were non-smokers. It suggested that genetic risk factor might play an important role in lung cancer. At the same time, some molecular epidemiological studies had suggested that the interaction between genetic and environmental factors might affect the incidence and development of lung cancer. However, the exact etiology and pathogenesis of lung cancer remained obscure.

With the in-depth study of high-throughput technologies, many genome-wide association studies (GWASs) had found the correlation between long non-coding RNAs (lncRNAs) and multiple cancers. LncRNAs, a class of non-coding RNA, ranging from 200 to 100 kbp in length, included non-protein coding transcripts [6]. While the lncRNAs were regarded as the “transcribed noise” in the early years, increasing evidence had shown that lncRNAs played key roles in biological processes. As regulators in cellular processes, lncRNAs could inhibit cell growth [7] and influence cell differentiation [8] and cell metastasis [9]. LncRNAs were known to play critical roles in multiple cancers by regulating multiple molecular regulation [10]. CDKN2B-AS1 is an lncRNA (3.8 kb) transcribed from the INK4B-ARF-INK4A gene, locating on human chromosome 9p21 [11, 12]. GWAS had shown that SNPs in this region (9p21) were associated with many diseases, including cancers [13, 14]. Many studies had shown that the oncogenic properties of CDKN2B-AS1 was existed in multiple carcinomas (thyroid cancer [15], gastric cancer [16], lung cancer [17, 18], and so on). Compared with normal controls, the overexpression of CDKN2B-AS1 was obviously found in NSCLC tissues and serum samples [19]. The expression level of CDKN2B-AS1 was higher in human non-small cell lung cancer than normal diploid fetal lung fibroblasts. The proliferation of H1299 cells were regulated by lncRNA CDKN2B-AS1 [20]. The c-Myc was a closely-watched transcription factor which was overexpressed in NSCLC [21]. The c-Myc could directly transactivate CDKN2B-AS1, and CDKN2B-AS1 could promote the proliferation of NSCLC cells [22].

A meta-analysis of 16 GWASs identified a novel disease locus for squamous cell carcinoma at 9p21. The result also found that 9p21.3 variants in the region of CDKN2B-AS1 were associated with squamous cell lung cancer [23]. So the functional genetic variations in the lncRNA CDKN2B-AS1 might contribute to the susceptibility of lung cancer. rs2151280 was located within the lncRNA CDKN2B-AS1 at 9p21.3. Because rs2151280 might affect the expression of CDKN2B-AS1 [24], and CDKN2B-AS1 expression might affect the susceptibility of lung cancer, we suspected that rs2151280 might affect the susceptibility of lung cancer. Emerging evidence demonstrated that rs2151280 was related to various kinds of malignant tumors, which indirectly confirmed the importance of rs2151280 in cancer susceptibility. rs2151280 was initially discovered by a GWAS research on the correlation between rs2151280 and the risk of basal cell carcinoma, and the results showed that rs2151280 was related to the risk of basal cell carcinoma (BCC) [25]. The high expression of CDKN2B-AS1 and the low expression of p14ARF were associated with rs2151280 TT variant in peripheral blood mononuclear cells (PBMCs) [26]. Molecular epidemiological studies had shown that rs2151280 was associated with various cancer risks (BCC [25] and plexiform neurofibromas (PNF) [24]). The association between CDKN2B-AS1 rs2151280 and esophageal squamous cell carcinoma (ESCC) was not found in the study [27]. The relationship between CDKN2B-AS1 rs2151280 and the risk of lung cancer was still unclear in the world.

Because the association between CDKN2B-AS1 rs2151280 and lung cancer was undefined, our research had some innovation in investigating the association between CDKN2B-AS1 rs2151280 and lung cancer risk. Therefore, we decided to further explore this SNP by supporting the above evidence in 507 lung cancer patients and 542 cancer-free controls. And we also evaluated the interaction of rs2151280 with smoking status on the risk of lung cancer in the case-control study. As far as we know, this was the first time to evaluate the effect of rs2151280 in lncRNA CDKN2B-AS1 on lung cancer risk.

Materials and methods

Study subjects

Our study was an ongoing hospital-based case control study. Quanto1.2.4 statistical software was used to calculate the sample size. Since we only found the incidence of lung cancer, we calculated the sample size according to the incidence of lung cancer. Lung cancer is divided into non-small cell lung cancer and small cell lung cancer. NSCLC includes four major types: adenocarcinoma, squamous cell carcinoma, large cell carcinoma, and adenosquamous carcinoma. We wanted to further explore the relationship between the risk of each type of pathology and rs2151280. So we performed pathological type analysis based on the collected patient information. We recruited 507 Han Chinese lung cancer patients and 542 Han Chinese healthy individuals in Shenyang City, located in northeast China. For excluding population admixture effects, we investigated the ethnicity of the participants’ parents for three generations. We perform a power calculation. The power was more than 0.88. All the cases were (1) newly diagnosed as lung cancer patients, (2) never accepted treatments, and (3) without cancer history or metastatic cancer. The healthy individuals were (1) selected in the same period in the same hospitals and (2) matched age and gender during the epidemiological survey. All enrolled subjects signed the informed consent forms and the institutional review board of China Medical University approved this study. Ten-milliliter venous blood sample collected from all enrolled subjects to detect the SNP. Phenol-chloroform method was used to isolate genomic DNA sample from venous blood. The rs2151280 was genotyped by 7500 Fast Real-time PCR system. In each genotyping process, blank controls were needed. The investigators randomly selected 10% samples to genotype twice for quality control, and the results were concordant with the duplicate samples.

Statistical analysis

Student’s t test and χ2 test were calculated to analyze the difference in continuous and categorical variables between cases and healthy individuals, respectively. The goodness-of-fit χ2 test was used to examine the Hardy-Weinberg equilibrium (HWE) of rs2151280. We calculated the odds ratios (ORs) with their 95% confidence intervals (CIs) by unconditional logistic regression to evaluate the relationship between rs2151280 and lung cancer risk. The interaction of smoking status and rs2151280 on risk of lung cancer was evaluated by logistic regression on additive model and multiplicative model. The criterion of statistical significance was defined as P < 0.05, and all of the statistical tests were two-sided in this study. All the statistical analyses were calculated by SPSS software.

Results

The demographic of 507 lung cancer cases and 542 healthy individuals was shown in Table 1. In terms of age, cases and controls seemed to be exactly matched (P = 0.096). There were no statistically significant differences in the proportion of sex status between the lung cancer case group (50.3% males and 49.7% females) and the control group (48.3% males and 51.7% females). As expected, the distribution of smoking status was significantly different in lung cancer cases and healthy controls (P < 0.001). Lung cancer cases included 248 adenocarcinomas, 125 squamous cell carcinomas, 90 small cell carcinomas, and 44 other types. The observed genotype frequencies for rs2151280 were in agreement with Hardy–Weinberg equilibrium. We wanted to further explore the relationship between the risk of each type of pathology and rs2151280. So, we performed pathological type analysis based on the collected patient information.

Table 1 Distribution of demographic variables in lung cancer and controls

The association between rs2151280 and lung cancer risk was shown in Table 2. Compared with individuals carrying TT genotype, individuals with CC genotype of rs2151280 had a lower lung cancer risk (OR = 0.654, 95% CI = 0.443–0.965, P = 0.033). After adjusting for age, gender and smoking status risk factors, this association remained significant (OR = 0.640, 95% CI = 0.421–0.972, P = 0.036). In the recessive model, individuals carrying variant homozygote CC genotypes were associated with a significantly lower risk of lung cancer compared with individuals carrying heterozygote TC and TT genotype (adjusted OR = 0.648, 95%CI = 0.439–0.957, P = 0.029). In allele comparison, the C allele was associated with a lower risk of non-small-cell lung cancer (OR = 0.674, 95%CI = 0.560–0.812, P < 0.001) in Table 2. The associations of rs2151280 with lung adenocarcinoma (AD) and lung squamous cell carcinoma (SQ) were shown in Table 3. The statistically significant association was found in AD but not in SQ. Individuals with CC genotype had a 0.567-fold decreased risk of AD than those carrying TT genotype (95%CI = 0.333–0.965, P = 0.037). In the recessive model, individuals with CC genotype had a 0.543-fold decreased risk of AD than those carrying TT genotype or TC genotype (95%CI = 0.330–0.893, P = 0.016). However, there was no significant association between rs2151280 and the risk of SQ in the five models. We performed the stratification analyses to further estimate the association between rs2151280 and risk of lung cancer. In the subgroup analysis by gender, there was a significant correlation between rs2151280 and lung cancer risk in female (Table 4). Compared with TT genotype, variant CC genotype of 2151280 was associated with lower risk of lung cancer (adjusted OR = 0.533, 95 % CI = 0.310–0.916, P = 0.023). Compared with TT genotype and TC genotype, variant CC genotype was associated with lower risk of lung cancer and NSCLC (adjusted OR were 0.538 and 0.572, 95% CI were 0.326–0.888, and 0.335–0.975, P were 0.015 and 0.040, respectively).

Table 2 The association of the rs2151280 with lung cancer risk and non-small cell lung cancer
Table 3 The association of the rs2151280 with lung adenocarcinoma risk and lung squamous cell carcinoma risk
Table 4 The association of the rs2151280 with lung cancer risks and NSCLC in female and male population

However, in the subgroup of age, we failed to find any statistically significant associations between rs2151280 and the risk of lung cancer and NSCLC (Table 5).

Table 5 Stratified analyses of the rs2151280 with lung cancer risk and NSCLC by age

In the subgroup of smoking, we found the significant association among rs2151280 with the risk of lung cancer and NSCLC in the individuals who had never exposed to smoking (Table 6). Carriers of the rs2151280 CC genotype had a lower lung cancer risk than carriers with TT genotype (adjusted OR = 0.518, 95%CI = 0.316–0.851, P = 0.009). Individuals carrying variant homozygote CC genotype had decreased risk of lung cancer by 0.561-fold (adjusted OR = 0.561, 95%CI = 0.354–0.891, P = 0.014) compared individuals carrying with heterozygote TC and TT genotype. CDKN2B-AS1 rs2151280 was significantly associated with the risk of NSCLC (CC vs TT: OR = 0.572, 95%CI = 0.334–0.980, P = 0.042; CCvsTC+TT: OR = 0.587, 95%CI = 0.356–0.968, P = 0.037, respectively). Tables 7, 8, 9 had shown the interaction between rs2151280 and smoking exposure on the susceptibility of lung cancer, NSCLC, AD, and SQ on additive interaction. Compared to individuals with CC genotype and never exposure to smoking, individuals with TC/TT genotype and never smoking had the increased risk of lung cancer and lung adenocarcinoma (adjusted OR = 1.659, 95%CI = 1.056–2.606, P = 0.028; OR = 1.814, 95%CI = 1.042–3.156, P = 0.035, respectively). Compared to individuals with CC genotype and never exposure to smoking, individuals with TC/TT genotype and ever smoking had the increased risk of lung cancer and NSCLC. At the same time, the same results existed in AD and SQ (adjusted OR = 7.403, 95%CI = 3.815–14.366, OR = 12.139, 95%CI = 4.827–30.526, respectively). However, the results of quantitative analyses indicated that there were no significant (Table 9). There were three measures [relative excess risk due to interaction (RERI), attributable proportion due to interaction (AP), and synergy index (S)] with their 95% CI were used to show the relationship. The criterion of these three measures was just as our previous study [28]. Due to the P value was more than 0.05, the results of multiplicative interaction indicated that there were no association between rs2151280 risk genotypes with smoking exposure and lung cancer risk.

Table 6 Stratified analyses of the rs2151280 with lung cancer risks and NSCLC by smoking
Table 7 Relationship of interaction between rs2151280 and smoking with lung cancer risk and NSCLC
Table 8 Relationship of interaction between rs2151280 and smoking with lung adenocarcinoma risk and lung squamous cell carcinoma risk
Table 9 Interaction measures between rs2151280 in lncRNA and smoking exposure on lung cancer, NSCLC, lung adenocarcinoma, and lung squamous cell carcinoma

Discussion

Lung cancer is one of the most common malignancies, with higher morbidity and higher mortality in the world in the 21st century [29, 30]. According to reports, the crude incidence of lung cancer in developing countries is increasing rapidly. Data was recorded by the International Agency for Research on Cancer (IARC) from 2008 to 2012. The data had shown that lung cancer was the first common malignancies in Chinese male (52.7 cases per 100,000 men) and the second common malignancies in Chinese female (28.4 cases per 100,000 women is just less than breast cancer). The age-standardized incidence rate (ASR) world was 45.9 per 100,000 s and 38.6 per 100,000 s for men in the USA and UK, respectively. Due to the limitations of cancer detection, lung cancer was often too late to get effective treatment. LncRNAs, one of the most important classes of ncRNAs with a length of more than 200 bp, became a hotpot as potentially critical regulators in cellular processes. The association between lncRNA DGCR5 and lung cancer had been found [31]. Many studies had shown that lncRNAs are associated with lung cancer, such as H19 [32], ATB [33], CDKN2B-AS1 [22], and MALAT1 [34].

CDKN2B-AS1 is located within the CDKN2B-CDKN2A gene cluster at chromosome 9p21. CDKN2B-AS1 was transcribed and sewn into a complex pattern [35] and play a role in a variety of cell functions [36]. Compared with healthy individuals, the high expression of lncRNA CDKN2B-AS1 was significantly existed in NSCLC patients [19]. In addition, the area under the curve (AUC) of CDKN2B-AS1 was greater than tumor markers (CEA, CYFRA21-1, and SCCA). It was indicating that CDKN2B-AS1 had a significant efficacy in distinguishing NSCLC patients and controls [19]. AUC had widely recognized as an inherent accuracy index for the authenticity evaluation of diagnostic tests. When the value of AUC is 0.5, there is no diagnostic value. When the value of AUC is 1, the diagnostic value is high. A previous study by Nie.et al had reported that CDKN2B-AS1 could promote the proliferation of non-small cell lung cancer cells and inhibit cell apoptosis [17].

In this study, we demonstrated that the rs2151280 in lncRNA CDKN2B-AS1 was significantly associated with lung cancer. The minor allele frequency (MAF) of rs2151280 was 0.268 in Chinese population. This study was the first time to suggest that rs2151280 might be an important risk factor in lung cancer. Specifically, the results of our study suggested that rs2151280 CC genotype could be a protecting factor in lung cancer compared with TT genotype, especially among adenocarcinoma patients. Compared with TC/TT genotypes, CC genotype had a decreased risk of lung cancer including adenocarcinoma patients. The C allele had reduced the risk of NSCLC. Furthermore, in the subgroup of gender, individuals with CC genotype had a lower risk of lung cancer compared with individuals with TT genotype (adjusted P = 0.023) in female. In female, individuals with CC genotype also had a lower risk of lung cancer and NSCLC compared with individuals with /TCTT genotypes (adjusted P = 0.015 or P = 0.040, respectively). Moreover, in the subgroup of smoking status, the same results were existed in lung cancer in never smoking population. However, we failed to find the any statistically significant association between rs2151280 and SQ. In this study, the sample size might be too small to get the significant results in SQ. The interaction of gene–environment on the risk of cancer was studied by few studies. Therefore, in this case-control study, we investigated the interaction between smoking status exposure and the rs2151280 C/T on lung cancer risk. The crossover analysis suggested that the interaction between rs2151280 and smoking exposure on the susceptibility of lung cancer, NSCLC, AD, and SQ were existed in Chinese population. However, the quantitative analyses had shown that no any statistical significance existed on the interaction between the rs2151280 and smoking status exposure.

Admittedly, this present study had several limitations. Firstly, the cases and controls were selected from hospitals, which might lead to Berkson’s bias. As such, the cases and controls selected from several different hospitals to decrease this bias. Secondly, smoking exposure data were collected by interviewing participants, which easily lead to recall bias. Thirdly, the sample size was small in the subgroup analysis and interaction analysis of this study, and the research results had certain limitations. Finally, due to the small number of genes in this study, polygenic score was not studied. In future studies, we will consider calculating the polygenic score to make our study results more meaningful. Therefore, the relationship between the rs2151280 in lncRNA CDKN2B-AS1 and lung cancer risk need to be validated by further large size studies.

Conclusions

Our findings provided new insights into the roles of rs2151280 in lung cancer risk in northern Chinese Han population. This hospital-based case-control study suggested that CDKN2B-AS1 rs2151280 T>C was associated with the risk of lung cancer. However, the gene–environment interaction between rs2151280 and smoking exposure was not statistically significant in this study.