Association of PD-1 polymorphisms with the risk and prognosis of lung adenocarcinoma in the northeastern Chinese Han population
- 190 Downloads
Lung cancer is a leading cause of death from cancer worldwide, especially non-small cell lung cancer (NSCLC). The marker of progression in lung adenocarcinoma, the main type of NSCLC, has been rarely studied. Programmed death 1 (PD-1) is an effective drug target for the treatment of NSCLC. Our study aimed to examine the PD-1 role in the disease process. The study of the effect of polymorphisms on the progression of lung adenocarcinoma in the Han population of Northeast China may provide a valuable reference for the research and application of these drugs.
Chi-square test, Wilcoxon rank sum test, and classification efficiency assessment were used to test SNPs of PD-1 in 287 patients and combined with clinical information.
We successfully identified biomarkers (rs2227981, rs2227982, and rs3608432) that could distinguish between lung adenocarcinoma patients of early stages and late stages. Multiple clinical indicators showed significant differences among different SNPs and cancer stages. Furthermore, this gene was confirmed to effectively distinguish the stages of lung adenocarcinoma with RNA-seq data in TCGA.
Out study indicated that the PD-1 gene and the SNPs on it could be used as markers for distinguishing lung adenocarcinoma staging in the Northeast Han population. Our investigation into the link between PD-1 polymorphisms and lung adenocarcinoma would help to provide guidance for the treatment and prognosis of lung adenocarcinoma.
KeywordsLung adenocarcinoma staging PD-1 Polymorphism SNP
Neutrophil to Lymphocyte Ratio
Non-small cell lung cancer
Programmed death 1
Small cell lung cancer
Support Vector Machine
White blood cell
Lung adenocarcinoma is the most common type of non-small cell lung cancer (NSCLC) and the leading cause of cancer death worldwide. It may develop as a result of the interaction between environmental risk factors and individual genetic susceptibility . Therefore, genetic susceptibility to lung adenocarcinoma has become an area of interest in lung cancer research [9, 7]. Almost all lung adenocarcinoma patients were diagnosed in advanced stages (IIIB and IV) . Lung adenocarcinoma develops silently and has no specific symptoms. Therefore, early lung cancer (Stage I and Stage II) is difficult to find. So accurate pathological staging is an important factor in the treatment of lung adenocarcinoma and plays an important role in the selection of postoperative treatment and prognosis. The Tumor, Node, Metastasis (TNM) staging system for NSCLC is an internationally accepted system used to characterize the extent of disease. Misdiagnosis still exists for patients with early hematogenous metastasis using TNM when imaging is not applicable.
Programmed death 1 (PD-1) plays a very important role in tumor immunity. PD-1 is currently the drug target of NSCLC . Various studies have evaluated the relationship between PD-1 polymorphisms and various cancer risks. rs2227981 (PD-1.5) and rs36084323 (PD-1.1) were both related to NSCLC . Previous studies reported the association between rs2227982 (PD-1.9) and gastric cardia adenocarcinoma  as well as breast cancer . rs7421861 could increase the risk of colorectal cancer in Chinese . Our study aimed to examine the effect of PD-1 and these four PD-1 polymorphisms on disease progression of lung adenocarcinoma.
In order to combine the lung adenocarcinoma staging information and clinical information, four polymorphisms of the PD-1 gene, including rs2227981 (PD-1.5), rs2227982 (PD-1.9), rs36084323 (PD-1.1) and rs7421861, were selected, and the genotypes of 287 patients were determined by SNaPshot primer extension assays.
Clinical characteristics of case and control subjects
Age(yr), mean ± SD
62.0 ± 10.1 (27–89)
62.5 ± 10.7 (40–91)
Basic information of SNPs
p_value for Hardy-Weinberg equilibrium
synonymous codon, upstream variant 2 KB
missense, upstream variant 2 KB
upstream variant 2 KB
intron variant, nc transcript variant
Six clinical indicators, including carcino-embryonic antigen (CEA), neutrophilicgranulocyte (GRAN), lactate dehydrogenase (LDH), lymphocyte (LYM), Neutrophil to Lymphocyte Ratio (NLR) and white blood cell (WBC), were measured in all samples .
DNA extraction and genotyping
rs2227981: 5′-TCTCCTGAGGAAATGCGCTGAC-3′ (forward) and 5′-TGGTGTCCCCAGATCACACAGA-3′ (reverse);
rs2227982: 5′-TCTCCTGAGGAAATGCGCTGAC-3′ (forward) and 5′-TGGTGTCCCCAGATCACACAGA-3′ (reverse);
rs36084323: 5′-CTCCCATTCTGTCGGAGCCTCT-3′ (forward) and 5′-GAAGGGGAGGTCAGCCTCACAG-3′ (reverse);
rs7421861: 5′-CCCAGCTGGAATGTCATTGAGAA-3′ (forward) and 5′-TTACACTCCCCTGTGCCAGAGC-3′ (reverse).
PCR was performed with 1 μL of DNA sample, 3.0 mM Mg2+, 1× GC-I buffer (Tahara), 1 μL multiple PCR primers, 0.3 mM dNTP and 1 unit HotStarTaq polymerase (Qiagen, Inc.) in a total volume of 20 μL. The PCR cycling program was as follows: 95 °C for 2 min; followed by 11 cycles of 94 °C for 20 s, 65 °C (decreased 0.5 °C per cycle) for 40 s and 72 °C for 90 s; plus 24 cycles of 94 °C for 20 s, 59 °C for 30 s and 72 °C for 90 s; with a final extension at 72 °C for 2 min and 4 °C forever. Next, 5 units shrimp alkaline phosphatase and 2 units Exonuclease I was added to the PCR product, incubated at 37 °C for 1 h and inactivated at 75 °C for 15 min for purification. SNaPshot multiple single base extension reaction was performed using 5 μL SNaPshot Multiplex Kit (Applied Biosystems), 0.5 μL 5′ ligase primer mixture (1.2 μM), 0.5 μL 3′ ligase primer mixture (1.6 μM), 2 μL ddH2O and 2 μL purified PCR product in a final volume of 10 μL. The reactions were cycled as follows: 28 cycles of 96 °C for 10 s, 55 °C for 5 s and 60 °C for 30 s, with the products subsequently kept at 4 °C. Purified extension product 0.5 μL was then combined with 0.5 μL Liz120 Size Standard and 9 μL Hi-Di, inactivated at 95 °C for 5 min and then sequenced and analyzed using an ABI 3730XL DNA Analyzer and GeneMapper 4.1 (Applied Biosystems Co.Ltd.USA), and the nucleotide at each SNP site was identified and recorded.
The statistical test of Hardy-Weinberg equilibrium has been an important tool for detecting genotyping errors in the past and is still important in the quality control of next-generation sequence data. The chi-square test was used to test whether the 4 SNPs were in Hardy–Weinberg equilibrium with Excel.
Wilcoxon rank sum test
The Wilcoxon rank sum test is a method often used in statistical practice to compare position measurements where the underlying distribution is far from normal or not known in advance . We used the Wilcoxon rank sum test to analyze differences of the six indicators between different TNM stages. Then, for the early stage (TNM I and TNM II) and late stage (TNM III and TNM IV) of both- or single-gender samples, differences of six clinical indicators were also tested, separately.
Logistic regression analyses
Correlations between lung adenocarcinoma stages and SNP genotypes were analyzed by logistic regression model under dominant and recessive models. Logistic regression analyses were performed using IBM SPSS (Statistical Package for the Social Sciences) Statistics ver. 17.0.
Classification efficacy assessment
To evaluate the efficiency of classifying early and late stage samples of genes and clinical indicator-encoding genes, a Support Vector Machine (SVM) classifier was constructed. Leave-one-out cross-validation (LOOCV) was carried out to assess the performance. The receiver operating characteristic (ROC) curves were plotted and the areas under the curves (AUC) were computed. Those with better classification efficiency indicated their better potential to act as biomarkers.
Differences of six indicators between different TNM stages
Association of PD-1 SNPs with lung adenocarcinoma in different models
Association of single SNPs with lung adenocarcinoma
Frequency of early stage samples
Frequency of late stage samples
Association of haplotypes with lung adenocarcinoma
Frequency of cases
Frequency of controls
The difference of six clinical indicators in different stages and different SNP genotyping
It was found that rs7421861 was significantly correlated with LDH in both- or single-gender samples. Correlation between four SNPs and other clinical indicators varied in different gender groups. For all samples, there was a significant correlation between rs2227981 and three indicators: CEA, NLR, and GRAN. The genotyping of rs2227982 and rs36084323 were significantly correlated with NLR, GRAN, and WBC. For male samples alone, there was a significant correlation between rs2227981 and three indicators: CEA, LYM, and LDH. The genotyping of rs2227982 and rs36084323 were significantly correlated with LYM, GRAN, and WBC. For female samples alone, there was a significant correlation between rs2227981 and five indicators: CEA, NLR, GRAN, WBC, and LDH. The rs2227982 and rs36084323 were significantly correlated with CEA, NLR, and LYM.
The differences in gender between the correlations of these SNPs and clinical indicators may provide a reference for the clinical test results of patients.
Classification efficiency evaluation of genes and clinical indicators
The classification efficiency of PD-1, PD-L1, CEA-encoding and LDH-encoding genes for both or single gender
PD-1 and its ligand, PD-L1, were drug targets of lung adenocarcinoma . In-depth study of their SNPs in the staging of lung adenocarcinoma may be helpful for the diagnosis and treatment of patients. Therefore, we selected 111 normal and 287 patients for the DNA extraction and genotyping, and the corresponding clinical indicators and stage information were analyzed. There was a certain correlation between four SNPs and the six clinical indicators. The significance of the correlation between clinical indicators and staging of lung adenocarcinoma of the two stages was higher than that of four stages. For two stages, PD-1 gene polymorphisms at three investigated positions and their haplotypes were associated with the risk and prognosis of lung adenocarcinoma in three genetic models (allele model, dominant model, and recessive model).
Based on SVM and LOOCV, CEA, LDH, and NLR could distinguish stages among the six clinical indicators. CEA was reported to be correlated with TNM stage and tumor invasion, and has become the marker of prognosis for lung adenocarcinomas . The elevated NLR is an independent prognostic marker for poor survival at NSCLC, which has been used as a simple and useful tool to predict the prognosis of lung adenocarcinoma after radical pneumonectomy [11, 16]. LDH provided energy to tumor cells by converting acetone acid into lactic acid, and has been linked to prognosis of small cell lung cancer (SCLC) in a previous study . PD-1 and PD-L1 could distinguish stages significantly, and their classification efficiency was higher than that of CEA and LDH related genes.
In view of the high classification efficiency of PD-1 and PD-L1 for staging and the significant correlation of their SNPs and haplotypes on them with staging, the polymorphism of PD-1 may be used as a marker for the staging of lung adenocarcinoma. Nevertheless, the data may be limited, for clinical data and SNP genotyping were obtained by our experiments, and the RNA-seq data were downloaded from TCGA, further evidence from other studies across different populations incorporating with the stage of cancers is required in order to confirm or refute the findings of this study.
In summary, our study examined the PD-1 role in the disease process with patients collected prior to treatment, indicating that the PD-1 gene and the SNPs on it could be used as markers for distinguishing between early and late stages of lung adenocarcinoma in the Northeast Han population. Our investigation into the link between PD-1 polymorphisms and lung adenocarcinoma would help to provide guidance for the treatment and prognosis of lung adenocarcinoma.
EH and WL performed the analyses and reviewed the manuscript. LC, KH and XW conceived and supervised the research. JL, GD, JX, CY and XZ read and approved the final manuscript. All authors read and approved the final manuscript.
LC was supported by the National Natural Science Foundation of China (Grant No. 61272388), she conceived and supervised the research; WL was supported by the Innovative Scientific Research Funding Project of Harbin Medical University (Grant No. 2017JCZX46), she performed the analyses and reviewed the manuscript.
Ethics approval and consent to participate
The study was approved by the the Second Affiliated Hospital, Harbin Medical University. The patients and their families have read and signed the informed consent form.
Consent for publication
Written informed consent was obtained from all patients included in this report for the use of clinical-related materials for scientific research and publications.
The authors declare that they have no competing interests.
- 6.Huang Q, Zhang H, Hai J, Socinski MA, Lim E, Chen H, Stebbing J. Impact of PD-L1 expression, driver mutations and clinical characteristics on survival after anti-PD-1/PD-L1 immunotherapy versus chemotherapy in non-small-cell lung cancer: a meta-analysis of randomized trials. Oncoimmunology. 2018;7:e1396403. https://doi.org/10.1080/2162402X.2017.1396403.CrossRefPubMedPubMedCentralGoogle Scholar
- 16.Wang G, Xiong R, Wu H, Xu G, Li C, Sun X, Xie M. Prognostic value of neutrophil-to-lymphocyte ratio in patients with lung adenocarcinoma treated with radical dissection. Zhongguo Fei Ai Za Zhi. 2018;21:588–93. https://doi.org/10.3779/j.issn.1009-3419.2018.08.04.CrossRefPubMedGoogle Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.