Background

For the past few years, immune checkpoint inhibitors that inhibit the interaction between programmed cell death protein-1 (PD-1) and its tissue ligand programmed cell death ligand-1 (PD-L1) have shown active and enduring clinical efficacy in a variety of solid tumors, including non-small cell lung carcinoma (NSCLC) [1,2,3,4]. However, considering the potential immune-related adverse reactions and high cost, identifying patients who may respond to anti-PD-1/PD-L1 treatment is particularly important. According to recent data, high levels of tumor PD-L1 expression usually indicate an enhanced likelihood that a patient will benefit from treatment with PD-1/PD-L1 inhibitors, [2, 5, 6] nonetheless, a subset of patients negative for PD-L1 expression can still respond to treatment [1, 3]. Although tumor PD-L1 expression is not a perfect biomarker with regard to its ability to predict the efficacy of immunotherapy, PD-L1 detection by immunohistochemistry (IHC) is currently the most convenient and economical method, and it has commonly been used to identify patients who may be more likely to benefit from immunotherapy with PD-1/PD-L1 inhibitors.

As the adverse effects of immunotherapy such as hyperprogressive disease [7,8,9] are better understood, screening of potentially beneficial populations is becoming increasingly important. Unfortunately, many patients with NSCLC are diagnosed at an advanced stage and sufficient samples are not available for PD-L1 detection because these patients cannot benefit from surgery; sometimes, they cannot tolerate even small biopsies. Malignant pleural effusion (MPE) is a common complication of NSCLC, especially in adenocarcinoma (AC). MPE samples are easy to acquire, and this type of cell block sample is often used as an alternative or sometimes as the only adequate sample for various relevant tests, such as IHC and molecular testing related to targeted therapy [10, 11]. In the past few years, studies on the applicability of small biopsies and cytology samples for PD-L1 IHC detection have become very common [12,13,14,15,16,17,18]. To the best of our knowledge, only few data have been published on the reliability of MPE cell blocks for PD-L1 testing; these studies are limited because of the small sample size of the MPE cell blocks [19, 20] or the lack of matched experiments [21, 22]. Due to the heterogeneity of PD-L1 expression by tumor cells, assessing the reliability and comparability of PD-L1 testing on MPE cell blocks by a comparison study is necessary.

As mentioned in many studies, PD-L1 evaluation is indeed a challenge for pathologists because it is sometimes difficult to distinguish tumor cells from immune cells, and this challenge is even more pronounced in cytology samples [23, 24]. To our knowledge, it has not yet been reported whether IHC double staining for nuclear proteins such as thyroid transcription factor-1 (TTF-1) and PD-L1 can resolve this issue.

Herein, the specificity of PD-L1 expression on MPE tumor cells is explored by comparison with matched histology samples, and the relationship between clinicopathological features and tumor PD-L1 expression is also analyzed. Furthermore, IHC double staining for TTF-1 and PD-L1 was also performed in this study, and its application value for interpretation of PD-L1 tumor cell staining was evaluated.

Methods

Cases

We retrospectively searched the electronic database of the Department of Pathology, the First Affiliated Hospital of Zhejiang University, Hangzhou, China, for data on MPE cell block samples obtained between January 01, 2015, and December 31, 2017. In all, 124 pairs of MPE cell blocks with matching histology specimens from patients diagnosed with NSCLC were included in the study.

Specimen preparation

The histology specimens were obtained by surgical resection or small biopsy. After fixation in 10% neutral-buffered formalin (10% NBF), samples were transferred to a Leica ASP300S Fully Enclosed Tissue Processor (Leica Biosystems, Buffalo Grove, IL, USA) for paraffin embedding.

The MPE cell blocks were prepared once malignant cells were confirmed by cytodiagnosis. Briefly, 200–800 ml of fresh pleural effusion fluid was centrifuged at 1600 x g for 3 min. The target cells, which were sampled from the surface of the nucleated cell layer, were then transferred into a pointed centrifuge tube containing 10 ml 10% NBF and oscillated for at least 15 min. Next, the samples were centrifuged again to remove the fixative and were then resuspended in 75% alcohol. Subsequently, a third centrifugation was performed to replace 75% alcohol with 95% alcohol; the material was treated gently to keep the cell mass intact. Afterwards, the samples remained immobile for more than 3 h, until the cell mass hardened and contracted. Finally, well-formed cell pellets were transferred to a dehydrator for final processing with the same procedure used for the histology tissues. All the samples were cut consecutively into 3-μm-thick sections for hematoxylin and eosin (H&E) staining and IHC analysis.

IHC quantification of PD-L1 expression and evaluation

Quantification of PD-L1 expression was performed using a Ventana PD-L1 (SP263) Rabbit Monoclonal Primary Antibody assay according to the manufacturer’s instructions. The tumor proportion score (TPS) was evaluated by a qualified pathologist who was trained in scoring PD-L1 expression. Membrane staining (local/global) at any intensity greater than background staining was evaluated, and only viable tumor cells (VTCs) were scored. Cases with too few VTCs (<100) were considered inadequate and were not included. PD-L1 expression was finally divided into three categories according to the TPS: < 1% (negative), 1–49% (low expression) and ≥ 50% (high expression). The staining intensity score (SIS) of PD-L1 expression was also recorded, as follows: 3+ (strong), 2+ (moderate), and 1+ (weak). A subset of paired samples with discrepant results in terms of PD-L1 expression was repeatedly tested, and the results were interpreted by an additional pathologist. The expression pattern of TTF-1 from IHC double-stained sections was taken as reference if necessary.

IHC double staining with PD-L1 and TTF-1

Positive cases with inconsistent PD-L1 expression in the paired samples (excluding 2 MPE cell blocks and 4 biopsy samples with insufficient remaining material), as well as other difficult-to-interpreted samples, were sectioned again for IHC double staining with anti-TTF-1 (clone SPT24, OriGene, USA) and anti-PD-L1 (clone SP263). In all, 32 samples (20 MPE cell blocks and 12 histology samples) from 18 lung AC patients with positive PD-L1 expression were enrolled.

Automated Ventana IHC analysis for PD-L1 and TTF-1 was successively performed on the same slide according to the manufacturer’s instructions for each antibody. The PD-L1 detection procedure was first performed according to the previous single staining process, which was followed by a subsequent automated Ventana IHC analysis for TTF-1 (dilution 1:100) using the same Ventana Benchmark ULTRA staining platform with an ultraView Universal AP Red Detection Kit (Ventana Medical Systems, Tucson, AZ, USA). The external positive/negative controls for PD-L1- and H&E-stained slides were also established.

Statistical analysis

The consistency of PD-L1 IHC expression among matched samples was analyzed by calculating the kappa coefficient (weak consistency for kappa value<0.4, moderate for kappa value = 0.4–0.74 and good for kappa value ≥0.75), and their difference was compared using the marginal homogeneity test. Pearson’s chi-square test or Fisher’s exact test was used to compare the consistency of the PD-L1 expression rate between matched samples according to different factors (sample interval time, treatment, sampling method of histology samples, and number of VTCs). Fisher’s exact test was also employed to compare the satisfaction rate among histology samples with different sampling methods, while the McNemar-Bowker test was used when MPE cell blocks and matched histology specimens were compared between two groups. The Mann-Whitney U-test was performed to explore the correlation between the PD-L1 expression in patients with their characteristics (sex, age, smoking status, and clinicopathological diagnosis). The Kruskal-Wallis test was applied to compare PD-L1 expression in histology samples for which different sampling methods were used. All the data were analyzed using SPSS 24.0 (IBM Corporation, Armonk, NY, USA), and the two-sided significance level was set at p < 0.05.

Results

Clinicopathological features of patients and specimens

In all, 124 paired NSCLC specimens were collected from 124 patients with a median age of 61 years (range, 29–85 years) (Table 1). Most of the samples were satisfactory, and the adequacy of paired specimens was similar (91.1% vs 89.5%). For histology specimens, the adequacy of various sampling methods was slightly different, but the difference was not statistically significant.

Table 1 Clinical and pathologic details of patients and specimens

PD-L1 expression in matched specimens

Excluding 23 unsatisfactory cases, 101 paired samples were successfully analyzed. Expression of PD-L1 was concordant in most cases (86/101, 85.1%) among the three TPS cut-offs (<1%, 1–49% and ≥ 50%), and the consistency of PD-L1 expression between MPE cell blocks and matched histology samples was confirmed by the kappa test (kappa = 0.774, p = 0.000<0.05). However, compared with matched histology samples, MPE cell blocks had a higher rate of positive PD-L1 expression (39.6% vs 30.7% with TPS ≥50%), and the major difference was a focus on biopsy and corresponding MPE cell block samples (p = 0.005<0.05) (Table 2). PD-L1 expression was discordant in 15 paired cases, which were all ACs (Table 3). Among them, PD-L1 expression was higher in 2 histology specimens and 13 MPE cell blocks compared with their corresponding samples.

Table 2 PD-L1 expression in MPE cell blocks and matched histology samples
Table 3 Detailed information of discordant cases for PD-L1 expression among paired samples

By comparing the SIS of PD-L1 expression in matched samples, we found that the intensity of PD-L1 positive staining in MPE specimens was often stronger than that in corresponding histology samples (p = 0.000<0.05) (Fig. 1) and that the coincidence rate was 73.3%. However, after excluding the 15 pairs of discordant cases, the difference was not significant (p = 0.074) (Table 4).

Fig. 1
figure 1

Inconsistent PD-L1 expression in matched MPE cell block and histology samples from three NSCLC patients (magnification × 40). Representative images of hematoxylin–eosin staining for a lymph node biopsy (a), a pleural biopsy (e), a CTG-CN biopsy (i) and the matched MPE cell blocks (c, g, k). PD-L1 IHC (SP263) shows higher PD-L1 expression and/or stronger staining intensity in MPE cell blocks compared with matched histology samples (d vs b; h vs f; l vs j). PD-L1, programmed cell death ligand-1; MPE, malignant pleural effusion; CTG-CN, computed tomography-guided core needle biopsy

Table 4 Consistency of staining intensity scores for the PD-L1 IHC assay between paired samples

Correlation between PD-L1 expression and various factors

In this study, if PD-L1 expression was inconsistent between the matched samples in one patient, the higher score was considered the final result. Compared with squamous cell carcinoma (SCC) and positive smoking status, patients with AC or nonsmoking status had higher tumor PD-L1 expression rates (p = 0.013 and 0.009). Considering the association between nonsmoking status and lung AC, we separated lung cancers into two independent groups (AC and SCC) and re-studied the relationship between the expression of tumor PD-L1 and smoking status in each group. We observed a possible trend toward statistical significance in AC subgroup (p = 0.052) (Table 5). When 1% was used as the threshold for TPS, smokers in the AC subgroup were more likely to have negative expression (TPS<1%) of PD-L1 (p = 0.043<0.05). No significant difference was observed in PD-L1 expression among samples obtained from different tissue sources or by different sampling methods.

Table 5 PD-L1 expression stratified by clinicopathologic features

IHC double staining with anti-TTF-1 and anti-PD-L1

Twenty-nine of the 32 samples were subjected to IHC double staining with antibodies to TTF-1 and PD-L1. The remaining 3 cases were excluded for insufficient VTCs after the blocks were re-sectioned. As expected, IHC double staining allowed an easier analysis of IHC quantification of PD-L1 expression, especially when the malignant cells were distributed singly and interspersed with non-neoplastic cells (Fig. 2). Unfortunately, a portion of the double-stained cases showed weaker staining intensity (Fig. 3) and a low PD-L1 expression score (p = 0.000<0.05) compared with cases stained using the IHC single PD-L1 assay (Fig. 4).

Fig. 2
figure 2

Immunocytochemical double staining with antibodies against TTF-1 (red) and PD-L1 (brown) in MPE cell block section (magnification × 40). a Hematoxylin-eosin. b Double staining easily distinguishes difficult-to-identify tumor cells from nonneoplastic cells

Fig. 3
figure 3

Discrepancy of tumor PD-L1 expression between immunohistochemical (IHC) single and double staining (magnification × 40). Single PD-L1 IHC staining (a, c) shows a higher tumor proportion score and stronger staining intensity compared with double IHC staining with anti-PD-L1 and TTF-1 (b, d) in both histology sample (a vs b) and MPE cell block (c vs d)

Fig. 4
figure 4

Comparison of tumor proportion scores (TPS) for PD-L1 expression between immunohistochemical (IHC) single and double staining. Double staining results in a lower TPS for PD-L1 expression compared with single staining. IHC single staining was performed using a VENTANA PD-L1 (SP263) Rabbit Monoclonal Primary Antibody assay. IHC double staining was performed using an automated Ventana IHC assay for TTF-1 (dilution 1:100; SPT24 clone, Leica, USA) with an ultraView Universal AP Red Detection Kit (Ventana Medical Systems, Tucson, AZ) on the basis of the IHC PD-L1 single-staining process

Discussion

The feasibility of using cytology samples for PD-L1 expression testing has been well reported [15, 20, 23, 25, 26]. However, to the best of our knowledge, the relevant data on the application of MPE cell block samples in PD-L1 testing are limited. And there is also no clear understanding of the specificity of this type of sample for PD-L1 detection. The results of the current study further confirmed the concordance between MPE cell blocks and matched histology samples in PD-L1 expression detection, which would certainly increase the confidence in the clinical application of MPE cell blocks to predict the efficacy of immunotherapy.

To better contrast our work with that in previous studies, we used three TPS cut-offs (< 1, 1–49% and ≥ 50%), which were used in the 22C3 assay, to evaluate PD-L1 expression. This decision was carefully considered and was based on the high concordance between the 22C3 and SP263 assays [14, 24, 27, 28]. In our study, the expression rate of PD-L1 with TPS ≥50% as a cut-off for either histology (29.7%) or cytology (37.7%) was similar to that noted in two previous studies of consecutive sample analyses [23, 25]. More importantly, we observed a significant difference in PD-L1 expression between MPE cell blocks and biopsy samples, which suggests that MPE cell block samples may be more suitable for PD-L1 detection than small biopsy samples.

The reason for the higher positive rate of PD-L1 expression that is often detected in cytology samples has remained unclear. Heterogeneity of tumor PD-L1 expression (including inter-tumoral and intra-tumoral) may explain this phenomenon [19, 29]. However, whether there are other potential causes is worth further exploring. We speculate that the suspended distribution of tumor cells in pleural fluid might be related to a higher PD-L1 positive rate of the MPE cell block. On the one hand, the dispersed suspension distribution may partially mitigate the effects of heterogeneity of tumor PD-L1 expression. However, due to the small size of the material, the results of PD-L1 expression obtained by evaluating small biopsies are significantly affected by intratumoral heterogeneity [30, 31]. The underestimation on the PD-L1 status of a whole tissue sample based on evaluating single biopsy from patient has been reported [32]. On the other hand, the distribution pattern of tumor cells with PD-L1 positivity has been described in previous literature, and it was found that these cells are more likely to be located at the tumor-stroma interface [19]. Single or tiny clusters of tumor cells suspended in pleural effusion may have more opportunities to interact with the immune microenvironment, in a similar manner as the tumor cells at the tumor-stroma interface. In fact, a previous study has suggested that there may be an immune interaction between pleural effusion tumor cells and macrophages [22]. Moreover, studies have confirmed that both macrophages and T cells in the tumor immune microenvironment can induce tumor cells to express PD-L1 through their distinct patterns [33, 34]. Therefore, it is reasonable to believe that PD-L1 expression in tumor cells in pleural effusion may be enhanced by an activated immune response.

To better clarify the possible reasons for the difference in PD-L1 expression between cytology and histology samples, we analyzed a number of relevant factors, including sample interval time, treatment differences, sampling types of histology samples, and the number of VTCs, but no significant correlations were found. The only difference before analysis was the fixation time in 10% NBF (15–30 min for MPE cell blocks vs 2–24 h for histology samples). Although the fixation method is indeed an important pre-analytical factor that affects the IHC results, [35, 36] in terms of PD-L1 detection, the current data indicated that different fixatives (formalin only vs methanol/alcohol only vs both) did not have significant effects on the evaluation of PD-L1 expression [18, 23]. Moreover, based on years of clinical practice, a 15-min fixation was considered to be sufficient for scattered single cells or tiny cell clusters because the cells were well mixed with 10% NBF in an oscillator. Nevertheless, the possible effects of fixation time should be further analyzed by more rigorous comparative tests. Furthermore, potential errors in the analytic and postanalytic phases were excluded by repeated testing and interpretation by multiple pathologists (for 15 pairs of inconsistent samples). The expression patterns of TTF-1 from IHC double-stained sections were also taken as references to exclude errors that could be caused by mistaking non-tumor cells for tumor cells.

Much research has reported the correlation between clinicopathological features and PD-L1 expression. However, no consensus has been reached. In our study, we found that patients with lung AC were more likely to have PD-L1-positive tumors than patients with SCC, which was consistent with the results of a recent meta-analysis [37] but different from a report on East Asian populations [38]. Since most of the samples included in this study were AC, we cannot exclude the influence of the small sample size of SCC on the reliability of the results. In addition, we found that for patients with advanced lung AC, positive smoking status was shown to be a negative factor for PD-L1 expression, which was different from the findings of some previous studies [22, 37, 39]. This contradiction may be related to the finding that self-reported smoking status does not accurately represent the presence of a molecular smoking signature [40].

Due to the cluster structure and malignant morphology of tumor cells, the evaluation of PD-L1 expression in MPE samples was not a problem for most cases. However, when these features were not common enough and tumor cells were interspersed with nontumor cells, the calculation of TPS was challenging. In this case, additional IHC stains must be utilized to identify tumor cells. To the best of our knowledge, no studies on the use of IHC double staining for PD-L1 detection have been published. We found only one study in which the author used double staining to detect PD-L1 and CD68, but they did not present relevant data [14]. Herein, we clarify the value of double staining in the interpretation of PD-L1 expression, but we also found that this method impaired PD-L1 expression detection compared with the results of a standard single staining process. We postulated that this might be related to the secondary antigen retrieval and multiple wash steps in the use of a second dye, which results in the elution of some of the protein-bound chromophores that had been bound to the tissues. We believe this can be resolved through further process optimization and calibration settings, which will be a focus of future studies.

As one of the limitations of the current study, small biopsy tissues accounted for the vast majority (91%) of the histology samples. The application of biopsy samples for tumor PD-L1 detection has its own limitations, which may affect the reliability of the results in the present study to some extent. Unfortunately, it isn’t feasible to compare the results between samples from surgical resection and MPE cell block as patients with advanced NSCLC have little chance to do surgery. In addition, the sections used for double staining were not obtained by continuous sectioning, which increased the impact of intratumoral heterogeneity on the results. Finally, this study was unable to further clarify the value of the clinical application of a relatively higher PD-L1 expression rate in MPE cell blocks due to a lack of relevant clinical treatment information, such as immunotherapy response rate and prognosis.

Conclusions

In summary, our results demonstrate that MPE cell block samples are good candidates for PD-L1 expression detection in advanced NSCLC patients. The mechanism and clinical significance of the higher PD-L1 expression rate of MPE cell blocks compared with small biopsy samples should be evaluated prospectively.