Background

Small cell lung cancer (SCLC) is the most aggressive and deadly form of lung cancer characterised by rapid growth, early metastasis, and high rates of acquired therapeutic resistance [1, 2]. Due to the nature of the disease, the majority of patients have metastatic disease at time of diagnosis leading to poor overall survival (OS) [3]. Over the last decades, improvements in cancer treatment have led to improved survival in non-small cell lung cancer (NSCLC) [4], but in SCLC patients, this impact on OS has been absent until lately, where the introduction of immunotherapy has shown promising results in clinical trials for this patient group [5, 6]. Though not all patients benefit from the available treatments, and for some patients, the course of the disease at time of diagnosis is fast and aggressive, therefore, the clinicians need guiding tools to predict the patient’s prognosis and the natural history of the disease. Moreover, to make improvements in the treatment of SCLC patients, we need prognostic markers that can identify patients who are at high risk of an inferior survival. By doing so, patients can be stratified into optimal treatment regimens or follow-up programmes which hopefully will lead to improved patient survival.

As one of the hallmarks of cancer [7], inflammation has been suggested as a prognostic marker [8]. Hence, general inflammation markers like C-reactive protein (CRP), leucocytes, or lymphocytes have been studied and shown some potential as prognostic markers in several cancers, even though results have been conflicting [9, 10]. Using individual inflammation markers as a measure of the inflammation status is a simplistic approach to a complex system. Therefore, inflammation scores that combine these general inflammation markers have been developed and proven to be prognostic markers of inferior survival in several cancers including NSCLC [10,11,12,13,14,15]. In SCLC, however, the prognostic value of inflammation scores is less studied, just as studies have shown inconsistent results [16, 17]. Therefore, we performed a systematic review to explore the literature on inflammation scores in SCLC. Furthermore, we performed a meta-analysis to investigate the prognostic value of pre-treatment neutrophil-to-lymphocyte ratio (NLR) and platelet-to-lymphocyte ratio (PLR) in SCLC patients.

Materials and methods

Data sources and search strings

A systematic search was carried out investigating the existing literature of inflammation scores in SCLC. The review was performed following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [18]. The search was made in the databases PubMed, MEDLINE, Embase, and Web of Science on the 20th of March, 2020 with no time restriction. All databases were filtered for English, and PubMed and Embase were filtered for “not animals” in addition. Studies were selected using terms defining Lung cancer (“Lung cancer”, “Lung neoplasm*”, “Lung Neoplasms”[Mesh], “Lung carcinoma”), general inflammation markers (“Lymphocyte*”, “Lymphocytes”[Mesh], “Lymphocyte Count”[Mesh], “Neutrophil*”, “albumin*”, “Neutrophils”[Mesh], “CRP”, “C-reactive protein*”, “C-Reactive Protein”[Mesh], “albumin”, “Albumins”[Mesh]) and inflammation based scores (“glasgow prognostic score*”, “neutrophil to lymphocyte ratio*”, “neutrophil-lymphocyte ratio*”, “lymphocyte ratio*”, “inflammation score”, “inflammation-based score”, “inflammation index”). The full search string is available in Supplementary Text S1.

Inclusion and exclusion criteria

The studies included in this review met the following inclusion criteria: (1) original data, (2) human studies, (3) patients with a pathologically proven histology of SCLC, and (4) studies evaluating a combination of general inflammation markers as a prognostic factor. Studies were excluded based on the following exclusion criteria: (1) language other than English; (2) papers without original data as reviews, meta-analyses, guidelines, editorials, comments, and letters to the editor; (3) conference abstracts or case reports including fewer than five cases; and (4) animal or in vitro studies. In case of two publications based entirely or partly on the same study population, the study containing the highest number of patients was included. According to the inclusion and exclusion criteria, two authors (BSP and AWL) screened the first 500 titles and abstracts to validate the inclusion and exclusion criteria. Disagreements were settled by discussion and consensus. The remaining titles and abstracts were screened by BSP. Two authors (BSP and AWL) read and included/excluded 30 randomly selected articles, and the remaining articles were assessed by BSP. The reference management tools Endnote (Clarivate Analytics) and Covidence (covidence.org) were used for identification of duplicates.

Data extraction and quality assessment

Data extracted from the studies included name of first author, publication year, inclusion period, sample size, study design, and follow-up time. Furthermore, clinical characteristics of the study population and information on the inflammation score including cut-off, and risk estimates of the association with OS were extracted. Studies were split into two, and data extraction was performed by two authors (BSP and AWL); each author extracting data from half of the studies. All data extraction were checked by the other author. Both authors quality assessed the articles included in the study based on a modified version of the Quality of Prognosis Studies Tool (QUIPS) [19], and articles were rated as high quality, moderate, or low quality. The protocol for the study was submitted to the PROSPERO database (registration number CRD42020188553).

Statistical analyses

For the individual inflammation score, a meta-analysis was performed if the score was evaluated in at least five studies with extractable risk estimates. Risk estimates included in the study were HR for the inflammation scores association with OS along with a 95% CI values or a beta coefficient and a standard error. Publication bias was evaluated by visual inspection of a funnel plot and by the Begg’s and Egger’s tests. Heterogeneity between the included studies was tested by using the Cochran Q and I2 [20], where I2 < 50% and p > 0.10 were set as cut-offs to define heterogeneity. In case of no significant heterogeneity, a fixed-effects model was applied; otherwise, a random-effects model was used. Sensitivity analyses were performed by excluding the low-quality studies and studies with predefined cut-offs to assess the robustness of the pooled estimate. Data were analysed by Stata software version 15.1 (Stata Corporation, College Station, TX, USA), and all p values were two-sided and considered significant if < 0.05.

Results

Study selection

A total of 5563 publications were identified through searching the online databases; 2570 of these were excluded due to duplication. Titles and abstracts were screened for eligibility which led to exclusion of 2891 irrelevant articles. Full texts of the remaining 102 articles were thoroughly reviewed, and 70 articles were excluded due to various reasons: abstract, n = 26; not possible to extract data on SCLC, n = 23; no data on outcome, n = 8; no data on SCLC patients, n = 5; no original data, n = 3; other language than English, n = 2; overlapping cohort, n = 2; wrong subject, n = 1. Finally, 33 articles met the inclusion criteria for the current systematic review. The inclusion and exclusion procedures are illustrated in Fig. 1.

Fig. 1
figure 1

Flow chart of the inclusion and exclusion procedures. SCLC, small cell lung cancer

Study description and quality assessment

Baseline characteristics of the studies included are listed in Table 1. In summary, all included studies were retrospective studies published between 2008 and 2020. The majority of studies were conducted in Asia (n = 23) [17, 21, 23,24,25, 27,28,29,30, 32, 33, 36,37,38, 41,42,43,44, 46,47,48,49], primarily China, whereas eight studies [16, 22, 26, 31, 34, 35, 50, 51] originated from Europe and two from the USA [39, 40, 45]. A total of 7762 patients with SCLC were included with the number of patients included in each study ranging from 46 to 938. In more than half of the studies (n = 19) [17, 21, 23,24,25, 27,28,29,30, 33, 36, 37, 41, 42, 44, 45, 48,49,50], the included patients were a mixture of patients with limited disease (LD) and patients with extended disease (ED), whereas only patients with LD were included in eight studies [16, 26, 31, 34, 40, 43, 46, 47] and only patients with ED were included in five studies [32, 35, 38, 39, 51]. For one study, the stage of disease was not described [22]. The presence of liver metastasis was described in seven studies [30, 32, 35, 38, 45, 48, 51] and ranged from 16 to 47% of patients with ED. The median/mean age of included patients varied from 56 to 72 years, and 6–50% of the included patients were female. Smoking was assessed in 26 studies [21, 23,24,25,26,27, 29,30,31,32,33,34, 36, 38,39,40,41,42,43,44,45,46,47,48, 50, 51] demonstrating that 28–100% of the patients had a history of smoking. Furthermore, the performance status was estimated in 26 studies [16, 17, 21,22,23,24,25,26,27, 29, 32,33,34, 36, 38,39,40, 42,43,44,45, 47,48,49,50,51] demonstrating that 33–100% of the patients were in a good performance defined by Eastern Cooperative Oncology Group Performance Status < 2 or Karnofsky performance scale ≥ 80.

Table 1 Studies investigating inflammation scores as prognostic biomarkers in small cell lung cancer patients (n = 33)

Eight different inflammation-based scores were identified. The inflammation-based scores neutrocyte-to-lymphocyte ratio (NLR) and platelet-to-lymphocyte ratio (PLR) were the most frequently evaluated (NLR: n = 23; PLR: n = 22). Other identified inflammation-based scores were modified Glasgow prognostic score (mGPS, n = 5), lymphocyte-to-monocyte ratio (LMR, n = 2), monocyte-to-lymphocyte ratio (MLR, n = 2), advanced lung cancer inflammation index (ALI, n = 2), systemic immune-inflammation index (SII, n = 2), and haemoglobin, albumin, lymphocyte, and platelet score (HALP, n = 1).

For mGPS, cut-offs were identical and predefined in all studies. For the remaining scores, predefined cut-offs were defined in seven studies [16, 17, 24, 26, 43, 44, 50], while data-dependent cut-offs were employed in 19 studies [21, 23, 25, 27, 28, 30, 31, 33,34,35,36,37, 40,41,42, 46,47,48, 51]. In two studies evaluating different scores, NLR was evaluated based on a predefined cut-off, whereas the cut-off for PLR were data dependent [39, 45]. As a consequence, cut-offs varied substantially for NLR, 2.3–5.0; PLR, 110.43–250; LMR, 2.615–6.08; ALI, 19.5–31.1; and SII, 748.51–1600. The inflammation scores were evaluated as prognostic biomarkers based on a blood sample collected at diagnosis of the lung cancer (n = 8) [21, 24,25,26,27, 29, 33, 50] or before start of treatment (n = 23) [16, 17, 22, 23, 28, 30,31,32, 34,35,36,37, 39,40,41,42,43,44,45,46,47, 49, 51]. For two studies [38, 48], the time of blood sampling was not reported. Based on the quality assessment, included studies were ranked from low to high quality; 21 studies were ranked low quality [16, 21, 22, 24, 26,27,28,29,30, 33,34,35, 37, 38, 41, 42, 45,46,47, 50, 51], ten studies ranked moderate quality [17, 23, 25, 31, 36, 39, 40, 43, 44, 48], and two studies ranked high quality [32, 49].

Inflammation scores and overall survival

In the identified 33 studies, an adjusted risk estimate of mortality risk between patients with a low versus a high score could only be retrieved in 25 studies (NLR, 16 [17, 21, 24,25,26, 30, 31, 36, 39, 40, 42, 43, 45, 48, 50, 51]; PLR, 7 [24, 25, 38, 40, 42, 45, 46]; LMR, 2 [42, 48]; MLR, 2 [34, 51]; mGPS, 4 [29, 32, 38, 49]; SII, 3 [24, 41, 42]; ALI, 1 [23, 27]). Rating and adjustment variables for the individual study are listed in Table 1. The studies were rated from low quality to high quality. The applied adjustment variables were reported in 20 studies [17, 23,24,25,26, 30,31,32, 36, 38,39,40,41,42,43, 45, 46, 48,49,50] while no information on adjustment variables were reported in five studies [21, 27, 29, 47, 51]. For inflammation-based scores evaluated in more than five studies, a meta-analysis was performed.

NLR and overall survival

A substantial between-study heterogeneity was observed in the 16 studies evaluating NLR (Q = 43.62 on 16 df; P < 0.0001; I2 = 63.3%; p = 0.03), why a pooled HR was estimated using a random-effects model. A high NLR was found to be associated with a 39% increased risk of death in patients with SCLC (HR = 1.39 (95% CI, 1.23–1.56), Fig. 2).

Fig. 2
figure 2

Forrest plot of hazard ratio for the association between neutrophil-to-lymphocyte ratio and overall survival in patients with small cell lung cancer. N, number of included patients; CI, confidence interval; LD, limited disease; ED, extended disease. # Two cut-points (NLR 3 and 4) were evaluated. aNLR were evaluated in two individual subgroups: LD and ED

Due to an observed asymmetry in the funnel plot, publication bias was suspected (Supplementary Figure 1), which was supported by the Eggers test (p = 0.04) and a tendency observed in the Begg’s test (p = 0.06). Hence, a Trim and Fill analysis was performed to account for absent studies, and an adjusted pooled random-effects HR was calculated. The pooled HR remained significant, even though the estimate was slightly reduced (HR = 1.23 (95% CI, 1.13–1.42)).

Since the meta-analysis included a large number of studies ranked low quality, a sensitivity analysis was performed including only the moderate quality ranked studies. Evaluating only the nine studies of moderate quality, we found a pooled HR of 1.51 (95% CI, 1.29–1.81). Furthermore, in nine studies, data-dependent cut-points were applied; hence, a sensitivity analysis was performed including only studies with predefined cut-points. Here we found a pooled HR of 1.34 (95% CI, 1.15–1.58).

PLR and overall survival

In the studies of PLR, a considerable between-study heterogeneity was also detected (Q = 26.87 on 7 df; P < 0.0001; I2 = 74%; p = 0.12), and again, a pooled HR was estimated using a random-effects model. A high PLR was associated with a 20% increased risk of death in patients with SCLC; however, the estimate of the risk increase was ranged from − 4 to 51% (HR = 1.20 (95% CI, 0.96–1.51), Fig. 3). An asymmetry in the funnel plot was observed and publication bias was suspected (Supplementary Figure 2). However, Begg’s test (p = 0.17) and Egger’s test (p = 0.22) did not identify publication bias.

Fig. 3
figure 3

Forrest plot of hazard ratio for the association between platelet-to-lymphocyte ratio and overall survival in patients with small cell lung cancer. N, number of included patients; CI, confidence interval; LD, limited disease; ED, extended disease. aPLR were evaluated in two individual subgroups: LD and ED

Discussion

In this systematic review, we explored the literature on inflammation scores as prognostic biomarkers in SCLC. We identified eight different inflammation scores and evaluated their ability to predict OS in SCLC. By the use of meta-analyses, we demonstrated that NLR leads to a 39% increase in mortality in SCLC patients. This finding was confirmed after taking the risk of publication bias into account, even though the pooled HR was reduced. Opposite, no association with OS could not be confirmed for PLR. All studies included in the review were retrospective studies, just as the overall quality of the studies was low. Nevertheless, the association was confirmed after omitting low-quality studies from the analysis. Regardless of these hesitations, as the first, this review collects the available literature on inflammation scores in SCLC and evaluates their potential as prognostic biomarkers in SCLC.

Inflammation is recognised as one of the hallmarks of cancer, why inflammation scores based on general inflammation markers have been demonstrated as a prognostic biomarker in several cancers [10,11,12,13,14,15]. Though, so far a hypothesis explaining the biological mechanisms behind these various inflammation scores, and especially, a hypothesis explaining why inflammation scores are associated with mortality, has been absent. A potential part of this puzzle could be interleukin-1β, as inhibition of interleukin-1β was shown to lead to a decrease in lung cancer incidence as well as mortality in atherosclerotic patients [52], and high levels of interleukin-1β has been observed along with anaemia, neutrophilia, lymphopenia, low levels of albumin, and increased CRP in patients with rheumatic disease [53]. In this comprehensive review, all identified inflammation scores, but mGPS, were based on lymphocyte count in various combinations with neutrophils, platelets, monocytes, albumin, haemoglobin, and BMI. NLR was the most frequently evaluated score (n = 23) closely followed by PLR (n = 22). For NLR, which can be a reflection of interleukin-1β associated neutrophilia and lymphopenia, high scores were related to inferior survival. Similarly, in the NLR-modified scores (ALI and SII) a high score was associated with inferior survival. For PLR, however, the association with OS could not be confirmed. A possible explanation for this could be that the PLR only includes one general inflammation marker (lymphocyte) affected by the interleukin-1β. Consequently, the HALP score, which is a modification of the PLR by including haemoglobin and albumin, should have an improved association with survival theoretically. Unfortunately, however, survival data were not reported for the HALP score [37]. The only score not including lymphocyte count is the mGPS, which is based on two other markers potentially reflecting interleukin-1β: albumin and CRP. We identified five studies [22, 29, 32, 38, 49] evaluating the prognostic potential of mGPS in a total of 954 SCLC patients with both LD and ED. A high mGPS score was overall associated with reduced OS, though one of the studies [22] did not report an adjusted HR. Due to the low number of studies, a meta-analysis could not be performed. However, in other cancer types like NSCLC, the prognostic value of mGPS has been established even in meta-analyses [54, 55] indicating that mGPS could be a valuable biomarker in SCLC as well.

In all scores except mGPS, data-dependent cut-offs based on survival were frequently applied (n = 19). When survival is used to decide how to define a cut-off in a given biomarker, the likelihood of the given biomarkers’ ability to be able to predict survival is enlarged enormously. Thus, this could be a potential bias to this study. Therefore, we performed a subgroup analysis excluding studies with data-dependent cut-offs and demonstrated only a slight reduction of the combined risk estimate indicating that data-dependent cut-offs were not a major bias in our study. Furthermore, due to the data-dependent nature of cut-offs, the cut-offs varied tremendously for all scores with a tendency of higher predefined cut-offs compared to data-dependent cut-offs.

Until now, only one systematic review on NLR in SCLC has been performed [56] identifying an association between NLR and OS. In the previous review, 21 studies were included of which six were posters from international conferences and two of these could not be identified in the PubMed, Embase, WOS, or Scopus databases. Moreover, a quality assessment was not performed in the previous review. Besides, in this study, 13 additional studies evaluating NLR as a prognostic marker in SCLC were identified owed to the comprehensive search, leading to a more wide-ranging evaluation of NLR as a prognostic biomarker.

The strength of this study is the comprehensive systematic review of the available literature on inflammation scores in SCLC. We used four internationally recognised databases applying broad search terms to include all inflammation scores available. Though we included specific search terms for well-known scores as NLR, PLR, and mGPS, thus, the likelihood of retrieving more results on these specific scores is present. To counter this potential skewness in identified scores, we included broad search terms in our search strategy including individual inflammation markers and various terms covering the inflammation score term. An additional strength of the review is the quality assessment performed by two authors. The quality assessment is essential to identify biases of a magnitude to affect study results. We applied a modified version of the QUIPS [19], which assesses six important domains being: study participation, study attrition, prognostic factor measurement, confounding, measurement and account, outcome measurement, and analysis and reporting.

Nevertheless, the study faces some limitations. Firstly, all studies were retrospective and the overall quality of the included studies was low. Additionally, substantial heterogeneity was observed between the included studies as I2 values of 63% for NLR and 74% for SCLC were observed. The I2 measure the variation in the estimates caused by-study differences. In general, an I2 > =50% indicate moderate heterogeneity and an I2 > 75% indicate substantial to considerable heterogeneity [57]. These thresholds are arbitrary. However, the I2 cannot stand alone but should be considered in combination with the forest plot. If the estimates vary but point towards the same conclusion in the forest plot as is seen for NLR (Fig. 1) and PLR (Fig. 2), a substantial heterogeneity can be present, but it would be of questionable clinical importance [57]. Finally, a language bias cannot be excluded as we only included studies written in English.

Conclusion

This review identifies that inflammation scores based on general inflammation markers do have some potential as prognostic markers in SCLC patients. The conducted meta-analyses demonstrated that NLR was associated with mortality. Furthermore, mGPS showed some potential as a prognostic marker of inferior survival, though only in a limited number of studies. Hence, inflammation scores as NLR could be clinically relevant as a prognostic marker in the treatment of SCLC patients.