Lymph node ratio (LNR) as a complementary staging system to TNM staging in salivary gland cancer

Purpose The role of lymph node ratio (LNR, ratio of metastatic to examined nodes) in the staging of multiple human malignancies has been reported. We aim to evaluate its value in salivary gland cancer (SGC). Methods Records of SGC patients from Surveillance, Epidemiology, and End Results database (SEER, training set, N = 4262) and Fudan University Shanghai Cancer Center (FUSCC, validating set, N = 154) were analyzed for the prognostic value of LNR. Kaplan–Meier survival estimates, the Log-rank χ2 test and Cox proportional hazards model were used for univariate and multivariate analysis. Optimal LNR cutoff points were identified by X-tile. Results Optimal LNR cutoff points classified patients into four risk groups, R0, R1 (≤ 0.17), R2 (0.17–0.56) and R3 (> 0.56), corresponding to 5-year cause-specific survival in SEER patients of 88.6%, 57.2%, 53.1% and 39.7%, disease-free survival in FUSCC patients of 69.2%, 63.3%, 34.6% and 0%, and disease-specific survival in FUSCC patients of 92.3%, 90.0%, 71.4% and 0%, respectively. Compared with TNM staging, TNM + R staging showed smaller AIC values and higher C-index values in the Cox regression model in both patient sets. Conclusions LNR classification should be considered as a complementary system to TNM staging and LNR classification based clinical trials deserve further research. Electronic supplementary material The online version of this article (10.1007/s00405-019-05597-0) contains supplementary material, which is available to authorized users.


Introduction
Salivary gland cancer (SGC) accounts for 7-12% of head and neck cancers and has an increasing incidence [1][2][3]. Lymph node (LN) involvement is among the most important prognostic factors in SGCs [3][4][5][6]. Patients with pathological lymph node metastasis (pN+) are recommended for postoperative radiotherapy according to the National Comprehensive Cancer Network (NCCN) guideline [7]. Based on the number, laterality and size of metastasized lymph nodes, they are classified as N1-N3 according to the American Joint Committee on Cancer (AJCC) staging system [7].
To improve the prognostic system, one would intuitively take not only information on positive LNs but also the number of LNs examined (LNE) into account. Lymph node ratio (LNR), defined as the number of involved nodes divided by LNE, was found to improve prognostic information in breast, gastric, colorectal, bladder and skin cancers [8][9][10][11][12]. The subsequent study showed that the LNR also improved the comparisons between institutions compared with AJCC Bo-Wen Lei, Jia-Qian Hu and Peng-Cheng Yu contributed equally to this work.

3
N staging [13]. However, to date, there have been no reports using the LNR in the staging of SGCs.
This study examined whether patients with SGC can be classified into meaningful risk categories based on the LNR (R classification). A TRM staging system which substituted the N classification of the TNM staging system with R classification was also developed and compared with the TNM staging system only and the combined TNM and R staging system (TNM + R), to determine the potential clinical significance of the LNR.

Patients
The datasets of two groups of patients were used in the current study. This research was approved by the institution's ethics committee. The SEER (Surveillance, Epidemiology, and End Results Program, https ://www.seer.cance r.gov) database has been widely used for the analysis of LNR staging in colon, breast and skin cancer [11,14,15]. For the analysis of LNR staging of SGCs, the study population consisted of patients with determined LNR and detailed N staging information. For the survival analysis, the SEER causespecific survival (CSS) was analyzed. Deaths attributed to the cancer of interest are treated as events and deaths from other causes are treated as a censored observation. Cases with non-concordant N classification information and number of regional positive nodes were rejected. Finally, 4262 patients with detailed lymph node examination information were collected, of which 1210 patients were pN+ and all the TNM staging were re-checked according to 2010 AJCC staging system. The cases with unclassified T, M classification, grade and other variables were also enrolled in the analysis set to avoid losing information and select bias, and were defined as Tx, Mx and unknown group. To compare the TNM staging system and TRM staging system, only 3944 patients were retrieved for analysis because 318 patients were excluded for without sufficient information (Tx and Mx) to determine the AJCC 7th edition stage.
The validating patient set was derived from the Fudan University Shanghai Cancer Center (FUSCC) dataset [5,6]. In total, 246 consecutive SGC patients underwent primary surgery at the Department of Head and Neck Surgery between January 1998 and January 2010. Anatomic compartment and level-based neck dissections were performed in 62.6% (154/246) of cases. To avoid bias caused by occult LN metastasis, only the 154 patients with neck dissection were enrolled in the current study. Pathologic examinations showed that 42.9% (66/154) of patients were pN+, which included 14 pN1 and 52 pN2 according to the AJCC staging system. Follow-up strategies for all patients include

Statistical analysis
The analysis was performed in four stages. For every step of survival analysis, we used the Log-rank χ 2 test to identify the factors associated with mortality, followed by a multivariate analysis using forward stepwise regression with a Cox proportional hazards model. The survival rate and curves were calculated using the Kaplan-Meier method. Harrell's concordance index (C index) and the AIC (Akaike information criterion) value related to the Cox regression model were analyzed to compare the predictive ability and relative goodness-of-fit between regression models [16]. A smaller AIC value and a higher C index value indicated a more desirable model for predicting the outcome. A P value of 0.05 was considered statistically significant. All statistical analyses were carried out using SPSS software version 17.0 (SPSS Inc., Chicago, IL, USA) and R2.14.0 software with packages (Boot, MASS and Survival). First, we evaluated the prognostic value of LNR as a continuous variable, adjusting for other covariates associated with CSS in 1210 SEER pN+ cases. The stability of the results was tested by a bootstrap procedure, which applies proportional hazards computations to full random samples with the replacement of the patients. We ran 10,000 iterations in this procedure.
In the second stage, we proceeded to determine the most appropriate cutoff points for categorizing LNR as high, medium, and low-risk groups. Optimal thresholds for LNR and the most appropriate method for deciding cutoff points differs among investigators [9,10,17]. In the current study, LNR modeling using spline smoothing functions was used to evaluate the effect of LNR on SEER CSS [17]. Two pairs of cutoff points were identified using different methods and compared with LNR as a continuous variable to identify the optimal cutoff points. The first pair of cutoff points were identified by tertiles to split the patients into equal-sized groups [17]. The second pair of cutoff points were calculated by X-tile using the minimum P values from Log-rank χ 2 statistics [18].
In the third stage, to determine the clinical usefulness of LNR staging, 4262 patients with detailed lymph node staging information were classified as R0-3, a total number of four groups using identified cutoff points (R classification). Either N or R classification and variables associated with CSS were enrolled in the multivariate Cox regression analysis to compare the predictive ability of both classifications. The predictive accuracy of SEER CSS of the TNM, TRM and TNM + R staging system (both TNM staging and R classification were analyzed in the Cox model as two variables) were compared by enrolled individually in the Cox regression model with covariates associated with CSS in 3944 patients with detailed TNM staging information. Finally, the prognostic significance of LNR staging was validated in FUSCC patients. Since SEER dataset lacking certain clinicopathologic characteristics of the tumor as lymphatic/vascular/extracapsular invasion and tumor size, etc., the survival predictive model is not good enough for validation in a new patients set. We then validated the LNR classification itself in the FUSCC set. N, R classification, TNM and TNM + R staging system were adjusted individually with all the variates associated with DFS and DSS of 154 FUSCC patients identified by Log-rank χ 2 test to compare the predictive ability of different staging system.

LNR as a prognostic factor of SGCs by univariate and multivariate analysis
From the SEER dataset, 1210 pN+ patients were analyzed for the prognostic significance of LNR, and the clinical details were presented in Table 1. Univariate analysis identified that primary sites, histologic subtype, grade, T, N, and M classification were all prognostic factors of SEER CSS (cause-specific survival, Supplement Table 1). The Cox regression model confirmed that primary sites, T, N, M classification, number of lymph nodes examined, LNR and age were significant prognostic factors of SGC SEER CSS ( Table 2). Examinations of 10,000 bootstrap resamplings of the data with reiterations of the AIC selection showed that primary sites, T, N, M classification, LNE, LNR and age were retained as significant factors in 86.81%, 100%, 96.05%, 100%, 99.93%, 100% and 100% of the random samples.

Cutoff points identification of LNR
In the second stage of cutoff point identification, a linear trend between LNR and SGC mortality was found by spline smoothing, and the upper and lower tertile points of LNR were 0.17 and 0.55, which were defined as the first pair of cutoff points. The X-tile program then identified 0.17/0.56 as the second pair of cutoff points. Using cutoff points of 0.17/0.55 and 0.17/0.56, we classified patients as R1, R2 and R3 three risk groups, respectively. Table 3 summarizes the univariate Log-rank χ 2 test, and Kaplan-Meier survival estimates according to risk group defined by specific LNR categories. Using the multivariate model identified in Table 2, the predictive accuracy of categorical LNR were compared with continuous LNR by the C-index and AIC value. As listed in Table 3, the cutoff points 0.17/0.56 showed a homogeneous patient grouping, the largest Log-rank χ 2 value, the highest C-index and smallest AIC value, which indicated high statistical significance representing the optimum prognostic stratification and predictive accuracy. Therefore, cutoff points of 0.17/0.56 were used for further analysis.

R classification as a complementary system to TNM staging
Using cutoff points 0.17/0.56, we then classified SEER patients with detailed pN staging (N = 4262) as R0-3 four risk groups. Compared with the imbalance of pN2 and pN3 CSS curves, the R classification CSS curves of the SEER set were clearly separated without cross (Fig. 1a, b). R classification also showed a higher C-index and lower AIC value than N classification (Table 4) in the multivariate Cox regression model with either R classification or N classification and variables identified by univariate analysis as covariates (Supplement Table 2). For 3944 patients with detailed TNM staging information, when the Cox regression model enrolled variables associated with CSS (Supplement Table 3) and TNM, TRM or TNM + R staging were compared, TNM + R staging showed the highest C-index and lowest AIC value which suggested that TNM + R staging had the best predictive accuracy, which followed by TRM staging ( Table 4).

Validation of R classification in FUSCC patient set
To validate LNR staging in the FUSCC patient set, 66 pN+ cases were analyzed and the median number of LNE, positive LNs and LNR were 25 (range 4-86), 4 (range 1-75) and 0.250 (range 0.026-1). The multivariate Cox regression model confirmed that continuous LNR [hazard ratio (HR) 10.503; 95% confidence interval (CI) 3.084-35.770] and postoperative radiation (HR 0.406; 95% CI 0.185-0.892) Fig. 1 Kaplan-Meier survival estimates according to pN classification and R classification: cause-specific survival (CSS) of the SEER set with different pN classification (a) and R classification (b); disease-free survival (DFS) of the FUSCC set with different pN classification (c) and R classification (d); disease-specific survival (DSS) of the FUSCC set with different pN classification (e) and R classification (f); compared with the overlap of the pN classification survival curves, R classification showed better discriminatory ability for salivary gland cancer survival ◂ were significant prognostic factors for DFS (disease-free survival) in the FUSCC patients with pN+. The Cox regression model also identified that LNR (HR 15.72; 95% CI 2.364-104.5) was the only significant prognostic factor for DSS in these patients. When R classification (categorical LNR using cutoff points 0.17/0.56) identified in the SEER set were compared with continuous LNR in predicting FUSCC patient DSS, the AIC value and C-index were 60.32 vs 65.55, and 0.802 vs 0.774, respectively (Supplement Table 4), which suggested that R classification performed better than continuous LNR in predicting DSS in FUSCC patients. For the clinical use of R staging, 154 FUSCC cases with neck dissection were analyzed. The Cox regression model included either N or R classification and variables listed in Supplement Table 5 and 6, and identified R classification as an independent prognostic variable for DFS and DSS in the set (Table 5). Compared with pN classification, R classification showed the better separation of survival curves (Fig. 1c-f), lower AIC values and higher C-index for predicting DSS ( Table 5). The TNM + R staging system also had superiority over TNM staging in predicting SGC mortality (higher C-index and lower AIC values, Table 5).

Discussion
Extensive studies have demonstrated that LNR is superior to pN classification in predicting patient prognosis in breast, gastric, and colorectal cancer [8][9][10]14]. To our best knowledge, this is the first paper to discuss the role of LNR in SGCs staging. Our analysis revealed that LNR is one of the most important prognostic factors of SGCs (retained as significant prognostic factors in 100% of 10,000 bootstrap resamplings). The discriminative ability of Kaplan-Meier curves (Fig. 1), the smaller AIC and the larger C-index values of Cox regression models (Tables 4, 5) all support that R classification and TNM + R staging system have higher predictive accuracy of survival than N classification and TNM staging, respectively [16,19]. According to our findings, LNR staging was a better predictor for SGC survival and should thus be used to complement TNM staging system.
Although the categorization of a continuous covariate (LNR) discards data and introduces a measurement error, it indeed brings simplicity and is therefore preferred in daily clinical practice [8]. Appropriate cutoff points are critical in the categorization of LNR because they provide consistent groupings between studies and ensure that each group contains an adequate number of individuals and events [17]. In this paper, we adopted two pairs of cutoff points for LNR. One was fixed centiles, to be more specific, tertiles. The other pair was calculated by the X-tile program, which uses the minimum P values from Log-rank statistics that control the inflated type I error and minimize the loss of information from multiple testing through cross-validation [9,18]. When compared with continuous LNR, the regression model with the categorical LNR (both cutoff points) showed lower AIC and higher C-index values, supporting that R classification does not decrease the statistical power for survival prediction. We recommend cutoff points 0.17/0.56 identified by X-tile for further validation and clinical usage because of the homogeneous patient grouping, the lowest AIC and highest C-index value in both patient sets. However, the issue of optimal cutoffs remains open and the cutoffs based on individual dataset need further validation.
Identification of high-risk patients and selecting patients for postoperative therapy are two major clinically important requirements of a novel staging system. Compared with N3 classification of SEER cases (0.68%, 29/4262), the percent of R3 patients was 9.29% (396/4262), while the 5-year CSS decreased from 43.3% to 39.7%. The shift in staging was also confirmed in the FUSCC set without pN3 patients, while 10.39% of FUSCC patients were identified as R3, and the 5-year DFS and DSS were all 0%. Postoperative radiation is recommended for all pN+ cases in SGC [7]. Concurrent chemoradiotherapy has also been reported to result in excellent local control in a subgroup of SGC patients with adverse prognostic factors [20]. However, no survival benefits were observed for specific N classification cases with and without radiation in both the SEER and FUSCC set. As presented in the Results and Fig. 2, no survival benefit was achieved in R1 FUSCC patients. For R2 patients, both DFS and DSS improvement from postoperative radiotherapy were observed. For R3 patients, although significantly improved 3-year DSS (0-10.2%) and DFS (0-25%) were observed, the survival rates were still low, suggesting the need for more intensified therapy. Compared with N classification, R classification shows superiority for selecting high-risk patients and better predictive ability of treatment benefit. R classification-based analysis and design of clinical trials deserve further research.
The strength of the present study draws on the complementary data collection system and the cross-validation of the SEER and FUSCC datasets. SEER data are extracted retrospectively from registries comprising 26% of the US population, which is considered representative of the entire population, and selection bias, recall bias, treatment fads, the influence of loss to follow-up and other oversights associated with a single institution's data collection were minimized [14,15]. Nevertheless, the inter-institution differences in patient management, unrecorded details of pathologic reports and covariates may compromise the outcome analysis in the SEER data. The FUSCC data was from a single-center, with the same team of oncologists and pathologists managing all patients, and all the potential covariates were analyzed in the current study. There is no generally accepted guidance for the postoperative management of SGC. Nevertheless, an appropriate postoperative follow-up strategy for SGC patients is critical to both the management of the disease and the accuracy of survival data. De Felice et al. recommend a routine combination of complete head and neck exam and diagnostic imaging exams (DW-MRI imaging and/or CT with contrast) [2]. In our institution, apart from physical examination and a head MRI imaging, the SGC patients also receive a neck ultrasound exam for Table 4 Different staging systems for SEER cause-specific survival (CSS) of SEER patients with salivary gland cancer a For N classification and R classification, a total of 4262 patients with the number of lymph nodes examined were analyzed. For TNM staging, TRM staging, TNM + R staging, a total of 3944 patients were enrolled for sufficient TNM staging information b N classification and R classification were adjusted for primary site, histologic type, histologic grade, site-directed surgery, radiotherapy, the number of lymph nodes examined, T classification, M classification and age of the SEER patients (variables identified in Supplement Table 2). TNM staging, TRM staging and TNM + R staging (TNM staging and R classification as two variables enrolled in the Cox regression together) were adjusted for primary site, histologic type, histological grade, site directed surgery, radiation, the number of lymph nodes examined and age of the SEER patients (variables identified in Supplement Table 3)  c TRM staging system was defined as I: T1-R0-M0; II: T2-R0-M0; III: T3-R0-M0, T1-R1-M0, T2-R1-M0, T3-R1-M0; IVA: T4a-R0-M0, T4a-R1-M0, T1-R2-M0, T2-R2-M0, T3-  the early detection of suspicious lymph nodes. While the patient series in one institution is small and the results only represent that center's experience, the current study validated the LNR staging system in two independent patient sets and showed the valuable predictive ability of LNR as a prognostic factor.
Although the LNR is superior to the N classification for prognostic staging, limitations still exist for R classification. It could only be used for postoperative staging, and the number of nodes harvested is an important factor for LN ratio. As a significant prognostic factor for the SEER dataset (Table 2), LNE was rejected by multivariate analysis in the FUSCC dataset ( Table 5). One of the possible explanations is that the standardized anatomic compartments and levels-based LN dissection and pathologic review with at least 4 LNs examined in one high disease volume hospital could lower the effect of the LNE in the prognostic model. Therefore, compartment-based neck dissection and sufficient LNs examination may still be necessary for accurate LNR staging of SGCs.

Conclusion
We clearly identified that the LNR was an independent prognostic factor of SGC, and R classification (LNR = 0, LNR = 0-0.17, LNR = 0.17-0.56 and LNR > 0.56) defines SGC mortality adequately. R classification is complementary to the TNM staging system. R classification-based stratification of patients for postoperative therapy and clinical trials deserves further research.