Introduction

Breast cancer is the most common neoplasm in women. Nodal status as determined by pathologic examination of lymph nodes has repeatedly been shown to be the single most important predictor of survival in breast cancer [1]. The absolute number of pathologically involved nodes has also been shown to be an important prognostic factor in breast cancer survival [26]. The extent of lymph node involvement is incorporated into prognostic indices such as the Nottingham Prognostic Index [79] (see Additional files 1,2,3,4,5,6). Old lymph node stage measures categorized cases according to whether they had none, one to three, or four or more involved nodes, and recently according to more detailed subdivisions [10, 11] (see Additional files 2,3,4,5 and 7).

However, several authors have noted the inherent confounding by the number of excised nodes [12, 13]. To address the variability of nodal examination, an intuitive approach is to use the proportion or the percentage of involved nodes, as was suggested by Rostgaard and coworkers [14]. The proportion can immediately be derived from pathology reports that clearly state the total number of lymph nodes examined and the total number of involved nodes [1, 15]. The proportion has received increasing attention in the literature, providing a reference base on which its clinical relevance may be discussed [13, 14, 1627].

In this report the modeling utility of the proportion of involved nodes is compared with the absolute numbers of involved nodes and of examined nodes. There is a one-to-one correspondence between proportion and ratio between involved and uninvolved nodes. A previous study hinted at an apparent linear relationship with survival between involved and uninvolved nodes (Fig. 1) [28], and therefore this report also examines the utility of expressing ratios as odds instead of proportions.

Figure 1
figure 1

Joint effect of the numbers of involved nodes (npos) and uninvolved nodes (nneg) on survival in T1–T2 breast cancer. Part of the contour plot was partially filled at the corners by padding. The pattern of isoprobability contours radiating from the origin suggests that similar ratios of involved/uninvolved nodes were associated with similar Kaplan–Meier survival estimates (for example. 8 npos/10 nneg has approximately the same 75% [contour line 0.75] 5-year survival chance as 4 npos/5 nneg). Reproduced with permission from Vin-Hung and coworkers [28]. Colors were omitted in the original publication.

The absolute numbers considered in the study were the number of nodes examined (excised; nx), the number of involved nodes (np), and the number of uninvolved nodes (nn).

Methods

The SEER (Surveillance, Epidemiology, and End Results) program of the USA [29] provides extensive cancer incidence data from 11 population-based registries. The data used in the present study were extracted from nine of those registries: San Francisco-Oakland, Connecticut, Metropolitan Detroit, Hawaii, Iowa, New Mexico, Seattle (Puget Sound), Utah, and Metropolitan Atlanta.

Selected patients were women without a previous history of cancer who presented with a noninflammatory invasive breast carcinoma, which was histologically confirmed and diagnosed between 1988 and 1997, with specified tumor size no larger than 50 mm (T1 and T2), strictly confined to breasts without distant metastasis, and in which curative surgery and axillary lymph node dissection were performed with removal of at least one node. Cases with involvement of skin, hypodermis or pectoral muscles, or with deep fixation were excluded. Patients who had undergone subcutaneous mastectomy, radical mastectomy, or preoperative or intraoperative radiotherapy were excluded. Data on systemic treatment were not available and therefore could not be taken into account. Certain records were rejected because of data quality concerns: uncertain sequence of treatment, nonhospital based data records, month of diagnosis unknown, or race unknown. Examination of outliers (scarce and extreme values) resulted in further exclusion of cases with more than 50 nodes examined, '0 months' of follow up, and age at diagnosis under 25 years or older than 95 years.

The follow-up cut-off date was 31 December 1999. The survival end event was defined as death from breast cancer.

The proportions of involved nodes were expressed as percentages ([np/nx] × 100%). The log odds of nodal involvement were computed using the empirical logistic transform: L = Loge([np + 0.5]/ [nn + 0.5]) [30]. The transform, also called the sample logit, avoids singularities caused by null observations, and is the least biased estimator of the true log odds [31]. (Note that, with hindsight, Fig. 1 shows a logarithmic relationship.) Unadjusted mortality (the number of patients who died divided by the number of patients at risk) as a function of the ratios was used for descriptive purposes.

The utilities of the percentage and log odds were evaluated in different multivariate Cox proportional hazards models [32]. The numbers np and nx, the percentage (np/nx) × 100%, and the L transform were entered as quantitative continuous variables in different combinations. The corresponding hazard ratios were each time computed within a Cox model that included tumor size, age at diagnosis, and year of diagnosis modeled as quantitative continuous variables; and the registry area, race, marital status, tumor topography, histologic type and grade, estrogen and progesterone receptor status, type of primary surgery, and administration of postoperative radiotherapy modeled as qualitative variables. The qualitative variables were converted or expanded as needed into dummy variables to allow binary coding ('married' versus 'not married', 'high grade' versus 'not high grade', for example, and so on). A first order interaction between type of surgery and postoperative radiotherapy was included for consistency with a previous analysis [33]. The models were computed in all cases irrespective of nodal status, and then as a function of positive or negative nodal status. The functional forms were assessed using the generalized additive model procedure [34].

The Nagelkerke R2 index (R2 N) was used to score the different Cox models [35]. R2 represents the proportion of variation explained by covariates in regression models [3537]. R2 N divides R2 by its maximum attainable value to scale it to within the range 0–1. R2 N is close to 1 for a perfectly predictive model, and close to 0 for a model that does not discriminate between short and long survival times.

Statistical analyses were performed using Splus (Insightful Corporation, Seattle, WA, USA) statistical software.

Results

In the 2002 SEER release [29], 188,410 women were diagnosed with breast tumors from 1988 to 1997, of whom 132,457 had a hospital based histopathologic diagnosis of unilateral invasive carcinoma. A total of 83,686 cases matched the selection criteria; 58,070 were node-negative and 25,616 node-positive. The median follow-up time was 73 months (range 1–143 months) for patients still alive at the follow-up cut-off date (31 December 1999). Characteristics of the patients were presented elsewhere [33]. Except for some additional cases due to updated registration minus the exclusion of outliers resulting in 90 fewer cases, there were no noticeable differences in the distribution of the characteristics.

The median number of nodes examined was 15 (range 1–50, mean ± standard deviation 15.4 ± 6.5). Among the node-positive patients, the median number of involved nodes was 2 (1–46, 4.1 ± 4.8).

Table 1 shows the distribution of the percentages of involved nodes. Figure 2 is a plot of the corresponding breast cancer mortality, which appears to increase linearly with the np/nx percentage.

Table 1 Distribution of the percentages of involved nodes and corresponding unadjusted mortality
Figure 2
figure 2

Unadjusted breast-cancer mortality as a function of the percentage of involved nodes in T1–T2 breast cancer based on the SEER (Surveillance, Epidemiology, and End Results) program data. Dot size computed as a step function of the number of patients at risk: smallest dots 1–20 patients and the largest dots >200 patients. The straight line highlights the trend but should not be interpreted as the basis for extrapolation.

Table 2 shows the distribution of the log odds for nodal involvement. Figure 3 plots the corresponding breast cancer mortality. There is an initial, almost flat segment for values of L ≤ -3, which is followed by a steeply sloping upward segment. The initial flat segment corresponds mostly to node-negative cases. The sloping upward segment corresponds to node-positive cases, with more positive L values indicating more involved nodes and/or fewer uninvolved nodes. There is an overlap between node-negative and node-positive cases for L values between -3.5 and -1.

Figure 3
figure 3

Unadjusted breast cancer mortality as a function of the estimated log odds of nodal involvement in T1–T2 breast cancer. Red dots are node-negative patients, and blue are node-positive patients. The smallest dots represent 1–20 patients and the largest dots represent >200 patients. The straight lines highlight the different slopes but should not be interpreted as the basis for extrapolation (they would extrapolate to <0% or >100% mortalities).

In multivariate analyses, np and nx exhibited marked nonlinearity and widely diverging confidence intervals (Fig. 4a,4b). The linearity improved for the percentage (np/nx) × 100% and the L transform, which also showed more homogeneously distributed confidence intervals (Fig. 4c,4d).

Figure 4
figure 4

Adjusted breast cancer mortality in T1–T2 node-positive breast cancer as a function of (a) number of nodes examined, (b) number of involved nodes, (c) percentage of involved nodes, and (d) log odds of involved nodes. Dotted lines indicate the 95% confidence interval. Plots are based on multivariate models that included all non-nodal covariates listed in the Methods section and a single nodal covariate. Analysis of cases pooled irrespective of nodal status revealed a more marked nonlinearity in plot c between 0% and 20% involved nodes. Otherwise, different combinations of nodal covariates gave similar plots.

The upper section of Table 3 shows a comparison between proportional hazards models that included different combinations of np, nx, np/nx, and L for all patients, irrespective of nodal status. Based on R2 N, the best predictive covariate was L (model 6), with a small improvement contributed by nx (model 10). In the model with nx alone (model 3), nx was statistically significant but its contribution to global model fit appeared negligible because the R2 N did not change from the baseline 0.069 (model 1). The contribution of np alone was substantial, with a change of R2 N from baseline 0.069 to 0.093 (model 2). However, adding np and/or nx onto L or onto np/nx provided no improvement, except in the already mentioned model 10.

The middle section of Table 3 shows multivariate analysis performed for node-positive cases only. Models based on separately expressed numbers provided the lowest R2 N (models 2–4). The largest R2 N values were all observed in models incorporating L or np/nx (models 5–11). The simplest model appeared to be based on np/nx alone (model 5; R2 N = 0.108). A small improvement was contributed by np (model 7; R2 N = 0.109).

The lower section of Table 3 shows the analysis performed for node-negative cases only. Because np, by definition, equals 0, there are only four models. They show that nx (model 3) and L (model 6) are statistically significant, but these variables either alone or in combination did not improve the index R2 N.

The multivariate computations were also performed by considering death from any cause as the end-point. There were no notable discrepancies.

Discussion

Although many data were evaluated in the present study, there are weaknesses. The data are heterogeneous. Histopathologic characteristics such as grade could not be verified. Neoadjuvant systemic treatment might have modified the yield of nodes [38]. Important information such as how patients were selected for any particular treatment is missing. Undocumented comorbidity might have affected the extent of nodal dissection. An imbalance in the delivery of chemotherapy or hormone therapy could have affected the distribution of deaths. For all of these reasons, the present results should be considered explorative and must be validated independently.

Since about 1999 a growing number of studies have investigated nodal ratios. In the studies that compared the numbers of involved nodes with ratios in multivariate models, the majority found that ratios were better than numbers as prognostic indicators [13, 16, 18, 19, 24, 26]. Ratios (expressed as percentages or log odds) have a better prognostic impact than do isolated numbers and, unlike numbers, they are not associated with inconsistent findings. Part of the explanation might be that a ratio can be interpreted as a form of standardization in which the number of involved nodes found in a patient is standardized to the number of nodes examined in that same patient [20]. It is noteworthy that the hazard ratios for np/nx were almost unaffected by the model (column np/nx [%] in Table 3), whereas the hazard ratios for np and/or nx exhibited more variability (columns np and nx in Table 3).

As a prognostic factor, np/nx appears the most convenient. Figure 2 shows that, for node-positive cases presenting with 0–10% involved nodes, the crude breast cancer mortality risk for an average follow up of 6 years is about 5%, and with 90–100% involved nodes the mortality is about 45%. For any intermediate value for the percentage of node involvement, the mortality risk is easily interpolated.

The estimated log odds L provided results very similar to those with np/nx. Overall, L improves on np/nx when all cases are considered together (column L in Table 3). The log odds appears useful for integrating node-negative and node-positive cases while avoiding more complex modeling, which we performed previously [39]. However, there is a range of L values in which node-negative and node-positive cases overlap (Table 2, Figure 3). In an analysis of all cases pooled, the overlap might blur the prognostic difference between node-negative status based on a very small number of excised nodes, and node-positive status based on a large number of excised nodes but with few involved nodes. The literature on the log odds of node involvement is scarce, and the utility of the L transform needs independent confirmation.

Table 2 Distribution of the estimated log odds of nodal involvement, and corresponding unadjusted mortality
Table 3 Comparison of models

The present findings indicate that the favorable survival attributed to higher numbers of nodes removed, as suggested by Krag and Single [40], might be due to different model specifications. The number of patients is huge and statistical significance can easily be demonstrated but without necessarily implying any major clinical impact. Undoubtedly, the uncertainty about node negativity increases when nx (the number of excised nodes) is small. However, the predictive utility attributable to nx is exceedingly small (Table 3, lower section). This dissociation between statistical significance and predictive utility appears counterintuitive. Nevertheless, it is in keeping with findings from Fisher and coworkers [41], who noted that prognosis was unaffected by the number of excised nodes when nodal status was reported to be negative. This is also supported by a recent report based on 3800 patients [42] in which the number of excised nodes was predictive of the risk for recurrence in node-positive but not in node-negative patients.

Sentinel node biopsy has gained wide acceptance since 1997 and it is used to determine the need for axillary dissection [43]. Because our selection of patients was from 1988 until 1997, it is unlikely that sentinel nodes could have represented any substantial part of the present study. The prognostic impact of one involved node in patients who had one node removed in this study cannot be extrapolated to the patient found with one involved node in a sentinel node procedure. However, in the prediction of nonsentinel node involvement when one or more sentinel nodes are found to be involved, Cserni and coworkers [44] reported that the number of sentinel nodes and the percentage of positive sentinel nodes were jointly significant predictors. A closely related finding that also highlights the predictive role of ratios was reported in a recent Australian study [45], in which the prediction model was determined by patient age, by the number of sentinel nodes, and by the proportion of involved sentinel nodes.

Conclusion

We found the percentage of involved nodes to be the most directly useful indicator of nodal involvement, but this is limited to node-positive cases. The log odds of nodal involvement performed equally well in node-positive and node-negative patients. The log odds might provide a unified approach to the modeling of nodal involvement. The present results and the growing literature argue that ratios should be considered in the staging of axillary dissection.