Papillary thyroid carcinoma (PTC) is the most common type of differentiated thyroid carcinoma, and its age-adjusted incidence has doubled in the last 25 years.1 Despite its relatively good prognosis, with a cancer-specific survival above 90 %, locoregional recurrence (LRR) is common.2 With recognition of the stepwise progression of lymph node metastasis (LNM) originating from the central (level VI) to the lateral compartment (levels II–V), a growing number of surgeons are advocating routine prophylactic central neck dissection (pCND) at the time of the total thyroidectomy.3 Although the role of pCND has remained controversial because there is little evidence to suggest that it improves long-term outcomes when compared to no pCND, analysis of short-term markers (such as stimulated thyroglobulin) (sTg) seems to suggest pCND may improve short-term outcomes.47

Because patients with clinically apparent or palpable LNM in the lateral compartment (or N1b) are at significantly higher risk of developing LRR and distant metastases when compared to those with clinically inapparent LNM, size of LNM is believed to be a prognostic factor in patients with PTC.814 However, to our knowledge, no study has specifically addressed the significance of size of central LNM (CLNM) in patients who routinely undergo pCND. For patients who undergo pCND after total thyroidectomy, the incidence of histologically proven CLNM ranged 40–50 %.47 However, because these CLNM are often small in size and may only be seen under high-power magnification, their clinical significance have been questioned.12,13 A recent study reported a series of 72 patients who underwent a pCND and found that none had CLNM ≥2 mm in size (macrometastasis or pN1mac), whereas only 18 patients (25.0 %) had CLNM <2 mm (micrometastasis or pN1mic) present in the central lymph nodes (CLNs).15 We hypothesized that size of CLNM might have a negative impact on prognosis, with larger CLNM leading to poorer prognosis. Postsurgical detectable sTg level (DsTg) after radioiodine (RAI) ablation is a good surrogate for persistent or recurrent disease after total thyroidectomy.1619 Therefore, the present study aimed to evaluate the association between size of CLNM and rate of postablation DsTg in patients who underwent an unilateral pCND with particular emphasis on the incidence and significance of pN1mic and pN1mac.

Patients and Methods

From 2005 to 2011, a total of 250 consecutive patients with PTC underwent surgery in our institution. All were managed by the same surgical team. Of these, 142 (56.8 %) underwent a routine unilateral pCND at the time of the total thyroidectomy. None had evidence of CLNM preoperatively on ultrasonography or intraoperatively. Patients with concomitant lateral lymph node metastases (N1b) or distant metastases (M1) were excluded. All operations were performed by two endocrine surgeons. The decision for a prophylactic unilateral central neck dissection (pCND) was based on the personal preference of the operating surgeon and not based on tumor size or other tumor characteristics. However, there was a tendency to perform more pCND in the latter part of the study period. During the study period, all resected specimens were examined by the same group of pathologists in our institution by a standardized technique. Specimens containing <3 CLNs retrieved during pCND (n = 38) were excluded to reduce the possibility of nodal under-staging.20 Also, patients with anti-thyroglobulin (Tg) autoantibody titer of >400 (n = 10) were excluded. Therefore, a total of 94 patients were eligible for analysis. Of these, 40 (42.6 %) had no demonstrable LNM on hematoxylin and eosin stain (or pN0) (group I), whereas 54 (57.4 %) had ≥1 demonstrable metastatic focus in CLN on hematoxylin and eosin stain. To measure the size of the CLNM, the relevant slides were retrieved and reexamined by two independent pathologists (A.H.T., T.W.S.). All metastatic foci were measured to the nearest tenth of a millimeter with a built-in computerized measurement program on the microscope (Nikon Eclipse E600). For patients with multiple metastatic foci, only the largest microscopic focus was recorded. To further ensure that the study only evaluated the significance of size of CLNM in the prophylactic setting, 5 patients with CLNM ≥1 cm were excluded because they could not be considered as prophylactic. To evaluate the association between size of CLNM and other clinicopathologic characteristics, patients were categorized into 3 groups: group I (pN0) (n = 40), group II (size of CLNM <2 mm) (pN1mic) (n = 20) and group III (size of CLN metastasis ≥2 mm) (pN1mac) (n = 29).

Methods

All relevant clinical, laboratory, radiologic and perioperative data were collected prospectively, and follow-up data were regularly updated in a computerized database. The present study protocol was approved by the local institutional review board. Patient clinicopathologic features, sTg and postoperative outcomes were compared between the three groups.

Management of PTC

Details of surgical treatment, criteria for RAI ablation, postoperative care and follow-up protocol have been described previously.11 In brief, total thyroidectomy was the preferred procedure for all patients with a preoperative diagnosis of PTC. A similar extent of unilateral pCND was performed for all patients, regardless of tumor size or extent.6 The pCND consisted of the removal of all nodes and fibro-fatty tissue extending vertically from the hyoid bone to the thoracic inlet and laterally from the medial border of common carotid artery to the midline of the trachea. The ipsilateral recurrent laryngeal nerve was mobilized and skeletonized along its entire cervical course. Parathyroid autotransplantation was readily performed. sTg was defined as a Tg level measured in the presence of thyroid-stimulating hormone (TSH) >30 mIU/L either by 4-week thyroxine withdrawal or recombinant TSH injections. The preablation sTg levels were taken approximately 2 months after surgery (at the time of RAI ablation), while the postablation levels were taken approximately 9 months after surgery (6–7 months after RAI ablation or at the time of the whole body scan). Tg autoantibodies were measured at the same time. The decision for RAI ablation was based on the presence of at least one or more risk factors such as tumor size >1 cm, LNM, age older than 45 years, extrathyroidal extension, macroscopic postoperative residual disease in the neck and distant metastasis, and were not dependent on the preablative sTg level. Three giga becquerels or 80 mCi I131 was the standard dose. TSH suppression to <0.1 μg/L was recommended for high- and intermediate-risk patients.

Follow-up Protocol

All postsurgical patients were followed up within 4 weeks in a specialized combined oncology clinic. A follow-up visit was conducted at 3-month intervals in the first 2 years, every 6 months in the subsequent 3 years, and annually thereafter. Clinical examination, neck ultrasonography and nonstimulated Tg level were done during follow-up visits. LRR was confirmed by fine-needle aspiration cytology or histology.

Laboratory Methods

All postoperative sTg levels were measured at the same laboratory by the same immunometric assay. The assay used was Immulite 2000 (Diagnostic Products, Roche, Los Angeles, CA). This was calibrated against the CRM-457 standard. A sTg level of ≥0.5 μg/L was considered a detectable stimulated thyroglobulin (DsTg). Normal reference range was <0.5–55 μg/L, and sensitivity was <0.2 μg/L).

Statistical Analyses

Statistical analyses was performed by χ2 or Fisher’s exact test to compare categorical variables, and Mann–Whitney U or Kruskal–Wallis test was used to compare continuous variables between groups. To evaluate the correlation between two continuous variables, the Spearman rank correlation test was performed. Continuous variables were expressed as medians with ranges. Variables that were significant in the univariate analysis were entered into multivariate analysis. Binary logistic regression analysis with a variable entrance criterion of 0.05 or less was conducted to identify factors associated with postablative DsTg after surgery. Disease-free survivals were estimated by the Kaplan–Meier method and compared with the log-rank test (Fig. 1). All statistical analyses were performed by SPSS software, version 18.0 (SPSS, Chicago, IL).

Fig. 1
figure 1

Cumulative disease-free survival curves of papillary thyroid carcinoma between those with no central lymph node (CLN) metastasis (pN0) (group I), with CLN micrometastases (pN1mic) (group II), and with CLN macrometastases (pN1mac) (group III)

Results

In our cohort, most were women (75.3 %) and ethnic Chinese (89.9 %). The median age at operation was 48.0 (range 8.1–81.3) years, and the median follow-up period was 29.1 (range 9.3–90.1) months. The median primary tumor size was 1.5(range 0.3–6.0) cm, with 8 (9.0 %) being microcarcinomas (<1 cm). The median number of metastatic CLNs was 1 (range 1–18), and the median number of CLNs collected was 6 (range 3–21). The median size of CLNM was 2.6 (range 0.1–6.9) mm.

Table 1 shows a comparison of patient clinicopathologic features, tumor, node, metastasis staging system (TNM) tumor stages, CLN ratio, and metastases, age, completeness of surgery, invasion, and size (MACIS) score between the three groups. The median age at operation was significantly different between the three groups (p = 0.003), with group III having the youngest median age of 43.0 years. There was an inverse correlation between age and size of CLNM (ρ = −0.315, p = 0.028). Gender, clinical presentation and tumor characteristics such as tumor size, multifocality, capsular invasion, extrathyroidal extension and coexisting thyroiditis were not significantly different between the three groups. Stage of PTC by TNM, MACIS score, postoperative complications and anti-Tg autoantibody titer were also similar between the three groups. Group III had significantly higher number of CLNs retrieved (p = 0.007), metastatic CLNs excised (p < 0.001) and metastatic CLN ratio (p = 0.010) than the other two groups. There was a direct correlation between size of CLNM and number of CLNs retrieved (ρ = 0.343, p = 0.016), size of CLNM and number of metastatic CLNs excised (ρ = 0.541, p < 0.001) and size of CLNM and metastatic CLN ratio (ρ = 0.387, p = 0.006). Group III also tended to have higher proportion of patients receiving RAI ablation after surgery than groups I and II (96.6 vs. 80.0 and 90.0 %, respectively), but this was not statistically significant (p = 0.198). In the preablation period the median sTg was similar, whereas in the postablation period the median sTg level was significantly different between the three groups (p = 0.012). The proportion of DsTg in groups I and II was reduced from 60 to 17.5 and 55 to 15 %, respectively, after RAI ablation, whereas the proportion in group III was reduced from 82.8 to 62.1 %. At the time of analysis, all patients were alive. In groups I and II, no patient developed LRR, whereas in group III, 2 developed ipsilateral lateral recurrences and 1 developed bilateral lateral recurrence. These three recurrences developed at 19.4, 21.9 and 42.9 months after surgery. All three recurrences had postablation DsTg 9 months after surgery, and their median sTg level was 87.0 (range 9.7–103.0) μg/L. The cumulative disease-free survival in groups I, II and III were not significantly different (p = 0.151), and the 5-and 10-year disease-free survivals in group III were 85.1 and 85.1 %, respectively.

Table 1 A comparison of patient characteristics between group I, II and III

Table 2 shows the univariate analysis of clinicopathologic factors for postablation DsTg (i.e., > 0.5 μg/mL). Age, gender, clinical presentation and most tumor characteristics such as multifocality, capsular invasion, extrathyroidal extension and thyroiditis were similar between those with postablation DsTg and with undetectable sTg. However, relative to those with undetectable sTg, those with postablation DsTg had larger median primary tumor size (2.0 vs. 1.5 cm, p = 0.018), greater number of metastatic CLNs (2 vs. 1, p = 0.013), higher CLN ratio (66.7 vs. 20.0 %, p = 0.003) and larger size of CLNM (4.8 vs. 1.6 mm, p < 0.001), but the proportion of patients with metastatic CLN (pN1a) was similar between the two groups (p = 0.114). When size of CLNM was categorized into types of metastasis (pN0, pN1mic and pN1mac), those with postablation DsTg had a significantly higher proportion of pN1mac (67.7 vs. 10.9 %) but a lower proportion of pN0 (20.6 vs. 60.0 %) and pN1mic (11.8 vs. 29.1 %) than those with undetectable sTg (p = 0.002). There was a significant direct correlation between the size of CLNM and postablation sTg level (ρ = 0.428, p = 0.003), but such correlation was not observed between size of CLNM and preablation sTg (ρ = 0.264, p = 0.083).

Table 2 Univariate analysis of clinicopathologic risk factors for detectable postablation stimulated thyroglobulin level (>0.5 μg/mL)

Table 3 shows the multivariate analysis of clinicopathologic risk factors for postablative DsTg 9 months after surgery. Variables that were significant in the univariate analysis were entered into the multivariate analysis. The number of metastatic CLNs was not entered because it significantly correlated with the CLN ratio (ρ = 0.896, p < 0.001), and the CLN ratio was shown to be a more significant factor.21,22 Because the size and type of CLNM were similar covariates, they were entered separately into the multivariate analysis with CLN ratio and tumor size. In the multivariate analysis, after adjusting for CLN ratio and tumor size, size of CLNM (expressed as a continuous variable) (odds ratio 1.56, 95 % confidence interval 1.09–2.24, p = 0.015) was the only independent factor for postablation DsTg. Similarly, when the type of CLNM was entered in the multivariate analysis with CLN ratio and tumor size, relative to pN0 and pN1mic, pN1mac was the only independent risk factor for postablation DsTg (odds ratio 1.76, 95 % confidence interval 1.22–27.70, p = 0.027).

Table 3 Multivariate analysis of clinicopathologic risk factors for postablative detectable stimulated thyroglobulin 9 months after surgery

Discussion

Previous studies have found that size of LNM in the lateral compartment is a significant risk factor for LRR, distant metastasis and poor overall survival.811,14 Sugitani et al. reported that for patients older than 50 years, LNM ≥3 cm was an independent factor for cancer-specific survival from PTC.10 Similarly, it was observed that those with clinically apparent or palpable LNM were at significantly higher risk of developing LRR and distant metastases than those with clinically inapparent or nonpalpable LNM.813 In fact, this was the rationale for performing therapeutic selective neck dissection in the lateral compartment and not prophylactic selective neck dissection in patients with PTC.13 However, in comparison to lateral selective neck dissection, pCND is associated with lower surgical morbidity and can be performed within the same incision as the thyroidectomy.23 In addition, it provides more information regarding the nodal status as required by the TNM and may lower LRRs.23 Several studies confirmed that those who underwent pCND had a significantly lower postsurgical sTg than those without pCND. This perhaps suggests that pCND was able to provide a more oncologically complete resection than thyroidectomy alone by excising micrometastases harbored in the central compartment.46 This hypothesis was supported by the fact that over 50 % of CLNs resected harbored metastases.47 However, questions remain regarding the size of these metastases and their importance to patient outcomes. Given that the size of LNM in the lateral compartment is a prognostic factor, we hypothesized that the size of CLNM in pCND would also be a prognostic factor, although this has not yet been demonstrated.

To ensure that our study only evaluated CLNs taken out of pCND, 5 patients with CLNM ≥1 cm were excluded. In our analysis, we found that even when the size of CLNM was limited to <1 cm, it was an independent factor for postablation DsTg 9 months after surgery. There was a significant direct correlation between the size of CLNM and postablation sTg level (ρ = 0.428, p = 0.003), although not between size of CLNM and preablation sTg (ρ = 0.264, p = 0.083). When the size of CLNM was correlated with other clinicopathologic characteristics, we observed that the size of CLNM was inversely correlated with age (i.e., younger patients had larger-sized CLNM), but this was directly correlated with greater number of CLNs, metastatic CLNs and higher metastatic CLN ratio. This latter association implied that larger-sized CLNM meant more extensive nodal involvement, at least within the central compartment. This would account for the higher rate of DsTg 9 months after surgery in the pN1mac group. In the multivariate analysis, after adjusting for tumor size and CLN ratio, patients with pN1mac were almost 6 times more likely to have postablation DsTg than those with pN0. Despite the small number of patients within each group, it is worth noting that the odds ratio for pN1mic did not reach statistical significance; this may imply that pN1mic did not have the same significance level as pN1mac on postablation DsTg.

However, because the size of CLNM is a histopathologic finding, its clinical application could only be limited to the postsurgical management. Nevertheless, one possible application would be to use it as a factor for deciding on the dose of RAI ablation. Because most patients with either pN1mic or pN1mac would receive at least one ablative dose, it would be rational to have those with pN1mac receiving a higher dose of ablation (instead of the standard dose of 3 GBq) than those with pN1mic or pN0 when RAI ablation is indicated. In our study, it was interesting to observe that up to 42.5 % of patients with either pN0 or pN1mic became athyroglobulinemic after 3 GBq of RAI ablation, whereas only 20.7 % in the pN1mac group became athyroglobulinemic. This means that the latter group might perhaps benefit from a higher dose of RAI ablation if athyroglobulinemia is the aim. If future studies could confirm the size of CLNM as an independent factor for outcomes, reporting the size of CLNM in the histopathology reports could become important.

Despite these findings, we acknowledge shortcomings with our study, including the relatively small number of patients within each group and potential biases in the selection for pCND. As a measure of outcome, some would argue that a sTg of 0.5 μg/L might be set too low and a level taken at 9 months after surgery might be too early. However, as shown previously, a sTg of <0.5 μg/L at 1 year had a >98 % likelihood of identifying patients completely free of disease at follow-up, and a sTg level taken at 1 year after surgery was as good as a level taken 2–3 years after surgery.24,25 Therefore, we believe sTg <0.5 μg/L at 9 months after surgery is a reasonable surrogate for future recurrences. Although not statistically significant, it is worth noting that of the 3 LRR, all belonged to the pN1mac group and had postablation DsTg with a median sTg level of 87.0 (range 9.7–103.0) μg/L. In contrast, none of the patients in pN0 or pN1mic developed recurrences.

In conclusion, pN1mic accounted for 20 (22.5 %) of 89 patients undergoing pCND. Larger-sized CLNM was significantly associated with younger age, greater number of CLN retrieved, greater number of metastatic CLN excised, higher metastatic CLN ratio and higher postablation sTg level. Tumor size, metastatic CLN ratio and size of CLNM were significantly associated with postablation DsTg, but only size of CLNM turned out to be an independent factor for postablation DsTg. Patients with pN1mac were almost 6 times more likely to have postablation DsTg than those with pN0 or pN1mic.