Introduction

Differentiated thyroid cancer (DTC) is rare in children [1, 2]. The initial treatment consists in surgery followed by radioactive iodine (RAI) therapy in selected patients [3]. DTC usually presents as a more extended disease in children and adolescents than in adult patients, with extensive regional nodal involvement and more frequent lung metastases, but the survival rate at 30 years is close to 90% [4,5,6]. The rate of recurrence is high in pediatric patients, in particular in those with high post-operative serum thyroglobulin (Tg) values [7, 8].

The American Thyroid Association (ATA) risk stratification classification may help to recognize at the time of initial treatment patients with a high risk of persistent or recurrent disease [9]. On the other hand, the extent of nodal disease is defined according to central or lateral location of lymph node involvement [9]. To refine this classification some authors considered the criteria used in the adult ATA guidelines, such as the number and size of the metastatic lymph nodes [7, 10, 11]. This modified ATA risk stratification was able to predict the outcome in pediatric DTC patients [10, 11]. In addition, a dynamic risk stratification system based on biochemical and imaging data that evaluates the response to initial therapy at 12 months predicted the outcome in adult patients [12, 13]. However, this prognostic approach has not been fully addressed in children.

The aim of this study was to evaluate the prognostic role of the response to therapy assessed at 12 months after initial treatment in pediatric patients with DTC.

Materials and methods

Study population

We retrospectively evaluated 122 pediatric DTC patients (age < 18 years) referred to our Department between 1992 and 2010. All patients underwent a total thyroidectomy, with or without central and/or lateral neck dissection, followed by RAI therapy. Before RAI administration, L-thyroxine was discontinued until the serum thyroid stimulating hormone (TSH) level has increased above 30 mIU/l. At the time of RAI, serum thyroglobulin (Tg) was determined and a 131I activity according to the body weight and the amount of disease (37 to 111 MBq/Kg) was administered. From 1992 to 1996, serum Tg levels were determined by an immunoradiometric assay (Dynotest Tg, Henning, Berlin, Germany) with a sensitivity of 1 ng/mL. From 1997, Tg was determined by a chemiluminescence system (Immulite, Diagnostic Products Corp, Los Angeles, CA, USA) with a detection limit of 0.2 ng/mL. According to post-operative Tg values obtained at the time of RAI therapy, patients were categorized into two groups: > 10 ng/ml and ≤ 10 ng/ml [7, 14]. Five to seven days after 131I administration, a post-therapy whole body scan (WBS) was performed as previously described using a dual-head gamma camera (E.CAM, Siemens Medical Systems, Hoffman Estates, IL, USA) equipped with thick crystals and high energy collimators [15].

ATA initial risk stratification

The risk of persistence or recurrence was initially estimated according to histopathological findings and the results of 131I post-therapeutic WBS, based on ATA pediatric guidelines [9]. Patients were classified as low risk (disease grossly confined to the thyroid, with N0 or Nx disease or with incidental N1a metastasis), intermediate risk (extensive N1a or minimal N1b disease) or high risk (extensive N1b disease or T4 tumor with gross tumor extension beyond the thyroid capsule, with or without distant metastasis). To consider nodal involvement as extensive, we used the number and size of lymph nodes, for N1b > 5 or any lymph node metastases ≥ 3 cm or the presence of any clinically detected lymph node metastases, according to the ATA adult guidelines [10, 11].

Therapy response evaluation

The response to therapy at 12 months after RAI administration was assessed with serum Tg measurement obtained on LT4 treatment or following thyroid hormone withdrawal, neck ultrasound and imaging findings. According to the 2015 ATA guidelines for adults [10], definitions of response to therapy were: (1) excellent response (ER), negative imaging and either Tg < 0.2 ng/mL on LT4 treatment or TSH-stimulated Tg < 1 ng/mL; (2) indeterminate response, non-specific findings on imaging studies and Tg levels on LT4 treatment that are detectable but < 1 ng/mL or stimulated Tg levels between 1 and 10 ng/mL or stable or declining titer of anti-Tg antibodies over time; (3) biochemical incomplete response, negative imaging and Tg ≥ 1 ng/mL on LT4 treatment or stimulated Tg ≥ 10 ng/mL or rising titer of anti-Tg antibody over time (4) structural incomplete response, structural evidence of disease with any level of serum Tg or of anti-Tg antibodies. Patients without neck ultrasound at the 12-month evaluation were excluded from the study.

Follow-up

After the evaluation at 12-months, all patients were followed every 6–12 months with serum Tg determinations (on L-thyroxine and in some patients off L-thyroxine therapy) and with imaging procedures. Disease status was recorded at each evaluation. Disease-free survival was measured from the date of surgery to the first observation of structural persistent disease, structural recurrent disease or additional treatment (e.g., surgery, RAI). Structural disease was defined as “persistent” in patients that one year after treatment had detectable Tg, detectable Tg antibodies or abnormal neck ultrasound, and as “recurrent” in patients who achieved remission but then developed detectable Tg, detectable Tg antibodies, or abnormal neck ultrasound. Suspicious nodal abnormalities at neck ultrasonography were confirmed by fine needle aspiration cytology and Tg determination in the aspirate fluid, histology or presence of RAI uptake; uptake in the thyroid bed at post-therapy WBS was considered a structural disease only when it corresponded to abnormal findings at neck ultrasonography [10] Patients last known to be alive and without structural disease were censored at the date of last contact.

Statistical analysis

Continuous data are expressed as mean ± standard deviation and categorical data as percentage. Student’s two-sample t test and χ2 test were used to compare the differences in continuous and categorical variables, respectively. Univariate and multivariate logistic regression analyses were performed to identify the variables associated with 12 months response to initial treatment. Hazard ratios with 95% confidence intervals (CI) were also calculated by univariate and multivariate Cox regression analyses. Variables showing a p-value < 0.05 at univariate analysis were considered for multivariate analysis. Disease free survival analysis was performed using the Kaplan-Meier method and log-rank test. Statistical analysis was performed with Stata 12 software (StataCorp, College Station, Texas USA).

Results

Among 122 pediatric patients with DTC, 28 without serum Tg determinations and/or neck ultrasound at the 12-months evaluation were excluded, leaving 94 subjects for the analysis. At initial evaluation, 62 (66%) patients were classified as ATA low risk, 17 (18%) as ATA intermediate risk and 15 (16%) as ATA high risk. Of these latter patients at high risk, 3 had lung metastases at post-therapy WBS.

At the 12 months evaluation, 62 (66%) patients had ER to initial therapy and 32 (34%) no-ER (Table 1). Among the no-ER patients, 11 were classified as indeterminate response, 17 as biochemical incomplete response and 4 as structural incomplete response for lung metastases (n = 3) or for lymph node metastases (N1b) confirmed by fine needle aspiration biopsy (n = 1). The 3 patients with lung metastases underwent a second RAI treatment and the patient with lymph node metastases was treated with surgery. Among 32 no-ER patients, 15 (47%) were initially classified as ATA low risk and 17 (53%) as ATA intermediate/high risk. Serum Tg at the time of RAI treatment was > 10 ng/mL in 25 (78%) no ER patients. Twelve (37%) patients were younger than 14 years at the time of initial treatment (Table 1). Among 62 patients initially classified as low ATA risk, pre-therapy Tg values and prevalence of serum Tg > 10 ng/mL were significantly higher in patients with no-ER (n = 15) as compared to those with ER (n = 47) (both p < 0.01) while the other baseline findings were comparable among the two groups (Supplementary Table). At multivariate logistic regression analysis, a pre-therapy Tg > 10 ng/mL was the only independent predictor of no-ER (p < 0.001) (Table 2).

Table 1 Baseline characteristics according to the 12-months response to initial treatment
Table 2 Univariate and multivariate logistic regression analyses with 12 months no-ER as dependent variable

Predictors of outcome

During a median follow-up of 86 months (range 9–517 months), 19 patients experienced an event occurred (20% cumulative event rate). Twelve of the patients with event required additional RAI therapy. In particular, 2 patients were treated for recurrence in the thyroid bed, 4 for persistent disease in the thyroid bed, 4 for persistent disease in both thyroid bed and lymph nodes, and 2 for lung metastases. The remaining 7 patients with event underwent both additional surgery and RAI therapy for nodal disease (6 patients for persistent disease and 1 for recurrent disease).

Patients with events were younger at initial treatment (p < 0.05), had a higher prevalence of ATA intermediate/high risk (p < 0.01) and pre-therapy Tg > 10 ng/mL (p < 0.001) (Table 3). The rate of events was significantly higher in the 32 patients with no-ER (n = 16, 50%) at 12 months as compared to the 62 patients with ER (n = 3; 5%) (p < 0.001). Individual characteristics of the 3 patients with ER who had recurrent disease are reported in Table 4. At multivariate Cox analysis (Table 5), no-ER remained the only independent predictor of events (p < 0.001).

Table 3 Baseline characteristics according to the occurrence of events
Table 4 Individual characteristics of the ER patients with events during follow-up
Table 5 Univariate and multivariate predictors of events

At Kaplan Meier analysis, the disease-free survival was lower in patients with no-ER as compared to those with ER (177 ± 32 vs. 477 ± 23 months, p < 0.001). The worst prognosis was observed in patients with no-ER and intermediate/high ATA risk (Fig. 1) and in patients with no-ER and pre-therapy Tg > 10 ng/mL (Fig. 2) (both p < 0.001).

Fig. 1
figure 1

Disease free survival by Kaplan-Meier according to 12-month response to therapy and ATA risk. Patients with no-ER and intermediate/high ATA risk had the worst outcome compared to the other groups (p < 0.001)

Fig. 2
figure 2

Disease free survival by Kaplan-Meier according to 12-month response to therapy and post-operative serum Tg levels. Patients with no-ER and pre-therapy Tg > 10 ng/mL had the worst outcome compared to the other groups (p < 0.001)

Discussion

In this retrospective analysis, we found that the response to initial treatment evaluated at 12 months with serum Tg determination and neck ultrasound, is able to predict the mid-term outcome of pediatric DTC patients treated with surgery and RAI.

At initial diagnosis, pediatric patients with DTC present with more extensive disease as compared to adults [1,2,3,4]. Moreover, in children lung metastases are more frequently observed and persist also after multiple RAI therapies [16, 17]. However, death is uncommon and usually occurs several decades after diagnosis [18]. In children with DTC the survival rate at 30 years close to 90% [1,2,3,4, 19]. However, the rate of recurrence is higher in pediatric than in adult patients, in particular in those with multifocal papillary thyroid cancer, with gross tumor extension beyond the thyroid capsule and in those with extensive lymph node involvement [20]. Pediatric patients with pre-RAI therapy Tg levels > 10 ng/ml have a higher risk of structural persistent disease, in particular those classified as ATA intermediate risk [8, 9, 21]. The initial risk stratification in pediatric patients with DTC is performed according to pediatric ATA guidelines, where the extent of nodal disease is defined according to central or lateral location of lymph node involvement [9]. We used a refined classification considering the criteria used in the adult ATA guidelines, such as the number and size of the metastatic lymph nodes [7, 11], and this modified ATA risk stratification was able to predict outcome in a cohort of 260 children with DTC [11]. In adult patients the dynamic risk stratification has been proposed to evaluate the response to therapy during follow-up. The addition of laboratory and imaging findings obtained during the first 12–24 months after treatment, can improve the initial risk assessment in adult [13] and also in pediatric patients [8, 21].

In this retrospective analysis of 94 pediatric patients, 62 (66%) were initially classified as ATA low risk. At 12 months evaluation, most (76%) of these low risk patients, had an ER, in agreement with the series of Sapuppo et al. [11]. However, the other 15 (24%) low-risk patients had no-ER, suggesting that a further stratification in these patients is needed.

An incomplete primary surgery may explain these findings, also considering that patients with no-ER showed higher pre-therapy Tg values as compared to those who had ER. Incomplete primary surgery could be due to non-accurate preoperative staging, including ineffective evaluation of lymph node status and local extension. The identification of persistent disease after surgery may be challenging, considering that local postoperative inflammation can affect neck ultrasound findings and RAI uptake in the thyroid bed is frequently observed at post-therapy WBS scan. In our population 92 of 94 patients showed neck uptake; moreover, it should be considered that at the time of treatment single photon emission computed tomography/computed tomography/CT) hybrid technology was not available.

From our results it emerged that, despite no-ER was more frequent in younger patients, the presence of pre-therapy Tg values > 10 ng/mL was the only independent predictor of no-ER. Predicting 12 months response to treatment is crucial, considering that the presence of no-ER is associated with unfavorable outcome. In our series, the rate of events was higher in no-ER patients, and the presence of no-ER remained the only independent predictor of outcome at multivariable analysis. Interestingly, lung metastases were observed in only 3 patients at diagnosis. Despite the evidence of structural incomplete response at 12-month evaluation and the need for a second RAI treatment, at the end of follow-up two of these patients had an excellent response to treatment. The complete remission in patients with lung lesions is uncommon, despite it has been suggested that an early diagnosis and the absence of defined nodules may influence the response to therapy [22]. In our series, the 2 patients whit lung metastases who achieved an excellent response had a micronodular pattern, and the absence of nodules may have resulted in a more effective response to RAI. In our population the rate of persistent disease was higher in patients with biochemical incomplete response at 12 months. Moreover, 11 patients showed also indeterminate response and 9 of them ended the follow-up without any evidence of disease.

In adult patients with low-risk thyroid cancer, the probability of persistent disease is low in patients with low or undetectable post-operative TSH stimulated Tg level, and the benefits of RAI administration have been recently questioned [23, 24]. This approach should be further investigated in in low risk children in order to reduce their irradiation whenever it is not beneficial. On the contrary, in patients initially at higher risk, the evaluation at 12-months might better identify patients in whom a more aggressive follow-up is needed.

Conclusions

Our data suggest that in pediatric patients with DTC the response to therapy evaluated 12 months after initial treatment has an independent prognostic impact and is able to predict mid-term outcome. Patients with no-ER at 12 months after RAI therapy should be closely followed-up.