Introduction

Thyroid cancer is the most common endocrine malignancy with an incidence of 2.9% in the United States for the year 2018 alone [1]. There are multiple types of thyroid cancer with papillary thyroid carcinoma (PTC) being the most common, accounting for ~80% of all cases [2, 3]. In general, well-differentiated PTC is regarded as a fairly indolent tumor with long-term survival rates >95% [4], however there are certain variants of PTC that are more aggressive in nature with less-favorable disease-free survival and overall survival. These variants differ in histology, cytology, molecular markers, treatment strategies, and outcomes [5]. The hobnail variant of papillary thyroid carcinoma (HVPTC) has been recently identified as an aggressive variant of PTC, whose histological features were first described by Kakudo et al. in 2004 and further elaborated by Motosugi et al. in 2009 [6, 7]. A case series of eight patients from the Mayo Clinic further described HVPTC and found this variant to be associated with more aggressive behavior including increased radioactive iodine (RAI) refractoriness, rate of distant metastasis, and mortality when compared with classical PTC [8]. Additional studies have furthered our understanding of the clinicopathologic characteristics and prognostic implications of HVPTC.

Histologically, Kakudo et al. [6] initially noted an elevated nuclear position (described as analogous to a hobnail appearance) and proposed this loss of cellular polarity as a characteristic of poor cellular differentiation. Asioli et al. [8] refined the histopathological diagnosis of HVPTC and described three diagnostic criteria: (1) nonsolid type of PTC, (2) ≤10% of the tumor with tall/columnar or diffuse sclerosing features, and (3) loss of polarity/cohesiveness with hobnail features in ≥30% of the tumor cells. In addition, subsequent authors also identified those with hobnail features present in 10–30% of tumors cells demonstrated worse outcomes, indicating PTC with <30% hobnail features are still aggressive [9]. Along with the hobnail appearance of cells, HVPTC demonstrates abundant cytoplasm with a higher nuclear-cytoplasmatic (N/C) ratio, a micropapillary pattern, and increased nuclear atypia compared with classical PTC (Fig. 1) [8, 10]. HVPTC is more common in women, and when compared with classical PTC tends to present at a higher stage with increased rates of regional and distant metastases as well as a higher mortality rate [7, 9, 11,12,13,14,15,16,17,18].

Fig. 1
figure 1

The hobnail variant of papillary thyroid carcinoma exhibits (a) complex papillary and micropapillary structures (b) lined by neoplastic follicular cells with eosinophilic cytoplasm, loss of cellular polarity, apocrine snouting, and apically located nuclei, giving the cells the hallmark “hobnail” appearance. (c) Classic papillary thyroid carcinoma nuclear features are present, including intranuclear inclusions and nuclear grooves

A systematic review of the existing literature describing HVPTC has yet to be conducted. Therefore, we performed a systematic review to unify the current literature in order to define the baseline clinicopathological characteristics of HVPTC and ascertain its prognostic implications.

Materials and methods

Search strategy

This systematic review queried four databases (PubMed, Scopus, Cochrane Library, and OVID) for any relevant studies for inclusion. This search included studies published from inception of the database to March 20th, 2020. This systematic review was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [19]. Search criteria included using subject headings or keywords: hobnail, thyroid, cancer, papillary, carcinoma, and micropapillary. References were then uploaded to EndNote (Clarivate Analytics, Philadelphia, PA, USA) and screened for relevance.

Study selection and eligibility criteria

Studies examining the prognostic significance of the HVPTC were included. Specific inclusion criteria included (a) examination of HVPTC with defined diagnostic criteria; (b) patient demographic data including age and sex; and (c) pathological tumor data. We elected to examine all HVPTC cases with any proportion of tumor with hobnail features, as Asioli et al. [9] demonstrated even those HVPTCs with <30% hobnail features demonstrate a worse pattern of disease. Exclusion criteria include: (a) non-human studies; (b) reviews; (c) non-English language; (d) incomplete or non-extractable data; and (e) data represented in another study. Two reviewers (LBD and FY) independently assessed titles and abstracts of all studies to screen for articles satisfying the inclusion criteria. Articles not satisfying inclusion criteria were then excluded. Full-text assessments were then performed to ensure satisfaction with inclusion criteria, and any discrepancies were then reviewed by a third author (SAN) for determination.

Critical appraisal and quality assessment

Articles were critically appraised to assess the level of evidence using the Oxford Center for Evidence-Based Medicine criteria [20]. The risk of publication bias was assessed according to the Cochrane Handbook for Systematic Reviews of Interventions 5.1.0 [21]. Two authors (LBD and FY) performed a pilot assessment on three studies to check for consistency of assessment. Both then performed independent risk assessment on the remaining studies. All disagreements were resolved by the way of discussion with a third author (SAN). Risk of bias items included the following: bias due to confounding, selection of participants into the study, classification of interventions, deviations from intended interventions, missing data, measurement of outcomes, and selection of the reported result. The risk of bias for each aspect is graded as “low,” “unclear,” or “high.” (Supplemental Fig. 1).

Data extraction

Data extracted from studies included: author, publication year, study design, study characteristics, and patient demographics. Collected patient data included: age, sex, and comorbidities. Collected pathologic data included: stage, TNM classification, tumor size, presence of lymphovascular invasion (LVI), extrathyroidal extension (ETE) and/or necrosis, tumor percentage of hobnail features, distant metastasis, lymph node metastasis, identification of immunohistochemical and/or molecular markers, and follow-up data such as disease status, locoregional recurrence (LRR), and mortality. If only TNM classification was reported, but not overall tumor stage was given, these cases were then staged according to the 8th Edition of the American Joint Committee on Cancer (AJCC) Staging Manual [22].

Statistical analysis

All statistical analyses were performed using MedCalc 18.10.2 (MedCalc Software, bvba, Belgium). All analyses were weighted according to the number of patients affected. Meta-analysis of proportions was performed to pool the prevalence rate of HVPTC in all PTC cases. Heterogeneity was assessed by the I2 test with the I2 value directly correlating with degree of heterogeneity (lower I2 value meaning lower degree of heterogeneity). A fixed statistical effect model was used if the I2 value was < 50%; for all other values, a random statistical effect model was used. The random effects model provides a more conservative estimate (i.e., with a wider confidence interval [CI]), however the two models are often in agreement when there is no heterogeneity.

Finally, the Sterne and Egger tests were performed for further assessment of risk of publication bias [23, 24]. Potential publication bias was evaluated by visual inspection of the funnel plot and Egger’s regression test. In a funnel plot, treatment effect is plotted on the horizontal axis and the standard error is on the vertical axis [25]. The vertical line represents the summary estimated derived using fixed-effect meta-analysis. Two diagonal lines represent (pseudo) 95% confidence limits (effect ±1.96 SE) around the summary effect for each standard error on the vertical axis. These show the expected distribution of studies in the absence of heterogeneity or of selection bias. In the absence of heterogeneity, 95% of the studies should lie within the funnel defined by these diagonal lines. Publication bias results in asymmetry of the funnel plot.

We further provided a pooled summary of the studies meeting inclusion criteria. Of note, multiple studies provided patient-level data for which we were able to delineate patients <30% and ≥30% hobnail features and patients <55 years old versus ≥55 years old. We chose 30% hobnail features as a cutoff as Asioli et al. [8] first used this threshold as a diagnostic criterion for HVPTC, however later defined tumors with <30% features to also have a poor prognosis. We used 55 years of age at diagnosis as a cutoff for staging, as per AJCC8, patients ≥55 years of age are automatically classified as stage III or IV [22]. A Chi square test was used to compare features of two groups (HVPTC with <30% versus ≥30% hobnail features; HVPTC <55 years old versus ≥55 years old). If a significant difference was found for a particular variable, then an odds ratio (OR) with a 95% CI was calculated by using a two-by-two frequency table where a = no. of exposed cases, b = no. of exposed non-cases, c = no. of unexposed cases, d = no. of unexposed non-cases, and OR = ad/bc. A p value of < 0.05 was considered to indicate a statistically significant difference for all statistical tests.

Results

Search results

A total of 125 articles were identified through our literature search. Of these, 65 articles remained after removing duplicates. Initial screening by title and abstract left 27 articles for full-text review. A total of 16 studies met inclusion criteria and underwent quantitative analysis (Fig. 2) [7, 9, 11, 12, 17, 18, 26,27,28,29,30,31,32,33,34,35]. Based on the Oxford Level of Evidence, 13 of the 16 studies are deemed as level 4 evidence with the remaining 3 deemed as level 3 evidence. A funnel plot was generated to assess risk of publication bias and did not reveal any points outside the plot, indicating low publication bias (Supplemental Fig. 2).

Fig. 2
figure 2

Search strategy according to PRISMA guidelines

Overview of included studies

This systematic review included 16 studies with a total of 124 patients with HVPTC (Table 1). Of the included patients, 81 (65%) were female and 43 (35%) male. The mean age at diagnosis was 52.3 years, with a range of 21–92 years. Mean follow-up time was 49.9 months, with a range of 4–274 months. Of the eight studies that included prevalence data, the pooled overall prevalence of HVPTC in PTC was 1.08% (95% 0.69–1.55) (Fig. 3) [9, 11, 12, 17, 18, 27, 31, 34]. Eighty-five patients had ≥30% hobnail component, 29 patients had <30% hobnail component, and 25 cases did not have the percentage of hobnail features available for review. Of 114 cases that had tumor size data available, mean HVPTC tumor size was 3.1 cm (range 0.5–9 cm). Twelve studies [12, 17, 18, 26,27,28,29,30,31,32,33,34] included focality of HVPTC, with 30 patients (36%) having multifocal disease and 53 patients (64%) having unifocal disease.

Table 1 Demographic overview of included studies
Fig. 3
figure 3

Forest plot demonstrating pooled prevalence of HVPTC in PTC cases

Treatment

All patients underwent thyroidectomy, with 95 (77%) receiving a total thyroidectomy, 2 (2%) receiving a hemithyroidectomy, and 26 (21%) for which extent of thyroidectomy is unknown. In addition, 77 (63%) patients received lymph node dissection; however, laterality and extent were not well-elucidated. Fifty-six (46%) patients were reported to have received adjuvant RAI and 8 (7%) received radiation therapy.

Clinicopathologic features and outcomes

The prevalence of three high-risk pathologic features (LVI, ETE, and necrosis) were examined (Table 2). Of the 13 studies reporting LVI, 69/111 (62%) tumors presented with LVI [7, 9, 11, 17, 18, 26, 27, 29,30,31,32,33,34]. Of the 11 studies reporting ETE, 32/64 (50%) tumors presented with ETE [7, 11, 12, 26, 27, 29, 31,32,33,34,35]. Of 104 cases with T-category available, 9 were classified as T4a or T4b, indicating extensive ETE to larynx, trachea, recurrent laryngeal nerve (T4a), or invading prevertebral fascia or encasing carotid arteries (T4b). Of the 8 studies reporting necrosis, 9/58 (16%) tumors presented with necrosis [7, 9, 12, 17, 27, 30,31,32]. B-type raf kinase (BRAF) p.V600E mutations were present in 62/85 (73%) cases with evaluable mutational data. Forty-six (47%) cases were stage I, 13 (14%) were stage II, 18 (19%) were stage III, and 20 (21%) were stage IV at presentation.

Table 2 Clinicopathologic overview of studies

Multiple outcome measures, such as rates of LRR, persistent disease, lymph node metastasis, distant metastasis, and overall survival were also evaluated. Of the 13 studies reporting LRR or persistent disease, 43/121 (36%) patients developed LRR or had persistent disease. The overall rate of lymph node metastasis was 80/122 (66%). The overall rate of distant metastasis was 27/116 (23%). Sites of distant metastasis included lung (18/27 [67%]), bone (10/27 [37%]), liver (5/27 [19%]), brain (4/27 [15%]), soft tissue/muscle (5/27 [19%]), pancreas (1/27 [4%]), spinal cord (1/27 [4%]), epiglottis/larynx (2/27 [8%]), and nasopharynx (2/27 [8%]). Of the 11 studies reporting survival status of their patients [11, 12, 17, 18, 26, 27, 29,30,31, 34, 35], 53/96 (55%) had no evidence of disease, 24/96 (25%) were alive with disease, 10/95 (10%) died of disease, and 9/96 (9%) were lost to follow-up.

Comparison of <30% and ≥30% hobnail

Tumors with ≥30% hobnail morphology had an odds ratio of 2.6 (95% CI 1.1–6.2) of developing lymph node metastasis over tumors with <30% hobnail morphology. No significant differences were seen in rates of distant metastasis or in high-risk pathological features such as LVI, ETE, and necrosis. BRAF mutation rates were similar in both groups (Table 3).

Table 3 Comparison of HVPTC with <30% versus ≥30% hobnail featuresa

Comparison of HVPTC <55 and ≥55 years old

Patients <55 and ≥55 years of age with HVPTC were compared. Patients ≥55 years old had an increased likelihood of developing distant metastasis (OR 4.5, 95% CI 1.6–12.6) and having LVI (OR 4.7, 95% CI 1.8–12.7) as a high-risk pathological feature compared with patients <55 years old. No significant differences were seen in rates of lymph node metastasis or BRAF mutation rates. Other high-risk pathological features such as ETE and necrosis were not significantly different (Table 4).

Table 4 Comparison of HVPTC by agea

Discussion

In part owing to the recent identification of HVPTC, tumor characteristics and clinical implications of this high-risk PTC variant remain poorly understood. The aim of our study was to systematically review the current literature to analyze clinicopathological features of HVPTC in order to further our understanding of this disease. Our study demonstrated an overall prevalence of 1.08% of HVPTC in PTC cases. HVPTC remains a rare but aggressive variant of PTC as evidenced by the greater rate of high-risk pathological features (LVI, ETE, and necrosis) and worse outcomes than that reported of classical PTC.

We have compared our findings for HVPTC to that of the reported literature values regarding classical PTC (Table 5). PTC is diagnosed at a mean age of 45 and shows a female predominance of 60–80% [36]. We found that HVPTC is diagnosed in older individuals with a mean age at diagnosis of 52.3 years and with a slight female predominance (65% female) similar to that of classical PTC. The mean size of all thyroid cancers at diagnosis has been reported to be 1.9 cm with 82% of thyroid cancers 3.0 cm or smaller in size [37]. Comparatively, our study demonstrated HVPTC tumors to have a larger mean size of 3.1 cm. Increased tumor size in classical PTC has been found to predict poor prognosis [38, 39]. Thus, the larger mean size of HVPTC at presentation highlights advanced T-category in comparison with classical PTC.

Table 5 Comparison of PTC and HVPTC as reported in the literature

Well established high-risk pathologic characteristics of PTC include, but are not limited to, LVI, ETE, and necrosis. In a single institution retrospective review, Hay et al. [40] found a total of 328/2444 (13%) PTC patients to have the primary tumor invade into surrounding extrathyroidal soft tissue. Comparatively, our study demonstrated HVPTC tumors to have a much greater proportion of ETE at 50%, indicative of the more aggressive nature of HVPTC. Furthermore, a large retrospective cohort series by Xing et al. [41] examining classical PTC patients found a significant association between BRAF mutations and aggressive features and mortality, with 845/1849 (45.7%) patients possessing a positive BRAF mutation. Our study demonstrated a substantially greater percentage of tumors possessing BRAF mutation (73%). This demonstrates the greater risk of cancer-related mortality in HVPTC patients as BRAF mutational genotypes are associated with higher rates of disease recurrence, decreased disease-free survival, and decreased overall survival [42, 43]. Targeted therapies for BRAF-mutated PTC, such as BRAF-specific inhibitors like vemurafenib and dabrafenib, have had promising results [44, 45]. In BRAF-mutated anaplastic thyroid carcinoma, these inhibitors, in combination with mitogen-activated protein kinase kinase inhibitors, have led to an overall response rate of 69% [46]. Consequently, the dual blockage of the MAPK pathway may prove to be significantly efficacious in HVPTC and is an area of ongoing investigation [46]. Additionally, the role of BRAF inhibition to sensitize RAI-refractory thyroid cancers to RAI may also play a role in the management of advanced HVPTC, with promising evidence revealing that vemurafenib restored RAI uptake in RAI-refractory BRAF V600E mutated thyroid cancers [47]. Other molecular markers such as RET/PTC, TERT, p53, and IPK3CA were not consistently evaluated in the majority of studies, however can be the focus for future molecular characterization of HVPTC in order to improve prognostic risk stratification and allow for development of potential targeted therapeutic options.

In regards to disease outcomes, HVPTC portends worse outcomes as compared with classical PTC. Differentiated thyroid carcinomas, including classical PTC, will develop local or regional metastasis in 5–20% of cases, and distant metastases in 10–15% of patients, notably to the lungs and bones [48, 49]. We found that HVPTC tends to have an increased local and regional recurrence rate, with 43/121 (36%) patients found to have recurrence in the 13 studies analyzed. In addition, the overall rate of distant metastasis was 27/116 (23%), with lung and bone being most commonly affected.

The impact of percentage of hobnail features has previously been evaluated. Asioli et al. [9] reviewed 24 HVPTC cases and stratified them into two groups: (1) PTC cases with ≥30% hobnail/micropapillary features and (2) PTC cases with <10% hobnail/micropapillary features. Their analysis found increased hobnail/micropapillary features were associated with an increased risk of death, distant metastases, and decreased disease-free survival. No significant differences were found in regards to gender, age, tumor size, LVI, ETE, RAI therapy and stage between the groups. Similarly, Watutantrige-Fernando et al. [18] analyzed differences between two groups: PTC cases with ≥30% hobnail component (16/25) and PTC cases with <30% hobnail component (9/25). Interestingly, their study found no statistically significant differences between the groups with regards to lymph node involvement, distant metastasis, and outcomes. They also reported no differences in age, tumor size, multifocality, staging, and mutational status. Our pooled analysis found an odds ratio of 2.6 for lymph node metastasis when comparing cases with ≥30% hobnail features versus cases with <30% hobnail features, however no significant differences were found in rates of distant metastasis or the previously discussed high-risk pathological characteristics. This may indicate that proportion of hobnail component is prognostically significant and may influence risk of locoregional disease; however, PTC cases <30% hobnail features should still be consistently considered an aggressive variant of PTC and further analysis is warranted.

Patients ≥55 years old with HVPTC had significantly higher rates of LVI and distant metastasis than patients <55 years old (35% vs 11%, p = 0.003), however there were no significant differences in lymph node metastasis rates, LVI, BRAF mutation status or necrosis. The rates of distant metastasis in patients <55 years old (11%) were comparable to that reported in the literature of differentiated thyroid carcinoma (10–15%) [48, 49]. Both cohorts also had high rates of lymph node metastasis at 55% in <55 years old group and 72% in the ≥55 years old group, which is greater than reported median values of lymph node metastasis (20–50%) in classical PTC [50]. In conclusion, although age may predict increased rates of distant metastasis, patients younger than 55 years old still had a comparable risk of lymph node metastasis, BRAF mutation and necrosis to patients ≥55 years old. Consequently, patients of any age with HVPTC require close attention.

Overall, these findings highlight the aggressive nature of HVPTC in comparison with classical PTC. These patients tend to have larger tumors, high-risk pathologic characteristics, and worse outcomes such as higher rates of LRR, distant metastasis, and death. Treatment specific to HVPTC warrants further investigation, as the role of RAI in this patient population is still unknown. Morandi et al. [16] reported HVPTC to have a poor response to radioiodine therapy, with only 6/18 patients demonstrating good iodine uptake and shrinkage of metastatic disease. As more knowledge about this variant of PTC is discovered, targeted therapies may develop to better treat the disease. Further prospective studies are needed to better assess pertinent diagnostic features and genetic drivers behind HVPTC in order to best risk stratify and maximize therapeutic intervention.

Limitations

Limitations of this study include paucity of data directly comparing HVPTC with classical PTC, heterogeneity in definitions of hobnail variant, and different follow-up times that can skew reported outcomes. Additionally, our study aimed to exam molecular markers of HVPTC and could only successfully evaluate BRAF mutational status in 8 studies. Other molecular markers that were considered included RET/PTC, TERT, p53, and IPK3CA, but these were not consistently reported by a majority of studies and may represent a possible focus of future studies. Also, our systematic review included studies of low-level evidence (i.e. case reports) given the rarity of HVPTC. Additionally, certain studies with novel data were excluded due to an overlap in patients previously reported in other studies which reduces overall data volume.

Conclusions

In this systematic review, we found that the HVPTC is associated with more aggressive clinicopathological features and a poorer prognosis as compared with classical PTC. This is seen by increased mean tumor size at diagnosis, higher rates of LVI, ETE, tumor necrosis, and BRAF mutation status. In addition, poorer outcomes as demonstrated by higher rates of LRR, lymph node metastasis, and distant metastasis compared to reports in the literature for classical PTC may support the aggressive nature of this hobnail variant. As HVPTC cases with <30 and ≥30% hobnail component did not significantly differ in rates of distant metastasis or presence of high-risk features, further refinement of diagnostic criteria regarding percentage of hobnail tumor involvement is warranted. Future investigation is required to better characterize these clinicopathological features and the resulting impact on response to treatment.