Neuroendocrine neoplasms in the pancreas occur in two entirely distinct groups recognized based on the morphologic grounds, as poorly differentiated carcinomas (PDNECs, which are essentially pancreatic counterparts of high-grade small-cell and large-cell neuroendocrine carcinomas in the lung) and the well-differentiated pancreatic neuroendocrine tumors (PanNETs; i.e., what used to be called islet cell tumors/carcinomas, or pancreatic counterparts of what used to be called “carcinoid”).1,2 While PDNECs are rapidly fatal malignancies, PanNETs are widely regarded as low-grade neoplasms with relatively limited metastatic potential to an extent that watchful waiting is considered for smaller examples.3,4 However, it is being recognized increasingly that a subset of PanNETs develop metastases and behave aggressively, clearly not as dismal as PDNECs but nevertheless often with dissemination and fatality.4 Some of these patients present to oncologists and skip surgeons’ attention and thus create some disparate views on the nature of PanNETs in respective literature. One parameter that could potentially help to identify this aggressive subset is the grade based on Ki67 proliferation index. It is now fairly well appreciated that those rare grade 3 PanNETs, defined as Ki67 >20% (which had been previously classified together with PDNECs until WHO-2017 when they were moved to PanNET category) have substantial potential for aggressive behavior (albeit nowhere near as bad as PDNECs4,5,6), and adjuvant therapy is typically considered in their management.7,8,9 In contrast, however, currently, G1/G2 (G1 Ki67 < 3%; and G2 Ki67 3–20%) PanNETs are widely viewed as benign-behaving tumors, and in fact those smaller than 2 cm are typically managed by watchful waiting without resection, regardless of the grade.10,11 Along these lines, for resected G1/G2 PanNETs, current management guidelines do not have any specific recommendation other than follow-up. However, as mentioned previously, some of these ultimately develop metastases and exhibit aggressive behavior and which cases are prone to this is currently not clear.3,4

In order to detect the G1/G2 PanNETs that are more likely to exhibit aggressive behavior, there have been efforts to identify histopathologic parameters to help guide the management. Recently, some morphologic variants, such as those with more abundant cytoplasm, single, prominent nucleoli (oncocytic and hepatoid; “metabolic cell phenotype”), and diffuse growth were found to be significantly more aggressive than the more organoid (ductulo-insular and paraganglioma-like) or degenerative (symplastic/pleomorphic) examples.12,13,14 Another parameter associated with more aggressive behavior was the more scirrhous growth pattern,15 infiltration pattern at the edges for which a classification system with strong prognostic correlation was proposed and proved to be even more valuable than grade, which is supported by other studies.16,17 However, as these recently proposed classification schemes are being verified and fine-tuned for incorporation into guidelines, the grade and stage remain as almost the sole tool in resected PanNETs to help guide the subsequent management.

It is under question whether Ki67 proliferation index is being utilized as effectively in the prognostication of G1/2 PanNETs. For example, although recent studies emphasized the usage of more reliable counting methodologies and although “eye-balling” approach is discouraged per consensus manuscripts, it is still being used widely in daily practice.18,19,20,21,22,23 Moreover, the current cutoffs, which were initially determined based on limited number of cases, and mostly extrapolated from the literature on gastrointestinal NETs (previously called “carcinoids”) are now widely acknowledged to have limitations in clinical application for PanNETs.

Recently, mostly in the oncology literature, patients with Ki67 ≥ 10% are increasingly being viewed as the group that warrant further attention, close follow-up, and consideration for somatostatin analog (SSA)/peptide receptor radionuclide therapy (PRRT) and/or even conventional chemotherapy.24,25,26,27,28,29,30,31,32,33,34,35 Although this management approach is mostly based on the Ki67 index on metastatic tumors, it also is being considered for resected PanNETs, yet the literature supporting this approach from pathology perspective is limited.36

In this study, we investigated the relevance and clinicopathologic associations of cutoff of 10% in a cohort of 190 PanNETs and subsequently tested its significance in a separate cohort of 145 cases. We found, in all these cohorts, that indeed the subset of G2 PanNETs that have Ki67 10% to ≤ 20%, which we propose to be documented as G2b, has significantly more aggressive characteristics, including incomparably higher rate of liver metastasis, and as such deserve to be notated separately in alliance with the developing management algorithms that are already placed in practice, although not yet in guidelines.

Materials and Methods

The study construct was reviewed and approved by the institutional review boards of the participating institutions.

Case Selection

For the initial cohort of the study, pathology archives were searched, and the resected PanNET cases encountered in Marmara University, Hacettepe University and Koç University, Türkiye, and Wayne State University and Emory University, USA, were retrieved. Based on the results of the initial analysis, a separate cohort from Verona University, Italy, and Memorial Sloan Kettering Cancer Center, NY, also were retrieved and analyzed as the validation cohort.

Demographic information was obtained from original pathology reports and verified from the databases of the institutes when available. Survival data were collected from hospital records and national health records database.

Poorly differentiated neuroendocrine carcinomas and ambiguous cases in which a clear NET/NEC distinction could not be made were not included.4,6,37,38,39 Also, carefully excluded were cases with mixed acinar-neuroendocrine neoplasm and cases of other nonneoplastic and neoplastic (acinar carcinomas, pancreatoblastomas, solid-pseudopapillary tumors) mimickers of PanNETs.12,40,41

Ki67 Subgrouping

In the primary cohort, for each case, all slides were reexamined by the authors. Ki67 analysis was performed as described in detail previously.18 Briefly, more cellular and atypical foci with mitotic activity were selected for Ki67. Ki67 stained slides were scanned at 10x objective, and the hot spot areas were identified. Camera captured images of the hotspot field were printed and cells were counted manually: tumor cells with nuclear Ki67 staining were circled, while unstained tumor cells were crossed off. Nuclear labeling in the nontarget cells (endothelial cells, lymphocytes, neutrophils, and macrophages) were carefully excluded.

For the validation cohort, Ki67 index was extracted from the surgical pathology reports; a separate count was not performed by the authors in this cohort.

Cases with Ki67 < 3% (including cases with 2.99 but not 3) were classified as G1. The WHO G2 group (Ki67 3–20%) also was evaluated in two subgroups as G2a (Ki67 3% to <10%) and G2b (Ki67 10% to ≤ 20%). Cases with Ki67 > 20% were regarded as G3. As mentioned previously, poorly differentiated neuroendocrine carcinomas were excluded.

Clinicopathologic Parameters

Age at diagnosis and sex were obtained for each case. Tumor size, vascular invasion, and perineural invasion were reevaluated by the authors along with the lymph node metastasis status in the resection specimens. Distant metastasis status was retrieved from the patient medical records. Tumor staging was performed according to the AJCC.42

In addition, in the primary cohort, for each PanNET, the invasiveness pattern at the periphery was evaluated as “non/minimally infiltrative” (NI), “moderately infiltrative” (MI), and “highly infiltrative” (HI) groups, as previously described16 to assess the association of the proposed grading system with this parameter.

Follow-up data concerning clinical, radiological, or histopathologic evidence of tumor recurrence and survival data were extracted from patient records of the institutions and the national electronic databases.

Correlative Analysis

Correlations between Ki67 index grade groups and various conventional established clinicopathologic parameters of aggressiveness (tumor size, vascular invasion (VI), perineural invasion (PNI), lymph node metastasis, liver/distant metastasis) were analyzed in all cohorts. In the primary cohort, the findings were also correlated with the infiltration-based prognostic categories, a histopathologic classification scheme that recently has been to correlate with aggressive behavior. Tumors in the group < 2 cm in size were investigated separately for clinical purposes because this subset of PanNETs are now commonly placed in watchful waiting category.10,11

Statistical Analysis

Descriptive statistics were presented to define continuous variables. The normality of continuous variables was investigated by Shapiro-Wilk’s test. The χ2 test was used for categorical variables along with Fisher exact test when applicable. Logistic regression was used to evaluate the effect of independent variables (which are found statistically significant at univariate analysis) on a dependent variable. Statistical significance was accepted when p < 0.05. Separately, to determine the true numerical cutoff that identifies the more aggressive group, the “maximally selected rank statistics” test was performed. All analysis was done with Jamovi43 and R.44

Results

General Characteristics

1. Initial (primary) cohort (n = 190)

Among 190 PanNET cases included in the primary cohort, the mean age at initial resection was 54.5 years (range 17–84), with a female:male ratio of 1.08. Mean tumor size was 3.5 (range 0.5–12.0, median 3.0) cm. With reference to AJCC 8th edition (2017), the percentages of pT1, pT2, pT3, and pT4 cases were 33%, 35%, 31%, and 1% respectively.

Regarding the infiltration pattern scoring at the advancing edge (assessed in the primary cohort only), 42% of PanNETs were classified as non/minimally infiltrative (NI), 37% as moderately infiltrative (MI), and 21% as highly infiltrative (HI) (Fig. 1).

Fig. 1
figure 1

Macroscopic and microscopic correlations of categories non/minimal infiltrative (A, D); moderately infiltrative (B, E), and highly infiltrative (C, F)

As the significant proportion of the cases were surgical referrals and case identification was conducted mostly through the pathology databases, the functionality status of the cases may not have been reliably documented in each case. In the primary cohort, 13 cases were recorded to have functionality, and these were insulinomas (n = 7), gastrinomas (n = 4), glucagonoma (n = 1), and somatostatinoma (n = 1). Thus, the majority of the cases were nonfunctional per the limited pathology records and clinical records available to the authors, as is the case in most studies nowadays.

Median follow up time was 40 (range 18–95) months. Local (lymph node) and liver-distant metastasis rates at the time of resection were 34% and 22%. During follow-up, 18 patients died from disease-related causes.

2. Validation cohort (n = 145)

The mean age at initial resection was 54.4 (range 30–77) years, with a female:male ratio of 0.88. Mean tumor size was 3.6 (range 0.7–25, median 2.8) cm. The percentages of pT1, pT2, pT3, and pT4 cases were 25%, 36%, 36%, and 3%, respectively. Median follow up time was 72 (range 0.5–276) months. Local (lymph node) and liver/distant metastasis rates at the time of resection were 41% and 30%. During follow-up, 11 patients died from disease-related causes.

Ki67 Grades: Frequency and Correlation with Clinicopathologic Parameters

The mean Ki67 index was 6.84% in primary, 7.45% in validation, and 7.20% in combined cohorts. The distribution of cases into grades G1, G2a, G2b, and G3 were as follows for the primary cohort: G1 48.4% (n = 92), G2a 36.9% (n = 70), G2b 7.9% (n = 15), and G3 6.8% (n = 13). For the validation cohort, these figures were G1 35.2% (n = 51), G2a 37.9% (n = 55), G2b 19.3% (n = 28), and G3 7.6% (n = 11). Please see Table 1 for comparison of cohorts, and Fig. 2 for representative Ki67 examples from each category.

Table 1 Comparison of primary and validation cohorts
Fig. 2
figure 2

PanNETs with Ki67 index 1.2% (A), 5.6% (B), 12.4% (C), and 29.4% (D) corresponding to G1, G2a, G2b, and G3, respectively

In 64 cases in the primary cohort, the Ki67 index rendered in the original pathology reports had not been performed by the authors (and not with the camera-captured/printed image (CCPI) method described) but rather counted by simple eye-balling by other pathologists. The comparison of the indices in the original report versus those elicited in the study in the same patients revealed the median difference to be 1.37 and mean 2.83. In 18 cases (28%), the final grade was different between the original report and the count performed for the study. However, none of the cases jumped by two categories in the subgraded approach proposed (i.e., none of the cases changed from G1 vs. G2b or G2a to G3). In 46 cases, grade did not differ between the original report and CCPI counting.

For conventional findings of aggressiveness, including PNI, VI, and infiltration score, the G2b category showed significantly higher rates than the G2a (Table 2). More importantly, for lymph node metastasis, the rate in G2b as opposed to G2a, this figure was double in the primary cohort (73.3 vs. 35.5%, p = 0.003), and significantly higher also in the validation (60.7 vs. 47.3%, p < 0.001), and combined (65.1 vs. 41.7%, p < 0.001) cohorts. For liver/distant metastasis, it was almost triple in the primary cohort (61.5 vs. 22.6, p = 0.001), almost double, in the validation (57.1 vs. 29.1, p < 0.001), and combined (58.5 vs. 26.1, p < 0.001) cohorts (Table 2).

Table 2 Correlation of Ki67 grades with clinicopathologic features of aggressiveness

Of note, in all these parameters, the G2b category was not only very different than the G2a but also was very similar to the G3 category (and in fact, in the primary cohort, showed numbers even higher than the G3 group, although this unexpected profile was not present in the validation cohort and mostly disappeared when all cases were combined).

Most importantly, compared with the G1 group, the proposed G2b category had incomparably higher rates of aggressiveness features, including the liver/distant metastasis rate (7.5% vs. 61.5% in primary, 3.9% vs. 57.1% in validation, 6.1% vs. 58.5% in combined cohorts).

Tumors < 2 cm and tumors 2–4 cm

With primary and validation cohorts combined, the differences between G2b and G2a persisted when they were size matched as well. Of the 102 PanNETs that were < 2 cm in size, 65.7% (n = 67) were stratified as G1, 24.5% (n = 25) as G2a, 6.9% (n = 7) as G2b, and 2.9% (n = 3) as G3. This group demonstrated statistically significant correlation between higher grade and lymph node metastasis (15.5% for G1; 9.1% for G2a; and 57.1% for G2b, p = 0.020) and liver/distant metastasis (1.6% for G1, 8.7% for G2a, 50% (n = 3/6) for G2b, p < 0.001).

Of the 94 PanNETs that were 2 to 4 cm in size, 50% (n = 47) were stratified as G1, 34% (n = 32) as G2a, 10.6% (n = 10) as G2b, and 5.3% (n = 5) as G3. In this group, the rate of lymph node metastasis (20.9% for G1; 48.3% for G2a; and 80% for G2b, p = 0.003) and liver/distant metastasis (6.8% for G1; 16.7% for G2a; 70.0% for G2b) increased progressively.

Maximally Selected Rank Statistics and Odds Ratios

The “maximally selected rank statistics” analysis performed to determine the main cutoff to predict the adverse outcome in the primary cohort, where Ki67 count was performed with the CCPI method, identified this number as Ki67 index of 12%.

To establish the correlation of conventional risk factors with adverse events (liver/distant metastasis or disease related death), multivariate analysis was performed. In this analysis, the current grading system of G1-G3 was not found to be significant, whereas the proposed grading was found to be independent along with tumor stage and lymph node metastasis. PNI and VI did not reach statistical significance (Table 3).

Table 3 Odds ratios of adverse outcomes (liver/distant metastasis + disease related death) per clinicopathological features for primary, validation, and entire cohorts

Discussion

This study, which is to our knowledge the first systematic pathologic analysis focusing on this issue, elucidates that, among resected PanNETs, by applying standardized counting methodology Ki67 index ≥ 10% identifies a significantly more aggressive and a metastasis-prone group and as such supports the evolving management protocols in the oncology literature that advocate managing this group differently.3,7,9,24,25,26,27,28,29,30,31,32,33,34,35 This category also stood the multivariate analysis along with lymph node metastasis and tumor size/stage.

It is important to point out that Ki67 index is a continuum; as such, it is plausible to pick essentially any cutoff and possibly find some prognostic value and associations. However, the current cutoff of 3–20%, which had been mostly extrapolated from G1 PanNETs proved to have virtually no discriminatory value, such that G1 and G2 PanNETs are currently mostly regarded together as one category (G1/2) for management purposes, leaving almost 95% of cases unstratified. In this study, 10% cutoff was selected and tested for two main reasons: (1) This cut off has recently begun to be employed in the oncology literature to select patients for therapy by somatostatin receptor analogues and others3,7,9,24,25,26,27,28,29,30,31,32,33,34,35; (2) The analysis of the primary cohort in our study in which Ki67 count was performed with special care revealed that with the “maximally selected rank statistics” method the proper cutoff was 12%. Considering this number is fairly close to the rule-of-thumb number of 10%, this figure was selected and indeed proved to have very striking discriminatory value.

Accordingly, evaluating G2 PanNETs that have a Ki67 index rate ≥ 10% separately as G2b (i.e., cases with Ki67 index of 10% to ≤ 20%), this group was found to exhibit significantly higher rates of metastasis than G2a group (i.e., Ki67 of 3% to < 10%). Other signs of aggressiveness also were significantly higher in the G2b group. Perhaps more importantly, these characteristics of G2b group was very similar to the G3 (>20%) group. Increasingly, the oncology literature encourages the consideration of therapy for any G3 PanNETs.28,45 Considering that the G2b cases were found to be very similar to G3 (both in the initial as well as the validation cohorts in this study), the approach applied to G3 cases may have to be considered for the G2b group, which needs to be further investigated by clinical trials. Clearly, if nothing else, this group is a candidate for closer follow-up. It should be reemphasized that his group is currently hidden in the wide G2 category and typically managed along with the G1 cases under the G1/2 umbrella; thus, their aggressiveness skips clinical attention.

The fact that G2b group is comparable to the G3 group suggests that the prognostic impact of Ki67 index peaks above 10% range but also starts to level off after a certain point probably as it approaches to the 20% range. In fact, in the primary cohort, the signs of aggressiveness were even higher in the G2b group than the G3 group. However, this may be related to the small number of cases in that initial cohort, because this difference mostly disappeared when combined with the validation cohort. Regardless, the findings are strongly indicative that G2b group defined is highly comparable to the G3 cases. It should be noted that, although well-differentiated PanNETs are viewed to be molecularly entirely distinct from PDNECs, recent studies are finding G3 cases to have some molecular overlaps with PDNECs5,46 with 35% of the cases showing p53 alteration. Considering G2b group shows aggressive behavior similar to G3, it will be important to analyze this group at the molecular level to see if it bears cases that show overlaps with (or ability to transition to) the PDNECs.5

It is important to reiterate that, in the current daily practice, the differences in aggressiveness between the G1 and the current broad G2 category (3–20%) is not viewed to be striking enough by the majority of the community to justify a differential treatment based on the grade alone.7,25,47 However, this study demonstrates that, when the characteristics of the G2b group is compared to G1, with the lymph node metastasis rate being several folds higher, and liver-distant metastasis rate of very compelling nine- to tenfold (58.5% vs. 6.1%), the significance of separating the G2b category and distinguishing it from the G1 becomes much more crucial.

These results also reemphasize some of the evolving practice recommendations for pathologists. First, the impression that “there is not much difference between a Ki67 of 4% and 19%” (all currently regarded G2), and “in fact, there is not much difference between G1 and G2 PanNETs” which in turn leads to the common application of eye-balling method in daily practice, is no longer valid. The striking differences in the behavior of G2a versus G2b group (which is even more dramatic when G2b is compared with G1) elucidated in this study necessitates more careful counting and numerical documentation of the Ki67 index. It should be reiterated that, after all, Ki67 index is a continuous variable, and that 9% (regarded G2a in this study) is probably not really that different than 11% (regarded G2b). As in any grading or staging system, there are imperfections in the cutoff regions. For this reason, in our opinion, documentation of the specific count (not only the grade) in the surgical pathology report is warranted. This would allow the management team to assess other findings, for example, the presence of other risk factors such as the highly infiltrative pattern16 or oncocytic phenotype,12 and incorporate age, patient expectation/choices and comorbidities to determine the course of action putting all these factors together. For this reason, we believe it is crucial to provide a specific number for Ki67 index, and we also recommend reporting of infiltration pattern and phenotypic classification, all of which altogether could aid in the final clinical decision making, especially for the cases that stand on the fence. Of note, despite the acknowledged shortcomings in applying cutoffs to a continuous process, at the same time, as in any grading and staging system, a cutoff to stratify the patients also is needed for Ki67. In this study (in which the hard stop cutoff was found to be 12% by the “maximally selected rank statistics”) when combined with the evolving impression in oncology literature regarding 10%, points to this rule of thumb number of 10% as the most practical and valid cutoff.24,25,26,27 As such, all of this warrants the creation of a G2b category.

The findings in this study will have significant impact on the treatment of PanNET patients. For management of tumors that have already been resected, it has not been clear which radiologic test to employ, how soon, and with which frequency. The rates of metastasis elucidated in this study for G2b cases (with three quarters of this group showing lymph node metastasis, and almost two thirds showing liver/distant metastasis, the latter 8 times more frequent than in the G1 cases), clearly indicates that this group warrants close follow up at minimum. In fact, depending on the age of the patient and other factors, the eligibility of G2b cases to a management that is similar to G3 (for which somatostatin analogues and other therapies are increasingly being employed as adjuvant treatment) may have to be considered, although, of course, this requires further studies.24,25,26,27,28 However, with a rate > 60% showing liver/distant metastasis, for this group, it is clear that surveillance protocols need to be modified; at minimum frequent liver radiograms are warranted for the G2b group, probably starting with 3-month intervals and extending them as time goes by, which is what we currently recommend for our patients. In contrast, for a G1 PanNET, which overall has less than 10% liver/distant metastasis rate, the surveillance may not have to be that intense, and an adjuvant therapy would obviously not be justifiable.

The results of this study also has significant implications on the management of PanNETs that are smaller than 2 cm. Currently, watchful waiting is considered as an option of these cases, regardless of the grade.10,11 Indeed, in this study, for < 2 cm tumors that are G1, the liver/distant metastatic rate at the follow up period was < 2%, and for even G2a, it was < 10%. However, when it came to the proposed G2b group, the rate of metastasis was 50% even in these small tumors. Granted this constitutes a relatively small percentage of the cases (7% of < 2 cm PanNETs are G2b and 3% G3), but nevertheless it is not a trivial number. These data indicate that if such a case is encountered, resection should be considered strongly.

Naturally, these approaches will have to be modified as more data comes out for the different subgrades, but in the absence of any specific data at this time, the approaches described above are justifiable based on this data and current literature. Clinical trials are warranted to consolidate new guidelines for these groups.