Background

Prostate adenocarcinoma is often morphologically heterogenous, consisting of two or more grades or patterns as defined in the Gleason grading system and its various amendments. Gleason scoring, which is the sum of the primary and secondary grades, was specifically designed to take the various grades present into account in order to provide an overall prognostic assessment of individual cases and requires assessment of the whole tumour. It is an important strength of Gleason grading that it factors in the heterogeneity of the tumour. While combinations of Gleason grade do provide an overview of the morphology of the tumour, the Gleason sum score is less granular on its own, and it is evident that identical individual Gleason scores may be achieved through the presence of apparently unrelated grade combinations. This issue is especially evident in cancers showing morphology of the higher grades. While Gleason scoring has the effect of masking morphological complexity, quotation of the composite grades in conjunction with the score, e.g. 3 + 4 = 7, does provide data that are of clinical utility when reporting a tumour as Gleason score 7. Such information is less evident when tumours are reported according to the five recently described International Society of Urological Pathology (ISUP) grades (also known as grade groups). This grading system was specifically introduced to assist patients understand that Gleason score 6 tumour is now the lowest grade, being designated as ISUP grade 1. This grading classification does differentiate between Gleason score 3 + 4 = 7 and 4 + 3 = 7 (ISUP grades 2 and 3, respectively), but there is no such distinction possible for tumours of higher score [1]. As a consequence, a tumour designated ISUP grade 4 may in reality be Gleason score 3 + 5, 4 + 4 or 5 + 3, each of which possibly reflect a differing prognosis. In the past, several studies have indicated a potential prognostic heterogeneity among these grade combinations [2,3,4,5,6,7,8], but there has been considerable disagreement as to whether there is a similar outcome for some specific combinations, e.g. 3 + 5 and 4 + 4 [6], 4 + 4 and 5 + 3 [2] or 3 + 5 and 5 + 3 [3, 4].

The aim of this study was to investigate prostate cancer-specific and all-cause mortality among subgroups of Gleason score 8 by analysing Gleason scores 3 + 5, 4 + 4 and 5 + 3 in a large population-based cohort taking date of diagnosis and treatment into account.

Materials and methods

The National Prostate Cancer Register (NPCR) in Sweden holds clinical and histopathological data on all prostate cancers diagnosed since 1998 with the aim to increase quality of care and adherence to guidelines [9, 10]. The coverage has been estimated to be more than 96% of all prostate cancers diagnosed in Sweden during this period [11]. For this study, men diagnosed with acinar adenocarcinoma, which were reported to NPCR between the inclusive years 2000–2020, were identified. From this group, all cases of acinar adenocarcinoma with Gleason scores 8 (4 + 4, 3 + 5 or 5 + 3) in needle biopsies were included in the study. Data were extracted utilising a similar methodology as in our recent registry study on cancers with Gleason scores 9 to 10 [12]. Briefly, 18,988 cases (9.5%) with incomplete or inconsistent data entries were excluded from further analysis. In addition to Gleason grade/scores, NPCR was searched for data relating to patient age, clinical TNM categories and serum prostate-specific antigen (PSA) levels at presentation. The primary treatment was classified as androgen deprivation therapy (ADT), deferred treatment (DT), radical prostatectomy (RP) or radical radiotherapy (RRT).

In Prostate Cancer data Base Sweden (PCBaSe), 5.0 data in NPCR has been linked to other nationwide population-based health care registers and demographic databases such as the cause of death register, the patient register and the Longitudinal Integrated Database for Health Insurance and Labour Market Studies (LISA), a socioeconomic database, by using the Swedish personal identity numbers as identifiers [10]. Deaths were recorded as death due to prostate cancer or death due to other causes. In brief, comorbidity was assessed by use of the Charlson co-morbidity index (CCI) based on data retrieved from the patient register for the 10-year period before the diagnosis of prostate cancer, as previously described [13, 14].

Data on education level were extracted from the Longitudinal Integrated Database for Health Insurance and Labour Market Studies (LISA) of Statistics Sweden [15].

This study was approved by Research Ethics Authority.

Statistical analysis

Multivariable Cox regression models were used for time-to-event analyses of death from prostate cancer and death from all causes and were censored at 10 years of follow-up. Multivariable analyses included Gleason score; patient age; T, N and M clinical staging categories; Charlson co-morbidity index; primary treatment; period of diagnosis; and education level.

Prostate cancer-specific mortality (PCSM) and all-cause mortality (ACM) were calculated at 5 and 10 years, using cumulative incidence functions (CIF) that considered competing events (death from other causes than prostate cancer) for PCSM using the nonparametric Aalen-Johansen estimator. Time interval from diagnosis to death was registered. The last death record was from March 2023. Follow-up was calculated from date of diagnosis to date of death, date of emigration or last date of follow-up 2023–04-01 whichever came first. Graphs were generated for PCSM and ACM.

Results

During the period of the study, a total of 199,620 men were reported to the NPCR with newly diagnosed prostate cancer. Following exclusion of cases where there was incomplete data entry or where cancers were diagnosed in transurethral resection specimens (Suppl. Figure 1), 172,112 men with prostate cancer diagnosed on needle biopsy remained for analysis. Of these, 18,281 (11%) were Gleason score 8. This group of cases consisted of 2085 (11%) Gleason score 3 + 5, 15,776 (86%) Gleason score 4 + 4 and 420 (2.3%) of Gleason score 5 + 3 tumours. The mean age at diagnosis was 71.5, 73.3 and 73.3 years in men with Gleason score tumours of 3 + 5, 4 + 4 and 5 + 3, respectively, while the median PSA values were 12, 23 and 23 ng/ml in men with Gleason scores of 3 + 5, 4 + 4 and 5 + 3, respectively. The distribution of T, N and M stage was quite similar between the three subgroups. When all of the cases in the series were grouped according to the year of diagnosis, the proportion of cancers that were assigned a Gleason score of 8 was 11% (4068/35,665) in 2000–2005, 11% (8745/78,112) in 2006–2014 and 9% (5468/58,335) in 2015–2020 (Suppl. Figure 2). Other clinical and demographic data are shown in Table 1. For men with Gleason score 8 tumours, the most common primary treatment was androgen deprivation therapy (55%), while 21% underwent radical radiotherapy and 16% radical prostatectomy. Deferred treatment was used as primary treatment strategy in 8% of cases.

Table 1 Descriptive data of men diagnosed with prostate cancer Gleason score 3 + 5, 4 + 4 and 5 + 3 in Prostate Cancer data Base Sweden (PCBaSe) 4.0 in 2000–2020

Survival data were available for all 18,281 men with Gleason score 8 tumour, and 38% of men were alive after a median follow interval of 10.8 years (Q1 = 6.6, Q3 = 15.0). During this same period, 32% men died of prostate cancer, and 30% died of other causes. The PCSM and ACM after 5- and 10-year follow-up are shown in Table 2. PCSM for men with Gleason scores 3 + 5, 4 + 4 and 5 + 3 at 5 years of follow-up were 0.10 (95% CI 0.09–0.12), 0.22 (0.22–0.23) and 0.32 (0.27–0.36), respectively, and at 10 years of follow-up 0.19 (95% CI 0.17–0.22), 0.34 (0.33–0.35) and 0.44 (0.39–0.49), respectively. ACM for men with Gleason 3 + 5, 4 + 4 and 5 + 3 at 10 years of follow-up was 0.50 (95% CI 0.47–0.53), 0.63 (0.63–0.64) and 0.70 (0.65–0.75), respectively (Fig. 1).

Table 2 Prostate cancer-specific mortality (PCSM) and all-cause mortality (ACM) at 5 and 10 years
Fig. 1
figure 1

A Prostate cancer-specific mortality (PCSM) of men with Gleason score (GS) 3 + 5, 4 + 4 and 5 + 3. B All-cause mortality (ACM) of men with Gleason score 3 + 5, 4 + 4 and 5 + 3. Curves are truncated at the time point of the last event

Separate analysis of constituent Gleason patterns showed substantial differences in PCSM. For men with a Gleason score of 5 + 3, the PCSM was higher than for those with a Gleason score of 4 + 4 after both 5- and 10-year follow-up. Similarly, the PCSM of men with a Gleason score of 4 + 4 was higher than for those with a Gleason score of 3 + 5. This pattern was repeated in men who had received ADT, while the mortality rate among men treated with radical prostatectomy or radical radiotherapy was generally so low that it was difficult to demonstrate significant differences in outcome (Table 2). In all men with Gleason score 8, cancers mortality was lower after radical prostatectomy or radical radiotherapy than after androgen deprivation therapy or deferred therapy (Table 2). Plots of PCSM and ACM after these treatment options are shown in Fig. 2 and Suppl. Figure 3, respectively. There was a substantial decrease in mortality from the first to the latest calendar period; plots of PCSM and ACM of all Gleason score 8 cancers diagnosed during the time periods of 2000–2005, 2006–2014 and 2015–2020, respectively, are shown in Fig. 3 and Suppl. Figure 4.

Fig. 2
figure 2

Prostate cancer-specific mortality (PCSM) of men with Gleason score 8 prostate cancer managed by androgen deprivation therapy (ADT), deferred therapy (DT), radical prostatectomy (RP) or radical radiotherapy (RRT), respectively. Curves are truncated at the time point of the last event

Fig. 3
figure 3

Prostate cancer-specific mortality (PCSM) of men with Gleason score 8 prostate cancer diagnosed in 2000–2005, 2006–2014 and 2015–2020, respectively. Curves are truncated at the time point of the last event

In a multivariable Cox analysis including Gleason score; age; cT, cN and cM staging categories; Charlson co-morbidity index; primary treatment; period of diagnosis; and education level, Gleason scores 4 + 4 and 5 + 3 are found to be significant predictors of PCSM, with hazard ratios (HR) of 1.32 (95% CI 1.18–1.49) and 1.96 (95% CI 1.62–2.37), respectively, utilising Gleason score 3 + 5 as the referent (Table 3).

Table 3 Multivariable Cox analysis of prostate cancer-specific (PCSM) and all-cause mortality (ACM) at 10 years

Discussion

In this population-based register study on the prognosis of Gleason score 8 prostate cancer in needle biopsies, there was a significantly higher mortality in men with Gleason score 5 + 3 cancer compared to those with 4 + 4 cancer and in men with Gleason score 4 + 4 cancer compared to those with 3 + 5 cancer. A majority of these men were treated with androgen deprivation therapy. Those who received treatment with curative intent in general had a very low prostate cancer mortality (0.04–0.15 after 10 years), and significant differences between the grade subgroups were not observed.

Prognostic heterogeneity in Gleason score 8 prostate cancer has been demonstrated in most previous studies [2,3,4,5,6,7]; however, outcomes between Gleason scores 3 + 5, 4 + 4 and 5 + 3 vary between the studies. In a systematic review and meta-analysis, Lu et al. analysed eight reports and concluded that the meta-analysis was hampered by the widely differing methodologies of the studies [4]. In particular, there were differences in the specimens used for the studies with cases consisting of biopsy specimens [2, 3, 5, 7, 8], radical prostatectomy specimens [2] or a combination of the two [6]. Other differences were that some studies relied on single-centre data [3], while others were based on multiple-centre data [2, 8] or registry data [5,6,7]. A problem in several studies was the low number of some of the grade categories. This was in particular true for Gleason score 5 + 3 tumours, and as a consequence, results from these studies need to be interpreted with caution [5, 8]. Selection bias most likely accounts for some observed outcome differences between treatments, such as the low prostate cancer mortality in men who underwent radical prostatectomy or radical radiotherapy in the current study.

Using Surveillance, Epidemiology and End Results (SEER) database registry data, Mahal et al. found that Gleason score 5 + 3 cancers had a significantly worse prognosis than those with a Gleason score of 3 + 5 or 4 + 4 [6]. However, the methodology of grade reporting in SEER was not uniform throughout the study period, and grading was based on the highest Gleason score in either the preoperative needle biopsies or the subsequent radical prostatectomy specimen. As a consequence, the results of this latter study are not comparable with those of our study. By contrast, Gandaglia et al. found a similar outcome in tumours with Gleason scores 4 + 4 and 5 + 3, but their data were based on radical prostatectomy specimens only [2]. Huynh et al. and Rusthoven et al. on the other hand combined cases of Gleason scores 3 + 5 and 5 + 3 and compared them against outcomes of Gleason score 4 + 4 tumours based on the assumption that the presence of a Gleason pattern 5 would be the most critical marker of prognosis [3, 7].

Gleason score 8 cancers on needle biopsy are Gleason score 4 + 4 in the vast majority of cases, and only 9% [3]–21% [8] are being reported as 3 + 5 or 5 + 3. In the present study, 86% were 4 + 4, which is in line with the other large register-based study [6]. Of all Gleason score 8 cancers in earlier series, only ≤ 5% were reported as 5 + 3 [5, 6, 8]. As noted above, some of the previous studies merged Gleason scores 3 + 5 and 5 + 3 due to a paucity of cases and did not report separate results [3, 7].

At the 2014 ISUP consensus conference on Gleason grading, it was recommended that the grading of prostate cancer be based on 5 ISUP grades (also known as grade groups) for reporting purposes. These so-called ISUP grade groups were defined on the basis of Gleason scores, i.e. Grade 1, Gleason score 3 + 3 = 6; Grade 2, Gleason score 3 + 4 = 7; Grade 3, Gleason score 4 + 3 = 7; Grade 4, Gleason scores 3 + 5 = 8, 4 + 4 = 8 or 5 + 3 = 8; and Grade 5 4 + 5 = 9, 5 + 4 = 9 or 5 + 5 = 10. It is apparent that while ISUP grades 1–3 consist of single Gleason scores, this does not apply to ISUP grades 4 and 5, and concerns have been raised that this grouping would have considerable disadvantages by not revealing the exact Gleason grade composition present in individual cases [16]. It has been shown that any percentage of Gleason pattern 5 tumour in a needle biopsy is associated with a worse prognosis when compared to Gleason score 4 + 3 tumours [17]. In a recent study, we analysed the outcome of ISUP grade 5 cancers and found significant differences in mortality between the three grade subgoups (Gleason scores 4 + 5, 5 + 4 and 5 + 5) confirming that a grouping of these Gleason scores would cause loss of granularity of data which, in turn, would diminish any prognostic information derived from grading [12]. In this present study, we found further support for this hypothesis. By restricting our analysis to the subgroups of tumours that constitute ISUP grade 4, we now also demonstrated a prognostic heterogeneity within this ISUP grade. Interestingly, the PCSM for Gleason score 5 + 3 cancers was very similar to that of Gleason score 4 + 5 cancers, reported in our earlier study, with PCSM after 10 years at 0.44 (0.39–0.49) and 0.45 (0.44–0.46), respectively [12].

Thus, while we have demonstrated significant differences in prognosis between the components of ISUP grades 4 and 5 when analysed separately, there was also an overlap between some components of ISUP grade 4 and ISUP grade 5. These data further emphasise that by grouping of Gleason scores to generate ISUP grades, valuable prognostic information is lost.