Background

Most urological societies recommend aorto-abdominal computed tomography (CT) or magnetic resonance imaging (MRI) for primary lymph node staging of patients with intermediate and high-risk prostate cancer [1,2,3]. These recommendations have been consistent for many years despite the modest positive and negative predictive values of anatomical imaging for lymph node metastases [4]. Positron emission tomography (PET) tracers like F-18- or C-11-choline have not shown superior to anatomical imaging methods [5].

Within the last couple of years, a high number of publications have indicated an excellent diagnostic performance of prostate-specific membrane antigen (PSMA) PET/CT for prostate cancer, primarily in biochemical recurrence [6,7,8,9]. Also, some reports have shown encouraging data with PSMA PET/CT for the detection of lymph node metastases for primary staging. A notable number of systematic reviews have been published on this topic since 2018 [8,9,10,11,12,13]. These systematic reviews comprise a limited number (five to eight) original reports with primary staging with pathology as reference. We are aware of many more studies with PSMA PET/CT for primary staging than previously reported. Thus, the objective was to provide an overview of trials with PSMA PET for staging of prostate cancer.

Methods

Aim

The aim was to provide an up-to-date overview of diagnostic test accuracy trials of preoperative PSMA PET for primary lymph node staging on a patient level in prostate cancer patients with a final pathology reference for lymph node metastasis.

Study design

This expedited systematic review was mostly compliant with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines for reporting of systematic review [14]. The term ‘expedited’ indicates that some aspects of systematic review methodology were kept at a minimum, e.g., one person did the entire eligibility screening, and assessment of bias comprised assessment of study design and recruitment only. Due to the sequential search of papers, a flow diagram of papers was not feasible. The review synopsis for this study was not published in a public database. No synthesis of results (meta-analysis) was performed. The expedited methodology was chosen since the publication velocity of papers within this area is so fast, that a full systematic review would be largely outdated at the time of publication.

Literature search

A literature search was performed in PubMed, which was searched from inception up to August 16, 2019. We used the following search string: (lymph node metastasis OR staging) AND (positron emission tomography OR PET) AND (prostate specific membrane antigen OR psma) AND (prostate malignancy OR prostatic carcinoma OR prostate cancer OR prostatic neoplasm). We did several searches in PubMed since end-2018 with repetitive and overlapping updates. A final search performed on August 16, 2019, provided 413 references (see Additional file 1). In addition, The Cochrane Library was searched from inception through July 2, 2019, using the search word ‘prostate specific membrane antigen’ or ‘prostate-specific membrane antigen’ (all text search). A total of 123 hits were identified, none of which were relevant supplementary hits compared to the references identified in PubMed. A final search on August 16 also showed 123 references. The validity of the PubMed search string was verified by the identification of all references presented in the latest systematic reviews of 68Ga-PSMA PET/CT for primary staging with pathology as the reference [8,9,10,11, 13]. All references were imported into EndNote X8 (Clarivate Analytics, Philadelphia, US) and were evaluated for inclusion in the review. We did not perform additional literature searches (‘grey’ literature, review of reference lists of eligible papers, or contact to authors).

Eligibility criteria

According to the PICOS concept (patient, intervention, comparator, outcome, study type), the following eligibility criteria were used: 1) The trial must comprise patients diagnosed with prostate cancer; 2) the patients underwent preoperative PSMA PET for primary lymph node staging, and 3) the reference test was histopathology. In addition, 4) the trial must report individual data or summary diagnostic accuracy findings on a patient level; and 5) it should include a minimum of five patients per study fulfilling all the criteria mentioned above. Based on the criteria mentioned above, 6) the study design was limited to diagnostic test accuracy studies. All languages were accepted, and any.

PSMA ligands were eligible. Among papers with mixed types of cancers or settings (e.g., primary and secondary staging), data should be extractable in accordance with the eligibility criteria mentioned above.

Sorting and data extraction

All papers were initially reviewed for eligibility by the reading of title and abstract by a trained reader (LJP). Apparently ineligible papers were excluded. Potentially eligible papers were reviewed in full text by reading supported by electronic searches within the text for relevant words and phrases, e.g., lymph node, lymph node metastasis, sensitivity, and true positive, and false positive. For papers finally included in the review, two readers (LJP and HDZ) independently extracted the data. A consensus decision was made in the cases of disagreement.

Each paper was reviewed for the reporting of diagnostic performance versus pathology reference (e.g., sensitivity, specificity, positive and negative predictive values) on a patient level. If diagnostic accuracy data were reported by the authors, these figures were used. If data were not reported (or incompletely reported), diagnostic accuracy was determined from 2 × 2 tables of true and false positive and negative cases. Any data available for comparative index tests were also extracted. Some papers reported findings with pathology as the reference in a subset of the patients. In these cases, we restricted the study population to the patients with a pathology reference. If the prostate cancer risk classification was not reported separately for the subset of patients with a pathology reference, the risk class for classified as ‘not reported’. If the study population in a paper comprised patients with primary and secondary lymph node dissection (LND), the mean (or median, if reported) number lymph nodes removed from the entire population was reported if data was not available from patients with primary LND.

Assessment of bias

Information about patient enrollment (prospective versus retrospective) and selection of patients (consecutive or non-consecutive enrolment) was extracted. The assessment of trial methodology was based on the actual reporting. Thus, a trial was classified as prospective only if the word ‘prospective’ was mentioned, the terminology was clear (e.g., “we enrolled”), or the trial was classified as an interventional trial. Finally, consecutive recruitment of patients was acknowledged only if the phrase “consecutive” or “unselected” was used, or it was clear that the trial included all patients or an unbiased selection of patients examined in a well-defined period, and the eligibility criteria were specified.

Statistics

Weighted analysis of sensitivity and specificity was performed using the reported diagnostic accuracy data corrected for the number of patients in each trial. No analytical statistics were used.

Results

Description of the included papers

The total search strategy comprised 512 references, 389 references from PubMed (Additional file 1) and 123 references from The Cochrane Library. Due to the sequential and overlapping searches, a flow chart cannot be made of the papers read by full text and the reasons for exclusion. The main reason for exclusion of papers after full text reading was the lack of a pathology reference of lymph node metastasis (reporting of detection rates only). One paper selectively reported diagnostic accuracy data on a lesional level with pathology as the reference but no data on the patient level [15].

Eighteen studies, originating from ten countries (based on the country of the first author) and published from 2016 and onwards (PubMed availability), reported data on a patient level (Table 1). Most trials came from Germany (n = 4) and Australia (n = 3). Most papers used Ga-68-PSMA-11 (PSMA HBED-CC). PET/CT was most frequently used; a few studies used PET/MR (entirely or predominantly). The majority of studies were retrospective in design. Half of the studies recruited consecutive patients; there was only one prospective study with consecutive recruitment. Generally, all papers reported the distribution of patients with intermediate and high-risk or reported proportions of patients with specific T-stages, PSA levels, or Gleason scores. Even though some papers did not report risk classes among patients with pathology as a reference, most of these papers entire or preferable recruited intermediate and high-risk patients [23, 32]. All papers but one performed LND with radical prostatectomy; one study solely included patients who had laparoscopic LND prior to curatively intended radiotherapy [33].

Table 1 Study demographics. Data are from patients undergoing staging for newly diagnosed disease in whom a pathology reference was applied

Diagnostic accuracy for PSMA PET for lymph node metastasis

Diagnostic accuracy for the detection of lymph node metastasis on a patient basis showed wide variations (Table 2). Sensitivity ranged from 23 to 100%, specificity 67–100%, positive predictive value 20–100%, and negative predictive value 41–100%. However, 15 of 18 studies reported a sensitivity of 50% or greater. Specificity was generally good, 90% or better in most studies. A weighted analysis showed a sensitivity of 59% (specificity 93%) based on the reported diagnostic accuracy data and the number of patients in the respective trials. The disease prevalence varied notably from 4 to 65%. There was some variation of the positive and negative predictive values, but the majority of trials presented values of 80% or better for both positive and negative predictive values.

Table 2 Diagnostic performance on a patient basis for the detection of lymph node metastasis with pathology as reference. Data accuracy data from any comparative index test(s) are shown in parenthesis

Comparison of PSMA PET with other index tests

A subset of reports compared PSMA PET with other index tests (Table 2). Four papers compared PSMA PET with guideline-recommended anatomical imaging (CT or MRI). In all four reports, PSMA PET numerically outperformed cross-sectional anatomical imaging. A few studies applied multi-parametric or diffusion-weighted MRI for lymph node staging with heterogeneous findings when compared to PSMA PET.

Discussion

Cross-sectional anatomical imaging has been recommended for patients with unfavourable intermediate and high-risk prostate cancer for primary lymph node staging across urological guidelines for years despite known insufficient diagnostic accuracy for both CT and MRI [4]. With a weighted sensitivity of 42%, a specificity of 82%, and a positive predictive value of 32%, CT is not the ideal imaging method in this setting. Here we summarised findings from 18 diagnostic test accuracy trials with PSMA PET and a valid reference test and found a weighted sensitivity of 59%, a specificity of 93%, and positive and negative predictive values of 80% or higher in the majority of the trials. These findings indicate that PSMA PET should be considered for primary staging in prostate cancer.

PSMA PET has shown encouraging diagnostic properties in prostate cancer since the first clinical trial was published with Ga-68-PSMA-11 in 2012 [34]. Since then, the use of PSMA PET has been evaluated in numerous settings, including primary staging and biochemical recurrence, as shown in recent systematic reviews [8,9,10,11,12,13]. However, these recent systematic reviews only present a fraction of the knowledge of PSMA PET for primary lymph node staging. Even though our systematic literature search included all PSMA ligands, 16 of 18 trials were with Ga-68-PSMA. The clinical trial documentation with PSMA PET originated from 2016 and onwards with most trials published in 2018 and later. It may be speculated that the tremendous work with systematic reviews and meta-analysis, including duplicate search and selection, searched across several databases and the grey literature, detailed bias assessment, may impair the ability to show up-to-date evidence. With rapidly evolving methods like PSMA PET, expedited systematic literature reviews may better provide a status of the current knowledge. We certainly acknowledge the place for systematic reviews and meta-analyses in particular to synthesise cumulative evidence in other areas of medicine.

The 18 trials showed clinical compelling diagnostic performance in those patients where primary lymph node staging is indicated (high-risk and unfavourable intermediate-risk categories), even though some variations in diagnostic accuracy were observed across the trials. The reasons for the wide variations may, among others, be related to research design, technical aspects of the PET/CT scanner and image reconstruction, reader experience, the surgical procedure, and handling of the pathology specimens. Most studies were of modest size, which together with variations in the proportion of LN metastatic patients (4 to 65%) may influence precision of the diagnostic test characteristics. Also, it has been documented that wide disease spectrum, non-consecutive recruitment, open-label reading of tests, and retrospective data collection were associated with higher estimate of diagnostic accuracy [35]. In this review [33, 36], the proportion of high-risk patients varied from 17 to 100%, approximately 50% of the trials stated consecutive recruitment, and blinding of readers were seldom reported. There were five prospective trials, but only one prospective, interventional trial with consecutive enrollment [33]. That study was fully compliant with The Standards for Reporting of Diagnostic Accuracy (STARD) [37], performed according to the Good Clinical Practice requirements, but unfortunately performed with older PET/CT scanners. Reader experience with PSMA PET should also be taken into consideration [38]; reader experience was seldom mentioned in the 18 trials. In addition to the extent of surgical resection, details for pathology assessment, including criteria for a pathology-positive lymph node, should be considered; such details were infrequently reported.

The weighted sensitivity and specificity of PSMA PET outperformed the reported diagnostic accuracy of CT and MRI published in the latest meta-analysis [4]. Four trials directly compared PSMA PET with cross-sectional imaging. In all trials, PSMA PET numerically outperformed CT and MRI. A few trials compared PSMA PET with functional MRI; the findings were heterogeneous, so no firm conclusion could be made.

The differences in diagnostic accuracy among anatomical and functional imaging methods may mainly reflect the criteria for classifying an index test as positive for malignancy. With CT and MRI, a size criteria of 10 or 15 mm, with or without additional radiological criteria, are generally applied for pathological lymph nodes [4, 39]. PET criteria combine anatomical and functional findings with quite consistent observer agreement across reporting guidelines [40]. Equivocal imaging findings on PSMA (e.g. PSMA-RADS-3) still represent a challenge; however, most studies in this systematic review reported PSMA results on a dichotomous scale. Handling of equivocal findings can notable influence diagnostic accuracy [41]. Recent studies with PSMA PET have revealed that most lymph node metastases are in fact quite small. In our own recent trial, false-negative metastases on PSMA PET/CT had a median diameter of 4 mm, with only 3 of 30 lymph nodes being larger than 10 mm; 31% of the patients with lymph node metastases had micrometastasis (< 0.2 mm) only [33].

When we searched for eligible papers for this review, it was striking that approximately 50% of the papers published in this area reported detection rates only, not correlative findings with pathology. At staging, it is possible to do pathological staging, at least on a patient basis. In other settings, e.g. biochemical recurrence, pathology may be difficult to obtain. Such results should be carefully evaluated due to possible verification bias [42, 43] [33, 37]. In order to move PSMA PET higher in the hierarchy of clinical evidence, we not only need large prospective diagnostic test accuracy trials [44], but we also need trials showing improvement of patient benefit, e.g. improved outcome of having a preoperative PSMA PET rather than CT or MRI. There are indications that such a benefit can be documented; in a recent study by van Leeuwen et al. [25], the biochemical recurrence rate after radical prostatectomy was 17% among patients who had no signs of lymph node metastasis on the PSMA PET/CT compared to 50% of those in whom PSMA identified lymph node involvement. Once clinical benefit with PSMA PET has been documented, strong recommendations in clinical practice guidelines will follow.

Conclusion

The current level of evidence indicated interesting diagnostic performance of PSMA PET/CT for primary lymph node staging of high-risk prostate cancer patients though there were some variations across the studies. PSMA PET/CT showed superior diagnostic performance compared to anatomical imaging for lymph node staging in studies where comparative data were reported. Since studies have shown PSMA PET/CT to outperform bone scans, with or without SPECT/CT, for the detection of skeletal metastases [45,46,47,48], PSMA PET/CT may turn out to be a one-stop modality for staging in these patients.