Introduction

The introduction of multiparametric MRI (mpMRI) has led to a paradigm shift in the diagnostic pathway for prostate cancer. By incorporating prostate MRI before biopsy, overdiagnosis and overtreatment of indolent cancers have been reduced due to a decrease in unnecessary biopsies [1,2,3,4]. In addition, MRI-guided biopsies detect at least as many clinically significant prostate cancers as traditional approaches using systematic transrectal ultrasound-guided biopsies. Consequently, the integration of MRI into prostate cancer guidelines has led to a substantial increase in routine use [5]. Multiple other applications of prostate MRI have emerged including local staging, population screening and the active surveillance of patients with indolent disease.

High-quality MR images are a prerequisite for any MRI-driven prostate cancer diagnostic pathway, directly influencing the accuracy of cancer detection and subsequent management decisions [6,7,8]. Despite the implementation of the Prostate Imaging Reporting and Data System (PI-RADS) version 2.1 technical standards, wide variability in image quality persists in community practice [9,10,11]. This variation is not only caused by poor adherence to the PI-RADS technical requirements, but is also influenced by factors such as patient motion, metalwork, patient preparation, and scanner hardware including receiver coils, scanner performance, software level and field strength [12,13,14].

Consequently, there is a growing recognition among urological and radiological societies of the need for an easy-to-use tool to objectively assess the quality of prostate MRI images [15, 16]. The first attempt to standardise the assessment of image quality was the prostate imaging quality (PI-QUAL) scoring system [17] developed by researchers of the PRECISION trial [18]. The first version of PI-QUAL (PI-QUAL v1) categorises image quality on a 5-point scale by evaluating each MRI sequence against a defined set of technical criteria along with subjective assessments of image quality for each multiparametric sequence [T2-weighted imaging (T2-WI), diffusion-weighted imaging (DWI), and dynamic contrast-enhanced (DCE) MRI].

PI-QUAL v1 has been evaluated in different patient cohorts and it has been demonstrated that image quality has a direct bearing on cancer detection and biopsy planning [14]. However, PI-QUAL v1 has limitations [19, 20]; the most important being that it is suitable only for mpMRI examinations, thus excluding examinations without intravenous contrast medium. Furthermore, it does not enable an objective evaluation of image quality distinct from biopsy implications, restricting its use in other clinical scenarios (tumour staging, active surveillance, population screening and follow-up of patients with prior negative/positive scans).

This document sets out the updated version of PI-QUAL (v2). It has been developed by an extended European Society of Urogenital Radiology (ESUR) prostate cancer working group to overcome the limitations mentioned above by (i) accommodating MRI without intravenous contrast medium; (ii) simplifying the scoring process and (iii) ensuring that the scoring system provides a reproducible assessment of the image quality, applicable to a wider range of patients in the clinical routine.

By introducing PI-QUAL v2, we aim to further enhance the standardisation and reliability of prostate MRI quality assessment, thereby optimising the diagnostic accuracy and subsequent management of patients with known or suspected prostate cancer.

Prostate MR image quality

High image quality is a prerequisite for any MRI-based diagnostic pathway. Suboptimal image quality can result in under or overcalling lesions, inadequate characterisation and staging, or the inability to confidently call negative scans [14, 21]. Poor image quality has been associated with increased rates of indeterminate MRI findings (i.e. PI-RADS 3 lesions) [22]. Thus, image quality can influence subsequent steps in the diagnostic pathway, such as MR-targeted biopsies, risk stratification, and treatment decisions. Prostate MR image quality depends on several aspects, including adherence to the technical standards defined within the PI-RADS guidelines, sufficiently high signal-to-noise ratio (SNR), image contrast, and image sharpness to distinguish and delineate relevant structures in and around the prostate gland.

Technical recommendations

The technical recommendations outlined in PI-RADS v2.1 [9] encompass critical aspects, including the field strength and basic image acquisition parameters, such as which and how many b values should be used for DWI, and the temporal resolution of DCE sequences. Furthermore, the use of endorectal coils is discouraged and phased-array surface coils are preferred for signal reception. These recommendations are specified to harmonise prostate MRI practice across institutions and enhance the reproducibility of MRI for detecting and characterising prostate lesions. An overview of the technical recommendations of PI-RADS v2.1 is presented in Table 1.

Table 1 PI-RADS v2.1 technical recommendations

Prostate MRI artefacts

Despite the PI-RADS technical recommendations, prostate MR quality remains variable and can be influenced by various factors [14]. Prostate MRI is susceptible to a variety of artefacts including:

  1. 1.

    Motion artefacts, which are among the most prevalent. Given the location of the prostate in the pelvis, involuntary motion, such as respiratory, peristalsis, bladder filling and pelvic floor movements can lead to image blurring.

  2. 2.

    Susceptibility artefacts on DWI are another common concern and are primarily attributed to the presence of diverse tissue types around the prostate with differing magnetic susceptibilities. Challenges arise from the presence of rectal gas or the presence of metals, such as hip prostheses. These artefacts manifest as signal distortions and loss, particularly at the recto–prostatic interface, where a gas-filled rectum can cause inhomogeneities of the magnetic field and geometric distortions of the prostate gland.

  3. 3.

    Chemical shift artefacts can also be observed because of the distinct resonant frequencies of fat and water protons. This phenomenon can lead to the appearance of dark or bright bands at the fat–water interface, potentially obscuring the boundaries of the prostate gland.

  4. 4.

    Aliasing or wrap-around artefacts occur when the field of view is too small for the imaged anatomy. Wrap-around occurs when structures appear at unexpected locations within the image. Adjusting the field of view or using larger matrices can help to reduce these artefacts.

  5. 5.

    Finally, poor SNR is problematic when surface coils are used for signal reception in patients with high body mass index, specifically with large hip circumferences.

Key factors in the optimisation of image quality

Effectively managing common artefacts is crucial for optimising image quality. The key factors for image quality optimisation can be categorised into three different perspectives.

Patient preparation

Several patient-centred measures can enhance image quality, but there is no consensus on the optimal strategy for patient preparation. To reduce involuntary motion artefacts, anti-peristaltic agents (e.g. anticholinergic agents, glucagon) can be used, aiming to reduce small bowel and rectal movements. These agents typically have an immediate antiperistalsis effect lasting approximately 20–30 min. The downsides of anti-peristaltic agents include adverse effects, rebound, and additional costs. Moreover, not all patients can receive these medications due to contraindications (e.g. glaucoma in the case of anticholinergic agents) or regulatory limitations. While these agents have been shown to improve the quality [23,24,25], they have not demonstrated benefits in cancer detection [24] or staging [26]. Other techniques include the use of micro-enemas, dietary restrictions, and thin rectal catheters to relieve rectal air build-up [27]. Due to conflicting results and low levels of scientific evidence on the effectiveness of these measures, along with the lack of diagnostic impacts on prostate cancer diagnosis, centres are urged to investigate and report the local success of different patient preparation methods to help define best practices.

Hardware

The topics that have been most widely studied are the magnetic field strength (1.5 T vs 3 T) and the use of an endorectal coil. In theory, there is a benefit of using a higher magnetic field strength because of the higher SNR, but the disadvantage to this is a higher risk of susceptibility artefacts. Limited evidence from small studies shows higher image quality from 3 T scanners compared with 1.5 T. However, there is a consensus that adequate image quality is still possible at at 1.5 T with optimised protocols [14]. The PI-RADS v2.1 guidelines recommend the use of 3 T, except in cases of metallic implants or devices [9]. Endorectal coils can be used to increase the SNR mainly for 1.5 T scanners and may also be beneficial for patients with large body mass index [28]. A meta-analysis revealed that the use of an endorectal coil did not show benefits for detecting extra-prostatic extension, and only demonstrated marginal improvement in sensitivity for seminal vesicle invasion [29]. The disadvantages of patient discomfort, additional time, and costs must be balanced with the potential benefits of using an endorectal coil.

Image acquisition team

It is crucial to recognise that good image quality can only be achieved through teamwork. Centres with state-of-the-art MR scanners, specialist radiographers or radiology technologists, and genitourinary radiologists should, when working together, produce consistently high-quality MR images. The ESUR and the European Association of Urology Section of Urologic Imaging (ESUI) have published recommendations regarding prerequisites for reporting, experience levels, supervision of prostate MRI reporting radiologists, and training of radiographers/technologists [15]. Radiologists and radiographers or radiology technologists are advised to participate in training programs that include prostate MRI image quality as part of the quality assurance processes of diagnostic centres. Enhancing awareness of optimal image quality among radiologists and radiographers necessitates the implementation of standardised and objective metrics, enabling them to impartially assess scans with a critical perspective. Another recommendation is that image quality should be recorded in routine clinical reporting for clinical audits and quality control. These recommendations can serve as a starting point for quality-assuring a prostate imaging pathway.

PI-QUAL version 2

Methodology

PI-QUAL v2 is a result of a collaborative international effort that engaged 20 experienced genitourinary radiologists specialising in prostate MRI, along with a group of six urologists actively incorporating prostate MRI into their routine clinical practice. This initiative brought together experts from Europe and North America who were working members of the Prostate Subgroup of the ESUR and ESUI and selected invited members from the Society of Abdominal Radiology (SAR) prostate cancer disease focused panel. Note that this work product is not endorsed by the SAR with members contributing in their individual capacities.

The working group was chaired by two radiologists (M.d.R. and F.G.) who facilitated the discussions through virtual and in-person meetings, as well as email correspondence between June 2021 and February 2024. Initial drafts were refined stepwise through testing on multiple cases to improve objectivity and increase reliability.

To gauge the initial inter-reader agreement of the scoring system, six out of the 20 radiologists who were not part of the initial development of PI-QUAL v2, independently evaluated 50 studies with varying image quality. These studies comprised 25 MRI scans without contrast medium and 25 mpMRI scans obtained from different vendors and magnets, selected randomly by the two chairs. The inter-reader agreement, calculated using the percentage of agreement with linear weighting, was 61%.

All authors involved in this study played an active role in defining, refining and editing PI-QUAL v2.

PI-QUAL v2 scoring system

All centres conducting prostate MRIs should be aware of and largely comply with the technical recommendations outlined in the PI-RADS v2.1 guidelines [9] (Table 1). However, certain PI-RADS technical parameters carry greater importance for obtaining images with optimal diagnostic capabilities. Adjustments in certain parameters such as field of view and in-plane resolution can help improve SNR [30]. Therefore, it was unanimously agreed that the new version of PI-QUAL should include only critical technical prerequisites for each sequence before proceeding with image quality assessments (Table 2).

Table 2 Essential technical prerequisites per sequence

The scoring sheet for PI-QUAL v2 comprises 10 criteria that include the ability to clearly delineate the relevant structures in the prostate (e.g. the capsule, seminal vesicles, ejaculatory ducts, neurovascular bundles, and external urethral sphincter) and the assessment of the most prevalent artefacts and image degradations that severely affect the prostate for each sequence individually (Fig. 1).

Fig. 1
figure 1

PI-QUAL v2 scoring sheet that includes the basic mandatory PI-RADS v2.1 technical prerequisites for T2-WI, DWI and DCE, and the table to derive the PI-QUAL v2 score

There are:

  • Four criteria for T2-WI (maximum score: 4/4) (Fig. 2)

    Fig. 2
    figure 2

    The four T2-WI criteria: examples with degraded axial (A, C, and E), coronal (G) and sagittal (H) T2-WI. Axial T2-WI of optimal image quality (B, D, and F). Axial T2-weighted images are from the same patients (before and after correction)

  • Four criteria for DWI (maximum score: 4/4) (Fig. 3)

    Fig. 3
    figure 3

    The four DWI criteria: examples with degraded (A, C, E, F, H and I) and optimal (B, D, and G—this latter is axial T2-WI) image quality. Significant susceptibility artefacts can cause displacement/distortion, which is regarded as severe when the displacement is > 5 mm versus axial T2-WI (G) at the posterior surface of the prostate (HI)

  • Two criteria for DCE sequences (dichotomised score: ‘+/−’) (Fig. 4).

    Fig. 4
    figure 4

    The two DCE criteria: examples with degraded (A) and optimal (B) image quality

In keeping with the PI-RADS v2.1 recommendations, T2-WI and DWI are the dominant sequences and can score up to four quality points each, while there are only two separate criteria for DCE sequences (and only when both criteria are met, are DCE sequences considered of optimal diagnostic quality). This aligns with the lesser role of DCE in the PI-RADS v2.1 scoring system, where it is primarily used to upgrade PI-RADS 3 to PI-RADS 4 lesions in the peripheral zone, with no role in the transition zone for category assignment of detected lesions.

The new PI-QUAL score is shown in Fig. 1 and the rules used to derive the PI-QUAL score for MRI without intravenous contrast medium and for mpMRI are shown in Table 3.

Table 3 Flowchart to assess the PI-QUAL score using MRI without or with intravenous contrast medium

Summary quality categories for MRI without intravenous contrast medium:

  • PI-QUAL score of 1: the image quality is inadequate (i.e. T2-WI and/or DWI score ≤ 2/4). The examination does not meet the critical technical/image quality requirements.

  • PI-QUAL score of 2: the image quality is acceptable (i.e. T2-WI and DWI score at least 3/4).

  • PI-QUAL score of 3: the image quality is optimal (i.e. T2-WI and DWI both score 4/4).

Summary Quality Categories for mpMRI:

  • PI-QUAL score of 1: the image quality is inadequate (i.e. T2-WI and/or DWI score ≤ 2/4 and either only one criterion or no criteria for DCE sequences are satisfied). The examination does not meet the critical technical/image quality requirements. However, if both criteria for DCE sequences are satisfied and at least one sequence (either T2-WI or DWI) scores 4/4, the PI-QUAL score is upgraded to PI-QUAL 2.

  • PI-QUAL score of 2: the image quality is acceptable (i.e. T2-WI and DWI score at least 3/4). This score cannot be upgraded or downgraded by DCE sequences.

  • PI-QUAL score of 3: the image quality is optimal (i.e. T2-WI and DWI both score 4/4 and both criteria for DCE sequences are satisfied). However, if only one criterion or no criteria for DCE sequences are met, the PI-QUAL score is downgraded to PI-QUAL 2.

It is important to focus on the differences between the two scanning protocols (with or without intravenous contrast medium).

Upgrading or downgrading of the PI-QUAL score can occur in three scenarios when using mpMRI:

  • Scenario 1: full scores (i.e. 4/4) both for T2-WI and DWI, but DCE of suboptimal quality.

The PI-QUAL score would be 3 for MRI without intravenous contrast medium but PI-QUAL 2 on mpMRI. Note that a PI-QUAL score of 2 is also given when either T2-WI or DWI is not optimal (i.e. not 4/4) for MRI without intravenous contrast medium.

  • Scenario 2: full scores (i.e. 4/4) for T2-WI, but not for DWI (i.e. 1 or 2 out of 4), but DCE of optimal diagnostic quality.

The PI-QUAL score would be 1 for MRI without intravenous contrast medium but PI-QUAL 2 on mpMRI. This would occur in the presence of pelvic metalwork. In this scenario, optimal DCE sequences represent a ‘safety net’.

  • Scenario 3: full scores (i.e. 4/4) for DWI, but not for T2-WI (i.e. 1 or 2 out of 4), but DCE of optimal diagnostic quality.

The PI-QUAL score would be 1 for MRI without intravenous contrast medium but PI-QUAL 2 on mpMRI. Again, optimal DCE sequences represent a ‘safety net’.

Further remarks:

  1. i.

    The panel agreed that the all-prostate gland relevant acquired images be evaluated together for their net contribution to PI-QUAL image quality. Thus, mpMRI quality must be evaluated with DCE images for an overall quality assessment. Selective reassignments of image quality based on the quality of DCE images is strongly discouraged.

  2. ii.

    There is no overall sum score. For example, a score of 5/8 for MRI without intravenous contrast medium is not advised. This implies that if either T2-WI or DWI scores ≤ 2/4, the PI-QUAL score for an MRI without intravenous contrast medium is automatically 1 (i.e. scan of inadequate diagnostic quality) irrespective of the possibility that the other sequence (either T2-WI or DWI) may still achieve a higher score.

  3. iii.

    It is recommended to include the separate score of each sequence along with the final PI-QUAL v2 score, to gain insight into which sequence needs to be improved. Describing why a particular sequence is limited is advisable, for instance: Image quality assessment: T2-WI: 3/4; DWI: 2/4 and both DCE criteria met (i.e. ‘+’), resulting in a final PI-QUAL score of 1. This indicates inadequate image quality attributed to an insufficient T2-WI and DWI sequence, resulting for instance from minor motion artefacts on T2-WI, the lack of discrimination of the peripheral zone from the transition zone on DWI and significant susceptibility artefacts on the apparent diffusion coefficient (ADC) image due to rectal air.

  4. iv.

    Ideally, poor-quality sequences should be repeated while the patient is undergoing the examination, and the pre-contrast T1 sequences should be checked for the absence of artefacts before intravenous injections.

  5. v.

    The differences between PI-QUAL v1 and PI-QUAL v2 are shown in Table 4.

    Table 4 Major differences between PI-QUAL v1 and PI-QUAL v2

Clinical recommendations

PI-QUAL v2 describes the quality of the MR images and should be used to guide clinical decisions about whether it is necessary to repeat an examination. That is, the PI-QUAL score should inform, but not determine, clinical decision-making. For example, in some cases it is still possible to identify a large lesion in an examination of inadequate diagnostic quality (PI-QUAL score 1), allowing a targeted biopsy to be performed without delay, but in such cases, inaccurate staging can still occur thereby affecting the treatment pathway.

It is important to stress that when the diagnostic quality of a scan is inadequate, the PI-RADS or Likert cancer likelihood scores should not be given. Specifically, it is suggested that an inadequate quality scan should NOT be allocated a PI-RADS/LIKERT score of 3. In case of a scan with inadequate quality, the imaging team should investigate and aim to remedy the cause(s). If inadequate diagnostic quality stems from patient-related factors (e.g. movement), while the scanner parameters are satisfactory, measures should be implemented to alleviate this issue. If the inadequate diagnostic quality stems from machine-related factors, the patient should be scanned using another MR system with better performance.

Optimal diagnostic quality (PI-QUAL score 3) is of particular importance when assessing patients on active surveillance or after treatment, where it is crucial to rule in and rule out the presence of clinically significant disease to assess the degree of radiological change over time or the presence of residual/recurrent disease with high confidence.

It should be also kept in mind that a PI-QUAL score of 2 does not mean that rescanning is always needed. Only if the scan remains doubtful or deemed of insufficient quality to make a diagnosis, rescanning is highly advised.

Future directions

PI-QUAL v2 is a tool for assessing imaging quality, and testing its effectiveness in both research and clinical practice in diverse clinical settings is strongly encouraged. One of the key aspects of the successful adoption of PI-QUAL v2 is teaching its application to those involved in prostate MRI acquisition and reporting. Training on image evaluations by PI-QUAL can be done effectively by dedicated teaching including hands-on sessions [31, 32]. We advocate that dedicated hands-on courses on image quality should become the cornerstone for the successful delivery of the MRI diagnostic pathway for radiologists and radiographers, as well as for radiology trainees and urologists [26].

Image quality assessment should also be a crucial aspect of prostate MRI reports and prostate cancer clinical research. Future and ongoing studies should always report on image quality as part of the scientific reports. Diagnostic centre accreditations should also incorporate PI-QUAL assessments as part of quality assurance and control processes.

Although PIQUAL v2 is designed to provide an objective way to assess image quality, users’ experience and the inherently subjective nature of image quality may lead to some inter- and intra-reader variability. Understanding the extent of this variability will be crucial. For an effective scoring system, intra-reader variability should be less than inter-reader variability and further studies evaluating this aspect are encouraged.

Automated methods based on deep learning have the potential to provide a more reproducible and standardised assessment of image quality. The collection of cases where image quality is routinely annotated would be helpful for model training. Preliminary studies of automated systems utilising convolutional neural networks have undergone testing and demonstrated their capability to accurately identify low-quality prostate MR images [33, 34]. Artificial intelligence assessments of image quality during the scanning process with timely corrective measures could enhance the workflow and overall quality of the MRI diagnostic pathway.

Conclusions

PI-QUAL v2 for assessing image quality of prostate MRI rectifies the limitations of PI-QUAL v1. The updated scoring system has been simplified by focusing on assessing compliance with critical technical and image quality parameters. Additionally, it applies both to mpMRI and MRI without intravenous contrast medium. This new version, through education, has the potential for broad adoption. As a living document, refinements based on future research and experience in clinical practice are welcome.

Our aspiration is that PI-QUAL v2 will be a key tool in the global effort to improve prostate MR image quality and thus the clinical utility of prostate MRI.