Background

Knowledge-based treatment planning is a promising solution to reduce the planning time [14] and planner-dependence of plan quality [511]. As a commercial knowledge-based optimizer, Varian RapidPlan (Varian Medical Systems, Palo Alto, CA) has been validated on patients with head & neck, lung, esophageal, breast, hepatocellular and prostate cancer [1217] based on similar cases for model training and dosimetric testing. However, no attempts have been made to extend the domain of model applications to less-similar patients so far.

Considering the treatment techniques (VMAT/IMRT) and patient setup orientations (supine/prone) may vary even for the same disease, it would be very time-consuming if not impractical to train specific models for all clinical varieties. In addition, the existing similar plans of a special type may be insufficient for the configuration of a qualified model. Therefore, it is highly desirable to explore and validate other potential possibilities of using an existing model more efficiently. Considering the full-arc VMAT covers all possible field angles of IMRT, and the geometry-based expected dose (GED) algorithm of RapidPlan is independent from patient orientations, this study aims to investigate the feasibility and dosimetric performance of applying a DVH estimation model trained on supine VAMT plans to the knowledge-based optimization of IMRT plans of both supine and prone setup orientations.

Methods

Model configuration and validation

A DVH estimation model was configured with 81 simultaneous-integrated-boosting VMAT plans for pre-operative rectal cancer patients of supine setup. All training plans were manually created by senior dosimetrists based on Eclipse treatment planning system (V11.0 or before) following consistent dose prescriptions and planning protocols (50.6 Gy and 41.8 Gy to 95 % of PGTV and PTV respectively/ 22 fractions, 1–2 full arc, 5° collimator rotation, and 10 MV photon). As recommended by Varian’s manual [18], model validation was conducted on similar cases before publication. Specifically, 10 historical plans of the same type that were not used for the model training were duplicated, and the model was applied to re-optimize the plan copies (referred as RP-VMAT plans). The RP-VMAT plans were compared with the original plans to check if the model's robustness of its designed purpose was clinically acceptable.

Knowledge-based IMRT planning

To avoid the bias of comparing the knowledge-based plans with the manual plans that were made suboptimal intentionally, the duplications of all 30 testing IMRT plans (20 prone and 10 supine, due to very limited supine patients treated with IMRT historically at our centre) were retrospectively selected from the clinically approved and treated cases of identical prescriptions as for the model. Using sliding window technique, each original IMRT plan was manually developed with five fields of 10 MV photon beams. The knowledge-based re-planning maintained all other settings except the MLC sequences were re-optimized using the estimates and objectives generated by the supine VMAT model (referred as RP-IMRT plans). The Photon Optimizer for IMRT (PO_13535), DVH Estimation Algorithm (v. 13.5.35) and Anisotropic Analytical Algorithm (AAA_13535) were selected for the automatic RP-IMRT optimization. Relative to the conventional DVO and PRO algorithms for the manual optimization of IMRT or VMAT plans, the new PO algorithm for RapidPlan is applicable to both techniques, which uses one single matrix over the image to define the structures, DVH calculation and dose sampling spatially [18]. Based on GED, PO partitions OAR voxels into four sub-volumes and predict the most likely landing range for the DVH curves, which were generated as optimization objectives for the knowledge-based planning. To base the dosimetric comparison on similar target dose coverage, all RP and original plans were renormalized to satisfy the dose prescriptions for both PGTV and PTV. Visual inspection of sectional dose distribution was routinely performed to examine the target coverage and hot spots appearance.

Dosimetric assessment and statistical comparison

Using the following DVH metrics, the dosimetric features were compared between the original vs. RP-VMAT, original vs. RP-IMRT plans and prone vs. supine setup orientations respectively: 1. homogeneity index of PGTV (HI_PGTV) and PTV (HI_PTV), defined as (D 2 % − D 98 %)/D 50 %; 2. conformity index of PGTV (CI_PGTV) and PTV (CI_PTV), defined as V 100 %/V t arg et ; 3. the relative volume exceeding 107 % of the prescribed dose to PGTV (V107 %, i.e. V54.14Gy); 4. Global maximum dose (Dmax) and near maximum dose in PGTV (D2 %) [19]; 5. The dose to 50 % of the femoral head volume and urinary bladder volume (D50 %_FH and D50 %_UB); 6. The mean dose to the femoral head and urinary bladder (Dmean_FH and Dmean_UB). 7. The total monitor units (MUs) of each plan.

To assess the differences between the original plans and the knowledge based re-planning, paired samples t-test was conducted for normally distributed data (tested by Shapiro-Wilk method), otherwise Wilcoxon signed ranks test was performed using SPSS (version 21.0). To appraise the impact of supine and prone setup orientations on the dosimetric outcomes, independent sample t-test and Mann-Whitney U Test were carried out respectively for the data of normal and non-normal distributions. The equality of variances was examined by Levene's test. P < 0.05 was considered as statistically significant (2-tailed). Based on the tabular-formatted DVH data exported from Eclipse system, an in-house MATLAB code was programmed to calculate the mean DVHs of 30 patients that were either manually planned or knowledge-based re-planned. Plotting was performed using SigmaPlot software (Version 10.0, Systat, San Jose, CA).

Results

Model validation on similar cases

Table 1 displays the model validation results by optimizing similar supine VMAT cases. Relative to the manually optimized clinical plans, RapidPlan has brought negligible changes to HI_PGTV, HI_PTV, CI_PGTV, CI_PTV, Dmax and D2 %. Negligible V107 % values were observed in 3/10 original plans (magnitude ≤0.18 %), but only in 1/10 RP-VMAT population. On the other hand, RP-VMAT plans have significantly and largely relieved the normal organ exposure than the clinical plans.

Table 1 Validation of DVH estimation model by applying it to similar cases: dosimetric comparison between the 10 clinical supine VMAT plans and their knowledge-based re-optimizations

Original vs. RP-IMRT plans

Knowledge-based DVH estimations and objectives could be automatically generated for the RP-IMRT optimization using the VMAT model. As an explicit comparison between the original and RP-IMRT plans regardless of the setup orientations, Fig. 1 illustrates the average DVHs of the 30 patients stratified by the planning methods.

Fig. 1
figure 1

Mean DVHs of 30 IMRT patients planned by experience-based (original) or knowledge-based (RapidPlan) methods regardless of the setup orientations

Relative to the original plans (Table 2), the model-assisted re-optimization has significantly reduced the dose to the urinary bladder and femoral head. The marginal changes of CI_PGTV and D2 % were insignificant. Significant but negligible increases of HI were observed in RP-IMRT plans. However, knowledge-based IMRT planning using the VMAT model has significantly increased the dose conformity index to PTV and the global maximum dose. In addition, more cases (6/30 patients) with larger V107 % values were observed in the RP-IMRT group (ranging from 0.01 ~ 1.76 %, not shown in Table 2) than in the original plans (hardly noticeable).

Table 2 Dosimetric comparison between the clinical IMRT plans (of both supine and prone setups) and their knowledge-based re-optimization using the DVH estimation model trained on supine VMAT plans

Supine vs. prone setup orientations

Conflict was not encountered during the optimization of prone patients using the supine model. Table 3 compares the dosimetric outcomes between the supine and prone patients. Regarding the HI and CI of the targets, no or slight disparities were observed between the two setups for any planning methods. The absolute inter-orientation dose differences of Dmax, D2 %, D50 %_UB and Dmean_UB were no more than 0.41 Gy. Prone patients received significantly lower D50 %_FH and Dmean_FH in both original and RP-IMRT plans.

Table 3 Dosimetric statistics between the supine and prone IMRT patients

Discussion

To appraise the contribution of the new PO optimizer, the RapidPlan-generated objectives were applied to the 10 VMAT validation plans using the old PRO optimizer, and the results were more close to the RP-VMAT than the original plans. Therefore, without changing any settings other than redesigning the MLC sequences, the dosimetric changes of knowledge-based re-optimization were mostly if not solely attributable to the estimates and objectives generated by the RapidPlan model based on patient-specific evaluations of structure sets and field geometry. However, as a key component of RapidPlan solution package, the minor role of the possible stronger PO algorithm cannot be excluded, which may deserve more investigations in the future.

Consistent with the published successful implementations of RapidPlan models to prospective patients of the same type [1217],our supine VMAT model could generate clinically acceptable plans for similar validation patients as it was configured for. Comparable target dose homogeneity and conformity were achieved, but the dose to the critical organs was largely reduced than the clinical VMAT plans that were developed manually (Table 1). As a new attempt of extending the model application domain, significant improvement of critical organ sparing was also achievable by the model preconfigured on another technique and orientation (Table 2 and Fig. 1). Using the same model, RP-IMRT achieved greater improvement magnitudes of D50 %_UB and Dmean_UB than RP-VMAT plans, yet the decreasing amplitudes of D50 %_FH and Dmean_FH were slightly to the opposite. Therefore, the VMAT model did not necessarily work better in the similar technique than in the different, which can be very helpful in detecting and improving the sub-optimal manual IMRT plans. As a result of knowledge-based re-optimization of both IMRT and VMAT plans, the decreasing amplitudes of the dose to the urinary bladder were much larger than that to the femoral head. We ascribed this different magnitude to the greater geometric varieties of urinary bladder, which have made the experience-based subjective judgement of achievable goals more challenging in the conventional planning. Some unreliable decisions could be avoided by RapidPlan using the personalized quantitative evaluation algorithm, hence reducing unnecessary normal tissue complication risks associated with suboptimal planning [20]. Additional evidences of reducing inter-planner variety using knowledge-based planning were the smaller standard deviations and narrower 95 % confidence intervals of the dose metrics to the critical organs (Tables 1, 2 and 3), which is also consistent with pervious observations on other cancer types [511].

As shown in Fig. 1, the nearly overlapped lines of GTV, PGTV, CTV and PTV of the original and RP-IMRT plans indicate that the aforementioned comparisons of organ sparing are based on similar target dose coverage after renormalization. Slight but considerable sharper dose gradient in the transitional region from PGTV to PTV can be observed in the RP-IMRT plans (approximately between the dose range of 42 ~ 49 Gy), which can be ascribed to the fact that ‘PTV-(PGTV + 5 mm)’ was included as an ‘organ-at-risk’ (rather than a ‘target’) in the model configuration which generated upper constraints for this structure in the knowledge-based IMRT planning to shape a good dose fall-off. The volume of ‘PTV-(PGTV + 5 mm)’ was created by deducting PGTV and its 5 mm outer margin from PTV, which was also optimized during the manual planning.

As shown in Table 2, the significantly increased CI_PTV of RP-IMRT plans indicated deteriorated dose fall-off beyond the border of PTV, which might not be the ‘fault’ of the DVH-estimation model though. Indeed, the model itself does not generate knowledge-based predictions and estimations for the targets. Instead, these fixed objectives shall be manually assigned and can be incorporated into the optimizer as templates to facilitate an automated planning process. Although these combined parameters functioned well for the automated RP-VMAT planning, it was in line with our clinical experience that IMRT optimization usually adopts different constraints and priorities than the VMAT especially for the target dose coverage and hot volume control. Although this study focused on the feasibility and dosimetric evaluation of cross-applying the identical RP-VAMT optimizer, it is advisable to investigate the target-objective revision to better readapt the VMAT optimizer to the IMRT planning, and to test the capability of IMRT model in the knowledge-based VMAT planning in the future.

The increased Dmax and V107 % of RP-IMRT plans may be associated with the significantly more MUs than the original plans. As a comparison, the MU escalation of RP-VMAT plans was minor, where the hot spot did not increase considerably (it is also true that VMAT technique is less likely to produce hot spots than IMRT). Moreover, emerging hot spots were usually segmented timely and suppressed with high priority during the iteration of conventional optimization, and/or manually erased by editing the fluence map afterwards [21]. These steps were all missing during the semi-automated knowledge-based optimization. Therefore, manual examination and elimination of hot spot is highly recommended especially for other treatment regions involving serial organs at risk.

As shown in Table 3, the impact of setup orientations on the dosimetric outcomes of RP-IMRT plans was very tiny: the differences of HI_PGTV, HI_PTV, CI_PGTV, CI_PTV, Dmax, D2 %, D50 %_UB and Dmean_UB between the prone and supine patients were negligible. The magnitudes of inter-setup variances were comparable to that of the original plans. The comparable results of knowledge-based IMRT planning between patients of opposite setups suggested that orientation variety did not affect the performance of a preconfigured model.

It was also noticed that the femoral head of prone patients received consistently lower dose for both original and RP-IMRT plans, but the difference may not be attributable to the orientation-disparity between the modelling and planning candidates, because the supine model has largely reduced the femoral exposure for both supine and prone patients. An alternative explanation was that some sub-optimal field geometry in the original supine IMRT plans may have involved more femoral head volume into the fields, hence induced more exposure for both original and RP-IMRT plans using identical field organizations. Nevertheless, RapidPlan reduced the magnitude of inter-setup dose disparities of D50 %_FH and Dmean_FH, suggesting that even under suboptimal field arrangement (which is not optimizable by RapidPlan), knowledge-based optimization still behaved superiorly in terms of plan quality and consistency.

Conclusions

A supine VMAT model can automatically estimate optimization objectives for the knowledge-based IMRT planning of either supine or prone patients, yielding superior organ sparing and quality consistency than the conventional method. If incorporated as part of the optimizer, the manually added objectives and priorities for the targets should be adjusted in accordance with the selected treatment technique. Manual processing of the hot spots is highly recommended after the semi-automated knowledge-based IMRT planning.