Introduction

Nasopharyngeal carcinoma (NPC) is one of the most common malignancies in Southeast Asia and China1. Radiotherapy has become the preferred treatment due to its high efficacy in the management of this disease. With recent development in technology, VMAT, as an advanced form of radiation delivery, has been widely used in the treatment of NPC2. Compared to intensity-modulated radiation therapy (IMRT), VMAT generally improves the target coverage and OAR sparing for head and neck tumors3,4,5. However, due to the highly complex, irregular tumor shapes and numerous radiation sensitive surrounding OARs, the radiotherapy plan design and optimization for NPC is technically challenging and resource demanding to meet strict dosimetric criteria. In order to obtain a high-quality individualized treatment plan, radiotherapy physicists spend significantly more time and effort iteratively modifying optimization functions and evaluating the results when compared to planning for other sites. Furthermore, the quality of treatment plans usually depend largely on the planners’ skill and experience, which could vary considerably among physicists and treatment centers6,7.

As a promising solution, automatic planning has been proposed to reduce planning time, improve quality and consistency with minimal manual intervetion8. Commercial softwares have been developed and introduced to clinics, including RapidPlan (Varian Medical Systems, Palo Alto, CA)9,10, Multi-Criteria Optimization (RaySearch Laboratories, Stockholm, Sweden)11,12 and Auto-Planning (Philips Medical Systems, Best, The Netherlands)13,14,15. The Auto-Planning module integrated in Pinnacle3 treatment planning system (TPS) could automatically place treatment isocenter, elect beam angles, contour auxiliary structures, setup optimization goals, dose constraints, and tweak weighting factors among the goals and constraints14. It has been demonstrated that automatic planning was feasible for many areas of treatment sites, including head and neck13,15,16, lung17, breast18, esophagus7,19, pelvic20 and so on. In most studies, automatic planning could achieve similar or better results compared with manual plans. However, there have been few reports on the usage of automatic planning for NPC, especially for locally advanced NPC21.

Due to the irregular shape of the target volumes and numerous nearby OARs, the design of treatment plans for locally advanced NPC was really time-consuming and challenging. In particular, the parotid glands and brainstem are often close to or partially overlapping with the targets. Common side effects like dry mouth22 and fatigue23 caused by radiation to the sensitive organs could seriously affect patients’ quality of life after radiotherapy. It is difficult to optimize a VMAT plan that could provide adequate target coverage while spare OARs as much as possible, even for a skillful and experienced physicist. Therefore, automatic planning for locally advanced NPC could potentially bring significant improvement in plan quality, consistency and clinical workflow efficiency. In this paper, the Auto-Planning module in Pinnacle3 was used to generate VMAT plans for 22 patients with locally advanced NPC. The feasibility and efficacy of Auto-Planning were evaluated by comparing dosimetry against the corresponding manual VMAT plans generated from a skilled planner. Furthermore, the difference of plan quality as a factor of the overall stages was separately analyzed for a more comprehensive evaluation.

Material and methods

Patient characteristics

Between October 2020 and February 2021, 22 locally advanced NPC patients who received treatment in Fujian tumor hospital were retrospectively studied. There were 17 males and 5 females aged 30 years to 76 years (median age: 48 years). The overall stage distribution was stage III: 50% (11) and IVA/B: 50% (10 IVA and 1 IVB), according to the Chinese 2008 staging system for NPC. The specific staging information was listed in Table 1. The study has been approved by the ethics committee of Fujian Cancer Hospital (ethics number: SQ2016-048-01) and all patients provided written informed consent prior to enrollment in the study. All methods were performed in accordance with the Declaration of Helsinki as well as relevant guidelines and regulations.

Table 1 The characteristics of patients with nasopharyngeal carcinoma (n = 22).

Target volume delineation and dose prescription

All patients were immobilized using a thermoplastic mask in the supine position. Planning CT with a slice thickness of 3-mm (Brilliance CT Big Bore, Philips Medical Systems Inc., Cleveland, OH, USA) and pretreatment enhanced magnetic resonance imaging (Philips Achieva 3.0 T) were acquired. The target volumes were contoured by experienced physicians in accordance with institutional protocols. The primary nasopharyngeal tumor (GTV-T) and definitive left and right lymph nodes (GTV-NL and GTV-NR) were determined from imaging studies, endoscopic examinations and clinical exams. A high risk region (CTV1) was defined as GTV-T with a margin of 5–10 mm, including the nasopharyngeal mucosa, while a low risk region (CTV2) was defined as potentially involved regions. The bilateral low-risk nodal regions (CTV-NL and CTV-NR) included disease at levels II–V. The seven planning target volumes (PTVs) were obtained by 3 mm uniform expansion from corresponding target volume, including GTV-T-P, CTV1-P, CTV2-P, GTV-NL-P, GTV-NR-P, CTV-NL-P and CTV-NR-P. The OARs, including lens, eyes, optic nerves, optic chiasm, brainstem, spinal cord, parotid glands, temporal lobe, mandible, temporomandibular joint, oral cavity and thyroid were also delineated and verified by the same oncologist.

Treatment planning and dose prescription

The prescribed dose was 69.96 Gy to GTV-T-P/GTV-NL-P/GTV-NR-P, 60.06 Gy to CTV1-P, 56.1 Gy to CTV2-P and 52.8 Gy to CTV-NL-P/CTV-NR-P. Manual VMAT plans (MPs) were generated in the Pinnacle3 (version 16.2, Philips Radiation Oncology Systems, Madison, WI). The Auto-Planning module was used to create automatic VMAT plans (APs). Both plans were designed by the same physicist with Chinese Linear Accelerator Physicist Certificate and 7 + years experience. All plans were created for an Elekta Synergy accelerator using a pair of 6MV coplanar full arcs (178–182) with opposite 10 degree collimator rotation from their neutral position. The gantry spacing was set to 4° in each arc. Treatment goals for MPs included 100% of prescription dose to cover 95% of the target volumes and the OAR dose limitation listed in Table 2. Meanwhile, the AP template (Table 3) was used for all APs and the template parameters could be adjusted based on the patients’ anatomy. All “Ring” structures are also automatically generated. Up to three slight manual interventions were allowed in AP when deemed necessary by the planner.

Table 2 The criteria of OAR for manual planning.
Table 3 OARs optimization goals in treatment planning.

Plan evaluation and statistical analysis

For quantitative comparisons, several dosimetric parameters were collected. Planning target volumes (PTVs) dose corresponding to 2% of volume (D2), 95% of volume (D95) and 98% of volume (D98), conformity index (CI = (Vprescription in PTV/VPTV)*(Vprescription in PTV/Vprescription)) and homogeneity index ((HI = (D2 − D98)/Dprescription)) were all evaluated. For parallel OARs such as parotid glands, mean dose (Dmean) or Vx (the percentage volume receiving × Gy dose) were analyzed. For serial OARs such as spinal cord, Dmax or D2cc (max dose or dose corresponding to 2 cc volume) were calculated. Meanwhile, the monitor unit (MU) per fraction and planning time were also recorded for comparison.

The Wilcoxon’s signed rank test was carried out between APs and MPs for dosimetric parameters previously described. Statistical package for the Social Sciences (SPSS 21.0; SPSS Inc., Chicago, IL, USA) was used to perform these tests and p < 0.05 was considered statistically significant.

Result

Targets dose comparison

Most of the plans including APs and MPs met the prescribed requirement of targets, as shown in Table 4. In general, the passing rate of dose criteria and dose distribution in the targets were similar in the two groups of plans. For GTV-NL-P, GTV-NR-P, CTV1-P and CTV2-P, there are no statistical differences between APs and MPs. However, compared to the MPs, the D2 and HI of GTV-T-P was slightly higher in the APs by 0.7% and 2.8% (p < 0.05), indicating the existence of hotter dose volumes in the APs in the target volumes, e.g., D95 for CTV-NL-P and CTV-NR-P were higher in the APs.

Table 4 Dosimetric comparison of PTVs in manual and automatic VMAT (mean ± SD).

OARs dose comparison

The dosimetric parameters for all OARs were summarized in the Table 5. The passing rate of dose criteria for all OARs was similar or increased in the APs compared with MPs, except for V30 of right parotid gland. Meanwhile, most dose parameters for APs were lower than MPs. The Dmax of left and right optic nerves, chiasm and brainstem were decreased by 1.9 Gy, 2.4 Gy, 1,2 Gy and 1.0 Gy in the APs, respectively (p < 0.05). The V50 of brainstem, D2cc of mandible and Dmean of oral cavity were also statistically lower in the APs, by 1.32%, 1.0 Gy and 1.5 Gy. However, the max dose of spinal cord was increased by 1.0 Gy in the APs (p < 0.05), although such increase at dose levels around 40 Gy was clinically insignificant. In addition, the volume of the low-dose (< 30 Gy) regions was significantly decreased from 2497.8 cc in MP to 2395.6 cc (p < 0.05), indicating an improvement in dose conformity and overall better sparing in normal tissues.

Table 5 Dosimetric comparison of OARs in manual and automatic VMAT (mean ± SD).

MU and planning time comparison

The average MU was 643.59 ± 45.42 in APs and 672.12 ± 51.82 in MPs respectively. The APs reduced the average MU by 4.2% (p < 0.05).

The overall treatment planning time of APs, including the manual intervention time and computer calculation time, were statistically reduced by 26.3% in AP plans, from 144 to 106 mins (p < 0.05).

Stratified analysis by the overall stage

The locally advanced NPC patients were divided into two groups, according to the overall stage. The differences in selected dosimetric characteristics between APs and MPs for the targets and OARs were calculated separately for the two groups, as shown in Table 6. The results suggested that the dose difference for the targets was independent of overall stage as the values were similar and without statistical difference (not listed in Table 6).

Table 6 Differences in dosimetric parameters between manual and automatic VMAT for the OARs stratified by overall stage (mean ± SD).

In both groups, APs might result in superior dose sparing for most OARs than MPs, except spinal cord. The improvement appeared most for optic chiasm and brainstem in Stage III, and parotid glands in Stage IV. Although most differences were statistically insignificant, V50 of brainstem reduced more evidently for stage III in APs (2.2% vs 0.4%, p < 0.05). V30 and Dmean of parotid glands reduced more evidently for stage IV. Nevertheless, the maximum dose to the spinal cord was lower in MPs for both stages and the difference was greater in stage III (p < 0.05). However, the differences are unlikely to be clinically significant at 40 Gy levels.

Discussion

A good amount of clinically used IMRT plans could be further optimized and improved, especially for those designed with limited time constraints, inadequate computational resources or less experienced planners24. Recently, automatic IMRT planning was fastly developed to potentially improve the plan quality and clinical efficiency. For example, the Auto-Planning module in the Pinnacle TPS is able to adjust optimization parameters and generate clinically acceptable plans automatically, based on an optimization algorithm with minimal planner intervention14. In this study, we demonstrated the feasibility and efficiency of Auto-Planning module in the VMAT planning for locally advanced NPC.

For both APs and MPs, the dose criteria of targets and OARs could not be fully met because some of the OARs were close to or overlapping with the targets. In general, the target dose coverage of APs was similar to that of MPs. It was notable that while the dose uniformity for GTV-T-P was superior in the MPs, dose inhomogeneity in tumors could be of less clinical concern in the era of imaging guided radiotherapy (IGRT) and inter fractional adaptive planning. In addition to providing preferable brainstem sparing, AP passing rates for target dose goals were equal to or slightly higher than MPs.

For most of the OARs, the dosimetric parameters of APs were superior to that of MPs, while the passing rates were usually higher than or equal to MPs, as also concluded by Yang et al.25 and Wang et al.26. APs could automatically generate a number of auxiliary structures for the dose limiting, which was practically impossible difficult to accomplish manually. However, the average Dmax of spinal cord for MPs was 1.03 Gy lower than that of APs (p < 0.05). In this particular case, a better balance between the particular OAR dose constraints and targets might be reached by an experienced physicist repeatedly adjusting the related parameter settings14.

For locally advanced NPC, dose differences for most OARs were similar between AP and MP plans regardless of overall stage. However, the APs provided better brainstem sparing in some stage III patients. As shown in Fig. 1A and B, there might be sufficient anatomic distance between brainstem and tumor targets in this patients’ cohort. Zhang et al. have reported that automatic plan would be more effective in sparing the brainstem if the anatomic distance between targets and the pons was greater than 5 mm27. In addition, the parotid glands sparing in patients with stage IV seemed superior for the automatic plan. However, when focusing on particular patients with stage IV, the parotid glands could be overprotected in the APs at the cost of reduced dose coverage for GTV-NL-P and GTV-NR-P. The parotid glands were more or less overlapping with the target in Stage IV. A typical dose distribution for a patient in this cohort was shown in Fig. 1C and D. The parotid glands were clearly better protected in the AP, but there was a significant underdose in the overlapping region between targets and parotid glands. In this case, the automatic plan could still not meet the dose criteria for parotid glands, which was usually not acceptable by the clinician. Our finding suggested that the balance between parotid glands protection and target dose coverage was still a challenge even for the Auto-Planning, especially for stage IV patients. For example, if the exposure dose of parotid glands was close to the dose criteria (V30 ≤ 50%), the clinician in our institution would tend to selectively reduce the CTV margins in favor of the protection of parotid glands. Conversely, if the exposure dose of parotid glands was far exceeded the dose criteria, adequate target coverage would be the preferred choice. For some other cases, the plan quality of APs was comparable to MPs. As shown in Fig. 1E and F, dose distribution of the two plans was quite similar in this patient. In our study, when specific dose distribution and dosimetric parameters were considered, the plan quality of APs was superior or equal to MPs in most cases, and inferior to MPs in several patients with stage IV.

Figure 1
figure 1

The dose distributions for two representative NPC patients were displayed. (A): manual plan, (B): automatic plan for a patient with stage III (T3N2M0); (C): manual plan, (D): automatic plan for a patient with stage IV (T4N3M0); (E): manual plan, (F): automatic plan for a patient with stage IV (T3N1M0).

Overall, the design of a conventional VMAT radiotherapy for locally advanced NPC could benefit from the automatic planning. In most cases, automatic plans were expected to achieve a similar or better plan quality. However, in several stage IV patients, automatic planning might have over-protected certain OARs such as the parotid glands. In such cases, it would require manual intervention from an experienced physicist and further clarifications for clinical preferences from the clinician. In addition, automatic planning could improve the planning efficiency. AP was usually based on artificial intelligence (AI) through the application of predictive models and decision supporting systems (DSS) optimization. It was particularly suitable for repetitive iterative work28. The overall planning time was decreased by 26% in the automatic planning, consistent with previous studies21,29. In fact, the improvement in planning efficiency was much greater than a flat reduction in planning time since the physicist can work on other duties while the automatic planning is being carried out by treatment planning computers backstage. This feature shall be particularly beneficial to institutions with a large number of patients but limited planners. Meanwhile, the patients who needed frequent replanning due to rapid changes in anatomy during a course of therapy could benefit from AP as the turnaround time of replan is expected to be much shorter. In this study, a uniform template parameter setting was used to start the AP process. The lack of individuality in this initialization may place unnecessary challenges to the AP algorithm in finding an optimized dose distribution with respect to a patient’s individual anatomy. Recent advancements in AI-based automatic planning are developing rapidly and potentially they could take the individual anatomy into account30,31,32,33. Bai et al. has developed a neural network-based IMRT treatment planning technique for locally advanced NPC33. Then automatic IMRT plan could be generated based on the individual’s anatomy, with comparable dosimetric qualities to manual plan33. However, these automatic plannings were usually difficult to be integrated into a commercial TPS platform. It demands high quality data management and computer skills among physicists. Conversely, the Auto-Planning module in its current format was more clinically adapted and easier to implement in practice.

Conclusion

For locally advanced NPC, Auto-Planning module could generate VMAT plans with similar or superior plan quality compared to manual VMAT plans for most patients. However, manual approach could be preferred in certain stage IV patients, due to a better control of the balance between the OARs and targets by an experienced physicist. In general, automatic VMAT could greatly improve clinical efficiency and should be an option for the implementation of locally advanced NPC VMAT treatment planning after careful institutional validation.