Novel in-house knowledge-based automated planning system for lung cancer treated with intensity-modulated radiotherapy

Shao, Yan; Guo, Jindong; Wang, Jiyong; Huang, Ying; Gan, Wutian; Zhang, Xiaoying; Wu, Ge; Sun, Dong; Gu, Yu; Gu, Qingtao; Yue, Ning Jeff; Yang, Guanli; Xie, Guotong; Xu, Zhiyong

doi:10.1007/s00066-023-02126-1

Novel in-house knowledge-based automated planning system for lung cancer treated with intensity-modulated radiotherapy

Original Article
Open access
Published: 21 August 2023

(2023)
Cite this article

Download PDF

You have full access to this open access article

Strahlentherapie und Onkologie Aims and scope Submit manuscript

Novel in-house knowledge-based automated planning system for lung cancer treated with intensity-modulated radiotherapy

Download PDF

Yan Shao¹,
Jindong Guo¹,
Jiyong Wang²,
Ying Huang¹,
Wutian Gan³,
Xiaoying Zhang⁴,
Ge Wu⁵,
Dong Sun⁶,
Yu Gu⁷,
Qingtao Gu⁸,
Ning Jeff Yue⁹,
Guanli Yang¹⁰,
Guotong Xie^5,11,12 &
…
Zhiyong Xu¹

1257 Accesses
Explore all metrics

Abstract

Purpose

The goal of this study was to propose a knowledge-based planning system which could automatically design plans for lung cancer patients treated with intensity-modulated radiotherapy (IMRT).

Methods and materials

From May 2018 to June 2020, 612 IMRT treatment plans of lung cancer patients were retrospectively selected to construct a planning database. Knowledge-based planning (KBP) architecture named αDiar was proposed in this study. It consisted of two parts separated by a firewall. One was the in-hospital workstation, and the other was the search engine in the cloud. Based on our previous study, A‑Net in the in-hospital workstation was used to generate predicted virtual dose images. A search engine including a three-dimensional convolutional neural network (3D CNN) was constructed to derive the feature vectors of dose images. By comparing the similarity of the features between virtual dose images and the clinical dose images in the database, the most similar feature was found. The optimization parameters (OPs) of the treatment plan corresponding to the most similar feature were assigned to the new plan, and the design of a new treatment plan was automatically completed. After αDiar was developed, we performed two studies. The first retrospective study was conducted to validate whether this architecture was qualified for clinical practice and involved 96 patients. The second comparative study was performed to investigate whether αDiar could assist dosimetrists in improving the quality of planning for the patients. Two dosimetrists were involved and designed plans for only one trial with and without αDiar; 26 patients were involved in this study.

Results

The first study showed that about 54% (52/96) of the automatically generated plans would achieve the dosimetric constraints of the Radiation Therapy Oncology Group (RTOG) and about 93% (89/96) of the automatically generated plans would achieve the dosimetric constraints of the National Comprehensive Cancer Network (NCCN). The second study showed that the quality of treatment planning designed by junior dosimetrists was improved with the help of αDiar.

Conclusions

Our results showed that αDiar was an effective tool to improve planning quality. Over half of the patients’ plans could be designed automatically. For the remaining patients, although the automatically designed plans did not fully meet the clinical requirements, their quality was also better than that of manual plans.

A knowledge-based intensity-modulated radiation therapy treatment planning technique for locally advanced nasopharyngeal carcinoma radiotherapy

Article Open access 03 August 2020

Automatic IMRT treatment planning through fluence prediction and plan fine-tuning for nasopharyngeal carcinoma

Article Open access 20 March 2024

Feasibility Study of the Fluence-to-Dose Network (FDNet) for Patient-Specific IMRT Quality Assurance

Article 12 November 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Artificial intelligence (AI) has been applied in medical imaging-based diagnosis and prognosis and has shown significant advantages with regard to application [1,2,3,4,5]. Although recent work has demonstrated the effectiveness of AI in radiotherapy [6], e.g., AI segmentation of planning target volume (PTV) and organs at risk (OARs) [7,8,9,10] and the AI prediction of dose images [11, 12], its application is still limited.

In clinical practice, a radiotherapy treatment plan is generated by configuring prescription, optimization algorithm, dose calculation algorithm and grid resolution, settings and options of the optimizer and optimization parameters (OPs) including field geometry, number of fields, and optimization goals. Then, the planner iteratively modifies OPs until the plan meets the clinical constraints. This is a very time-consuming and laborious process. And to some extent, the selection of the OPs and the plan modification process are based on the planner’s experience [13]. Thus, the quality of radiotherapy treatment plans may vary between planners, and some patients may be treated with suboptimal plans [14, 15].

In order to minimize the variations of plan quality between planners and improve the plan quality, automatic treatment planning (ATP) methods [16,17,18] were developed. Some commercial ATPs are available, such as RapidPlan [19] and HyperArc [20] from Varian, auto-planning [21] from Philips, RayStation autoplanning modules, EZfluence [22] from Radformation, and Elements [23] from Brainlab. Some other noncommercial systems were also developed by researchers, such as iCycle [24, 25] which utilized an a priori approach with multicriteria optimization. However, all these commercial and noncommercial systems still require planners to select OPs as the input to generate a treatment plan. However, because the selection of OPs depends largely on the experience of planners, suboptimal plans may result. Further work is needed to mitigate the dependence on planners and to generate invariant treatment plans of high quality.

In order to solve the above problems, researchers proposed several solutions to improve the quality, uniformity and processing efficiency of planning, such as the use of a complicated objective function, the use of multiobjective optimization and the introduction of knowledge-based automated planning methods (KBAP) [26,27,28,29]. Zhang et al. [30] proposed a semi-automatic radiotherapy treatment planning process by combining the ideas of machine learning automated planning and multicriteria optimization (MCO). In their workflow, handcrafted features were introduced. KBAP is a two-step approach to realize the automatic design of radiotherapy treatment planning. Clinically acceptable doses were first predicted and then the predictions were converted into deliverable plans with an optimization algorithm [31]. Babier et al. [32] developed and evaluated a new inversed optimization model. In this model, the weight of the objective function estimated from the clinical dose–volume histogram (DVH) was used to generate the inversed plan. DVH is two-dimensional and cannot include spatial information. Therefore, automated planning based on three-dimensional (3D) dose distribution may be more advantageous.

In the present study, we introduce a novel method named αDiar. It can automatically design plans for lung cancer patients treated with intensity-modulated radiotherapy (IMRT) based on predicted dose images. First, the optimal 3D dose distribution of the new plan was predicted based on the A‑Net model [33] developed by our team. Then an image retrieval model was constructed with AI. The features of the predicted 3D dose distribution and the dose distribution of the clinical plan in the database, respectively, were extracted with this model. Finally, the similarities between the features of predicted and clinical dose distributions were compared. The feature in the database that was most similar to the feature of the predicted 3D dose distribution was found, and its corresponding OPs were assigned to the new plan. Then the automated design of the new plan was completed.

Two experiments were conducted to evaluate and investigate the clinical applicability of this architecture. First, αDiar was used to automatically generate treatment plans. The quality of these auto-generated plans was evaluated to determine their clinical feasibility. Second, a comparison between plans with and without the assistance of αDiar was performed. To verify the potential value of αDiar, dosimetric parameters group of clinical metrics were calculated and compared with the clinical acceptance criteria. Both of the experiments comprised a pilot step to employ the technology of image retrieval in the automatic design of radiotherapy treatment planning, and the proposed architecture may enable automated treatment planning that does not rely on the experience of planners.

Methods

Database

From May 2018 to June 2020, 612 IMRT treatment plans of lung cancer patients were retrospectively selected. All these clinical plans were designed by three experienced dosimetrists. Each of these original clinical plans consisted of four to seven coplanar 6 MV photon beams. An Edge linear accelerator (Varian, Palo Alto, CA) was selected for dose delivery. All plans were normalized; thus, 95% of the PTV received 100% of the prescription dose. Each treatment plan consisted of a computed tomography (CT) scan, PTV contour(s), OAR contours, prescription dose, beam arrangement, optimization goals, and clinically delivered dose distribution that was calculated in the Pinnacle 9.10 treatment planning system (TPS; Philips Healthcare, Fitchburg, WI, USA). All the contours of PTV and OARs were delineated by junior radiation oncologists and reviewed by experienced radiation oncologists. OARs included left lung, right lung, total lung, spinal cord, and heart in this study. Total lung was defined as the union of left lung and right lung excluding gross tumor volume (GTV).

Architecture

Figure 1 shows the architecture of αDiar, which is the automated knowledge-based treatment plan design system proposed in this study. The system contained two parts: the in-hospital workstation and the search engine in the cloud. In the in-hospital workstation, CT scan, PTV, and OARs contours of a new patient were transferred to the in-hospital workstation with only one click using a TPS script. The CT scan was registered with the phantom chest CT scan [34] via the registration toolbox—elastiX [35]. The transformation matrix gained from the registration was applied to the corresponding PTV and OARs masks afterward. The transformed PTV and OAR masks automatically predicted a series of virtual dose images through the dose prediction The AI model [33] was supervised by the ground truth of clinical dose images. With no other information, the predicted virtual dose images were subsequently transferred to the cloud for computation of a feature vector. To predict the feature vector, a 3D convolutional neural network (3D CNN) model was trained through the virtual and clinically delivered dose images. The derived feature vector of the virtual dose images was compared to the feature vectors derived from clinical dose images of treatment plans in the database. The Euclidean distances between the feature vectors of the virtual dose images and stored clinical dose images were calculated. A link between the feature vector of the virtual dose images and the most similar feature vectors (with the smallest Euclidean distance) in the database was established. The corresponding OPs of the most similar stored feature vectors were transferred back to the TPS through the in-hospital workstation. Finally, in the TPS, the auto-planning module was utilized to optimize the plan of the new patient with the downloaded OPs.

No human intervention was required in the design process of the radiotherapy plan. The optimal dose distribution prediction, OPs retrieval and plan optimization of the new plans were done entirely by the system and did not depend on the experience of planners. Thus, αDiar achieved a fully automatic design of treatment plans. The specific step-by-step flowchart is shown in Fig. 2.

Prediction of virtual dose images

In previous research, A‑Net was used to predict virtual dose images from the contours of PTV and OARs [31] and included an encoding part and a decoding part in A‑Net. In the encoding part, there were four stages with two squeeze-and-excitation (SE) [36] blocks in each stage. And there were two stages in the decoding part. A‑Net is an end-to-end network through which the virtual images of a case can be predicted in only one try. Its inputs were the masks of PTV and OARs and the size of the input images was 384 × 384 × 128. Its outputs were the virtual dose images and the size of the output dose images was 96 × 96 × 128. During the process of model training, the ground truth was the clinically delivered dose images. The performance of A‑Net was demonstrated in the article [31].

Building and training of image retrieval model

A small and simple 3D CNN was built in the training of the image retrieval model. The inputs were the dose images, and the output was the feature vector of the dose images. As shown in Fig. 3, there were five sets of convolutional layers. The size of the input images was 96 × 96 × 128. First, it went through a set of 3D convolution (kernel size: 3 × 3 × 3, the number of output channels: four times the number of input channels, stride: 1 × 1 × 1), ReLU, and a batch normalization (BN). Then it went through three blocks of 3D convolution and 3D max-pooling. Each 3D convolution (kernel size: 3 × 3 × 3, the number of output channels: doubling the input channels, stride: 1 × 1 × 1) was followed by a BN, and 3D max-pooling (kernel size: 2 × 2 × 2, stride: 2 × 2 × 2). Finally, a fully connected layer converted the feature map to a 32‑d feature vector, and the output was a feature vector with a size of 32 × 1.

In the training phase, the network consisted of three inputs: (i) anchors—the virtual dose images, denoted as A; (ii) positives—the clinically delivered dose images of the same patient to the virtual dose images, denoted as P; (iii) negatives—the clinically delivered dose images of another dissimilar patient, denoted as N. A, P and N composed a triplet [37]. A, P and N separately went through the image retrieval model and their corresponding feature vectors a, p and n were generated respectively. During the training phase, A and P were the virtual dose images and the clinically delivered dose images of the same patient. In case of the random selection of N may not train a robust AI model well, an online hard example mining procedure proposed in [38] was used in the selection of N. In a mini-batch, N could be selected from other triplets’ A and P.

The distance matrix in a mini-batch was defined by the following equation:

$$\left(\begin{array}{ccc} d\left(a_{1}{,}p_{1}\right) & \cdots & d\left(a_{1}{,}p_{k}\right)\\ \vdots & \ddots & \vdots \\ d\left(a_{k}{,}p_{1}\right) & \cdots & d\left(a_{k}{,}p_{k}\right) \end{array}\right)$$

(1)

where k was the batch size, a_i and p_i were the feature vectors of ith anchor and ith positive in the mini-batch, and d(a_i,p_j) was the distance between a_i and p_j. d(a_i,p_j) was defined as:

$$d\left(a_{i}{,}p_{j}\right)=\sqrt{2-2a_{i}{p}_{j}^{T}}$$

(2)

In order to find a hard negative sample, the minimum distance amount d(a_i,p_j) and d(a_q,p_i) ($j{,}q=1\ldots k$) were denoted as d(a_i,p_jmin) and d(a_qmin,p_i). The triplet loss function could be written as:

$$Loss=\frac{1}{k}{\sum }_{i=1{,}n}^{n}\max\Big(0{,}\textit{margin}+d\left(a_{i}{,}p_{i}\right)\\ -\min\left(d\left(a_{i}{,}p_{\mathrm{jmin}}\right){,}d\left(a_{\mathrm{qmin}}{,}p_{i}\right)\right)\Big)$$

(3)

where the margin was a constant, set as 1.

The complete proposed network was trained on a Tesla V100 GPU with 16 GB memory and was randomly initialized with Glorot normal distribution. The Adam optimizer with “poly” learning rate decay policy was employed to minimize the loss function. The “poly” learning rate decay policy can be formulated as follows:

$$lr_{\text{itertion}}=lr_{\mathrm{init}}*\left(1+\frac{1}{\textit{decay*iteretion}}\right)$$

(4)

where lr_itertion was the learning rate in this iteration, lr_init was the initial learning rate assigned to 0.001, and decay was initialized to 0.01. Given the limitation of GPU memory size, the batch size was one.

Inference of image retrieval model

In the inference phase, the trained image retrieval model was applied to infer the virtual dose images and the obtained feature vector was denoted as f_v. The clinical dose images of each patient in the database were fed to this image retrieval model procedure and the output feature vectors were denoted as f_c. When retrieving a plan, the Euclidean distance between f_v and every f_c in the database was calculated. The smaller the Euclidean distance was, the more similar the two features were. Finally, the OPs corresponding to the f_c that were most similar to f_v were used to generate planning for the new patient. The formula of Euclidean distance was as follows:

$$dist\left(X{,}Y\right)=\sqrt{{\sum }_{1}^{n}({x_{i}}-{y_{i)}}^{2}}$$

(5)

where X and Y represented the feature vector of the predicted virtual dose image and clinical dose images stored in the database, respectively.

Study of fully automated usage of αDiar

A study was conducted to validate whether the searched results of OPs could successfully and automatically initialize the auto-planning module. It may also validate whether the fully automated treatment plans could meet clinical criteria. From April 17 to July 16, 2020, 96 lung cancer patients treated in our department were selected and analyzed. The prescription dose of all the treatment plannings was 60 Gy.

αDiar was used on each of the 96 cases and three most similar plans (denoted as Search 1, 2 and 3) were retrieved. The OPs of the three plans were respectively and manually fed into the auto-planning module in Pinnacle TPS. Three treatment plans were eventually generated for each of the 96 cases. The best one was denoted as AP_AI. The quality of the plans was evaluated according to the criteria set forth by the Radiation Therapy Oncology Group (RTOG) [39], the National Comprehensive Cancer Network (NCCN) [40], as well as the clinical standard used in our department. For the purpose of comparison, the corresponding treatment plans which were clinically delivered to patients were also collected and denoted as AP_Clinical.

Comparison of treatment planning with or without αDiar

It was not likely that all AP_AI plans met the clinical criteria. Thus, a study was also conducted to investigate whether AP_AI could assist dosimetrists in clinical practice. From April 25, 2018 to March 5, 2020, 26 patients were involved in this comparison experiment. AP_AI plans of all patients did not meet the clinical criteria. Based on these AP_AI plans, two junior dosimetrists independently modified the searched OPs once and performed the plan optimization with the auto-planning module. These optimized treatment plans were denoted as $AP_{AI+\text{Human}}$. Separately, the same two dosimetrists designed the treatment plans from scratch using the same CT images, PTV and OARs. These plans were denoted as AP_Human.

This experiment practically consisted of two phases, and they were separated by a 4-week wash-out time. In the first phase, $AP_{AI+\text{Human}}$ plans were designed for 13 of 26 cases (group A), and AP_Human plans were designed for the remaining 13 patients (group B). In the second phase, AP_Human plans were designed for group A and $AP_{AI+\text{Human}}$ plans were designed for group B. During these two phases, cases were randomly ranked across group A and group B.

Metrics

NDCG

Normalized discounted cumulative gain (NDCG) was defined as

$$NDCG_{K}=\frac{DCG_{K}}{iDCG_{K}}$$

(6)

$$DCG_{K}={\sum }_{i=1}^{k}\frac{2^{r\left(l\right)}-1}{lb\left(i+1\right)}$$

(7)

where NDCGK was the cumulative gain of the first K positions, lb(i + 1) was the reciprocal of the impact factor of the solution at the i position, and r(l) was the relevance level of Search 1.

D2, D98 and D99

D2, D98 and D99 (units: Gy) were the radiation doses delivered to 2, 98 and 99% of PTV, respectively.

CI and HI

Conformity index (CI) [41] was defined as:

$$CI=\frac{V_{T{,}\mathrm{ref}}}{V_{T}}\times \frac{V_{T{,}\mathrm{ref}}}{V_{\mathrm{ref}}}$$

(8)

where V_T,ref was the volume of PTV covered by prescription dose, V_T was the volume of PTV, and V_ref was the volume covered by prescription dose.

Homogeneity index (HI) [42] was defined as:

$$HI=\frac{D2-D98}{D_{P}}$$

(9)

where D_P was the prescription dose.

V5, V20, V30, V40, V45 and V60

V5, V20, V30, V40, V45, and V60 were the volume percentages of OARs receiving over 5 Gy, 20 Gy, 30 Gy, 40 Gy, 45 Gy and 60 Gy, respectively.

MLD, MHD, Dmean and Dmax

Mean lung dose (MLD) was the mean dose of total lung, and mean heart dose (MHD) was the mean dose of heart. Dmax and Dmean were the maximum dose and mean dose of PTV or OARs, respectively.

Metrics’ usage

As shown in Table 1, the following metrics were used to evaluate the differences between plans. For example, D2, D98, D99 and CI and HI were used for evaluating PTV only. V5 and V20 were applied to evaluate total lung.

Table 1 Metrics used in different regions

Full size table

Statistical analysis

Data analyses were performed with SPSS 20.0 (IBM Corp., Armonk, NY, USA) statistical software. For normally distributed data, paired samples t‑test was used to compare the differences of dosimetric parameters between two groups. Wilcoxon signed rank test was used to compare the differences of dosimetric parameters between two groups for the data with nonnormal distribution. p < 0.05 was considered statistically significant.

Results

Validation of the search model

In order to validate the performance of the proposed searching model, NDCG was employed. It was designed for ranking tasks with more than one relevance level. NDCG ranged from 0 to 1. The closer NDCG is to 1, the higher the accuracy is. In this research, the NDCG of the first three search results was 0.69 ± 0.09.

Experiment of the automated usage of αDiar

Three-dimensional dose distribution

In Fig. 4 the dose distribution of a randomly selected patient from the 96 patients is shown, where Fig. 4a, b, c represent the dose distribution of AP_Clinical, while Fig. 4d, e, f represent the dose distribution of AP_AI. Figure 5 shows the difference of DVH between AP_Clinical and AP_AI for the same patient as in Fig. 4. The difference between AP_Clinical and AP_AI was small for PTV and OARs except for spinal cord. To further compare the difference in DVH between AP_Clinical and AP_AI, we calculated the DVH of all patients and plotted a mean DVH (Fig. 6).

Comparison of dosimetric parameters between AP_Clinical and AP_AI

Table 2 shows the comparison results of PTV in AP_Clinical and AP_AI plans for the 96 clinical cases. D2, D98 and Dmean in AP_Clinical were slightly higher than those in AP_AI plans. There were no significant differences except D98 and HI. Figure 7 shows the difference of D2, D98, D99 and Dmean between AP_Clinical and AP_AI. Overall, the evaluation metrics of AP_Clinical and AP_AI plans were similar.

Table 2 PTV metrics of AP_Clinical and AP_AI plans of 96 cases, and their organs at risk (OAR) metrics comparing to three standards (RTOG0623, NCCN, the standard in our department)

Full size table

Figure 8 shows the difference of V20, V5 of total lung and V30, V40, V45, V60 of heart between AP_Clinical and AP_AI. As shown in Table 3, the metrics in AP_AI and AP_Clinical of all patients met the criteria of RTOG0623 [39], except for Dmax of spinal cord. 52 AP_AI plans and 40 AP_Clinical plans met the dosimetric criteria of RTOG0623 [39]. Among the 44 AP_AI plans which did not meet the spinal cord dosimetric criterion in RTOG0623 [39], 33 of the corresponding AP_Clinical plans did also not meet the spinal cord criterion. All metrics of 92 AP_Clinical plans and 89 AP_AI plans met the dosimetric criteria set in NCCN [40]. More AP_Clinical plans met the clinical dosimetric criteria followed in our department, except for spinal cord (40 vs 52).

Table 3 OARs metrics of AP_Clinical and AP_AI plans in 96 cases, and comparison with three standards (RTOG0623, NCCN, the standard in our department)

Full size table

As shown in Fig. 9a, c and e, the percentages of AP_AI and AP_Clinical plans that met all of the RTOG0623 [39] dosimetric criteria were 54.17 and 41.67%, respectively. The percentage of AP_Clinical plans (95.83%) meeting the NCCN criteria [40] was slightly higher than that of AP_AI (92.71%). The percentages of AP_AI and AP_Clinical plans that met the dosimetric criteria of our department were 43.75 and 40.63%, respectively. The figures also showed that the numbers of plans meeting the criteria were similar between AP_AI and AP_Clinical. Slightly more AP_AI plans met the criteria of RTOG0623 [39] and our department standards than the AP_Clinical plans, but slightly fewer AP_AI than the AP_Clinical plans that met the NCCN criteria [40].

As shown in Fig. 9b, d and f, the numbers of AP_AI plans that met the dosimetric criteria of RTOG0623[39], NCCN [40] and our department standards were 52, 89 and 42, respectively. The number of cases in which both AP_AI and AP_Clinical met the three criteria was 29, 88 and 25, respectively. The number of cases that AP_AI plans met the RTOG0623 [39] criteria while the corresponding AP_Clinical plans did not meet the criteria was 23. This was higher than the number of cases of AP_Clinical plans that met RTOG0623 [39] criteria but the corresponding AP_AI plans did not. Similar results were obtained when the dosimetric criteria of our department were adopted. For the NCCN [40] dosimetric criteria, the number of cases that AP_AI met the criteria but AP_Clinical did not was fewer than the number of cases that AP_Clinical met the criteria but AP_AI did not. However, these two numbers were very close.

An experienced radiation oncologist reviewed all AP_AI and AP_Clinical plans and was blinded to the information about how these plans were designed. Based on this evaluation, 9 AP_AI plans were better than AP_Clinical, 43 AP_AI plans were similar to AP_Clinical, and 44 were worse. In conclusion, 54.17% of the AP_AI were better than or comparable to the AP_Clinical, and could be directly applied in clinical practice.

Comparison experiment

Three-dimensional dose distribution

The dose distribution of a randomly selected patient from the 26 patients is shown in Fig. 10, (a) the dose distribution of $AP_{AI+\text{Human}}$ designed by dosimetrist A, (b) the dose distribution of AP_Human designed by dosimetrist A, (c) the dose distribution of $AP_{AI+\text{Human}}$ designed by dosimetrist B, and (d) the dose distribution of AP_Human designed by dosimetrist B. Visually, for dosimetrist A and B, the dose distribution of $AP_{AI+\text{Human}}$ was significantly better than AP_Human.

Comparison of dosimetric parameters between $\boldsymbol{AP}_{\boldsymbol{AI}+\textbf{Human}}$ and AP_Human

Table 4 shows the metrics of the $AP_{AI+\text{Human}}$ and AP_Human plans. All metrics of PTV in $AP_{AI+\text{Human}}$ plans designed by dosimetrist A were better than those of AP_Human. For dosimetrist B, all PTV metrics of $AP_{AI+\text{Human}}$ plans were better than those of AP_Human , except HI.

Table 4 Metrics results of planning target volume (PTV) and organs at risk (OARs) in AP_AI ₊ _Human and AP_Human

Full size table

As shown in Table 4, the metrics of total lung and spinal cord in $AP_{AI+\text{Human}}$ plans designed by dosimetrist A were slightly better than those in AP_Human. For the plans designed by dosimetrist A, V45 and V60 of heart in $AP_{AI+\text{Human}}$ were slightly better than those in the corresponding AP_Human. V30, V40 and MHD of heart in $AP_{AI+\text{Human}}$ were slightly worse than those in the corresponding AP_Human. MLD of total lung, V45 and V60 of heart and Dmax of spinal cord in $AP_{AI+\text{Human}}$ designed by dosimetrist B were better than those in the corresponding AP_Human.

Discussion

In this study, a novel architecture to automatically retrieve treatment plans in the database via the agent of virtual dose images was proposed. As a knowledge-based method to implement an automated design of planning for lung cancer patients treated with IMRT, the virtual dose images were inferred from the masks of PTV and OARs. And the whole procedure of retrieval and planning can be implemented in a fully automated system. In order to validate the performance of αDiar, two experiments were conducted. The first experiment was to investigate the quality of αDiar-initialized plans without any planner intervention, and the second experiment was to compare the impact on the planning quality with and without the aid of αDiar. The first experiment revealed that over half of the tested αDiar-initialized plans could be directly used in clinical practice, and the second experiment revealed that the αDiar-initialized planning procedure could improve plan qualities.

A comparison of isodose distributions and DVH of AP_AI and AP_Clinical plans for one patient are shown in Fig. 11. The results showed that although AP_AI and AP_Clinical plans met the clinical requirements, quality differences still existed. And the use of αDiar may lead to better quality.

In 44 cases (PTV size: 251.93 ± 117.48cc), the qualities of AP_AI plans were worse than those of the corresponding AP_Clinical plans. Compared with AP_Clinical plan, 13 AP_AI plans exhibited worse conformability despite the lower doses received by OARs. In one AP_AI plan, the metrics of the plan met all dosimetric criteria but failed to provide individualized protection to the unilateral lung. The doses of OARs in 30 AP_AI plans were higher than those in the corresponding AP_Clinical plans. It was observed that a larger PTV often led to poorer quality of AP_AI plans than their corresponding AP_Clinical plans. The AP_AI plans tended to over-protect OARs, while decreasing the conformability. Moreover, the αDiar process could not consider the oncologists’ preferences which should be pursued in the future research.

In this study, one case (PTV size: 119.70cc) was excluded because its OPs could not be used in auto-planning module. Since the OPs did not present any abnormality, this may be due to an internal error in Pinnacle, which needs further investigation.

In the second experiment, two junior dosimetrists designed plans for 26 lung cancer patients with and without the assistance of αDiar, respectively. Each plan was optimized only once. For dosimetrist B, based on the results of metrics, the qualities of treatment plans [43] designed without αDiar were generally inferior to those designed by dosimetrist A. However, the quality differences of the plans initiated with αDiar decreased remarkably between the two dosimetrists, which showed that the αDiar process may have the potential to improve quality differences between planners. Figure 10a and b displays the isodose distributions of a case designed by dosimetrist A with and without αDiar. Figure 10c and d displays the corresponding isodose distributions designed by dosimetrist B. Evidently, isodose distributions were improved with the use of αDiar.

As an image retrieval architecture, αDiar could be very useful in taking advantage of the entire treatment plans database in the department of radiation oncology and making it available as a knowledge base which can be accessed by all dosimetrists in the future. This architecture may not only make it possible to “share” the knowledge of experienced dosimetrists, but also help to improve the overall qualities of treatment plans.

The utilization of αDiar may change the workflow of radiotherapy treatment planning. In the current workflow, upon the radiation oncologist determined the prescription and approved the contours of PTV and OAR, the dosimetrist designed the treatment plan on TPS by configuring the prescription, dose calculation algorithm and grid resolution, optimization algorithm and OPs. The treatment planning design process was iterated until the treatment plan was clinically acceptable. Upon implementation of αDiar, the workflow may be changed as follows. If the retrieved OPs could be used to generate a satisfactory treatment plan, the plan could be directly applied to clinical treatment with the approval of the dosimetrist and radiation oncologist. If the αDiar-initialized plan does not meet the clinical requirements, it could also help dosimetrists to start with a semi-ready plan to achieve a plan that meets clinical standards.

This proposed architecture provides a scenario where no patients’ images are transferred out of hospitals. In the process, a patient’s CT scan images are transferred to the in-hospital workstation for the purpose of rigid registration, and the transformation matrix gained from the registration is employed to transform the masks of PTV and OARs. The registered masks of PTV and OARs are utilized to predict virtual dose images which serve as substitutes for CT scans in content-based image retrieval (CBIR). Once the most similar clinical dose images are found in the database, a link between the searching plan and the stored plan in the database is established, and the corresponding stored OPs can be transferred and applied to the new plan. Furthermore, redundant information in CT scans is not necessary for image retrieval. For example, the anatomic information of muscles, bones, vessels, and airways may lead to over-complicated AI-model training. Replacing these anatomic structures with OAR masks as well as low-dose areas in dose images can simplify the training of the image retrieval model and increase the searching speed.

Traditionally, the training of a CBIR system has often been challenged by the lack of similar pairs of samples [44]. In a database with T samples, a physician theoretically needed to review T pairs of samples to find the most similar one. Finding exhaustive pairs of possibly similar samples in a database with T samples required T×T times of review, which was time-consuming and labor-intensive. As proposed in this research, the virtual dose images and their clinical dose images were naturally a similar pair. Thus, the labor-intensive work of identifying similar pairs to train the CBIR model could be avoided by utilizing virtual dose images as the agent.

In the future, the manual input of OPs to the Pinnacle user interface could be replaced by engineering work to embed αDiar in a TPS. Also, due to the knowledge-based method, the performance of αDiar can be expected to improve by expanding the database size without changing the AI models. Compared with the commercial KBP method, the database of αDiar can be expanded to a larger volume. On the other hand, the robustness and feasibility of αDiar still needed further improvement as well as generality through the implementation of αDiar in other institutions. Finally, in this study, there were two layers of auto-planning. One was the proposed in-house KBAP that produced OPs, and the other was the commercial auto-planning module in Pinnacle. At present, we cannot decouple the effect of the first from the second. However, due to the extensive validation of the two layers, the results were credible when comparing the dosimetric parameters.

Conclusion

In this article, the authors proposed a novel knowledge-based architecture for an automated treatment plan design named αDiar. It can automatically retrieve radiotherapy treatment plans from the database through proxy virtual dose images. It was found that 54% of lung cancer patients can be treated with radiotherapy treatment plans that were generated using the fully automated αDiar. The plan quality and interplanner plan quality variation can also be improved with the architecture. The implementation of αDiar may change the radiotherapy workflow. Further investigation is required.

References

Altaf F, Islam SMS, Akhtar N, Janjua NK (2019) Going deep in medical image analysis: concepts, methods, challenges and future directions. IEEE Access 7:99540–99572
Article Google Scholar
Durgadevi P, Vijayalakshmi S (2021) Deep survey and comparative analysis of medical image processing. J Compu Theor Nanos 17(5):2321–2329
Article Google Scholar
Haskins G, Kruger U, Yan P (2019) Deep learning in medical image registration: a survey. Mach Vision Appl 31(1):8
Google Scholar
Liu L, Cheng J, Quan Q, Wu FX, Wang J (2020) A survey on U‑shaped networks in medical image segmentations. Neurocomputing. https://doi.org/10.1016/j.neucom.2020.05.070
Stolte S, Fang R (2020) A survey on medical image analysis in diabetic retinopathy. Med Image Anal 64:101742
Article PubMed Google Scholar
Thompson RF, Valdes G, Fuller CD, Carpenter CM, Morin O, Aneja S, Lindsay WD, Aerts HJWL, Agrimson B, Deville C, Rosenthal SA, Yu JB, Thomas CR (2018) Artificial intelligence in radiation oncology: A specialty-wide disruptive transformation? Radiother Oncol 129(3):421–426
Article PubMed PubMed Central Google Scholar
Samaneh K, Anjali B, Dan N, Sarah MG, Raquibul H, Jiang S, Amir O (2018) Segmentation of the prostate and organs at risk in male pelvic CT images using deep learning. Biomed Phys Eng Expr 4(5):55003
Article Google Scholar
Men K, Dai J, Li Y (2017) Automatic segmentation of the clinical target volume and organs at risk in the planning CT for rectal cancer using deep dilated convolutional neural networks. Med Phys 44(12):6377–6389
Article CAS PubMed Google Scholar
Dong X, Lei Y, Wang T, Thomas M, Tang L, Curran WJ, Liu T, Yang X (2019) Automatic multiorgan segmentation in thorax CT images using U‑net-GAN. Med Phys 46(5):2157–2168
Article PubMed PubMed Central Google Scholar
Zhong Z, Kim Y, Plichta K, Allen BG, Zhou L, Buatti J, Wu X (2019) Simultaneous cosegmentation of tumors in PET-CT images using deep fully convolutional networks. Med Phys 46(2):619–633
Article PubMed PubMed Central Google Scholar
Momin S, Lei Y, Wang T, Zhang J, Roper J, Bradley JD, Curran WJ, Patel P, Liu T, Yang X (2021) Learning-based dose prediction for pancreatic stereotactic body radiation therapy using dual pyramid adversarial network. Phys Med Biol. https://doi.org/10.1088/1361-6560/ac0856
Ma J, Nguyen D, Bai T, Folkerts M, Jia X, Lu W, Zhou L, Jiang S (2021) A feasibility study on deep learning-based individualized 3D dose distribution prediction. Med Phys 48(8):4438–4447
Article PubMed Google Scholar
Batumalai V, Jameson MG, Forstner DF, Vial P, Holloway LC (2013) How important is dosimetrist experience for intensity modulated radiation therapy? A comparative analysis of a head and neck case. Pract Radiat 3(3):e99–e106
Article Google Scholar
Landers A (2018) Fully automated radiation therapy treatment planning through knowledge-based dose predictions
Google Scholar
Nawa K, Haga A, Nomoto A, Sarmiento RA, Shiraishi K, Yamashita H, Nakagawa K (2017) Evaluation of a commercial automatic treatment planning system for prostate cancers. Med Dosim 42(3):203–209
Article PubMed Google Scholar
Voet PW, Dirkx ML, Breedveld S, Al-Mamgani A, Incrocci L, Heijmen BJ (2014) Fully automated volumetric modulated arc therapy plan generation for prostate cancer patients. Int J Radiat Oncol Biol Phys 88(5):1175–1179
Article PubMed Google Scholar
Moore KL (2019) Automated radiotherapy treatment planning. Semin Radiat Oncol 29(3):209–218
Article PubMed Google Scholar
Wang C, Zhu X, Hong JC, Zheng D (2019) Artificial intelligence in radiotherapy treatment planning: present and future. Technol Cancer Res Treat 18:1533033819873922
Article CAS PubMed PubMed Central Google Scholar
Shao Y, Wang H, Chen H, Gu H, Duan Y, Feng A, Li X, Xu Z (2019) Dosimetric comparison and biological evaluation of PET- and CT-based target delineation for LA-NSCLC using auto-planning. Phys Med 67:77–84
Article PubMed Google Scholar
Wong FHC, Moleme PA, Ali OA, Mugabe KV (2022) Clinical implementation of HyperArc. Phys Eng Sci Med 45(2):577–587
Article PubMed Google Scholar
Fan J, Wang J, Zhang Z, Hu W (2017) Iterative dataset optimization in automated planning: Implementation for breast and rectal cancer radiotherapy. Med Phys 44(6):2515–2531
Article CAS PubMed Google Scholar
Yoder T, Hsia AT, Xu Z, Stessin A, Ryu S (2019) Usefulness of EZFluence software for radiotherapy planning of breast cancer treatment. Med Dosim 44(4):339–343
Article PubMed Google Scholar
Shah AP, Meeks DT, Willoughby TR, Ramakrishna N, Warner CJ, Swanick CWCW et al (2020) Intrafraction motion during frameless radiosurgery using Varian HyperArcTM and BrainLab ElementsTM immobilization systems. J Radiosurg SBRT 7(2):149–156
PubMed PubMed Central Google Scholar
Breedveld S, Storchi PR, Keijzer M, Heemink AW, Heijmen BJ (2007) A novel approach to multi-criteria inverse planning for IMRT. Phys Med Biol 52(20):6339–6353
Article PubMed Google Scholar
Breedveld S, Storchi PR, Voet PW, Heijmen BJ (2012) iCycle: Integrated, multicriterial beam angle, and profile optimization for generation of coplanar and noncoplanar IMRT plans. Med Phys 39(2):951–963
Article PubMed Google Scholar
Guthier CV, Orio PF 3rd, Buzurovic I, Cormack RA (2021) Knowledge-based inverse treatment planning for low-dose-rate prostate brachytherapy. Med Phys 48(5):2108–2117
Article PubMed Google Scholar
Momin S, Fu Y, Lei Y, Roper J, Bradley JD, Curran WJ, Liu T, Yang X (2021) Knowledge-based radiation treatment planning: a data-driven method survey. J Appl Clin Med Phys 22(8):16–44
Article PubMed PubMed Central Google Scholar
Bai P, Weng X, Quan K, Chen J, Dai Y, Xu Y, Lin F, Zhong J, Wu T, Chen C (2020) A knowledge-based intensity-modulated radiation therapy treatment planning technique for locally advanced nasopharyngeal carcinoma radiotherapy. Radiat Oncol 15(1):188
Article PubMed PubMed Central Google Scholar
Chen H, Wang H, Gu H, Shao Y, Cai X, Fu X, Xu Z (2018) Study for reducing lung dose of upper thoracic esophageal cancer radiotherapy by auto-planning: volumetric-modulated arc therapy vs intensity-modulated radiation therapy. Med Dosim 43(3):243–250
Article PubMed Google Scholar
Zhang T, Bokrantz R, Olsson J (2022) Probabilistic Pareto plan generation for semiautomated multicriteria radiation therapy treatment planning. Phys Med Biol. https://doi.org/10.48550/arXiv.2110.05410
Babier A, Mahmood R, McNiven AL, Diamant A, Chan TCY (2020) Knowledge-based automated planning with three-dimensional generative adversarial networks. Med Phys 47(2):297–306
Article PubMed Google Scholar
Babier A, Boutilier JJ, Sharpe MB, McNiven AL, Chan TCY (2018) Inverse optimization of objective function weights for treatment planning using clinical dose-volume histograms. Phys Med Biol 63(10):105004
Article PubMed Google Scholar
Shao Y, Zhang X, Wu G, Gu Q, Wang J, Ying Y, Feng A, Xie G, Kong Q, Xu Z (2021) Prediction of three-dimensional radiotherapy optimal dose distributions for lung cancer patients with asymmetric network. IEEE J Biomed Health Inform 25(4):1120–1127
Article Google Scholar
Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, Moore S, Phillips S, Maffitt D, Pringle M, Tarbox L, Prior F (2013) The cancer imaging archive (TCIA): maintaining and operating a public information repository. J Digit Imaging 26(6):1045–1057
Article PubMed Central Google Scholar
Klein S, Staring M, Murphy K, Viergever MA, Pluim JP (2010) elastix: a toolbox for intensity-based medical image registration. IEEE Trans Med Imaging 29(1:196–205
Article Google Scholar
Hu J, Shen L, Albanie S, Sun G, Wu E (2020) Squeeze-and-excitation networks. IEEE Trans Pattern Anal Mach Intell 42(8):2011–2023
Article PubMed Google Scholar
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. IEEE conference on computer vision and pattern recognition (CVPR).
Google Scholar
Mishchuk A, Mishkin D, Radenovic F, Matas J (2017) Working hard to know your neighbor’s margins: Local descriptor learning loss. Conference and workshop on neural information processing systems.
Google Scholar
Lilenbaum R, Komaki R, Martel MK (2008) A phase II trial of combined modality therpy with growth factor support for patients with limited stage small cell lung cancer. Radiation Therapy Oncology Group
Google Scholar
Ettinger DS, Wood DE, Aggarwal C, Aisner DL, Akerley W, Bauman JR, Bharat A, Bruno DS, Chang JY, Chirieac LR, D’Amico TA, Dilling TJ, Dobelbower M, Gettinger S, Govindan R, Gubens MA, Hennon M, Horn L, Lackner RP, Lanuti M, Leal TA, Lin J, Loo BW Jr, Martins RG, Otterson GA, Patel SP, Reckamp KL, Riely GJ, Schild SE, Shapiro TA, Stevenson J, Swanson SJ, Tauer KW, Yang SC, Gregory K, Hughes M (2019) NCCN guidelines insights: non-small cell lung cancer, version 1.2020. J Natl Compr Canc Netw 17(12):1464–1472
Article PubMed Google Scholar
Riet AV, Mak AC, Moerland MA, Elders LH, Zee W (1997) A conformation number to quantify the degree of conformality in brachytherapy and external beam irradiation: application to the prostate. Int J Radiat Oncol Biol Phys 37(3):731–736
Article Google Scholar
Yoon M, Park SY, Shin D, Lee SB, Pyo HR, Kim DY, Cho KH (2007) A new homogeneity index based on statistical analysis of the dose-volume histogram. J Appl Clin Med Phys 8(2):9–17
Article PubMed PubMed Central Google Scholar
Hernandez V, Hansen CR, Widesott L, Bäck A, Canters R, Fusella M, Götstedt J, Jurado-Bruggeman D, Mukumoto N, Kaplan LP, Koniarová I, Piotrowski T, Placidi L, Vaniqui A, Jornet N (2020) What is plan quality in radiotherapy? The importance of evaluating dose metrics, complexity, and robustness of treatment plans. Radiother Oncol 153:26–33
Article PubMed Google Scholar
Kumar A, Kim J, Cai W, Fulham M, Feng D (2013) Content-based medical image retrieval: a survey of applications to multidimensional and multimodality data. J Digit Imaging 26(6):1025–1039
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the patients cared for at Shanghai Chest Hospital, Shanghai Jiao Tong University. We would also like to thank J. Liu, X. Liu and L. Yao.

Author information

Authors and Affiliations

Shanghai Chest Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
Yan Shao, Jindong Guo, Ying Huang & Zhiyong Xu
Shanghai Pulse Medical Technology Inc., Shanghai, China
Jiyong Wang
School of Physics and Technology, University of Wuhan, Wuhan, China
Wutian Gan
School of Information Science and Engineering, Xiamen University, Xiamen, China
Xiaoying Zhang
Ping An Healthcare Technology Co. Ltd., Shanghai, China
Ge Wu & Guotong Xie
School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen, China
Dong Sun
School of Engineering, Hong Kong University of Science and Technology, Hong Kong SAR, China
Yu Gu
School of Medicine and Biological Information Engineering, Northeastern University, Shenyang, China
Qingtao Gu
Department of Radiation Oncology, Rutgers Cancer Institute of New Jersey, Rutgers University, New Brunswick, NJ, USA
Ning Jeff Yue
Radiotherapy Department, Shandong Second Provincial General Hospital, Shandong University, Jinan, China
Guanli Yang
Ping An Health Cloud Company Limited, Shanghai, China
Guotong Xie
Ping An International Smart City Technology Co., Ltd., Shanghai, China
Guotong Xie

Authors

Yan Shao
View author publications
You can also search for this author in PubMed Google Scholar
Jindong Guo
View author publications
You can also search for this author in PubMed Google Scholar
Jiyong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ying Huang
View author publications
You can also search for this author in PubMed Google Scholar
Wutian Gan
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoying Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ge Wu
View author publications
You can also search for this author in PubMed Google Scholar
Dong Sun
View author publications
You can also search for this author in PubMed Google Scholar
Yu Gu
View author publications
You can also search for this author in PubMed Google Scholar
Qingtao Gu
View author publications
You can also search for this author in PubMed Google Scholar
Ning Jeff Yue
View author publications
You can also search for this author in PubMed Google Scholar
Guanli Yang
View author publications
You can also search for this author in PubMed Google Scholar
Guotong Xie
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyong Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Guanli Yang, Guotong Xie or Zhiyong Xu.

Ethics declarations

Conflict of interest

Y. Shao, J. Guo, J. Wang, Y. Huang, W. Gan, X. Zhang, G. Wu, D. Sun, Y. Gu, Q. Gu, N.J. Yue, G. Yang, G. Xie and Z. Xu declare that they have no competing interests.

Ethical standards

For this article no studies with human participants or animals were performed by any of the authors. All studies mentioned were in accordance with the ethical standards indicated in each case.

Additional information

Authors Y. Shao, J. Guo and J. Wang contributed equally to the manuscript.

Author Responsible for Statistical Analysis

Yan Shao

Availability of data

Research data are not available at this time.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shao, Y., Guo, J., Wang, J. et al. Novel in-house knowledge-based automated planning system for lung cancer treated with intensity-modulated radiotherapy. Strahlenther Onkol (2023). https://doi.org/10.1007/s00066-023-02126-1

Download citation

Received: 28 September 2022
Accepted: 10 July 2023
Published: 21 August 2023
DOI: https://doi.org/10.1007/s00066-023-02126-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Novel in-house knowledge-based automated planning system for lung cancer treated with intensity-modulated radiotherapy

Abstract

Purpose

Methods and materials

Results

Conclusions

Similar content being viewed by others

A knowledge-based intensity-modulated radiation therapy treatment planning technique for locally advanced nasopharyngeal carcinoma radiotherapy

Automatic IMRT treatment planning through fluence prediction and plan fine-tuning for nasopharyngeal carcinoma

Feasibility Study of the Fluence-to-Dose Network (FDNet) for Patient-Specific IMRT Quality Assurance

Introduction

Methods

Database

Architecture

Prediction of virtual dose images

Building and training of image retrieval model

Inference of image retrieval model

Study of fully automated usage of αDiar

Comparison of treatment planning with or without αDiar

Metrics

NDCG

D2, D98 and D99

CI and HI

V5, V20, V30, V40, V45 and V60

MLD, MHD, Dmean and Dmax

Metrics’ usage

Statistical analysis

Results

Validation of the search model

Experiment of the automated usage of αDiar

Three-dimensional dose distribution

Comparison of dosimetric parameters between APClinical and APAI

Comparison experiment

Three-dimensional dose distribution

Comparison of dosimetric parameters between \(\boldsymbol{AP}_{\boldsymbol{AI}+\textbf{Human}}\) and APHuman

Discussion

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Ethical standards

Additional information

Author Responsible for Statistical Analysis

Availability of data

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Comparison of dosimetric parameters between AP_Clinical and AP_AI

Comparison of dosimetric parameters between \(\boldsymbol{AP}_{\boldsymbol{AI}+\textbf{Human}}\) and AP_Human