Introduction

Research object

In the development process from IMRT to VMAT, accurate radiotherapy technology has been improved. Although the technical level has been greatly developed, the treatment benefit is very limited from the perspective of patients and doctors. In this context, the industrial Internet of Things has brought a ray of sunshine to the development of precision radiotherapy [1]. With the support of Internet of Things and cloud computing, the conventional large-scale IMRT health system takes a second leap forward in development, and the radiotherapy healthcare system can be realized in more primary hospitals with high quality [2].

Research background

Halcyon\(^\mathrm{{TM}}\), a smart radiotherapy platform of Varian, was officially launched in China in 2019 [3]. The Halcyon system is the first medical linear accelerator to realize 100% image guidance in the world [4]. It offers the advantages of high integration, high automation and high intelligence [5]. It further promotes radiotherapy from conventional “precise radiotherapy” to “intelligent precise radiotherapy” [6]. As a new generation of smart radiotherapy platform [7], the Halcyon system can connect with the Varian smart cloud 4.0 radiotherapy platform and provide users with comprehensive intelligent connection services by using big data, artificial intelligence and industrial Internet of Things technology [8]. With the support of the large-scale healthcare platform, the Halcyon system can be deployed in the medical union, which ensures that the primary hospital can carry out treatment safely, normally and effectively under the real-time guidance of the specialist medical union, thus opening the era of the medical alliance radiotherapy cloud [9].

Current problems

With the continuous innovation and development of radiotherapy technology and equipment, the methods of cancer radiotherapy are becoming increasingly diverse [10], including three-dimensional conformal radiation therapy (3D-CRT), static intensity modulation radiation therapy (SIMRT), volumetric-modulated arc therapy (VMAT), helical tomotherapy (TOMO)-IMRT, and stereotactic body radiotherapy (SBRT) [11]. New technology and new equipment continuously improve the treatment of cancer patients. Advances in radiotherapy equipment, such as dynamic multi-leaf collimator with arc therapy, have resulted in more accurate irradiation. With the improvement of the function of the treatment planning system, more precise and optimized treatment plans can be formulated.

To meet the practical needs of clinical applications, the trend in current developments of the software and hardware systems of radiotherapy is to expand from a simple planning design to the entire process of radiotherapy [12]. Radiotherapy system software not only involves a treatment planning system but also unified planning and design for the information management of radiotherapy, the design of radiotherapy plans, the technical services of radiotherapy and the implementation of radiotherapy [13]. Thus, the efficiency, safety and reliability of radiotherapy and the convenience of technical support and services can be improved [14].

The contributions of this article

The complete inverse planning process involves complex system engineering, which mainly includes three modules: a data preparation module, a plan optimization module and a simulation evaluation module [15, 16]. First, we need to acquire computed tomography (CT) medical images. After obtaining these image data, clinicians must delineate each CT image manually to assist the later 3D computer positioning process. At the same time, the physical dosimeter obtains the data of the effect of the cumulative dose of different parts irradiated by different intensities of radiation through photographic film and obtains the dose effect matrix by means of a simulation. Second, inverse planning requires a dose target model and fluence map decomposition model and a solution process. Inverse planning needs to consider the clinical variables, target dose requirements (achieve the prescribed dose) and nontarget dose constraints (inhibition the irradiation dose), as well as the number of beams, the direction of the beam, the intensity of the beam, the hardware requirements of the multileaf collimator and the requirements of doctors for the time of radiation and the number of monitor units of the irradiation machine. Third, doctors need to visualize the simulation results of the calculated radiotherapy plan and objectively evaluate the advantages and disadvantages of the plan according to the dose volume histogram.

A novel Intelligent IoT-based large-scale inverse planning strategy with the key factors of the postmodulation is developed, and the whole process is described with a pseudo code flow. The scheme consists of two steps. The first step is to obtain a relatively optimal combination of gantry angles by considering the dose distribution requirements and constraints and the modulation requirements and constraints. The second step is to optimize the intensity map and smooth the map based on prior knowledge according to the determined angles and then obtain the final modulation scheme according to the relevant objectives and constraints of the map decomposition.

Related work

Volume-modulated radiotherapy (VMRT) has been widely used in the past 10 years, and to some extent, VMRT has proven superior to conventional intensity-modulated radiotherapy, such as modulation speed and dose conformal degree. However, VMRT has necessitated higher requirements for high-precision control, quality control and performance verification of radiotherapy equipment [17]. Such requirements make it too expensive for developing countries to upgrade the multileaf collimator. On the other hand, VMRT has not fundamentally changed the treatment principle of intensity-modulated radiation therapy. Further improvement in the optimal treatment scheme of IMRT can lead to a dose distribution of IMRT which approximates that of the volume modulation scheme. At the same time, conventional IMRT can be achieved only by relying on the conventional multileaf collimator and the conventional dose verification method [18]. The economic benefits cannot be underestimated.

Conventional intensity-modulated radiation therapy involves the step-and-shoot mode [19, 20]. To avoid the back and forth movement of the leaves leading to leaf wearing, sliding windows are usually used in clinical applications. Because static IMRT chooses a limited field to achieve dose modulation, the total numbers of monitor units in the modulation process of the intensity maps obtained by choosing different treatment fields are quite different. Because of the convex and concave physical characteristics of the blade, the tongue and groove effect in the modulation process is also a very important factor [21].

For radiotherapy of a particular type or organ, the doctor usually uses empirical field direction selection [22]. Although the field direction chosen in this way is more representative, the empirical field direction may not be the best choice because of the change of the target location. Therefore, the choice of the direction of the field is the first key problem in the radiotherapy planning system [23]. The conventional criterion for selecting the direction of the field is to ensure that the dose modulation distribution achieved by the fluence map can be optimized, ignoring the total number of monitor units and the tongue and groove effect in the modulation process [24].

In fact, the difference in the dose modulation distributions in a clinic with the objective of ensuring an optimal fluence map does not lead to an essential difference in clinical treatment [25, 26]. In contrast, the total number of monitor units and the tongue and groove effect in the modulation process will lead to a great difference in the quality assurance and verification process of the entire radiotherapy process. In other words, the fluence maps that appear to have similar dose distribution states will produce distinct dose distribution states because of the variable difficulty in the subsequent dose modulation process [27]. This is a major problem that must be considered in inverse planning [28]. The tongue and groove effect is the direct cause of the hot and cold points of a dose. To avoid this effect, it is necessary to avoid double or multiple peaks in the intensity maps: that is, to keep the high dose areas connected as much as possible. From a mathematical point of view, convex constraints cannot be used for the description. Through analysis, we can obtain a priori knowledge regarding the upper left blade and the right lower blade in the adjacent blades, or determine that the lower left blade and the right upper blade cannot collide, but should maintain a convex shape as much as possible during the sliding process [29, 30].

Key parts of the radiotherapy software system

The main hardware of radiotherapy includes an accelerator, a multileaf collimator, and a radiotherapy simulator. The radiotherapy software system mainly consists of a treatment planning system, database management system, treatment control system and radiotherapy simulation and verification system. Data exchange between systems is carried out through a high-speed network connection. The complete radiotherapy system acquires image data from external image acquisition equipment (such as CT and MRI) through a gigabit network. The composition and main functions of each system in the radiotherapy system are described below.

Radiotherapy treatment planning system

The treatment planning system obtains current patient image data from external image acquisition equipment, such as a CT machine, through a high-speed network. According to these image data and preset physical parameters of the therapeutic accelerator machine, the current treatment plan of patients is formulated, and the relevant data are sent to the database system. The main contents of the treatment plan include the parameters of the gantry, the parameters of the treatment bed, the multileaf collimator parameters, the treatment grading parameters and the radiation dose distribution parameters. A well-made treatment plan and patient-related image data will be sent to the database system in the form of digital imaging and communications in medicine (DICOM) files.

Radiotherapy database management system

The radiotherapy database management system consists of three parts: a database management program, a patient information database and a patient data file. The database management program is mainly used to create DICOM files and generate and manage database records of patient information. When the treatment planning system completes the treatment plan, the content of the plan and the related image data will be sent to the database system in the form of DICOM files, which are stored in the patient information database by the database management program according to the DICOM protocol. When the IMRT system carries out the image acquisition process, an X-ray image and a CT image will also be sent to the database system through the network in the form of DICOM files. The patient information database is mainly used to store basic patient information, treatment plan data and patient-related image data. The basic information of the patients and the parameters of the gantry and the bed are obtained from the treatment plan file through the database management program. This information will be added to the relevant tables of the patient information database by the database management program. After the DICOM image data files and plan data files related to patients are stored in the patient data folder, their storage locations will also be recorded in the patient information database. Patient data files keep DICOM image data files and plan data files that are related to all patient treatment plans. The database management program extracts DICOM image data files and treatment plan data files and stores them separately. The specific storage locations will be stored in the related tables of the patient information database in the form of database records. X-ray images and CT image data will also be stored in the corresponding patient data file.

Radiotherapy control system

The control system is connected to the database system through the network. It can directly obtain the patient information and treatment plan data in the patient information database. The treatment control system downloads the treatment plan data related to the patient from the patient information database according to the currently selected information. When the treatment control system obtains the basic information and treatment plan data of the current patients from the patient information database according to the doctor’s choice, the treatment control system will send start-up instructions to the IMRT system through the network. When the treatment control system receives the response message of the IMRT system, the treatment control system sends the basic information of the current treatment patients to the IMRT system. The IMRT system will use this information as an index to search the current treatment plan data, related image data and plan data file storage locations in the patient information database through the network and download them to the computer. When the IMRT system completes the position correction operation of the treatment bed through registration, the treatment control system will control the accelerator and multileaf collimator for the treatment of patients according to the current patient treatment plan data acquired previously. After all treatment operations are completed, the information of the treatment results is stored in the patient information database through the network.

Radiotherapy simulation and verification system

The radiotherapy simulation and verification system completes the DICOM interface with the radiotherapy planning system and radiotherapy control system and feeds the information of the reset and verification back to the database management subsystem. The simulation and verification system of radiotherapy can realize the functions of cone beam CT reconstruction and a 3D simulation and verification of the plan.

Methodology of the large-scale planning system

Simplicity and controllability of the modulation by fluence map smoothing

Smoothing technology is used to reduce the complexity of the fluence map, which is similar to image denoising in image processing. Fluctuations in the fluence map have a great influence on leaf sequencing, especially for the high-frequency region. Therefore, intuitively, if there are fewer fluctuations in the fluence map, the subsequent decomposition process based on a multileaf collimator will become much easier. The conventional smoothing methods include a smoothing strategy outside of the objective function, an implicit smoothing strategy in the objective function and an explicit smoothing strategy in the objective function.

The outside strategy adopts a low-pass filter to smooth the optimized fluence map directly. Linear filters are commonly used; typically, a linear convolution is used along the line direction. The main consideration is that the lead blades of the multileaf collimator are modulated by moving along the line direction. \(x_{i, j, k}^{\prime }=\sum _{\lambda =-L}^{R} c_{\lambda } \cdot x_{i, j+\lambda , k} \),where \(c_{\lambda }\) are the convolution coefficients and x is the intensity, and L and R represent the lengths of the convolution domain areas. The main advantages of the smoothing strategy outside of the objective function are that the postprocessing is simple, the convolution coefficient is flexible, and the fluence map can be adjusted according to different filtering requirements. The disadvantage is that the smoothed results may seriously violate the prescription dose requirements.

The implicit strategy directly defines the specific smoothing penalty function, such as the curvature function (Eq. 1), L1-norm function (Eq. 2) or L2-norm (Eq. 3), where x is the intensity, m and n are the intensity map size, and l is the beam number.

$$\begin{aligned} F(X)= & {} \frac{2 x_{i, j, k}-x_{i, j-1, k}-x_{i, j+1, k}}{\left( 1+\frac{1}{4}\left( x_{i, j-1, k}-x_{i, j+1, k}\right) ^{2}\right) ^{2}}\end{aligned}$$
(1)
$$\begin{aligned} F(X)= & {} \sum _{i=1}^{l} \sum _{j=2}^{m} \sum _{n=2}^{n}\left( \left| x_{i, j, k}-x_{i, j-1, k}\right| \right. \nonumber \\&+\left. \left| x_{i, j, k}-x_{i, j, k-1}\right| \right) \end{aligned}$$
(2)
$$\begin{aligned} F(X)= & {} \sum _{i=1}^{l} \sum _{j=1}^{m-1} \sum _{n=1}^{n-1}\left( \left( x_{i, j, k}-x_{i, j+1, k}\right) ^{2}\right. \nonumber \\&\left. +\left( x_{i, j, k}-x_{i, j,k+1}\right) ^{2}\right. \nonumber \\&\left. + \frac{1}{2}\left( x_{i, j, k}-x_{i, j+1, k+1}\right) ^{2}\right. \nonumber \\&\left. +\frac{1}{2}\left( x_{i, j, k}-x_{i, j+1, k-1}\right) ^{2}\right) \end{aligned}$$
(3)

The advantage of the implicit strategy is that it reduces the smoothing postprocessing and secondary optimization calculation, and the model design is more concise. The disadvantage is that the design of the penalty function varies greatly. For a model that is easier to calculate, the designed penalty function is often relatively simple, which results in neglecting the other radiotherapy factors and a lack of flexibility in smoothing postprocessing.

The explicit strategy obtains the smoothed intensity map through low-pass filtering. On this basis, an optimization calculation with the objective function is carried out again to add a smoothing penalty function \(\sum _{i=1}^{I} \sum _{j=1}^{m} \sum _{k=1}^{n} w_{i, j, k} \cdot \left( x_{i, j, k}-x_{i, j, k}^{\prime }\right) ^{2}\) to the objective item of the original fluence map optimization model. Quadratic optimization with feedback of the smoothing results of the intensity maps allows the reverse plan to meet the prescription dose distribution requirements to the greatest extent, and the calculation process is simple and clear. The disadvantage is that the term of the smoothing penalty function describes the difference between the actual fluence map and the smoothed fluence map. The intensity maps calculated by this strategy are often not optimal in terms of dose distribution.

Directional filtering based on a partial differential equation (PDE) is an effective image denoising method. On the other hand, denoising based on nonlocal means (NLM) provides the possibility of smoothing based on prior information. NLM makes use of redundant information in the image, which can retain the details of the image as much as possible while smoothing. The estimated value of the current pixel is obtained by a weighted averaging of the pixels in the image with similar neighborhood structure. The smoothing value of the pixel to be smoothed in the NLM is determined by a similarity measure between the local and global image blocks around the pixel. The algorithm makes full use of prior knowledge of the fact that image blocks in natural images are highly repeatable. The similarity is defined as \(w(x, y)=\exp \left( -\frac{\Vert V(x)-V(y)\Vert ^{2}}{h^{2}}\right) /\sum _{y} \exp \left( -\frac{\Vert V(x)-V(y)\Vert ^{2}}{h^{2}}\right) \). The original intensity smoothing algorithm is generally an isotropic smoothing strategy, and according to NLM denoising, we can introduce the prior knowledge mentioned above.

Therefore, the method of smoothing the fluence map can employ NLM to make full use of prior knowledge on the basis of PDE smoothing: that is, to limit the smoothing direction by considering not only the projection direction of the target area in the direction of the irradiation field but also the smoothing direction of the reducing MU.

Fig. 1
figure 1

The fluence maps of two typical irradiation fields with two vertical angles

The influence of the monitor unit with different gantry angle selections

To illustrate the problems arising from the selection of the beam angles, we provide the following test examples (Fig. 1). We assume that the prescription dose for the target area is 10 and that the dose of organ at risk (OAR) is no more than 5. A typical selection of the two perpendicular angles for the irradiation field is shown in the figure. For the combined field of angles 1 and 2, the total number of monitor units is 15, and the dose of OAR is 5. For the combined field of angles 3 and 4, the total number of monitor units is 20, and the OAR dose is 0. Thus, we can see that the optimization degree of the clinical dose distribution is inconsistent with the total number of monitor units when some target areas surround the distribution of OAR. For a certain optimization model, the value and direction of the angle of the irradiation field also need to be included in the complete optimization process.

Fig. 2
figure 2

Typical two-step modulation strategy for the intensity map in a diagonally isolated distribution (the upper left figure is the first step of modulation, the upper right one is the second step, and the lower part is the modulation result)

Fig. 3
figure 3

Collision by the single modulation when the fluence map is diagonally distributed (the left image shows the modulation process without collision constraint of adjacent leaves, and the right figure shows the modulation result)

Fig. 4
figure 4

There is no leaf collision when the fluence map is correlated up and down (the left image shows the modulation process with collision constraint of adjacent leaves, and the right figure shows the modulation result)

The influence of the dose modulation with different fluence maps

IMRT is usually modulated by a multileaf collimator. For most target areas of different shapes and sizes, only a few leaves are in the range of the effective irradiation field, while the other blades outside the range of the effective field should be combined in pairs to prevent radiation leakage. However, to avoid the mechanical damage caused by the relative collision of the paired blades, a small clearance is usually included. A minimum external rectangular field must be specified for the conventional treatment collimator of the accelerator. This can not only shield the intermittent leakage radiation from each pair of the incompletely closed blades in the field but also shield the small external rectangular field between adjacent leaves. The corresponding control data are transmitted to the corresponding control system to realize the automatic setting of the minimum rectangular field and the multileaf collimator(MLC) effective field.

Due to the motion constraints of the upper and lower leaves of the multileaf collimator, it is generally required that the upper and lower leaves should not overlap with each other. Therefore, for the bimodal fluence map in the figure, a two-step modulation is usually needed (Fig. 2). If a single-step modulation is used, a collision between the upper and lower adjacent leaves will inevitably occur (Fig. 3). However, if the position of the bimodal intensity map is corrected, as shown in Fig. 4, it may be achieved by a one-step modulation. A one-step modulation means that the modulation process can be realized with less TNMU. It can be determined that to reduce the number of monitor units, and at the same time avoid a collision of the leaves, the optimization of the fluence map must also take this demand into account.

The design of the leaf section shape is an important factor that must be considered in a multileaf collimator. To ensure the minimum leakage dose between adjacent leaves and relative leaves when they are closed, the sides of the leaves should have convex grooves. The positions of the grooves can be machined in the middle of the leaf. However, this kind of structure requires a high processing accuracy. Sometimes, leaves fail because of the high resistance to motion. Many manufacturers have used step structures to produce the leaf. The tongue and groove effect is an unavoidable problem when the multileaf collimator is used for the dose modulation, which will lead to an underdose when the effect is serious.

Fig. 5
figure 5

A sketch of the fluence map projection

Fig. 6
figure 6

The outline of the target area and OAR for the test case

Fig. 7
figure 7

Gantry angle comparison after the optimization

The solution strategy proposed in this paper

The complete process of inverse planning software

In summary, in addition to the clinical dose requirement, many factors need to be considered in the complete dose modulation process because these factors exert a great influence on the simplicity of the dose modulation process, the stability of the dose distribution in the modulation process, and the possible dose leakage during the modulation process. Therefore, the smoothness of the intensity map, the selection of the field angle, the optimization of the dose intensity map, the reduction in the total machine hops, the number of subfields and the reduction in the tongue–groove effect should be taken into account as much as possible in the early stage.

The entire inverse optimization process is mainly divided into two steps. The first step is to obtain a relatively optimal combination of gantry angles by considering the dose distribution requirements and constraints and the modulation requirements and constraints. The second step is to optimize the intensity map and smooth the map based on prior knowledge according to the determined angles, and then to obtain the final modulation scheme according to the relevant objectives and constraints of the map decomposition (leaf sequencing). The whole process is described in programming language as follows.

figure a

Optimal angle combination by a finite beam angle

First, the initial frame angle is given, and the dose distribution requirement and the postmodulation requirement are given by the big data through the Internet of Things. In this step, through an ant colony algorithm, the constraints and optimization objectives and iteration steps are set to determine the optimal firing angle. We employ fast field selection based on a heuristic algorithm. The [31] ant colony algorithm is a typical heuristic intelligent algorithm. The ant colony algorithm is a bionic algorithm inspired by the foraging behavior of ants in nature. In nature, ant colonies can always find an optimal path from nests and food sources in the process of foraging. Thus, we consider that a group of individuals exhibit intelligent behavior through collaboration, which provides the possibility for solving complex problems. The ant colony algorithm is a probabilistic algorithm that is used to find the optimal path. We consider using this algorithm to filter the angle of the field. Here, we provide the definition of pheromones.

Table 1 Dose requirements for the head and neck case

The fluence map is projected in the X direction, and we can obtain the projection curve in the X direction \(p_x\)  (Fig. 5).

Then, a parameter is defined to describe the overall distribution of the intensity maps. The quality of the intensity maps is not a measure of the complexity of the intensity maps, but mainly considers the graded complexity of the dose modulation process. Because of the directionality of the multileaf collimator leaf motion, the grading complexity needs to be considered according to the direction of the leaf motion. Thus, we propose a definition \(\eta =\frac{\sum \varDelta p_{x}^{+}}{\int p_{x} \mathrm{{d}} x}\), where \(p_{x}\) is the fluence map projection as shown in Fig. 5, \(\varDelta p_{x}^{+}\) is the positive term after difference. This parameter represents the fluctuation characteristics in the X direction. Obviously, the large-area average irradiation of a single peak will lead to a reduction in this parameter as much as possible. Then, the absolute value of the difference in the variance between the projection function in the XY and YX directions is calculated to prevent the tongue–groove effect, which is defined as \(\theta =\left| \sigma _{x y}-\sigma _{y x}\right| \cdot \left( \sum \sum X\right) \). Considering the objective function value of the dose distribution and the total number of monitor units, the pheromone of any fluence map is defined as

$$\begin{aligned} \phi =\alpha \cdot \eta +\beta \cdot \theta +\gamma \cdot \text{ objValue }+\lambda \cdot TNMU, \end{aligned}$$
(4)

where \(\alpha +\beta +\lambda =1\), \(\eta \) is the smoothness parameter, \(\theta \) is the tongue–groove parameter, \( \text{ objValue }\) is the dose conformity parameter, and TNMU is the irradiation duration parameter.

Smoothing fluence map by a PDE with prior knowledge

Linear diffusion filters blur useful structural information while denoising; consequently, confusion of the structural information occurs. Therefore, we present the smoothing algorithm by a PDE with prior knowledge. Nonlinear heterogeneous diffusion equations require minimization of the gradient energy. We set the gradient energy function as \(E(u)=\int |D u(x, y)| \mathrm{{d}}x \mathrm{{d}}y\).

The objective is to solve the zero-flow boundary variational problem \(\left\{ \begin{array}{l}J(u)=\min E \\ u \in \left\{ u \in C^{2}(\varOmega )|u|_{\partial \varOmega }=0\right\} \end{array}\right. \) by using Green’s formula, where Euler’s necessary conditions and the zero-flow boundary conditions are as follows.

$$\begin{aligned} \int \varOmega \left[ \frac{\partial F}{\partial u^{*}}-\frac{\partial }{\partial x}\left( \frac{\partial F}{\partial u_{x}^{*}}\right) -\frac{\partial }{\partial y}\left( \frac{\partial F}{\partial u_{y}^{*}}\right) \mid v \mathrm{{d}} x \mathrm{{d}} y=0\right. \end{aligned}$$
(5)

The solution satisfies the heterogeneous diffusion equation \(\partial u / \partial t={\text {div}} \frac{D u}{|D u|}\).

If we set the energy function as

$$\begin{aligned} E(u)= & {} \int g(v)|\nabla u(x, y)|+\beta \left( u-u_{0}\right) ^{2} \mathrm{{d}} x \mathrm{{d}} y, v \nonumber \\= & {} D\left( G_{\sigma }{ }^{*} u\right) , \end{aligned}$$
(6)

the corresponding heterogeneous diffusion equation is as follows.

$$\begin{aligned} \partial u / \partial t= & {} g(v)|D u| {\text {div}} \frac{D u}{|D u|}+D g(v)g D u \nonumber \\&-\beta |D u|(u-u_{0}) + \left( \frac{\partial g(v)}{\partial u_{x}}, \frac{\partial g(v)}{\partial u_{y}}\right) \nonumber \\&g |D u| d i v|D u|+{\text {div}}\left( \frac{\partial g(v)}{\partial u_{x}}, \frac{\partial g(v)}{\partial u_{y}}\right) g |D u|^{2}.\nonumber \\ \end{aligned}$$
(7)

We set the fluence map as the image to be denoised. Anisotropic diffusion occurs with the above function according to the vertical direction of the MLC leaf movement and other related directions, and the projection direction of the OAR projection on the projection field is also considered.

Leaf sequencing while minimizing total number of monitor units

For the i-th fluence map \(\overline{\varvec{X}}_{i}\), we assume that \(\varvec{I}_{i}^{T}\) and \(\varvec{I}_{i}^{L}\) are the leaf trajectory matrices of the trailing and precursor, respectively. When the last rear-drive leaf covers the last beam block in the intensity map, the entire modulation process of the intensity map of the field ends. Therefore, the monitor unit of this fluence map is \(\mathrm{{M U}}_{i}=\max _{j}\left\{ \varvec{I}_{i}^{T}(j, n)\right\} \). The total number of monitor units is \(\mathrm{{T N M U}}=\sum _{i=1}^{l} M U_{i}=\sum _{i=1}^{l}\left[ \max _{j}\left\{ \varvec{I}_{i}^{T}(j, n)\right\} \right] \). If we add constraints to the dose model, which represent the upper bound of the total number of monitor units, then we form a map optimization model with the total machine hop constraints for the MLC modes.

Dose distribution objective function in target area and OAR area S.t.

$$\begin{aligned} {\left\{ \begin{array}{ll} \ \mathrm{{Dose constraints in target and non-target areas}} \\ \ 0 \le x \\ \ x_{i, j, 0}=0, i=1,2,\ldots ,l; j=1,2,\ldots ,m \\ \ \mathop {\sum }\nolimits _{i=1}^{l}\left[ \mathop {\max }\nolimits _{j} \mathop {\sum }\nolimits _{k=1}^{n}\left\{ \max \left\{ \left( x_{i, j, k}-x_{i, j, k-1}\right) , 0\right\} \right\} \right] \le TNMU_{0}. \end{array}\right. } \end{aligned}$$
(8)
Table 2 The objective value and monitor unit comparison after the angle optimization
Fig. 8
figure 8

Fluence map comparison between the initial (left) and optimized (right) gantry angles

Fig. 9
figure 9

Fluence map comparison between conventional smoothing (a) and PDE-based smoothing (b)

Design of the experiments

The computer used has an Intel 2.13 GHz Pentium P6200 CPU with 2 GB of memory. The software package Computational Environment for Radiotherapy Research (CERR), developed and published by the University of Washington, USA, is used as the inverse planning design platform of IMRT. The calculation of the intensity matrix in our experiment is based on the QIB dose calculation engine, which is also a part of IMRTP in the CERR development platform. All test cases are public cases downloaded from the website published by CERR. The prescription dose of the target area and dose constraints of the nontarget area are shown in the table as follows (Table 1).

Experimental results and analysis

As shown in the figure, the selection of the gantry angle should be adapted to the distribution of the target area and the OAR because the target area is semicircumferential to the OAR (Fig. 6).

In the experiments, the smoothness parameter is set as 0.2, the tongue–groove parameter is set as 0.2, the dose conformity parameter is set as 0.5, and the irradiation duration parameter is set as 0.1. That is, we are more concerned about the dose conformity. After the gantry angle selection (Fig. 7), we can obtain a better result.

As shown in Table 2, because a higher weight is set for the dose conformity, and the smoothness and tongue–groove effects are taken into account, the weight of the TNMU is lower; therefore, after the angle optimization, the value of the objective function varies greatly before and after optimization, especially in the target area, and the target value decreases by approximately 10%. Meanwhile, the total number of monitor units remains the same.

As shown in Fig. 8, due to the smoothing and convex-groove effects of the fluence maps, the optimized fluence maps have a smoother distribution. Furthermore, the latter has a lower diagonal distribution than the former.

After the angle selection is completed, we smooth the fluence map by a PDE with prior knowledge and a minimization of the total number of monitor units.

After determining the optimal gantry angles, it can be determined from the comparison of the fluence maps (Fig. 9) that by considering the smooth orientation and prior information, it is easier to modulate the fluence map due to isotropic smoothing, but the smoothed map ignores the protection of the OAR. In contrast, the fluence maps obtained by our smoothing and subfield decomposition process no longer exhibit radiation from large areas, but show relatively regular radiation blocks, which better reflects the protection of the OAR and other nontarget tissues.

By comparing the dose volume histogram (DVH) and dose distribution of the target area, we can see that the dose distribution of the target area does not change significantly after the new algorithm is adopted (Fig. 10). However, by comparing the DVH and dose distribution in the nontarget area, we can see that the dose distribution in the nontarget area is obviously optimized using the new algorithm (Fig. 11). For the OAR regions such as the brain stem and spinal cord, and for the nontarget organization regions such as the mandible and other tissues, the DVH curves are very low. It can also be determined from the dose distribution map that the dose distribution obtained by the new algorithm decreases more steeply from the target area to the brainstem (Fig. 12).

It is also shown from the objective function value that the target value is essentially unchanged for the target area, while for the nontarget area, the value decreases by 16% (Table 3). Meanwhile, the TNMU remained essentially unchanged.

Fig. 10
figure 10

Target dose volume curve comparison between conventional smoothing (solid line) and PDE-based smoothing (dotted line)

Fig. 11
figure 11

Nontarget dose volume curve comparison between conventional smoothing (solid line) and PDE-based smoothing (dotted line)

Fig. 12
figure 12

Dose distribution comparison between conventional smoothing (a) and PDE-based smoothing (b)

Table 3 The objective value and monitor unit comparison after PDE smoothing

Conclusion

Because the key factors of postmodulation are taken into account as much as possible in the early optimization, the intelligent IoT-based large-scale inverse planning strategy proposed in this paper has been effectively optimized in many problems, such as dose distribution optimization, gantry angle selection and fluence map smoothing. Compared with the conventional divide-and-conquer and step-by-step optimization strategies, the proposed large-scale healthcare planning method considering key factors of the postmodulation process provides an integrated solution that can also be applied to other inverse planning systems. Next, we will further consider the rapid application of medical consortium in inverse planning.