1 Introduction

Non-invasive high-intensity focused ultrasound (HIFU) thermal treatment has demonstrated great potential for tumor treatment. The physical principle of this interventional approach is to apply focused ultrasound waves to the tumor tissue, such that heating of the tissue causes its necrosis [1]. The concept of HIFU was first proposed by Lynn et al. in 1942 [2], though it was not until 1960 that Fry et al. used it to treat various neurological disorders. Ultrasound was demonstrated to be able to destroy tumor cells in a region without affecting the nearby area [3]. After years of research, according to literature reviews, it is apparent that HIFU therapy is clinically practiced to treat various cancers, such as prostate tumors [4,5,6], uterine fibromas [7, 8], and liver tumors [9,10,11].

HIFU ablation systems require a medical imaging capability and a positioning device for tumor positioning. Current HIFU ablation systems primarily use magnetic resonance imaging (MRI) or ultrasound imaging (USI) to detect the target tumor. For example, Kim et al. [12] used MRI to support a HIFU ablation system to treat 33 uterine fibroid patients. The targeting accuracy values (displacement in absolute values) in the LR, CC, and AP directions were 0.9 ± 0.7 mm, 1.2 ± 0.9 mm, and 2.8 ± 2.2 mm, respectively. Holbrook et al. [13] focused on the distance error problem in tumor positioning due to respiration, and employed MRI to guide the HIFU focal point and track the moving phantom. Sakuma et al. [14] combined a HIFU system with an USI system, integrating an ultrasound probe with a robotic arm, using which the rotatable probe was able to obtain a 3D image of the tumor phantom. The average positional error was 3 mm.

However, respiration causes displacement of abdominal organs due to pressure from the diaphragm, which further affects the accuracy of tumor positioning for treatment. In fact, displacement is mostly seen in the superior–inferior direction [15]. An evaluation by Davies et al. [16] pointed out that during normal respiration, the average displacement of the liver and diaphragm in the superior–inferior direction was 10 ± 8 mm and 12 ± 7 mm, respectively, and under deep respiration, the average displacement could be as great as 37 ± 8 mm and 43 ± 10 mm, respectively. Song et al. [17] calculated the average displacement of the liver to be 12.8 ± 4.6 mm during a respiratory cycle. Therefore, it is essential to develop a target-tracking approach to improve the positioning accuracy for HIFU ablation treatment of liver tumors during respiration.

As detection of organ displacement relies on medical imaging, and is currently limited by the acquisition time delay during HIFU treatment, it is not easy to obtain continuous data on the displacement of organs, and thus, real-time tracking is difficult. Siebenthal pointed out that an internal or external marker could be a way by which to trace the displacement of an organ during respiration [15]. An internal marker might use X-ray imaging to calculate the displacement of the diaphragm and a pulmonary tumor [18, 19]. An external marker could, for example, measure the variation in the air flow rate of the lung and its temperature [20]; assess the status of the heaving chest or abdomen using laser measurement [21, 22]; or measure the change in abdominal pressure as a signal to track pulmonary tumors and abdominal lymph nodes [23]. Thus, if using an external respiratory signal to acquire the displacement of a tumor, it is then possible to compensate for such a change and continue to focus the focal point of HIFU on the target tumor, allowing the patient to breathe normally even during HIFU therapy. Seppenwoolde [24] established a model of respiratory signal and tumor displacement. In 8 patients each with a pulmonary tumor, 4 metal markers were implanted, and under normal free respiration, the position of the tumor and the state of the heaving abdomen were recorded. Then, using a polynomial function (which was obtained from those data), the displacement in the tumor position under respiration was simulated to obtain the closest trace of movement, enabling the robotic arm to continue tracking based on the model. Seo et al. [25], on the other hand, used two ultrasound probes to scan a moving kidney phantom. A coagulated lesion in the phantom was first burned and an image scanned; then, by image processing, the displacement of the lesion was calculated. In the experiment employing a robotic arm and a HIFU probe, the average positioning error of the coagulated lesion was 2 mm.

The size of the focal point of HIFU is generally similar to that of a rice grain, which is typically smaller than the target tumor. Complete ablation of the target tumor requires a long time, and thus, the positioning of the HIFU focal point must be synchronized with the target point of the target tumor, which changes position due to respiration. The use of a robotic arm can improve the positioning accuracy and stability of the HIFU focal point. This research employed an experimental ultrasound imaging-guided robotic HIFU ablation system created for the conduction of a previous experiment [26] to further develop a synchronous method for tracking the displacement of a tumor during respiration and evaluated the accuracy of the method.

2 Materials and Methods

2.1 System Structure

Figure 1 illustrates the structure of the ultrasound imaging-guided robotic HIFU ablation system. This system was based on a previously-designed experimental ultrasound imaging-guided robotic HIFU ablation system [26] and combined with a respiratory simulation device in order to simulate the displacement of a tumor during the respiratory cycle. This system integrated an optical tracker (Northern Digital Inc., Polaris Spectra), dynamic reference frames (DRF, a tool with three IR reflective marker spheres), an ultrasound imaging system (Hitachi-Aloka Medical, ProSound Alpha 6), a HIFU ablation system (Sonic, H-106 probe with Instek, GFG-8255 signal generator and AR, 150A100B power amplifier), a robotic arm (YAMAHA, YK400XG), an experimental water tank, a notebook (Dell, M4500), and the novel respiratory simulation device.

Fig. 1
figure 1

System hardware architecture and respiration simulation device

An optical tracker and DRF were applied in this study for positioning purposes. The ultrasound probe scans the tumor phantom to obtain the position information of the tumor. The movement of the ultrasound probe is controlled by the motor-driven linear slide and detected by the optical tracker through the DRF, which is a reference coordinate frame tracked by the optical tracker. Through coordination transformation, the position of the tumor phantom relative to the ultrasound image frame can be transferred to and represented by the robot frame [26]. The robotic arm is thus able to bring the focal point of the HIFU transducer to aim at the tumor phantom. The signal generator and power amplifier are used to enable the HIFU transducer to generate high-intensity sound power for thermal therapy.

The upper right image in Fig. 1 shows the set-up of the respiration simulation device. The heaving chest was simulated by a cam, and a phantom that represented the liver tumor was installed at the end of the linkage, which was designed according to the displacement as described in [15]. The shape of the cam was designed in accordance with the actual up-and-down changes of the chest of a real person during respiration. A full-circle rotation of the cam driven by a motor indicated one respiratory cycle, which also generated forward–backward movement of the tumor phantom. The maximum displacement of the tumor phantom was 46.8 mm. The change in height of the cam was measured using a laser rangefinder (Keyence, IL-100), the value of which was transmitted to the computer in order to calculate the position of the tumor phantom.

The tumor phantom was developed based on a mixed gel of acrylamide and proteins (Tung et al. [27]). According to the protein characteristics, the progress of the HIFU ablation was monitored. In addition, Takegami [28] discovered a decremental relationship between the protein volume ratio and sound speed transmission. In order to better simulate human soft tissue, the volume ratio of protein was increased to 40% from 30% to obtain an attenuation coefficient of 2.507 (Np/m).

2.2 System Execution Procedure

A flowchart of the execution procedure of the ultrasound imaging-guided robotic HIFU ablation system is shown in Fig. 2. The procedure can be separated into three main sequential sections: acquisition of tumor images and heaving chest data, and 3D reconstruction of the tumor; establishment of the correlation between the heaving chest and tumor position; and development of a compensatory method of the robotic arm for the tracking delay to facilitate HIFU ablation. Details of the system execution procedure are given in the following sections.

Fig. 2
figure 2

System execution flowchart

2.2.1 Tumor Imaging, Heaving Chest Positional Data Acquisition and 3D Reconstruction

The position of the tumor changes and the chest heaves continuously during respiration. Therefore, in this study, six different heaving chest states in a respiratory cycle were chosen for position information recording and scanning of the tumor using an ultrasound probe. The six different states were as follows: maximal exhalation; maximal inhalation; 2 during the exhalation process; and 2 during the inhalation process. Meanwhile, positional information for the heaving chest was obtained by laser rangefinder measurement. Subsequently, the ultrasound images of the tumor were subjected to image processing. Along with its position information, we employed the marching cube method to reconstruct the tumor as a 3D model (Fig. 3). Details of the image processing and 3D reconstruction procedures are presented below.

Fig. 3
figure 3

Image processing and 3D reconstruction of the tumor

In the original tumor ultrasound images detected by the ultrasound probe, it was not easy to detect the edge position information directly due to noise. In order to obtain the edge position information for 3D reconstruction, image denoising was the first priority in image processing. First, we determined the region of interest (ROI) in each ultrasound image. Then, the erosion-dilation method was applied to the ROI of the original tumor ultrasound images for denoising. Subsequently, for boundary-finding purposes, the inner contour line of the tumor image had to be identified first. Otsu’s method was used in this study to ascertain threshold value, using which the tumor ultrasound images could be thresholded, following by Canny edge detection to ascertain the inner contour line of the tumor ultrasound images. Next, the boundary of the tumor ultrasound image was identified using marker-controlled watershed segmentation, under which the inner contour line of the tumor ultrasound image was taken as the inner marker and the boundary of the ROI was taken as the outer marker. Finally, the position information of the tumor had already been obtained by the optical tracker during scanning of the tumor. Therefore, the boundary information of the tumor ultrasound images (after image processing) could also be obtained by calculation. Along with this boundary position information, we employed the marching cube method to reconstruct the tumor as a 3D model.

OpenCV was selected as the tool for image processing in this study. The video capture card in this system was UPG-306, the ultrasound image resolution was 640*480, and the capture speed was 30 fps. In this study, approximately 300 ultrasound images were captured during each scan, and the image processing duration was approximately 15 s/per scan (of 300 ultrasound images). The resolution of the ROI in each tumor ultrasound image was approximately 90*90. The Visualization Toolkit (VTK) was selected as the tool for 3D reconstruction in this study, and the 3D reconstruction duration for one tumor in one position state was approximately 10 s (using Win 7 OS; CPU i72600; resolution of 640*480).

2.2.2 Establishment of the Correlation Between the Heaving Chest and Tumor Position

In order to obtain the tumor position from the positional information of the heaving chest, it was necessary to ascertain the correlative polynomial function between the heaving chest and the tumor position. First, the transformation matrices among the 3D tumor model frames under different heaving chest states were ascertained using the Iterative Closest Points (ICP) algorithm. By inputting the Source \({\text{S}} = \left\{ {P_{{s_{i} }} ,i = 0,1,2, \ldots ,k} \right\}\) and the Target \({\text{T}} = \left\{ {P_{{t_{i} }} ,i = 0,1,2, \ldots ,u} \right\}\), the transformation relationship between S and T was obtained. Equation 1 shows that the coordinates between the points in the S-group and T-group were correlated by the transformation matrix \(T_{s}^{t}\) between \(F_{t}\) (frames of the T-group) and \(F_{s}\) (frames of the S-group).

$$P_{t} = T_{s}^{t} P_{s}$$
(1)

Following establishment of the transformation matrix, any target ablation point \(P_{s}\) under frame \(F_{s}\) could be transformed by applying the transformation matrix \(T_{s}^{t}\) to obtain \(P_{t}\) (which is the ablation point under frame \(F_{t}\)). S and T represent different heaving chest states. Thus, finding the transformation matrix between two heaving chest states renders it a simple task to calculate the coordinate information of the target ablation point in the subsequent heaving chest state. The transformation relationship between S and T is shown in Fig. 4a.

Fig. 4
figure 4

a Transformation relationship of the scheduled target point of the frame system. b Relationship between the position value of the heaving chest and tumor position

Another key variable used to generate the heaving-chest-and-tumor-position polynomial function was the position value of the heaving chest \(L_{j}\), where “j” represented the heaving chest state (e.g., j = 0, maximal exhalation). The 3rd-order polynomial function was generated from the corresponding positions of the target point \(P_{tj}\) and the heaving chest \(L_{j}\) in each of the X, Y, and Z directions. Then, the heaving chest and tumor position correlation was established in each exhalation and inhalation state (Fig. 4b).

2.2.3 Compensatory Mechanism of the Robotic Arm for Tracking Delay and HIFU Ablation

Ideally, the robotic arm would be able to track the tumor accurately according to the heaving-chest-and-tumor-position polynomial function calculation results. However, the system operation procedure, information transmission, computer calculation and robotic arm movement caused tracking delay of the robotic arm (Fig. 5), which affected the accuracy of ablation. Therefore, the establishment of a compensatory mechanism was necessary in this case.

Fig. 5
figure 5

Illustration of the positional delay in tracking of the target point by the robotic arm

The method employed to calculate the delay was to record the position value of the HIFU focal point \(\left\{ {P_{{H_{iu} }} } \right\}\) without a compensatory mechanism by setting a DRF at the end of the robotic arm (\(i\): the number of the points; \(u\): the sampling interval, with 0.1 s as the unit time). The results were compared with the tumor target position \(\left\{ {P_{{L_{iu} }} } \right\}\), which was calculated using the heaving-chest-and-tumor-position polynomial function along with the position value of the chest (Fig. 4b). As shown in Fig. 5, there existed a time delay between HIFU focal point position \(P_{{H_{iu} }}\) and target point \(P_{{L_{iu} }}\). In order to ascertain the compensation time, \(ku\) was defined as the compensation time at the beginning (\(k\): an unknown integer; \(k\) = 0, 1, 2, …). An iterative search for the ku value was executed until the minimal summation error between \(P_{{H_{iu} }}\) and \(P_{{L_{iu} }}\) was found (Eq. 2). At this time, \(ku\) was taken to be the compensation time required for the robot to track the movement of target point.

$${\text{min}}\sum\nolimits_{i}^{N} {\left\| {P_{{H_{{\left( {i + k} \right)u}} }} - P_{{L_{{iu}} }} \;} \right\|} \;{\text{N}}:{\text{ number of target points}}$$
(2)

3 Results of Experiments

This research used a respiration simulation device to simulate the movement of the heaving chest, as well as the positional displacement of a liver tumor (using a tumor phantom). A DRF was set on the linkage of the respiration simulation device, as shown in the upper right image in Fig. 1, in order to track and detect the coordinates of the tumor phantom. Before the final experiment, the compensation time for the robotic arm had to be obtained in order to minimize the tracking error. The final experiment was conducted to evaluate the synchronized tracking accuracy. The experiment was separated into two parts: single-point tracking and cross-sectional ablation. Both used a thermocouple to measure the ablation temperature, and the tracking accuracy was judged according to the coagulation of the lesion.

3.1 Robotic Arm Tracking Compensation Experiment

The rotational speed of the respiration simulation device was driven by a motor, and the cycle duration was set to 9 s, which matched the normal respiratory cycle duration of an adult. During the experiment, the robotic arm controlled the HIFU focal point to track a target point (on the tumor phantom). The moving positions of the HIFU focal point and the target point were recorded simultaneously during a respiration cycle, as shown in Fig. 6a. These heaving chest positional data were then imported into the heaving-chest-and-tumor-position polynomial function to calculate the compensation time in order to minimize the distance error. Figure 6b shows that the focal point and the target point had the minimal distance error when the compensation time was 0.2 s.

Fig. 6
figure 6

a Correlation between the target points and focus points before and after compensation. b Correlation between the compensation time and the minimal average distance error. c HIFU tracking error after time compensation

After the compensation time had been found, the accuracy of target point tracing during three respiratory cycles was measured. The correlation between the heaving chest state and the compensated HIFU tracking error was as shown in Fig. 6c. The average tracking error after time compensation was 0.65 ± 0.31 mm.

3.2 Synchronous Tracking and Ablation Experiment

3.2.1 Single-Point Tracking and Ablation Experiment

As shown in Fig. 7a, a thermocouple was inserted into the tumor phantom, the top of which was the target point in this experiment. A needle-type probe of the optical tracker measured the tip of the thermocouple (the target point) in order to obtain the coordinates of the target point, which were then used as a control group and compared with those calculated using the heaving-chest-and-tumor-position polynomial function to obtain the tracking error of the HIFU focal point.

Fig. 7
figure 7

a Arrangement of the tumor phantom and thermocouple. b Variations of temperature and positioning error during ablation. c Relationship between positioning error and heaving chest state

In the experiment, the respiration simulation device was activated for 45 s, and the HIFU focus point driven by the robotic arm tracked and ablated the target point simultaneously. The temperature of the target point was recorded continuously in addition to the positioning error of the HIFU focal point, and the results are shown in Fig. 7b. The overall temperature was around 80 °C, which would be sufficient to kill tumor cells in a real case. The results also showed that when the positioning error increased, the temperature decreased significantly. The relationship between the HIFU focal point positioning error and the heaving chest state was investigated in a subsequent experiment, as described below.

A duration of three respiratory cycles was applied as the basis of the experiment conducted to investigate the relationship between the HIFU positioning error and the heaving chest state. The average positioning error over three respiratory cycles was 1.72 ± 1.26 mm. Figure 7c shows every positioning error and the position values of the heaving chest during the three respiratory cycles. Repeatability in positioning error was observed. In addition, significantly increasing positioning errors occurred during the inhalation and exhalation transition periods (blue dotted frames). This can be explained by the difference between the simulated trajectory and the actual trajectory of the target point, as detailed below.

3.2.2 Cross-Section Ablation Experiment

In order to more closely represent the real situation, Fig. 8a shows an oval-shaped tumor phantom (4 cm in length, 2 cm in width) that was wrapped in graphite agar and placed at the end of the linkage. The experimental process was the same as that used for the previous experiment. An optical tracker was also employed to track the coordinates of the tumor phantom, and the results were compared with the coordinates calculated using the heaving-chest-and-tumor-position polynomial function to obtain the positioning error of the HIFU focal point. In addition, a cross-section of the 3D tumor model was chosen as the target plane. Figure 8b shows the 25 points on the plane chosen to be ablated; the distance between each point was 2 mm, and the ablation route was of an “S” shape.

Fig. 8
figure 8

a Tumor phantom and b ablation plane for the cross-sectional ablation experiment. c Positioning error during synchronous tracking. d Comparison of ablation area

In the experiment, the respiration simulation device ran for 7 respiration cycles in order to ablate 25 target points. The average positioning error after importing the compensation time was 3.04 ± 1.24 mm. The results of the average positioning error and the position value of the heaving chest during the different heaving chest states are presented in Fig. 8c. It was observed that there was regularity in the positioning error, and larger positioning errors occurred during the inhalation and exhalation transition periods.

Figure 8d shows the planned route for ablation on the cross-section and the cross-section of the tumor phantom after ablation. The white coagulated lesion in the area of the planning route was obvious (owing to protein denaturation when the temperature was above 60 °C).

4 Discussion and Conclusion

To summarize the experimental results, the robot compensation time was 0.2 s, and the average positioning error in the single-point ablation experiment was 1.72 ± 1.26 mm, allowing the ablation temperature at the target point to remain stabilized at 80 °C, denaturing the proteins in the tumor phantom. Moreover, the results of the 3D cross-section ablation experiment showed an average positioning error of 3.04 ± 1.24 mm, which was larger than that observed in the single-point ablation experiment. In addition, in both the single-point ablation experiment and the cross-section ablation experiment, larger positioning errors occurred at the moment of change from inhalation to exhalation. It was speculated that the number of sampling points was too few to enable establishment of the heaving-chest-and-tumor-position polynomial function, resulting in positioning error between the simulated trajectory and the actual trajectory. However, according to the ablation results of the tumor phantom, this system achieved the expected outcome, the target ablation region being filled with the white coagulated lesion.

In this research, an ultrasound-guided robotic HIFU ablation system with respiration displacement and time delay compensation functions was developed. The main characteristic of this system to note is that the robotic arm guided the HIFU focal point to track and ablate the target point automatically and synchronously by importing the position values of the heaving chest and the compensation times. In future study, the number of tumor sampling positions will be increased in order to better approximate the actual tumor trajectory by the simulation trajectory, and real animal tissue will be used for further testing. In addition, different HIFU powers, tumor tissue categories, and thicknesses will be included as factors in order to evaluate the variation in positioning accuracy of the HIFU focal point.