Background

Laparoscopic surgery is a technique whereby a laparoscope and surgical instruments are inserted into the patient’s body through an artificial or natural body cavity, followed by the surgeon operating the instruments based on the monitor image captured by the laparoscope [1]. Laparoscopic surgery has many advantages, such as shorter hospitalization times, lower physical burden on patients and cosmetic improvement [2] compared with open surgery. Although laparoscopic surgery has many advantages for patients as described above, the difficulty of performing the technique is so high that surgeons need to carry out long-term training at a special medical training center. Even if they have experienced professional training, they still suffer from high mental stress during operations, which may reduce their dexterity or judgment [3]. One of the causes of such problems is that laparoscopic surgery requires another operator to hold the laparoscope for the surgeon. Therefore, the coordination between operators has a significant effect on the process and result of laparoscopic surgery [4].

To tackle the problems mentioned above, robotic technology is an effective solution. Naviot [5] provides surgeons with the possibility of solo surgery, and requires the surgeon to use one hand to hold a controller when they need to alter the direction of the laparoscope. The Da Vinci Surgical System [6] can change the control mode between the surgical manipulator and the endoscopic manipulator for the operator using a foot pedal. However, such methods do not allow the surgeon to alter the operative field while simultaneously manipulating tissue [7].

Other systems, like the automatic endoscope optimal positioning system (AESOP) from Computer Motion Inc. [8] and ViKY EP [9] use a voice control system to allow the operator to control the robotic endoscopic holder. However, rather than being helped by voice control, surgical time is actually often increased because of its slow response [10].

Alongside voice control, eye-tracking is a further intuitive hands-free method that can be used as an input signal for manipulating the laparoscope holder. As a practical example, Ubeda et al. [11] used the electrooculography signal from electrodes attached around the user’s eye to control a manipulator. In the video-oculography field, Noonan, et al. [12] used a stand-alone eye tracker to alter the laparoscope direction based on gaze.

Single port surgery (SPS) is one of form of laparoscopic surgery that requires only one incision port. SPS has attracted increasing attention from patients because of its cosmetic advantages [13]. Although robots enhance the performance of standardized laparoscopic techniques [14], current systems, including those mentioned in the previous paragraphs, are not suitable for SPS because they require multiple incisions. Therefore, our laboratory has developed two prototypes of an SPS assistive robot system. Prototype 1 [15] uses respective controllers for tool manipulators and laparoscope direction; in Prototype 2 [16], the control mode can be changed between tool manipulators and endoscope direction by pushing a foot pedal like in the Da Vinci Surgical System [6].

The purpose of this paper is to introduce a control method for the alteration of endoscope direction using pupil-tracking into the Prototype 2 [16] system, which is achieved via image processing and the use of a webcam. The system will translate the obtained image data into an output signal. We propose a threshold for pupil movement distance: the sheath manipulator, which is used for the alteration of endoscope direction, activates when the user’s pupil movement distance exceeds a threshold value; the manipulator remains in a static state when the pupil movement distance is below the threshold. Therefore, an appropriate threshold for the output signal needs to be determined to judge whether the movement state of the sheath manipulator is dynamic or static. To determine this, we tested four threshold values one by one in a horizontal movement experiment. The experimental outcome variable was the completion time of moving the field of view to a specific target, so that the most appropriate threshold could be judged from the minimum completion time. The first part of this paper will describe how we obtain the pupil movement data. Then, the general system framework, including how the system integration is achieved, will be discussed. The second half of this paper describes an experiment to verify the effectiveness of the system and obtain a proper activation threshold value, which is important for the operability of pupil tracking.

Methods

Acquisition method of pupil movement

In this study, a webcam was used to perform eye tracking. Webcams have an advantage compared with wearable gaze tracking systems, as they need not be mounted on the user’s head. Such head-mounted displays or glasses-type head-wearable devices may induce additional physical and mental stress on the surgeon [17]. In contrast, the presence of a webcam has little or negligible burden on surgeons. During the development of this system, a near infrared (IR) LED webcam was eventually selected for one reason: the color of East Asian people’s pupils in visible light is the same as that of the iris, but reflects white in near infrared light [18]. So the near infrared reflection features from the pupil were used in the image processing of the pupil movement data. Additionally, it is worth mentioning that near infrared light is harmless to the eye; this kind of light has been used in retina identification technology for a long time [19]. The infrared LED webcam (DC-NCR13U, Hanwha Q CELLS Japan Co., Ltd.) is used in this research, and its resolution is 1.3 million pixels. Figure 1a shows the control console of the prototype [20] and the set position of the webcam. Generally, the webcam is set on top of the display, but this may cause problems in capturing the whole eye because eyelashes or bangs may interfere. Thus, we set up the webcam in the position shown in Fig. 1b.

Fig. 1
figure 1

a Control console. b Set position of webcam

Applying non-rigid face tracking

An image processing method, non-rigid face tracking, was used to collect pupil movement data in this study. This method involves machine learning. The main merit of non-rigid face tracking, compared with traditional face tracking methods, is that it is non-rigid in that it can detect an individual face from the entire image and can also measure the relative distance between the feature positions, e.g., corners of the mouth, canthi and pupils. Thus, this method was used for pupil movement detection. The program used for image processing was OpenCV 2.4.4. The specific realization of non-rigid face tracking refers to [21]. An overview of the image processing is shown in Fig. 2. The complete procedure consists of image capture, annotation, geometrical constraints, training detector, pupil tracking and setting the threshold.

Fig. 2
figure 2

Overview of image processing

Image capture

The first step was to use the IR LED webcam to capture as many of the operator’s facial images as possible (Fig. 3). The more images the webcam captures, the more plentiful data the system has, which aids in training a robust face detector because the data include different lighting conditions, facial positions and eye directions.

Fig. 3
figure 3

Image capture in different conditions

Annotation

The second step was to annotate the tracking feature points on the facial image that could highlight the movement of the eyes. In computer vision, corner points are easily detected, so we annotated the eye corners. Moreover, our system needs to be able to track pupil points. Hence, we also annotated the pupils as feature points (Fig. 4). After annotation, we indexed the symmetry points. For example, the two inner eye corners and two pupils can be regarded as symmetry points. Symmetry indices can be used to mirror the captured images, which can increase the training dataset. Then, we connected the annotated points (Fig. 5), which are used for visualizing the pupil-tracking effect. Finally, the remaining images need to be processed using the previous steps. The size and resolution of these images are the same. The annotation points of the previous image will remain and appear on the next image. There is a deviation between the annotation points and the facial feature points; therefore, we need to use a mouse pointer to drag the annotation points to the facial feature points (Fig. 6).

Fig. 4
figure 4

Annotation of the feature points

Fig. 5
figure 5

Connecting the feature points

Fig. 6
figure 6

Correcting the positions of annotation points

Geometrical constraints

The set of annotations should correspond to physically consistent locations on the human face. In face tracking, the face placement on an image will translate, rotate and scale. However, the corresponding relationship of the facial organs’ positions, also known as facial geometry, will not change. Therefore, the corresponding relationship of the annotation points should also be constrained according to the facial geometry. These annotated points were combined as a geometrical model using Procrustes Analysis [22] to handle several variations on the pose of the user’s face in the image such as translation, rotation and scale, and a shape model was established (Fig. 7).

Fig. 7
figure 7

Shape model established by Procrustes Analysis

Training image patches

To make annotation points to track the facial features, we need to train discriminative patch models [21]. The annotated images mentioned in the Annotation section comprise the training dataset. The image patch can be independently trained from every annotated point, and the trained patch is cross-correlated with an image region containing the annotated point. Then, the image patch will strongly respond to the part of the image containing the facial feature, and weakly respond everywhere else. Figure 8 shows the training process: the patches for detecting eye corners and pupils are being trained. Then, the result of the patch training is shown in Fig. 9.

Fig. 8
figure 8

Process of training image patches

Fig. 9
figure 9

Trained facial feature patches

Pupil tracking

Before processing pupil tracking, the last step is to combine the shape model and the image patches into one tracker model (Fig. 10). The reason for this step is that the image patch may respond strongly at an incorrect facial feature point. An accurate estimate of the facial feature position can be determined via the geometrical model, and then the facial features can be detected with the cross-correlation image patches, producing good robustness and efficiency. Therefore, we need the shape model to restrict the matching region of the image patches. Moreover, initialization of pupil tracking should result in the ability to place the tracker model on the user’s face in the image. In this step we use OpenCV’s built-in face detector to confirm the box region of the user’s face in the captured images. Then, the tracker model can start the tracking operation from the box region of the user’s face.

Fig. 10
figure 10

Combination of the shape model and the image patches

After completing the above steps, the face tracker was able to achieve pupil tracking because the positions of both intraocular angles and pupils could be obtained from the tracking trajectory. As shown in Fig. 11a, the midpoint of both pupils is labeled as the active point (red circle), while the midpoint of both inner canthus angles is labeled as the static point (blue circle). As shown in Fig. 11b, the variation in distance between the active point and the static point was taken as the output signal.

Fig. 11
figure 11

a The static point is the middle position between the two canthi of the eyes, and the active point is the middle position between the pupils. b The position of the active point changed based on the movements of the pupils

Besides up–down movement and left–right movement, human eye movement also includes some unconscious movements such as blinking, saccade and tremble [23]. The output signal caused by these movements is regarded as noise by the control system. Therefore, the moving average method was used to filter the noise signal. The average value was calculated from 10 samples and the window size of the filter was 400 (ms). As a result, the frequency of pupil tracking was 25 Hz.

Figure 12 shows the internal processing of pupil tracking for the control system. Two parameters, D (pixel) and ∆X (pixel), can be quantifiably obtained via image processing. D represents the distance between both inner canthus, and ∆X (pixel) represents the horizontal distance between the static point and the active point. With the introduction of parameter t, D/t can be set as the changeable activation threshold. If the absolute value of ∆X exceeds the activation threshold value D/t, as shown in Eq. (1), the sheath manipulator will start to move.

Fig. 12
figure 12

Quantization: a D represents the distance between both inner canthus; b \(\Updelta X\) represents the horizontal distance between the static point and the active point

$$\left| {\Updelta X} \right| > \frac{D}{t}$$
(1)

Conversely, if the absolute value of ∆X does not exceed the activation threshold value D/t, as shown in Eq. (2), the sheath manipulator will remain static.

$$\left| {\Updelta X} \right| \le \frac{D}{t}$$
(2)

In this control system, Eq. (3) is used to control two servomotors, which are the actuators of the sheath manipulator.

$$V = \frac{{TargetAngle\text{-}PresentAngle}}{{T_{cycle} }}$$
(3)

In Eq. (3), V indicates the angular velocity of the servomotors, TargetAngle indicates the target angle for the servomotors, PresentAngle indicates the current angle of the servomotors, and T cycle indicates the sampling period of the servomotors. ∆X can be thought of as the input signal to control the servomotors according to Eq. (4).

$$P \cdot (TargetAngle\text{-}PresentAngle) = \Updelta X$$
(4)

P indicates the gain value and P∙Tcycle = 800.

Through conversion from Eq. (1) to Eq. (4), Eq. (5) is obtained.

$${\text{V}} = \left\{ {\begin{array}{l} {\alpha \Delta {\text{X }}\left( {\left| {\Delta {\text{X}}} \right| > \frac{{\text{D}}}{{\text{t}}}} \right)} \\ {{\text{0}}\left( {\left| {\Delta {\text{X}}} \right| \le \frac{{\text{D}}}{{\text{t}}}} \right){\mkern 1mu} \cdot \left( {\alpha {\text{ = }}\frac{{\text{1}}}{{{\text{P}}\cdot{\text{T}}_{{{\text{cycle}}}} }}} \right)} \\ \end{array} } \right.$$
(5)

As shown in Eq. (5), ΔX is sent to the servomotors in a proportional manner, when the absolute value of ΔX exceeds the activation threshold value D/t. Conversely, no signal will be sent to the servomotors when the absolute value of ΔX does not exceed the activation threshold value D/t. Additionally, Eq. (5) can be graphed as in Fig. 13: Limitation indicates the limits of the pupil movement range.

Fig. 13
figure 13

Relationship between pupil tracking and output signal. Horizontal axis indicates the horizontal distance value. Vertical axis indicates the output signal value

Performance analysis

As mentioned in the previous research, the frequency of image processing was 25 (Hz), and therefore the time delay was 40 (ms). A frequency equal to or greater than 25 (Hz) is regarded as real-time [24, 25]. Moreover, the response of the manipulator was less than 100 (ms) [15]. Therefore, the overall delay was acceptable because it did not exceed 330 (ms), which was estimated as the maximum time delay compatible with the safe performance of surgical manipulations [26].

Before we design an experiment for determining the proper activation threshold value, we need to decide the range of D/t. If D/t is too big, the user has to rotate their eyes to move their gaze out of the screen; if D/t is too small, the sheath manipulator will activate even when the user keeps their eyes static. Therefore, three participants, engineering graduate students without glasses, tested the performance of the pupil-tracking method. During this test, we requested that the participants perform three kinds of eye motions: rotating the eyes to a maximum angle as much as possible, gazing at the edge of the monitor, and keeping the eyes static and gazing straight. Meanwhile, we recorded the variations in D and ΔX.

First of all, we made personal tracker models for each of the three participants as in the procedures mentioned in the previous sections. Then, we requested the participants to perform the three kinds of eye motions.

For rotating the eyes to a maximum angle as much as possible, we instructed each participant to move left and right twice. The purpose of this task was to obtain the maximum value of ΔX/D. The result is shown as a graph (Fig. 14). On the vertical axis, with units of pixels, Nos. 1–4 represent the results for Participant 1, Nos. 5–8 represent the results for Participant 2, and Nos. 9–12 represent the results for Participant 3. The recorded values of D and ΔX are attached to the bottom of the graph. The average ΔX/D was calculated as 1/5.

Fig. 14
figure 14

The values of D and \(\varDelta X\) shown in the graph were obtained when participants were rotating their eyes to the maximum angle

Then, we requested participants to gaze at the edge of the monitor shown in Fig. 1a. The purpose of this task was to observe the maximum value of ΔX/D when the range of visibility was within the screen, because the user’s gaze should not leave the monitor screen when controlling the endoscopic manipulator via pupil tracking. The distance between the screen and the participants’ eyes remained at 600 [mm]. The participants had to gaze at the edge of the monitor for 3 s every trial, and were instructed to gaze at the left and right sides of the edges twice. The results are shown as a graph (Fig. 15). On the horizontal axis, Nos. 1–4 represent the results of Participant 1, Nos. 5–8 represent the results of Participant 2, and Nos. 9–12 represent the results of Participant 3. The units of the vertical axis are pixels. The recorded values of D and ΔX are attached to the bottom of the graph. The D and ΔX for each trial are averaged over the 3 s. Finally, the average ΔX/D was calculated as 1/7.

Fig. 15
figure 15

The values of D and \(\varDelta X\) shown in the graph were obtained when participants were gazing at the edge of the monitor screen

Ideally, ΔX is equal to 0 when the eyes remain static and look straight. However, ΔX has some variation that needs to be observed. In this task, every participant was requested to keep their eyes static and look straight for 3 s. The results of the three participants are shown in Fig. 15. The average ΔX/D values shown in Fig. 16a, b are 1/40, and the average ΔX/D value shown in Fig. 16c is 1/25.

Fig. 16
figure 16

The units of the horizontal axis are seconds; the units of the vertical axis are pixels. a indicates the results from Participant 1; b indicates the results from Participant 2; c Indicates the results from Participant 3

In the next section, we needed to identify an appropriate activation threshold that could be used to determine whether the sheath manipulator was moving or not. Using the above results, we confirmed that the activation threshold should be selected from a range between D/7 and D/25.

Experiment

Purpose of the experiment

The aim of the present experiment is to verify the effectiveness of the image processing method applied to the sheath manipulator control system. Moreover, to judge whether the sheath manipulator is dynamic or static, an appropriate output signal threshold needs to be obtained. In this experiment, the activation thresholds in four conditions were evaluated using operation time.

Sheath manipulator

In this experiment, we used the sheath manipulator of SPS robot prototype 2 [15] as an experimental platform. The sheath manipulator, designed for adjusting the direction of the endoscope, bends through double-screw-drive mechanisms (Fig. 17) to change the view orientation. The bending portion consists of three double-screw-drive mechanisms so that it can achieve several kinds of movement: up–down, left–right and diagonal turning. Furthermore, the screws between each double-screw-drive mechanism are connected by universal joints. Figure 18 shows that the upper two universal joints allow the endoscope to bend, while the one underneath is for support. If the upper two joints rotate at the same speed and in the same direction, the sheath manipulator will bend in the vertical direction. Conversely, if they rotate at the same speed but in opposite directions, the sheath manipulator will bend in the horizontal direction. In addition, if only one upper joint rotates, the sheath manipulator will bend in the diagonal direction.

Fig. 17
figure 17

Double-screw-drive mechanisms

Fig. 18
figure 18

Universal joints

Communication between master and slave

The SPS robot has a remote control function based on the master–slave structure. The master is a control console programmed using C/C ++ on a Windows PC. The slave is a dedicated computer that reads the encoders of the servomotors in real time and activates these servomotors according to the received signal from the master PC. The communication between master and slave is realized using user datagram protocol (UDP) over Ethernet. Figure 19 shows the complete program architecture of this system.

Fig. 19
figure 19

Program architecture

The master computer was installed with Windows XP. The pupil-tracking program we proposed was merged into the operating system of prototype 2 and was compiled with Visual Studio 2008. As shown in Fig. 19, the Infrared LED webcam mounted on the master side initially captures the images. After this, the pupil movement data are extracted from the captured images by the pupil-tracking program and are then converted into an output signal for controlling the servomotor. Subsequently, the output signal is sent to the slave via UDP. After receiving the output signal, the slave computer activates the servomotors to drive the double-screw-drive mechanisms [15], thus adjusting the direction of the endoscope.

Experimental conditions

The participants in this experiment were five graduate students who had no experience of endoscopic surgery. Figure 20 shows the experimental setup. The distance between the display and the participant’s face was set between 50 and 70 cm according to the most ergonomic view [27]. As shown in Fig. 21, the operators use their pupils to operate the sheath manipulator, moving the image center point from “point 0” to “point 3” or “point 4”, which were displayed on a chessboard with a 25 × 25 mm2 lattice. The reason for using the chessboard was to facilitate the observation of the image changes. The image center of the endoscope matching “point 0” was set as the initial state. The activation threshold value D/t was determined in four conditions: when t was equal to 14, 18, 22, or 24. As mentioned in the Session Performance analysis, the four conditions were selected from the range between D/7 and D/25. Therefore, we selected the first condition as D/14, which was half the value of D/7, and obtained the value D/25 when participants kept their eyes static and looking straight. The manipulator could not be static if we had chosen a condition smaller than D/25. Therefore, we chose D/24 as the smallest condition. Beyond that, we selected two additional conditions.

Fig. 20
figure 20

Overview of experimental setup

Fig. 21
figure 21

Chessboard

Experimental procedure

Before starting the experimental procedure, the personal eye tracker model was established for each of the five participants as mentioned in the (“Methods”) section. Then, the flow of the experiment proceeded as follows:

  1. (a)

    The activation threshold was set for the control system.

  2. (b)

    The system was tested as to whether it altered direction with the participant’s pupil movement (Additional file 1).

  3. (c)

    Initialization: the image center of the endoscope was changed to match “point 0” (Fig. 22).

    Fig. 22
    figure 22

    Experimental initialization

  4. (d)

    Each participant moved the image center of the endoscope from “point 0” to “point 3” four times, and from “point 0” to “point 4” four times. The time cost of every trial was recorded.

  5. (e)

    The threshold value was changed and the above flow from step a to step c was repeated.

The change order was D/14, D/18, D/22 and D/24.

For the present experiment, it was important that the system reflected the operator’s viewing intention so that the participants could confirm their location during the experiment. To judge whether the sheath manipulator was in the static state, each participant verbally confirmed when the operation was completed, i.e., when the image center of the endoscope had reached the target and stopped. Thus, the time measurement ended when the participant replied that the movement was complete.

Results and discussion

All participants could alter the direction of the sheath manipulator with their pupil movement. Figures 23 and 24 show the results of the experiment. In particular, Fig. 20 shows the results when moving the image center of the endoscope from “point 0” to “point 3” (Figs. 25), and 21 shows the results when moving the image center of the endoscope from “point 0” to “point 4” (Fig. 26). The horizontal axis represents the operation time and the vertical axis represents the activation threshold D/t (pixel). As shown in Fig. 23 and 24, the average operation time was shortest and the standard deviation of operation time was smallest when the activation threshold was set as D/22. In the D/22 condition, the average operation time was 5.2 s and the standard deviation was 1.9 s when the endoscope turned to the left (Fig. 25). In the other conditions, the average operation time was 4.2 s and the standard deviation was 0.8 s when the endoscope turned to the left (Fig. 26).

Fig. 23
figure 23

Experimental results from point 0 to point 3

Fig. 24
figure 24

Experimental results from point 0 to point 4

Fig. 25
figure 25

Sheath manipulator turning to the left

Fig. 26
figure 26

Sheath manipulator turning to the right

For the alteration of the endoscope direction operation system based on pupil position tracking, the shortest operation time was when the activation threshold value was equal to D/22. Similarly, the standard deviation values were smallest in both experiments when that threshold value was selected. Therefore, it is appropriate to use D/22 as an activation threshold for the operation system of the proposed SPS robot. In addition, we found that operating the sheath manipulator via pupil tracking can provide good stability and response when an appropriate threshold value is used. Also, the period between blinks was 6–8 s [28], which prevents the eye from fatiguing and from drying out. We observed an obvious difference for operation time and standard deviation in the D/22 condition in the two experiments. [29, 30] suggested that the center of monitor image should be aligned to the center of operative field. If the target points: “point 0”, “point 3” or “point” is projected on the center of monitor image, ΔX will not exceed the activation threshold value. Therefore, the operator can stop the desired target position. The limitation of this experiment is that the numbers of participants and conditions were both rather small. Therefore, there is limited evidence to support the optimal threshold value. Moreover, the accurate relationship between the motion of the manipulator and the movement of the pupils was not confirmed. After causal analysis, we found that the difference arose from the intertwining flexible shafts that are set between the manipulators and servomotors; such a situation affects the stability and speed of rotation. As a solution to this problem, these flexible shafts need to be sheathed in pipes or fixed on a fixation device so as to avoid intertwining. Furthermore, we hope to achieve better pupil tracking in the vertical direction by using a higher resolution webcam.

To invite surgeons or medical trainees to participate the manipulation experiment in the future, we gave a presentation of the system and the experimental results to two endoscopic surgeons. After the presentation, the surgeons agreed with the usefulness of the proposal and stated that:

  1. 1.

    The proposed system is more intuitive than voice-control and pedals. The surgeons certainly need a hand-free strategy to manipulate the endoscope. Using webcam and image processing is a good approach because it does not require the surgeons to be attached to additional devices, which may increase their burden during an operation.

  2. 2.

    The proposed system is useful because the manipulator will stop when the center of the visual field aligns with the target. One of the fundamentals of manipulating the endoscope is aligning the center of the visual field with the center of operation.

  3. 3.

    The horizontal alternation of the visual field is greater than the vertical, but the vertical alternation is still indispensable.

  4. 4.

    A zoom function is indispensable for an endoscopic control system when the surgeon is performing a delicate operation. For example, the surgeon would ideally like to zoom in the visual field when they are peeling the tissue from around a vessel. Moreover, in a future system, it would be better to be able to adjust the rate of visual field alternation according to the magnification of the lens.

  5. 5.

    An emergency stop button is an indispensable part of the system, which is used to avoid a collision between the endoscope and tissue.

Conclusions

In this paper, a hands-free technique for controlling the alteration of endoscope direction using a pupil-tracking method via an image processing method was introduced. The novelty of the proposed method is its ability to achieve pupil tracking because the variation in distance of both intraocular angles and pupils could be obtained from the tracking trajectory. In this method, an appropriate output signal threshold needs to be obtained for judging whether the sheath manipulator, which alters the endoscope direction, is dynamic or static. An experiment was performed to verify the effectiveness of the image processing method applied to the sheath manipulator control system, and the activation threshold of the control system had to be determined and used for the horizontal direction movement of the sheath manipulator. We found an activation threshold value that fulfils stability and response simultaneously. This time, we only verified the horizontal direction because of the limitations of our method. At present, it is quite difficult for the sheath manipulator to make vertical movements, because the vertical movement range of the eye is much less than its horizontal movement range. To realize vertical direction movement, we need a higher resolution webcam to detect the relatively small vertical movement of the eyes. As surgeons pointed out, a zoom function is indispensable for endoscope manipulation. Thus, using only the pupil position parameter as shown in this experiment is not sufficient to achieve the zoom function. In future work, we aim to develop an algorithm that includes a large number of operating conditions to judge the movement state and achieve more types of movements. Also, we will improve the current control system based on the surgeons’ suggestions. Furthermore, we also plan to invite surgeons and medical trainees to be participants in the manipulation experiments.