Laparoscopic surgery has advantages in terms of low invasiveness or esthetic outcome. However, complicated surgery using the laparoscopic approach is still challenging compared with open surgery due to technical limitations with the planar image from two-dimensional (2D) monitors. The recent utilization of stereoscopic vision from a three-dimensional (3D) monitor in laparoscopic surgery has overcome a lack in depth perception and improved surgical performance, such as operative time and accuracy [17]. Further, laparoscopic performance using a 3D monitor by novices who had never performed laparoscopic surgery was also improved in relation to superior accuracy and dexterity compared with that under 2D [812]. However, the negative or positive effects of laparoscopic training of novices using 3D vision are still unclear. In this study, we compared operative data, including the path length of forceps, under 3D and 2D vision by 3D optical tracking systems in a laparoscopic training box and clarified the effects of 3D vision for novices in the early phase of training.

Materials and methods

Training equipment

Figure 1A shows the training setup. For the assessment of laparoscopic performance using stereoscopic images captured by a 3D system, we set the monitor on place where a subject directly faced the display with viewing distance ranged about 1.2 m and viewer’s eye level is the same as the middle of the display. This setting was simulated as similar as possible to the usual position of surgeon and monitor in actual laparoscopic surgery in our institution. The distance between the tip of the endoscope and the object of the operation was fixed at 15 cm.

Fig. 1
figure 1

Setup for recording performance of participants. A The apparatus consisted of a monitor, position tracker, and training box including a phantom task. B Device for task 1 (touching markers on the flat disk with a rod). C Device for task 2 (straight rod transfer through single loop). D Device for task 3 (curved rod transfer through two loops)

Devices used were a 3D laparoscope and 3D laparoscopy system (Olympus Medical Systems, Tokyo, Japan). With this laparoscope, two charge-coupled device (CCD) image sensors are located at the distal end of the laparoscope to provide left and right images, respectively. These two image signals are processed by a special-purpose video system to generate a high-resolution 3D image that is then displayed on a 3D monitor and viewed through 3D glasses to provide realistic 3D images. This system can be switched from providing 3D images to 2D images by pressing a switch on the scope.

The micron tracker (Claron Technology, Toronto, Canada) consists of CCD image sensors. This system enables the tracking and recording of 3D coordinates and rotation of forceps with an optical marker. Data are obtained as text data and can be analyzed by statistical software.

Study design

Study participants were 40 novices who had never performed laparoscopic surgery as an operator (20 medical student volunteers and 20 junior resident volunteers). The sample size was determined based on the outcome of our preliminary study. Participants performed three different laparoscopic phantom tasks in the training box five times each (one set) and performances were recorded by the optical position tracker in order to analyze the motion of forceps in a 3D space. Participants were randomly divided into two groups: one group performed a set of tasks initially under the 2D system (2D start group) and the other group performed the tasks under the 3D system (3D start group). Then both groups again performed the same set of tasks after switching to the alternate system (Fig. 2). All participants completed the tasks using 2D and 3D monitors with the same settings and on the same day. After participants completed the second set of tasks, we compared operative times, path lengths of forceps, and technical errors between systems. In addition, we investigated the learning curves for each group throughout all attempts.

Fig. 2
figure 2

Diagram of study design

The purpose of the study was sufficiently explained, and consent was obtained from all participants of this study before the trial.

Tasks

Task 1: touching markers on the flat disk with a rod (a simple operation) (Fig. 1B)

The metal rod on the center of the flat disk was grasped with a lockable, endoscopic needle holder by the right hand, then was moved to touch the metal markers in order from No. 1 to 5. After touching the No. 5 marker, the rod was passed to a needle holder in the left hand and then moved back to touch the metal markers No. 4–1 in that order. Failure to pass the rod from the right to the left hand and not touching the markers were counted as errors.

Task 2: straight rod transfer through single loop (Fig. 1C)

The metal rod on the right side of the flat table was grasped with an endoscopic needle holder by the right hand and then was transferred through the loop on the center of the table, without touching the loop, to the left side. After the transfer, the rod was passed to a needle holder in the left hand and touched the black dot on the left side. The rod was then moved back through the loop to the right side and passed back to the right hand. Finally, the rod touched the black dot on the right side. Accidental touches between the rod and loop were counted as errors.

Task 3: curved rod transfer through two loops (Fig. 1D)

The curved metal rod on the right of the flat table was grasped with an endoscopic needle holder by the right hand and then was transferred through the loop on the center of the table, without touching the loop, to the left side. After its transfer to the left side, the rod was passed to a needle holder in the left hand and touched the black dot on the left side. The rod was then moved back through the other loop to the right side and was passed back to the right hand. Finally, the rod touched the black dot on the right side. Accidental touches between the rod and loop were counted as errors.

Statistical analysis

Measured data were recorded and assessed by a single investigator. Forceps path lengths were calculated as follows:

$$P_{1} \left( {x_{1} , \, y_{1} , \, z_{1} } \right), \, P_{2} \left( {x_{2} , \, y_{2} , \, z_{2} } \right), \, \ldots \, P_{n} \left( {x_{n} , \, y_{n} , \, z_{n} } \right)$$
$${\text{Sum of path lengths}} = \sum\limits_{n = 2}^{N} {\sqrt {(x_{n} - x_{n - 1} )^{2} + (y_{n} - y_{n - 1} )^{2} + (z_{n} - z_{n - 1} )^{2} } },$$

where P 1, P 2, … P n represent tip position of the forceps at each time frame (one time frame = 0.05 s).

Statistical analysis of the data was performed using JMP ver.8.0 software (SAS Institute Inc., Cary, NC, USA). All data are presented as the median value and the mean ± standard error of the mean (SEM). The comparisons between groups were performed by the Wilcoxon signed-rank test or the Wilcoxon rank-sum test. P < 0.05 was considered significant.

Results

3D monitor versus 2D monitor

Figure 3A shows comparisons of the average scores of all participants for each task between 2D and 3D systems. Median operative times were 22.9 and 22.0 s (P = 0.047) for task 1 with 2D and 3D systems, 26.2 and 19.5 s (P < 0.001) for task 2 with 2D and 3D systems, and 40.4 and 36.0 s (P < 0.001) for task 3 with 2D and 3D systems, respectively. Median path lengths were 1407.3 and 1276.6 mm (P = 0.120) with 2D and 3D systems for task 1; 1291.1 and 1009.1 mm (P < 0.001) with 2D and 3D systems for task 2; and 1961.7 and 1759.5 mm (P = 0.001) with 2D and 3D systems for task 3, respectively. Median technical errors (times) were 1.2 and 0.7 (P < 0.001) with 2D and 3D systems for task 1; 2.8 and 0.9 (P < 0.001) with 2D and 3D systems for task 2; and 10.5 and 6.9 (P < 0.001) with 2D and 3D systems for task 3, respectively. Each score for all tasks performed under 3D was significantly better compared to those with the 2D system except for mean path length in task 1. Although there was no significant difference in mean path length in task 1 between 2D and 3D according to the statistical analysis, the score for the 3D system was shorter than that for the 2D system.

Fig. 3
figure 3

A Comparison between 2D and 3D systems regarding average scores (operative times, path length of forceps, technical errors) of all participants. B Learning curves of 2D start group and 3D start group. During early phase of training, the performance of the 3D start group was superior to the 2D start group. However, during the later phase of training, the performance of the 3D start group became worse than the 2D start group

We also investigated the average scores of 2D start group and 3D start group with only early phase (from the first to fifth attempts) to work out whether novice trained under a 3D monitor will learn faster than under a 2D monitor or not. The operative time during early phase among 3D group was shorter than that among 2D group only in task 2 [median operative time, 2D vs. 3D (s): task 1, 25.3 vs. 23.7, P = 0.350; task 2, 26.3 vs. 23.2, P = 0.001; and task 3, 40.7 vs. 39.1, P = 0.374]. The path lengths during early phase among 3D group was shorter than that among 2D group in tasks 2 and 3 [median path lengths, 2D vs. 3D (mm): task 1, 1511.6 vs. 1401.5, P = 0.068; task 2, 1195.7 vs. 1059.3, P < 0.001; and task 3, 2054.9 vs. 1730.8, P = 0.008]. The technical errors during early phase among 3D group was shorter than that among 2D group in all tasks [median technical errors, 2D vs. 3D (times): task 1, 1.4 vs. 1.0, P = 0.047; task 2, 3.8 vs. 1.1, P < 0.001; and task 3, 12.8 vs. 8.6, P < 0.001). Each score for all tasks performed under 3D was relatively better compared to those with the 2D system. During early phase, the performance under 3D was significantly better than under 2D for all tasks, especially in technical error.

Analysis of learning curves

Figure 3B shows the learning curves for the 2D and 3D start groups throughout all attempts. During the first set (first to fifth attempts) of tasks, the 3D start group had shorter operative times and path lengths and fewer errors than the 2D start group. On the other hand, during the later set (sixth to tenth attempts) of tasks, the 3D start group had longer operative times and path lengths and more frequent errors after switching to the 2D system as opposed to performance of the first set.

Table 1 shows the comparison of scores between the first to third attempts and the eighth to tenth attempts at each task to determine whether performance was improved from the early phase to later phases by repetition. Each task 1 score in both groups was significantly improved from the early phase to the later phase of repetition, except that technical errors for the 3D start group in the later phase were similar to those in the early phase. In task 2, all scores in the 2D start group were significantly improved from the early phase to the later phase in contrast to opposite findings for the 3D start group. In the 3D start group, operative times and path lengths were not improved by repetition, and technical errors in the later phase were significantly worse than in the early phase. In task 3, all scores for the 2D start group were significantly improved from the early phase to the later phase, whereas all scores for the 3D start group were similar to those in the early phase.

Table 1 Comparison between the first to third attempts and the eighth to tenth attempts

Table 2 shows the scores for the fourth to fifth attempts and the sixth to seventh attempts by the 2D start group and 3D start group to reveal whether performance was improved after switching to the opposite monitor.

Table 2 Comparison between the fourth to fifth attempts and the sixth to seventh attempts

In task 1, mean operative times and mean path lengths in both groups did not differ significantly after switching to the opposite system. However, technical errors by the 2D start group were significantly improved after switching to the 3D system, whereas the scores for the 3D start group were similar after switching to the 2D system. In task 2, all 2D start group scores were significantly improved just after switching to the 3D system as opposed to the significantly worse scores for the 3D start group after switching to the 2D system. For task 3, mean operative times and mean path lengths in both groups did not differ significantly after switching to the opposite system. However, technical errors by the 2D start group were significantly improved after switching to the 3D system, whereas technical errors were significantly worse in the 3D start group after switching to the 2D system.

Discussion

Several studies have proposed that stereoscopic vision using 3D results in benefits in laparoscopic performance by trainees [812]. The present study also showed that the use of a 3D monitor for training improved laparoscopic surgical performance by novices. Blavier et al. [13] reported a shorter operative time by novices who trained in the 2D environment after switching to a 3D environment and a longer operative time for novices who trained in the 3D environment after switching to a 2D environment. We also showed that the use of a 2D monitor impaired the performance of trainees after training using 3D indicated by a longer operative time due to longer path lengths and more frequent errors. Saseem et al. [14] reported similar findings, especially in complicated tasks. Our results showed that all scores for performance of task 1 (simple task) (median operative time, path length, and technical errors) in the 3D start group did not differ significantly between just before and after switching to a 2D monitor. However, in more complicated tasks, the scores for the 3D start group obviously worsened just after switching monitors, especially the score for technical errors. This inaccurate performance by novices after the loss of depth perception using 3D may be the major reason for the longer operative time under 2D vision.

We considered that there would be differences in the learning effect among trainees depending on their degree of proficiency as surgeons. Therefore, under the same setting, the differences in the learning effect among experienced surgeons might be shorter compared with that among novices. We previously revealed that scores for performance of laparoscopic sutures in the training box by moderately experienced surgeons became significantly worse after switching monitors from 3D to 2D [15]. However, in the task of straight rod transfer through a single loop (which was the same setting as task 2 in this study), the scores for performance by moderately experienced surgeons did not worsen after switching monitors from 3D to 2D. On the other hand, in the present study, the performance of task 2 by novices became significantly worse after switching monitors from 3D to 2D. This suggests that the deleterious effect of loss with depth perception using 3D is dependent not only on the difficulty of the task but also on the lack of experience of the surgeon. When a novice who is trained only using a 3D monitor participates in laparoscopic surgery using 2D for the first time, the surgical team should pay attention to the possibility of more frequent technical errors and longer operative times.

There are several limitations in this study. First, we did not take into consideration the unsuitability of each participant for stereo vision. Several studies about 3D indicated that there are some people who have disadvantages of stereo vision, such as stereo blindness [16, 17], or the side effects (feeling discomfort, headache, or sickness) using a 3D monitor [18]. However, each participant in this study improved their performance and did not feel any discomfort or sickness as side effects by 3D during the trial. We considered that the reason why the participants had no side effects might be stabilizing stereoscope not to move the background displayed on the monitor, or quite short operative time for each subject compared to actual surgery. Second, tasks in this study were simply designed just for novice. Therefore, the statistical difference of scores in each task might not explain the clinical importance by itself and it is unclear whether our performance data from novices using the training box can be extrapolated to actual laparoscopic surgery. However, in actual surgery, there are countless situations that request to perform simple operation such like task 1 and accumulation of each slight difference might become quite a large difference. Therefore, it remains unclear whether novices should start laparoscopic training or experience their first surgery using 3D or 2D. However, considering the obvious better performance of novices using 3D compared with 2D, laparoscopic surgery using 3D is recommended for novices as far as the environment permits. The more frequent errors by novices under 2D compared with 3D in the training box indicate that with poorly experienced surgeons more frequent accidents such as bleeding or injury could occur during actual surgery when switching from 3D to 2D.

In conclusion, for novices, stereoscopic images improved their laparoscopic surgical techniques from the early phase of training. However, it is important to understand the learning effect showing that performance of novices trained only by 3D worsened after changing to the 2D environment even for a simple task.