Background

Today it is generally accepted that the traditional apprentice-tutor model is no longer valid for training all skills necessary for laparoscopic surgery [1]. This agreement is based upon the recognition that, in contrast with open surgery, laparoscopic surgery demands surgical skills and psychomotor skills that not necessarily should be trained together. Indeed, increasing evidences strongly suggests that psychomotor skills must be trained earlier and outside the operating room, and several models have been proposed for this aim [2,3,4,5,6,7].

Among these validated training models is the Laparoscopic Skills Training and Testing (LASTT) model, developed by The European Academy of Gynaecological Surgery, suitable for training basic laparoscopic psychomotor skills, such as laparoscopic camera navigation (LCN), hand-eye coordination (HEC), and bimanual coordination (BMC) [1,9,10,11,12,, 813].

Several studies in these models, including the LASTT model, have sufficiently proved that training improves laparoscopic skills [8,9,10, 14], which also applies specifically to training in box models as recently reported in a meta-analysis [15]. The majority of the studies base this conclusion upon measurements performed at two or very few points (before and after training). The effect of training however can be better appreciated if several points are taken into consideration, allowing tracking the improvement in performance over time, which is defined as a learning curve [16]. Although learning curves have been observed for many health technologies [17], only recently, they have become regularly used and reported for laparoscopic procedures [10,19,20,21,22,, 1823].

Following the first system (few measurements before and after training), we have demonstrated in a randomized controlled trial (RCT) performed in a population of residents and specialist in OB&GYN that HEC training with both the dominant hand (DH) and non-dominant hand (NDH) facilitates the acquisition [9] and retention [24] of more complex laparoscopic tasks, such as intra-corporeal knot tying (LICK). The present study was performed to evaluate in detail the learning curves of LICK after different pre-training conditions (no HEC training, HEC training with the DH only, and HEC training with both the DH and the NDH) from non-reported data of the same RCT mentioned above [9].

Methods

Participants and venue

The study was carried out in the Centro Médico La Costa in Asunción, Paraguay, and included 60 specialists/residents in OB&GYN with experience in open surgery but with no experience in laparoscopic surgery.

Instruments, materials, and laparoscopic tasks

The tasks were performed in the LASTT model inserted in the Szabo trainer box with standard laparoscopic instruments (Karl Storz, Tuttlingen, Germany).

Task 1 (hand-eye coordination)

The ability to grasp and transport six objects to six specific targets with both the DH and NDH, while navigating a camera was evaluated in a validated model, as described previously [9]. Briefly, with forceps held with the hand being evaluated and the camera with the contra-lateral hand, the six different objects were grasped and transported to their targets in a fixed order. The time for each repetition was limited to 600 s. The task finished either when the last object was transported to its target or when the time limit expired. The task executed with the DH (task 1a) was scored separately than the task executed with the NDH (task 1b).

Task 2 (laparoscopic intra-corporeal knot tying)

The ability to perform a LICK was evaluated in a validated model, as described previously [9]. A soft pad with two pre-mounted sutures (vicryl 2-0, 20 cm length), 1 cm between entry and exit sites, and tails equally distributed at both sites was fitted in the Szabo trainer box in a horizontal position. The optic was introduced through a midline port and the needle holders through lower and lateral ports. With a camera fixed at a distance that allowed the visualization of the entire operating field and the needle holders held with the DH and NDH, the tip of the thread was grasped and the thread was pulled through the pad, leaving a 2 cm tail on the opposite side. Then, a double counter-clockwise knot was made, followed by a single clockwise knot, and finally, by a single counter-clockwise knot. The time for each repetition was limited to 600 s. The task finished either when the participant considered he/she completed the knot or when the time limit had expired. Then, the tutor performed a quality control, and only the flat and square knots were considered correctly performed.

Scoring system

The measurements were based on the time taken for task completion system [6, 15, 21, 25]. Thus, if the task was successfully accomplished within the time limit, the score was the time actually used to execute the task, ranging from 1 to 600. However, if the task was not successfully accomplished within the time limit, a penalty score of 1200 was given.

Experimental design

The data presented in this study were collected but not reported at the time of an already published RCT [9]. Participants were randomly allocated to three different groups (G1, G2, and G3; n = 20 per group). Within each group, they worked in fixed pairs throughout the study. Working sessions of 1–2 h were performed 2–3 times a week in order to optimize the results, as reported by other authors [26]. A supervisor was present at the working station in all sessions to ascertain the set up was correctly ensemble and to score the tasks. The study was carried out in five phases.

  • Phase 1. All participants received full explanation and video demonstrations of the different tasks and then performed a test (T 1) (three repetitions of each task) to evaluate the baseline skills before any training.

  • Phase 2. Participants performed different training programs for HEC, according to the group they belong to. G1 trained both the DH and the NDH (60 repetitions of each task in alternating order). G2 trained the DH only (60 repetitions). G3 did not train HEC at all.

  • Phase 3. All participants performed a second test (T 2), in the same manner than at T 1, to evaluate the skills acquired after HEC training but before LICK training.

  • Phase 4. All participants performed a standard training program for LICK (60 repetitions).

  • Phase 5. All participants performed a third test (T 3), in the same manner than at T 1 and T 2, to evaluate the post-training skills.

Statistics and curve fitting

All statistical comparisons were performed using the GraphPad Prism 6 (GraphPad Software, San Diego, California, USA).

Intergroup differences in age were evaluated with one-way ANOVA, whereas differences in gender, DH side and training status with chi-square tests.

The scores registered at all points were plotted to produce the learning curves for each student (individual learning curves) and for each group (group learning curves). Nonlinear regression models were used to fit the data to the one- and two-phase exponential decay models.

The one-phase exponential decay model is expressed as Y = (Y 0 − Plateau) * exp (−K*X) + Plateau. The two-phase exponential decay model is expressed as Y = Plateau + SpanFast * exp (−KFast*X) + SpanSlow * exp (−KSlow*X), where SpanFast = (Y 0 − Plateau) * PercentFast * .01, and SpanSlow = (Y 0 − Plateau) * (100 − PercentFast) * .01. Y is a dependent variable (score), and X is an independent variable (number of the repetition). Y 0 is the Y value when X is zero (the starting point before any training). Plateau is the Y value at infinite times, expressed in the same units as Y (the theoretical best score that a subject could achieve with infinite practice). K, KFast, and KSlow are rate constant, expressed in reciprocal of the X units and which measures the steepness of the curve (higher values of K indicates faster learning). Span is the difference between Y 0 and Plateau, expressed in the same units as Y values. PercentFast is the percentage of the Span accounted for by the faster of the two components. For LICK, the Y 3, which represents the Y extrapolated value from X 3 (the first point of the curve immediately after HEC training/before LICK training), was also calculated.

The extra sum-of-squares F test was used to evaluate curve fitting (one phase vs. two phase) and if one single curve adequately fits for all groups. The curve parameters (continuous variable normally distributed) are presented as means ± SEM, and parametric test were used for statistical comparisons. For HEC, differences in the DH learning curves between G1 and G2 were evaluated with unpaired t test (two groups), whereas differences between DH and NDH in G1 were evaluated with paired t test (one group with two curves). For LICK, differences in the learning curves between G1, G2, and G3 were evaluated with one-way ANOVA with Tukey’s Multiple Comparison post-test (three groups). A two-tailed p value of <.05 was considered statistically significant.

Results

The demographics were already reported at the time of the first publication of this RCT [9]. The median age of the participants was 29 years (range 26–45 years), and gender was evenly distributed (50% males, 50% females, n = 30 each). The number of specialists (n = 20, 40%) was less than the number of residents (n = 40, 60%). As expected, the number of right-handed participants (n = 55, 92%) was greater than left-handed participants (n = 5, 8%). The demographics of the three groups are reported in Table 1. No intergroup differences were detected for any of the parameters.

Table 1 Participants’ demographics

For both HEC and LICK, the scores registered by each group at T 1, T 2, and T 3 were already reported in a previous study [9]. For the aims of the present study, the scores registered by each participant at all 69 repetitions (R0–R68) were plotted to evaluate the individual and the group learning curves.

HEC learning curves

The learning curves for the DH were evaluated in G1 and G2, whereas the learning curves for the NDH were evaluated in G1 only.

Most individual learning curves fitted better the one-phase model, whereas few of them fitted better to two-phase model or were ambiguous (did not fit to any model) (Fig. 1).

Fig. 1
figure 1

Hand-eye coordination (HEC). Individual learning curves. Participants performed 69 consecutive repetitions (R0–R68) of the task (G1, with both the dominant hand and the non-dominant hand; G2, with the dominant hand only; G3, none). The scores were plotted and individual learning curves were observed, fitting to one- or two-phase exponential decay model according to participants’ performance

The group learning curves (G1-DH, G1-NDH, G2-DH) fitted better to the two-phase exponential decay model (p < .0001 for all comparisons). However, one single type of curve did not adequately fits for G1-DH and G2-DH (p < .0001), neither for G1-DH and G1-NDH (p < .0001). For the DH, G1 and G2 started from a similar Y 0 (NS), but G1 reached a lower Plateau (p = .04), with a higher PercentFast (p = .01) and lower KFast (p = .02) and KSlow (p = .01). In G1, the DH curve started from a lower Y 0 (p < .0001) than the NDH curve, but both curves reached a similar Plateau with comparable PercentFast (NS), KFast (NS), and KSlow (NS) (Fig. 2 and Table 2).

Fig. 2
figure 2

Hand-eye coordination (HEC). Group learning curves. Participants performed 69 consecutive repetitions (R0–R68) of the task (G1, with both the dominant hand and the non-dominant hand; G2 with the dominant hand only; G3, none) during phase 1 (P1): R0–R2, phase 2 (P2): R3–R62, phase 3 (P3): R63–R65, phase 4 (P4): none, and phase 5 (P5): R66–R68. The scores were plotted and group learning curves were calculated. In all groups, the two-phase exponential decay model was the best fitting model

Table 2 Hand-eye coordination (HEC). Parameters of the learning curves

LICK learning curves

Most individual learning curves fitted better the one-phase model, whereas few of them fitted better to two-phase model or were ambiguous (did not fit to any model) (Fig. 3).

Fig. 3
figure 3

Laparoscopic intra-corporeal knot tying (LICK). Individual learning curves. Participants of G1, G2, and G3 performed 69 consecutive repetitions (R0–R68) of the task. The scores were plotted and individual learning curves were observed, fitting to one- or two-phase exponential decay model according to participants’ performance

The group learning curves fitted better to a two-phase exponential decay model (p < .0001 for all groups) (Fig. 4 and Table 3). However, one single type of curve did not adequately fit for all groups (p < .0001). All groups started from a similar Y 0 (NS) and reached a similar Plateau (NS), but the curve decays were different. Indeed, as soon as at Y 3, which represents the extrapolated value from X 3 (the first point of the curve immediately after HEC training/before LICK training), the curve values were already significantly different, G1 scoring lower than G2 (p < .05) and G3 (p < .05). The PercentFast of G1 was higher than of G2 (p < .05) and G3 (p < .05), but the differences in KFast and KSlow were not statistically different (Table 3 and Fig. 4).

Fig. 4
figure 4

Group laparoscopic intra-corporeal knot tying (LICK) learning curves. Participants of G1, G2, and G3 performed 69 consecutive repetitions (R0–R68) of the task during phase 1 (P1): R0–R2, phase 2 (P2): none, phase 3 (P3): R3–R62, phase 4 (P4): R63–R 65, and phase 5 (P5): R66–R68. The scores were plotted, and group learning curves were calculated. In all groups, the two-phase exponential decay model was the best fitting model

Table 3 Laparoscopic intra-corporeal knot tying (LICK). Parameters of the learning curves

Discussion

This study was performed for insight assessment of the data gathered in the frame of an already published RCT [9] in which changes in the performance of HEC and LICK at three different time points were evaluated. Indeed, in that study, the baseline scores before HEC training (T 1), after HEC training/before LICK training (T 2), and after LICK training (T 3) were evaluated, disregarding the scores registered at each of the 69 points of the study. In this study, the entire dataset was evaluated in order to characterize the learning curves of both HEC and LICK and, more specifically, to determine if pre-training HEC has an influence in the LICK learning curve.

For each task, the real scores of each individual participant were plotted, and obvious individual and group learning curves were observed, which were fitted to the one- and two-phase exponential decay models. An exponential decay equation models many chemical and biological processes. The one-phase model is used whenever the rate at which something happens is proportional to the amount that is left. The two-phase model is used when the outcome measured is the result of the sum of a fast and slow exponential decay, which is also called a double exponential decay. From these curves, the Y 0 (the starting point before any training), the Plateau (the theoretical best score that a subject could achieve with infinite practice), and the Span (the difference between Y 0 and the Plateau) were calculated. From the curves fitted to the one-phase exponential decay model the learning constant (K) was also calculated. From the curves fitted to the two-phase exponential decay model the learning constants (KFast and KSlow) and the PercentFast (the proportion of the Span accounting for the faster component of the decay) were also calculated.

The individual curves denoted a lot of variability between surgeons, specifically at the beginning of the curves, reflecting the natural heterogeneity in the population (Figs. 1 and 3). The variability, however, decreased significantly at the end of the curves, indicating the positive influence of training regardless the personal characteristics. For both HEC and LICK, some individual curves fitted better to the one-phase exponential decay model, whereas others fitted better to the two-phase exponential decay model regardless the training program.

The learning curves of HEC were characterized for the use of the DH in G1 and G2 and of the NDH in G1. All group curves fitted better to the two-phase exponential decay model. Differences between the DH and the NDH learning curves were evaluated in G1. Although no statistical significant differences were detected in the Plateau, the PercentFast and the learning constants, the NDH curve started from a higher Y 0. This is consistent with our previous report comparing the scores at three specific points, in which the DH scores were better (lower) than the NDH scores before any training (T 1), after HEC training/before LICK training (T 2), and after LICK training (T 3) [9]. We, therefore, hypothesized that the DH curve would decay faster than the NDH curve. Surprisingly, however, the difference detected in this study was observed at the beginning of the curve and not at the Plateau, as would be expected, indicating that appropriate training counteracts the initial differences and that the NDH can achieve skills comparable than the DH. Differences in the DH learning curves of G1 and G2 were also evaluated. Although both curves started from a comparable Y 0, G1 reached a lower Plateau, with a higher PercentFast and lower KFast and KSlow. These better results in G1 can be explained by the fact that the training of the DH and the NDH were performed in alternate and not consecutive order, which could possibly influence positively the learning curve of the DH. Moreover, this can also be explained by some experimental evidences saying that early in training when information about the movement was still spatially encoded and motor programs had not yet been formed, monkeys were able to transfer motor tasks learned with one limb to the opposite limb [27].

The learning curve of LICK was characterized in the control group with no previous training (G3), but because the most important aim of this study was to evaluate the effect of different pre-training conditions, the learning curve was also evaluated in the group that trained HEC with both DH and NDH (G1) and in the group that trained HEC with the DH only (G2). The curves of the three groups fitted better to the two-phase exponential decay model. As expected, all groups had comparable starting points (Y 0). All of them improved their scores at Y 3, which represents the calculated Y value at X 3 and which was included to evaluate specifically the impact of the previous HEC training. This improvement was observed in G3, but it was more pronounced in G2 and even more important in G1. In spite of these differences at the beginning of the curve, all groups reached a similar Plateau but again G1 depicted a faster decay, as demonstrated by its significantly higher PercentFast. Since G3 did not train HEC at all, the influence of repetition only cannot be neglected. Since G2 trained HEC with the DH only, the effect of this training is evident. However, the curve characteristics in G1 indicate the relevance of training HEC with both the DH and the NDH in order not only to start the LICK training from a better point but also to achieve proficiency sooner. It is also important to consider an alternative hypothesis: the shorter learning curve observed in G1 was due to the different training volume (G1 performed 120 repetitions in total) and not necessarily due to the training of both hands. In order to define the cause of the positive effect, we would need another group training the DH only for 120 repetitions, but unfortunately, we did not consider such group in the study design.

Our data about HEC learning curves are consistent with previous studies about laparoscopic psychomotor skills in general. Indeed, we have described learning curves after 30 repetitions of HEC with the DH using the same model but with a different scoring system (i.e., number of objects transported in 2 min) in 14 novices and 10 experts [10]. In that study, we reported that experts performed better than novices from the beginning till the end and that after some 20 repetitions, the scores remained similar, but the different parameters of a learning curve were not calculated. Brunner et al. have also described learning curves for 12 basic tasks using a virtual reality model in 12 medical students who performed 30 repetitions of each task in order to define how many repetitions would be necessary to reach the plateau. They reported their data fitted better to a spline model and that a lengthy learning curve existed for novices, which may be seen throughout 30 repetitions and possibly beyond [18].

Our data about LICK learning curves are also consistent with previous studies. Vossen et al. have reported in 29 trainee learning curves with one- or two-phase exponential decay model, the latter fitting their experimental points only marginally better [20]. Zhou et al. [28] and Thiyagarajan et al. [29] have also reported in 20 trainee learning curves with an exponential decay shape. Consistent with our results, the duration of the first knot varied with the previous laparoscopic experience being lower in more experienced trainees [20].

There are few studies evaluating the effect of previous HEC training upon LICK. Consistent with our study, Stefanidis et al. [30] demonstrated in 20 novices that training basic laparoscopic skills (bean drop, running string, block move, checkerboard, and endostitch), all of them representing different tasks for HEC, shortened the learning curve of a more complex laparoscopic task-like suturing. After completing basic skills training, this group achieved proficiency in laparoscopic suturing and knot tying considerably faster and after fewer repetitions (21 ± 8 repetitions) compared with the group with no previous training (50 ± 16 repetitions). They have also claimed the additional benefit of substantial cost savings because the trained group required significantly less active instruction and less overall costs of the suture material. In spite that learning curves were not reported, Fried et al. have also demonstrated in 215 surgeons that training a basic task (i.e., pegboard transfer), which is also a task for HEC, improves significantly the performance of LICK [31].

Although in this study, we did not evaluate effect of surgeon characteristics (age, gender, training status, and DH side) upon the results of the learning curves, in our previous study, we failed to demonstrate any influence of those factors upon the changes in scoring between T 1 and T 2, T 2 and T 3, and T 1 and T 3 [9], which is consistent with other studies showing that the learning curves are not substantially affected by previous exposure to surgery, either by assisting or by watching laparoscopic interventions, nor by personal characteristics, such as leisure activities, eye dysfunction, eye correction, dominant hand, personality, and gender [20, 22, 31]. For gender, however, Thorson et al. have claimed that among medical students, women had a worse performance than men [32], which might be explained by their smaller sample size than in our study (n = 32 vs. 60 participants).

It can be argued that one limitation of our study was the scoring system, which was based upon the widely used time taken for task completion system [6, 15, 21, 25]. We have to admit that time alone is not necessarily an accurate assessment of surgical skills and that accuracy and precision should be incorporated into the scoring system. In our system, however, these factors were implicitly incorporated because only objectives correctly achieved were scored. In relation to the basic tasks, this was obvious for both participant and tutor. In relation to LICK, however, knot’s quality could be debatable and differences in participant and tutor validation should be considered. For the aim of this study, only tutor evaluation was considered valid. Unfortunately, we did not correlate both measurements to determine whether students’ assessments improve over time.

On the other hand, we believe that the strength of this study is the measurement of each individual point during the entire training process, which have allowed us to evaluate the learning curves of both basic and advanced laparoscopic tasks. The characteristics of skills acquisition, reported in this study, and of skills retention, reported earlier [24] is consistent with other motor skills acquisition and retention characteristics. Indeed, compelling behavioral and neuro-imaging data suggest that the retention and perfection of skills reflects long-lasting experience-driven changes in the brain’s organization (neural plasticity) [33]. Moreover, extensive motor skill training induces reorganization of movement representations and synaptogenesis within adult motor cortex [34]. Behavioral, functional imaging, electrophysiological, and cellular/molecular studies provide evidence that motor skill learning is a staged process [35]. From the neurological point of view, different mechanisms appear to be active at different times. During training, there is sequential demand for different circuitry. The acquisition phase is characterized by fast (within session) and slow learning (between sessions). Consolidation (i.e., stabilization of novel motor memory) occurs both during and after training. Task complexity may be an important determinant of how “staged” or segregated the process is. Complex motor tasks require several training sessions interspersed with periods of rest and sleep. For these tasks, acquisition and consolidation processes are interlocked, forming a complex sequence of events.

Conclusions

In conclusion, our study confirms that training improves both basic and advanced laparoscopic skills and demonstrates that the improvement (the decay of the curve) is different according to the individual characteristics, the task complexity, and the training program. This indicates that pre-training of HEC facilitates the acquisition of LICK skills and, moreover, that pre-training of HEC with both the DH and the NDH shortens the LICK learning curve. It remains to be elucidated the potential effect of continues tutoring during training, as suggested by some authors [36], and, moreover, the impact of all these factors upon real surgery in humans.