Background

With the widespread use of minimally invasive surgeries such as endoscopy and laparoscopy, limitations to training as a result of working-hour regulations, and the ethical consideration of patient safety, simulation training is now considered an important part of surgical education. In the urological field, ureterorenoscopy is one of the most frequently performed minimally invasive procedures to manage urinary stones or upper urinary tract cancer. The lack of haptic feedback, limited visual field, and sheer quantity of instruments such as rigid/flexible scopes, guide-wires, access sheaths, and baskets that the surgeon needs to be familiar with, means ureterorenoscopy is one of the most technically difficult and stressful procedures especially for novice operators. The associated stress may increase fatigue and have a negative impact on surgical performance.

Various studies of simulation-based ureterorenoscopy training have been reported, with most focusing on the validation of an individual training model or technical skill improvement [1,2,3,4,5,6]. There is a paucity of literature analyzing the impact of repeated simulation training on changes in the mental workload of novice operators. However, such studies could offer valuable information to promote patient safety and develop an effective educational training curriculum. The aim of the present study was to investigate the effect of repeated simulation-based ureterorenoscopy training in a high-fidelity setting on the mental workload and the skill achievements of novice operators.

Methods

Participants

Nineteen medical students, with no prior ureterorenoscopy training, voluntary participated in the present simulation training and provided written informed consent. After obtaining background information including the age, sex, clinical experience (medical school year), dominant hand, and experience of urological endoscopy, all participants initially received video-based didactic training on the basics of cystoscopy and ureterorenoscopy, stone treatment, devices such as guide-wires, access sheaths, baskets, and on how to perform the following two tasks. After this training, each participant engaged in the first training session (1st session). Subsequently, participants underwent a minimum of five additional training sessions, with an inter-session interval of at least 5 h.

Training task

During the training sessions, the case scenario described “a 73-year-old male without any significant past medical history who is about to undergo the removal of a stone located in the right renal upper calyx”. Table 1 shows each step the participants were required to perform. Task 1 involved training in the use of a rigid cystoscope and preparation for stone removal using a flexible ureterorenoscopy. Task 2 involved training in flexible ureterorenoscopy and stone removal with a basket. After completing the WHO checklist and assembling their devices, participants were required to perform thorough bladder mucosa observation using a rigid cystoscopy (22.5 Fr, Olympus, Japan), and place a guide-wire (PTFE guide-wire, 0.035, 150 cm, Olympus, Japan) and an access sheath (UroPass®, 12/14 Fr, 38 cm, Olympus, Japan) into the right renal orifice. They were then required to perform a systemic examination of the right renal calices using a flexible ureterorenoscopy (DUR-8®, Gyrus ACMI, USA) and extract the stone located in the upper renal calyx using a basket (Ultra-Catch NT®, 1.8 Fr, 115 cm, Olympus, Japan).

Table 1 Summary of the required steps in each task

Training was performed in a mock operating theater, “Igloo” (Imperial College London, UK), a portable inflatable simulated operating room, with the two collaborators in the roles of an anesthetist (TA) or a scrub nurse (FD) (Fig. 1). The Uro-Scopic Trainer® (Limbs and Thighs Ltd., Bristol, UK) was used in task 1, and the Scopic Trainer® (Mediskills Ltd., Edinburgh, UK) in task 2. Before task 2, the Uro-Scopic Trainer® was replaced with the Scopic Trainer® with an access sheath in situ and a stone fragment in the upper renal calyx. During the session, participants were expected to perform each procedure independently, although one of the authors (TA, an experienced urologist) gave appropriate guidance according to the situation during the training, and conducted debriefing after each training session. The scrub nurse (FD) acted as a passive surgical assistant, performing tasks such as holding the guide-wire or operating the basket according to participants’ instructions. A maximum of 30 min was allowed for each task, and performances including both endoscopic and external views were video-recorded for subsequent analyses.

Fig. 1
figure 1

View of ureteroscopy training in a virtual theater

Outcome measures

The time taken to complete each task was recorded. A procedural checklist, described in Table 1, was used to assess performance by both authors (TA and FD). The checklist score was calculated by awarding one point for each step performed correctly and independently. There was a total score of 13 points for task 1, and 8 points for task 2. Technical skills were assessed by one of the authors (TA) onsite using the Objective Structured Assessment of Technical Skills (OSATS) score [7]. Following the completion of each task, the mental workload was subjectively evaluated by each participant. The National Aeronautics and Space Administration Task Load Index (NASA-TLX), the most extensively utilized subjective questionnaire for mental workload assessment, was used [8]. The NASA-TLX includes six subscales: mental, physical, and temporal task demands, effort, frustration, and perceived performance, and the questionnaire uses a 20-point visual analogue scale to measure the mental workload based on the 6 above-mentioned subscales. Participants also completed a self-assessment-based questionnaire on difficulty and confidence after each training session (Additional file 1: Table S1). Figure 2 shows the flow of the present study. To evaluate assessment reliability, 30 videos were selected (1st, 3rd, and 6th session movies of 10 random participants), and assessed by a blinded expert ureteroscopist (PA) using the OSATS score.

Fig. 2
figure 2

Flow of the present study

Statistical methods

Differences in the NASA-TLX score between tasks 1 and 2 were assessed using a paired t-test. A MANOVA model was used to analyze differences in task times and scores over training sessions. Pearson correlation coefficients were assessed between the onsite and blinded OSATS scores. All statistical analyses were performed using JMP® Pro12.01 (SAS). P-values < 0.05 were considered significant.

Results

After the 1st training session, two students withdrew, and the remaining seventeen students who had completed a minimum of 6 training sessions were included in the present analyses. Table 2 shows a summary of participants’ characteristics. The median age was 22 years (range, 18–35), and ten were male. All participants were right-handed and five had experience of observing urological endoscopic procedures.

Table 2 Summary of participants’ characteristics

Table 3 shows the mean time ± standard deviation to complete each task in the 1st, 3rd, and 6th sessions. The total operative time decreased over the course of training. However, when the total time for task 2 was divided into two parts (before and after initiation of the basket procedure) there was no significant improvement in the time needed to catch the stone in the basket during the 6 sessions. Figure 3 shows OSATS scores over the 6 sessions. Scores improved over the 6 sessions in both tasks with evidence of plateauing. Inter-rater reliability of the OSATS scores was high (Additional file 2: Figure S1 r = 0.8776, p < 0.0001). Checklist scores also improved over the sessions with similar plateauing in score progression (Fig. 4). All the steps except “Insert the basket and catch the stone with it (two-minute time limit)” were accomplished over the training sessions, as shown in Fig. 5.

Table 3 Summary of time to complete each task in 1st, 3rd, and 6th sessions
Fig. 3
figure 3

OSATS scores over the 6 sessions. The OSATS scores significantly improved over the 6 sessions for both tasks

Fig. 4
figure 4

Checklist scores over the 6 sessions. Checklist scores also improved over the sessions

Fig. 5
figure 5

Pass rate of each surgical step over the sessions. All the steps except for “Insert the basket and catch the stone with it (two-minute time limit)” were accomplished over the training sessions

Fig. 6 shows NASA-TLX scores during the training sessions. The mental workload was significantly higher in task 2 than in task 1 in every training session (paired t-test, p < 0.05). During the training sessions, there was a continual improvement in the mental workload in both tasks without plateauing (MANOVA model, task 1: p = 0.0005, task 2: p = 0.0028). The self-assessment scores for difficulty and confidence were consistent with the results of NASA-TLX. Participants showed improved self-confidence after repeated training (Additional file 3: Figure S2).

Fig. 6
figure 6

NASA-TLX scores over the training sessions. The mental workload was significantly higher in task 2 than task 1 in every training session (paired t-test, p < 0.05). During the training sessions, there was a steady decline in the mental workload in both tasks

Discussion

Regarding the effect of simulation training on the operator’s workload, Hu et al. observed a decreased mental workload in three basic laparoscopic tasks: ring transfer, precision cutting, and intracorporeal suturing, based on the Fundamentals of Laparosocpic Surgery (FLS) curriculum [9], after simulation training. Following one-hour training, medical students demonstrated improved technical performance, and significant reductions in NASA-TLX scores [10]. Yurko et al., in their study including 28 novices, also observed a reduced mental workload in laparoscopic suturing and knot tying after subjects had achieved a previously published expert-level performance score [11]. In the present study, the 17 participants underwent simulation training in cystoscopy and subsequent ureterorenoscopy in a high-fidelity setting multiple times. Our study demonstrates the effectiveness of structured, high-fidelity scenario training with appropriate guidance and debriefing from expert urologic surgeons. Repeated training was beneficial for both technical skill acquisition and reducing the mental workload.

In the present study, we prepared a simple clinical scenario of stone treatment, and participants were required to complete the predetermined surgical steps independently. As shown in Fig. 5, all the steps except for “Insert the basket and catch the stone with it (two-minute time limit)” were accomplished over the training sessions. We consider that dividing a training task into required routine steps and assessing their accomplishments over training sessions is helpful to build an effective training program. Based on our observation that OSATS scores reached a plateau in the 5th and 6th sessions, we consider that five repeats of scenario-training of stone treatment after the introduction session would be sufficient to learn the core tasks of stone surgery regarding technical aspects, and training in the basket part could be conducted separately according to each trainee’s level of skill achievement. However, in terms of the mental workload, NASA-TLX scores steadily decreased over the sessions, without reaching a plateau. This study highlights that even though technical performance may plateau during simulation training, reductions in the mental workload continue to be realized. This both emphasizes the benefits of repeated, structured simulation training and highlights the importance of simulation even when participants and trainers may feel a satisfactory technical proficiency has been achieved.

The main limitation of the present study is that, whilst all efforts were made to create a realistic, high-fidelity environment, it is reasonable to consider that some of the simulation training effect will be lost when participants are confronted their initial cases in the operating theater. The lack of a real clinical section wherein we can confirm the transferability of the simulation training effect to real clinical practice is a flaw of this study, because the present subjects were medical students from different years and it was difficult to perform such a real clinical section. Furthermore, in real clinical practice, surgeons have to cope with multiple distractions simultaneously, such as staff entering the operating theater, case-irrelevant conversations, telephone calls, and equipment-related problems. In terms of operating room distractions, Wheelock et al. reported an interesting observational study. They observed ninety general surgery cases in real time and assessed them using 4 validated tools: OR Distraction Assessment Form, the Observational Teamwork Assessment for Surgery tool, NASA-TLX, and short form of the State Trait Anxiety Inventory. They observed that the most prevalent distractions were those caused by external staff, followed by case-irrelevant conversations. Case-irrelevant conversations were associated with poor team performance. Equipment-related distractions were correlated with higher stress levels and poorer teamwork in nurses [12]. It is likely that novice trainees are more susceptible to those distractions. Our group previously reported that after the simulation-based curriculum for the rigid ureteroscope, including the teaching of non-technical skills, the intervention cohort showed improvements both in technical and nontechnical aspects, compared with the control group [13]. In addition, Stefanidis et al. recently reported an interesting mental skill curriculum specific to surgery. Nine surgical novices completed their curriculum, and they showed significant improvements in their laparoscopic simulator-based performance and mental skills. All participants also completed the procedure using a porcine model without a significant change in their perceived stress [14]. In order to improve patient safety, we must continue efforts to develop an integrated surgical training program, including nontechnical skill training.

Another limitation is that we cannot comment on the possibility of changes in their performance outcomes or mental workload on using different scenarios such as a left renal stone during data collection, because participants performed training based on the same “right renal stone” scenario. If we invite novice trainees to participate in the present study, their scores might be different from the current observation in medical students, as novice trainees have real operative experience in the theater. Nevertheless, we believe that our study showed the potential benefit of mental workload reduction facilitated by simulation training before real clinical practice. Our study provided a valid scenario training model and researchers are able to examine influences of other potential stressors within the “mock” theater, using different scenarios. Further studies of surgical simulations are warranted to develop an integrated training program.

Conclusions

Under repeated simulation training in ureterorenoscopy in a high-fidelity setting, participants showed a continual decrease of the mental workload, while the improvement of technical skills reached a plateau over 6 sessions. Our study showed the important benefit of simulation training to reduce the mental workload by repeated scenario training before actual clinical practice.