Introduction

Subspecialization in Endocrine Surgery, as in other surgical disciplines, aims to yield surgeons with an adequate level of proficiency and safety in performing various procedures independently and competently following a proper learning curve [1,2,3,4,5,6,7]. As performance tends to improve with experience, a trainee’s volume or annual caseload becomes the cornerstone of any given surgical educational program [1, 3,4,5,6,7,8,9]. It has also been demonstrated that trainees considerably benefit from surgical simulation [9,10,11]. The essence of surgical simulation is to expose trainees to real-time in-theater experiences without compromising patient safety.

“Primum non nocere” or “first do no harm” has always been the motto in Medicine. The rapid evolution in surgical disciplines and the advent of contemporary procedures has rendered the Halstedian model of apprenticeship “see one, do one, teach one” inadequate [12]. Therefore, there is a continuous need for methods that allow for sufficient mentored-hands-on training without compromising patient safety. Surgical simulation provides the opportunity of supervised learning for trainees, allowing full mastering of technical skills prior to actual practice on patients following the pedagogical principles of Kirkpatrik’s pyramid of training evaluation (Fig. 1) [10, 11, 13, 14] For this reason, the Anatomy, Biomechanics and Simulation Laboratory (ABS Lab) of Poitiers’ University developed a hyper-realistic training model for specialized surgical procedures. The innovative training model was named “SimLife” [15, 16]. It uses a pulsated revascularized and reventilated cadaver donated for surgical education.

Fig. 1
figure 1

Kirkpatrick’s pyramid of training evaluation

The purpose of this study is to validate SimLife as a training model in highly specialized endocrine surgical procedures. As well as its reliability as a tool for assessing performance and its progression.

Materials and methods

The training model (SimLife)

The innovative training model was developed at the ABS lab of Poitiers’ University [15, 16]. The model consists of a human cadaver prepared for surgical simulation. It is dynamized by pulsatile vascularization with simulated blood warmed to 37 °C and is ventilated. Bodies donated to science are retrieved from the body donation center of Poitiers’ University. The body typically arrives within 48 h following demise. However, arrival time has been reduced to less than 24 h as per SimLife protocol to achieve timing optimality [16]. The body is then assigned a traceable number but remains anonymous. Any human body with one of the following criteria is excluded from SimLife: body mass index (BMI) ≥ 25 kg/m2, previous major surgery (cervical, thoracic and/or abdominal), and possible contamination including HIV, HBV, HCV, Creutzfeldt-Jacob, Tuberculosis and most recently the COVID-19 [15,16,17].

Once all inclusion criteria are fulfilled, the cadaver is technically prepared as a training model. Vascular cleansing is performed with water at low pressure (0.8 bar) at a maximum temperature of 30° C. This is achieved by means of cannulas placed in both femoral arteries (input) and both femoral veins (output) (Fig. 2). Subsequently, external body cleaning and disinfection is performed, and the corpus is freezed at 22 °C in a negative pressure cold room [15, 16]. Upon scheduling a training session, the corpus is defrosted at 16 °C, over several days. Ventilation is achieved by means of orotracheal intubation and the stomach is emptied by a nasogastric tube. The model is then tested for physiological data.

Fig. 2
figure 2

Body’s preparation to become a SimLife model

The specific technical module P4P (Pulse for Practice, patent no. 1000318748 deposited by Poitiers’ University, INSERM and CNRS international extension PCT/EP2016/075819 published on 2017/05/11, WO 2017/076717 A1) animates the living human body. It is perfused by blood-mimicking fluid (patent L18217) circulating in the arterial system in a pulsating manner, recoloring and warming internal organs to 37 °C, and restoring venous turgor. Physiological hemodynamic data are computer monitored continuously and adapted as needed, with the vital signs fluctuating as per real in-theater possible scenarios. The P4P module is a specifically dedicated software that is responsive to potential adverse intraoperative scenarios mimicking real-life situations such as hemorrhage, … etc.

The learning platform on the cadaveric model has been approved by the French Ministry of Health Ethics Committee (protocol number DC-2008-137). Data collection and analysis were performed according to the institutional guidelines and the ethical standards of the Helsinki Declaration.

Evaluation of learners’ performance

Simulation is part of the method and practice of academic surgical teaching. This mandates the presence of an objective scale for evaluating the acquisition of knowledge and skills. A scoring scale that allows for evaluating performance was designed and validated by six French national experts in endocrine surgery following the Downing method for the assessment of competency [18]. The pre-scale was created using the nominal group method. It was then tested on six senior surgeons, nationally recognized as experts in endocrine surgery. The final scoring scale was psychometrically assessed through evaluating a population of trainees of variable levels of experience. To construct a uniform objective scale, each surgical procedure was broken down into essential executional steps. A scoring system was then applied evaluating task fulfillment. The points given ranged from − 3 to 1. Item specific scoring gives a trainee 1 point when the task is executed correctly, and penalties the trainee (− 1 to − 3 points) in case of a surgical fault. Specific scoring scales were designed for each of the specified procedures (adrenal surgery, thyroid surgery, and neck dissection). The scale for each procedure varies according to its essential executional steps. Figure 3 demonstrates the application of the scale for right transperitoneal adrenalectomy as an example.

Fig. 3
figure 3

Performance evaluation scale (right anterior laparoscopic adrenalectomy)

The evaluation was performed by two external observers to determine both the scale’s validity (consistency) and reliability (interobserver reproducibility).

This method allowed teachers to objectively evaluate single-surgeon performance and progression of performance in specified endocrine procedures.

Evaluation of learners’ satisfaction

At the end of each training session, trainees were asked to anonymously complete a survey indicating their degree of satisfaction by a Likert Scale from 0 to 10 on the following 4 items: the ease of learning a specific surgical procedure using SimLife, the accuracy of anatomic landmarks in SimLife, the degree of realism of SimLife, and overall satisfaction with the training SimLife.

Statistical analysis

Statistical analysis was performed by means of SAS 9.3 software. Values are reported as means and Standard deviations (SD).

Results

A total of 32 training sessions using the SimLife simulator model were hosted by the ABS lab of the medical school of Poitiers’ University. The training sessions included 24 adrenal surgeries, 4 thyroid surgeries, and 4 therapeutic functional lateral compartment neck dissections. Adrenal surgeries included conventional (open) adrenalectomy (n = 8), laparoscopic anterior and posterior adrenalectomy (n = 8 each). While thyroid surgeries were in the form of a hemithyroidectomy with a concomitant therapeutic functional lateral compartment neck dissection. Artificial lymph node metastases were created on SimLife by injecting specific material (data not disclosed, material under patent demand) and was identifiable on preoperative neck ultrasound (Figs. 4, 5). Hands-on training was preceded by an interactive theoretical session that introduced SimLife and educational material on the specified endocrine procedures. As performance scoring scales are procedure specific, results are provided in Table 1 for each procedure. For open adrenalectomy, scores ranged between 22 and 28 out of 37. For posterior laparoscopic adrenalectomy, scores ranged between 24 and 29 out of 34. Whereas for anterior laparoscopic adrenalectomy scoring scales were also side dependent; ranging between 17 and 20 out of 25 for right sided procedures, and 17–22 out of 30 for left sided procedures. Hemithyroidectomy scores ranged between 10 and 12 out of 15 (Fig. 6). Finally, lateral compartment neck dissection score was out of 15 and ranged between 8 and 11 among trainees (Fig. 7). No major events were reported for the adrenalectomies, while only one jugular vein injury occurred during neck dissection. Satisfaction scores for the specified four items ranged between 8.43 (SD 0.87) and 8.89 (SD 0.96) (Table 2).

Fig. 4
figure 4

Artificial lymph-node metastasis

Fig. 5
figure 5

Ultrasound appearance of artificial lymph nodes metastases

Table 1 Performance scores for specific procedures
Fig. 6
figure 6

Thyroid lobectomy: recurrent laryngeal nerve and parathyroids

Fig. 7
figure 7

Left Neck dissection. End of procedure

Table 2 Trainee’s answers (n = 20) to evaluation questionnaire (Likert Scale 0 to 10) regarding SimLife model

Discussion

As in other disciplines of surgery, endocrine surgical training aims to yield surgeons that are competent in the field. However, this goal is not always achieved. A recent analysis of endocrine surgical training across 22 European countries demonstrated below-expected scores of performances for thyroid surgery and lateral neck dissection [1]. The expected median values for thyroidectomy and neck dissection were 10 and 2, respectively. While the actual values obtained were disappointingly 0 and 0.5, respectively. Adrenal surgery training was difficult to assess as only 3 centers across Europe responded, reporting a median of 5 open and 5 minimally invasive procedures dedicated for surgeons in training per annum. Two surveys conducted by the American Association of Endocrine Surgeons (AAES) reported that surgical residents performed 2 neck dissection procedures prior to their graduation, while clinical fellows in endocrine surgery performed 6–15 procedures during their 2-year fellowship program with 12 procedures being the cut-off number of procedures required to obtain competency [3, 4]. Mihai et al. recently analyzed the current practice in adrenal surgery worldwide [5]. The results reported were far from being encouraging. Only few centers reported a consistent annual caseload, whereas most centers performed less than 5 procedures per year. These results were consistent with the findings of Palazzo et al., who analyzed the annual practice of adrenal surgery in the UK in 2013–2014 [6]. They reported that only 16% of surgeons performed ≥ 6 adrenalectomies per year, whereas the remainder performed a median of one adrenalectomy only. Collectively, these data reflect an insufficient annual caseload of adrenal surgery during specialized training. Furthermore, the literature demonstrates that competency in minimally invasive adrenal surgery is achieved at roughly, 30 procedures. While 20 and 15 procedures are required to attain proficiency in performing thyroid surgery and neck dissection, respectively [1, 3, 4]. In the context of these figures, the need emerges for novel resources that help achieve the purpose of any subspecializing training in surgery. Simulation could potentially serve as a core component of the modern surgical apprentice [10, 11, 13, 19]. It aims to ameliorate the technical skills of trainees enhancing their overall performance as reported by Kirkpatrik’s pyramid of training evaluation [14]. Simulation has the advantage of allowing for the repetition of technical gestures without compromising the safety of a living human being. This in turn allows trainees to detect and analyze errors and familiarizes them with the way of solving critical intraoperative situations [10, 11, 13, 19].

Surgical simulators could be broadly classified into two categories: synthetic and organic [10, 19]. The former includes plastic, rubber or latex-based simulators, virtual reality, and computer-based simulators. The main drawback of synthetic simulators is that they lack realistic resemblance to living beings [10]. Organic simulators, on the other hand, are subclassified into animal- and human-based models. These provide high-fidelity working environments. However, are potentially subject to restrictive measures that vary from one country to another [10, 19]. Human cadaver-based models, the oldest models available, are considered the best models for surgical training [10, 19,20,21,22,23,24,25,26,27,28,29,30,31]. Nevertheless, their major drawback is that they are static and do not provide real-time dynamic in-theater clinical scenarios [10, 19,20,21,22,23,24,25,26]. To overcome this issue, a perfused cadaveric model that offers a hyper-realistic working environment have been introduced [27,28,29,30,31]. SimLife is an organic dynamic simulator that provides the pedagogical items of surgical simulation with the highest possible degree of resemblance to reality, but, away from the patient. Our results indicate that trainees performed well in terms of skill acquisition. Furthermore, a good level of trainee satisfaction was obtained. This certainly implies the potential for further progress in performance.

There are some limitations of SimLife that need to be highlighted. First, the model was set for senior trainees. Junior trainees would benefit more from simpler types of models. Second, the number of annual sessions made available relies on the availability of cadavers. Therefore, any limitation in resources would reflect on annual caseload. Third, the revascularized model lacks hemostatic mechanisms. Therefore, intraoperative hemostasis cannot be achieved in contrast to real life. Finally, the model is costly. The mean cost per human body used, independent from the type of procedure, is about 2000 euros. Moreover, a specific dedicated budget needs to be dedicated for the P4P power module that runs the SimLife system (www.simedys.com).

Conclusion

SimLife is a hyper-realistic training model that allows for the satisfactory acquisition of skills and the evaluation of performance progression. It has the potential to overcome the pitfalls of subspecialized surgical training programs and become a core component of trainee’s curriculum.