Background

The notion that spatial cognition is a salient factor for predicting and influencing surgeons’ skill acquisition and performance in minimally invasive surgery (MIS) has already been well documented [1]. Spatial cognition however is not a unitary function but consists of a set(s) of multimodal, independent and interchangeable processes. One of such spatial cognitive abilities is Visuo-spatial ability (VSA). VSA refers to a set of abilities that allow an individual to ‘form internal mental representations of visual patterns, and use such representations to solve spatial and complex problems’ [2]. In practical terms, each of these individual and independent processes allow one to retain, retrieve and transform visual and spatial information, as according to their spatial locus. In the context of MIS, these processes govern the surgeon’s capability to, for example, a) pre-operatively position the patient according to the procedure and entry point, b) insert the trocars safely and efficiently, c) judge the spatial relation between the tip of the instruments and the adjacent organ or structure, d) apply appropriate force to the instrument, and finally, e) to successfully perform technically demanding tasks such as suturing etc.

Thus, although the impact and influence of VSA on MIS learning and performance has already been well acknowledged [3], the existing literature appears to have merely focused on investigating the role of VSA as a unitary concept. Thus consequently, despite the existence of many individual studies, it is currently difficult to pinpoint which aspect(s) of spatial cognition are most important for MIS leaning and performance, respectively. We argue that when attempting to draw conclusion about how cognitive processes impact performance, it would be advisable to start distinguishing spatial abilities from each other, as it is unwarranted to assume that they all carry the same weight of impact. Psychological literature has shown that cognitive abilities are highly mouldable by experience, presenting a convincing case that separating spatial abilities from each other could provide us with a fresh new insight into which spatial abilities form a proficient MIS surgeon. Although the findings have been largely inconsistent, most researchers seem to agree that spatial visualization (understanding three dimensional (3D) objects through two dimensional (2D) representations by creating mental representations of these objects), spatial orientation (understanding one’s own spatial relation to and with objects in space, whilst imagine what an object looks like from different perspectives) and mental rotation (mentally manipulating and rotating objects) may play a prominent role. What remains to be determined is to what extent and how these specific abilities influence learning and performance in MIS. Consequently, while the influential nature of spatial cognition in MIS is generally accepted, the knowledge regarding which of the more specific abilities are responsible for the observed influence, and exactly which aspect of performance and learning they appear to influence, remains unclear. Various studies have investigated the role of the visuospatial abilities with MIS but no prior research has attempted to summarize the evidence to identify the impact of the individual spatial abilities on performance and skills acquisition in MIS. What seems to be missing is a concise and detailed systematic analysis of the literature assessing spatial cognitive ability in MIS, in an aim to identify which specific set of abilities are important in MIS.

In this contribution, we report a systematic review of the current evidence base in aim to assess which specific spatial cognitive abilities are important in both surgical education and performance, whilst also assessing the impact that MIS related technology have on cognition. Identifying exactly which individual spatial abilities impact performance and acquisition of MIS technical skills is thus deemed fundamental for many reasons. First, the identification of specific spatial cognitive processes will allow us to devise a training program that would enable a successful outcome for novices of all cognitive levels. Such knowledge could thus have a considerable impact on the way we approach the design and implementation of the MIS curriculum. Second, the newly gained insight would also allow us to better understand the psychological attributes and profiles that underlie MIS skill learning and performance. Third, the technology continues to change and shape the surgical practice we know today. Identification of specific cognitive processes would also allow us to better understand, and even predict, which specific spatial abilities must be accounted for and trained, to allow introducing new medical technologies (surgical simulators or surgical robots) into the operative environments with minimal disruption. Fourth, a deeper understanding of the nature of the spatial abilities involved in MIS would facilitate simulator-based medical education.

Methods

Search strategy

We conducted the systematic review in accordance with PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines for systematic reviews [4]. We searched (October–November 2016) a variety of medical, psychological, and overall scientific databases; MEDLINE PubMed, PsychINFO, ScienceDirect, Elsevier and Web of Science. Our first search focused on spatial cognition generally using the Boolean combination of (‘spatial cognition’) AND (‘minimally invasive surgery’) and (‘spatial cognitive abilities’) AND (‘minimally invasive surgery’). Then, specific spatial abilities in combination with MIS were searched for: Visuo-spatial abilities AND visual-spatial ability, Spatial Orientation, Mental Imagery, Perceptual-motor, Mental Rotation, Depth Perception, Spatial Perception and Spatial Memory, e.g. ‘Visuo-spatial ability’ AND ‘minimally invasive surgery’ etc. To search for the grey literature, such as dissertation theses, Google Scholar was searched using the exact same set of search terms and combinations as stated above. Last literature search was conducted November 28th,2016, using the Web of Science database.

Eligibility criteria

Studies were deemed eligible if they were written English and were published between 2006 and 2016. The 10-year timeframe was chosen, as we were interested in investigating developments in the literature, after Keehner et al. [5] published the noteworthy findings that spatial cognitive ability do in fact influence MIS performance exclusively. Only articles that did not directly investigated spatial cognition in MIS, or approached the topic from a purely technical perspective (engineering, for example), were excluded. To avoid publication bias and reduce positive result bias, all types of grey literature such as conference studies and theses/dissertation studies, which fitted the eligibility criterion, were included in the review. Due to the heterogeneous nature of the reviewed studies, a meta-analysis was not deemed appropriate.

Data extraction

The data were extracted by the first author and subsequently reviewed by the second author. For each article, the following information was extracted: General research information (First author name, year of publication, authors’ affiliations) field of study (psychology or medicine); which specific type of MIS procedure was explored (e.g. laparoscopic, DaVinci); participant characteristics (educational level or surgical level, number of participants, age of participants); Funding information; Methodology information (name of the simulator, simulator task, psychometric test used); Cognitive abilities studied; Statistics used and outcome information (Table 1). Outcomes sought after included; Operative errors, learning curves, cognitive improvement, performance outcomes and development and operative (task) time.

Table 1 Characteristics of the included studies

Study selection

Articles were screened from titles and abstracts by two independent reviewers (TV & HS). Studies, which were found to satisfy the criterion, were then screened for full-text in greater depth. Studies that carried a heavy engineering focus (e.g. testing how a software can track spatial misalignment) or investigated spatial cognition as a secondary factor (not directly testing or exploring spatial cognition in MIS performance and/or learning) were excluded from the review. Further articles were excluded if they had duplicates, incomplete data or were not available in full text.

Results

A total of 6450 articles were identified through databases, and 1900 articles through Google Scholar. After carefully screening and removing the duplicates from the databases, the remaining 5695 articles were screened based on their titles and abstracts. Through this process, 661 articles were screened against the eligibility criterion, of which 567 full-text articles were identified. Out of the 567 full-text articles, 94 articles were again evaluated against the inclusion and exclusion criteria (stated above). Sixty-eight identified articles were excluded as 43 articles carried a heavy engineering focus (e.g. describing guidance system using CT scans to improve the visual-spatial information processing intra-operatively), whilst the rest of the 25 articles were excluded as they did not explore or investigate spatial cognition, or specific spatial cognitive abilities (i.e. depth perception) in direct relationship to MIS performance and/or learning. Finally, mere 26 articles were found to meet the eligibility and inclusion criterion and were included in the review (Fig. 1). To identify any relevant grey literature on Google Scholar, the list of excluded and included articles from the databases were used to search for new and relevant articles. Through this process, five grey literature articles were identified, of which one bachelor thesis [6] was found to meet the eligibility and inclusion criterion, and was included in the review.

Fig. 1
figure 1

Flowchart showing study selection process

Included studies

Of the 26 included studies, 22 were correlational studies exploring relationships between spatial cognition/abilities and either performance and/or learning outcomes in MIS whilst the remaining four studies were comparative studies, aiming to compare the effect of/on spatial cognition/abilities on specific technical characteristics in MIS (e.g., comparing the effects of spatial disorientation on MIS performance by manipulating the visibility of the endoscopic tip on the monitor [7,8,9]. Of the 26 reviewed studies, four studies focused primarily on investigated visuo-spatial ability [3, 10,11,12], two investigated spatial orientation [7, 13], three investigated mental imagery/practice [14,15,16], three investigated perceptual and visuo-motor ability [8, 17, 18], one investigated mental rotation [19], one depth perception [20], one spatial perception [21] and another visual working memory [22]. Further seven studies investigated a combination of one or more of the above-mentioned spatial abilities, or investigated ‘spatial cognition’ in its general term [23,24,25,26]. The heterogeneous nature of the reviewed articles is further illustrated in Table 1. Overall 1214 participants were included in the reviewed literature, out of which 673 were undergraduate university students (enrolled in either psychology of translational medicine course), or medical students, 353 surgical residents (largely in the first year of training) and 13 qualified surgeons (attending or consultant).

Main findings

VSA as a predictor of simulator performance

All studies in this review investigating the relationship between VSA and MIS performance and learning, found VSA to have an impact on operative simulator performance. These studies administrated the mental rotation test (MRT) to assess the individuals’ VSA. Nevertheless, they were inconsistent in regards to which element was found to be most prominent, the innate VSA [3, 10,11,12], or the mere practice gained on the simulator [27]. Only two studies attempted to link VSA with a specific aspect of simulated MIS who also reported inconsistent findings, with one study arguing that VSA only predicts damage and motion on the simulator [28] and other that VSA only predicted the task completion time [27]. The targeted population of these studies were mainly young students, or young resident surgeons with a maximum of 2 years of experience, with an average age of 24 years. A large majority of these studies targeted medical students [9,10,11, 21, 22, 25, 28], or even psychological undergraduate students [3, 6, 17, 20]. Two studies [10, 12] employed a between-subject design and divided students into two groups, depending on their levels of innate spatial abilities. Prior video game experience was found to positively impact performance and improve mental rotation skills [11] whilst additional haptic and auditory cues were found to negatively impact performance [9].

Mental practice and mental imagery impact MIS performance

All three studies who aimed to explore the impact of mental practice on MIS performance, both in simulated and real-life environment, found mental practice and its form of mental imagery to enhance surgical performance if used pre-operatively [14,15,16]. One study [14] reported that residents who practice mental imagery pre-operatively performed considerably better on a real cystoscopy procedure than residents who did not, whereas the other two [15, 16] found mental practice to improve mental imagery, which in turn improved performance, technical skill acquisition and better response to crisis and stress.

Impact of ergonomics on VSA and MIS performance

The VSA is influenced by the technical setup which is ultimately impact surgical performance. The impact of technical setup was measured in terms of angularity, where research attempted to identify an optimal angle of both the endoscopic camera and monitors. All studies agreed that increased degree of image rotation decreased performance, although the conclusion in regards to the optimal degree of angle varied. One study found endoscopic cameras should be held at 0° angle, with no rotation [18, 23], whilst another concluded that the combination of 90° camera angle and surgeons direct hand view on the display increased performance. Monitor displays were found to increase performance if they were located directly opposite the surgeon, at a 180° angle [23], as it aligned visual view with cognitive spatial motor movement [17]. Reduction in performance was reported with camera rotations of 270° angles [3] and between 15° to 90° angles, which lead to 10 to 30% increase in error due to decreased directional spatial-motor ability [19]. Conrad et al. [19] was the only study that tested the degree of angularity on experienced surgeons and residents who already have experience with real-life laparoscopy procedure. Perceptual-motor disruptions were reported to be a result of decreased depth information in spatial mappings, reporting that the loss of depth information inhibits the eye-hand coordination leading to stress [8]. One study proposed that having a visible endoscopic camera tip going in the same direction as the hand, significantly increased performance by decreasing the cognitive load (the amount of mental effort required to perform a task) [7].

Discussion

Spatial cognition appears to be a prominent constraint for the acquisition and performance of MIS skills, although the role of specific cognitive parameters remains debatable. Thus, what seemed to be missing was a concise and detailed systematic analysis of both medical and psychological literature assessing spatial cognitive ability in MIS. The aim was to better understand what role spatial cognition, and the specific underlying processes, plays in influencing, supporting and even predicting surgeon’s performance and skill acquisition in MIS techniques.

Our review of the existing research literature on the topic of spatial cognition in MIS indicates that VSA plays a significant role in the acquisition of MIS technical skills in a simulated environment, but mainly in inexperienced novices. The review also highlighted that novices with stronger VSA, or more specifically a stronger mental rotation (MR) ability, can acquire the required technical skills at a faster rate. Additionally, mental practice and mental imagery were also identified to play a significant role of enhancing performance of more experience surgeons when preoperative practice is carried out. This review has also identified the importance of considering both internal and external factors that may influence and interact with performance such as ergonomic set up. Surgeon’s own spatial cognitive abilities appear to be negatively impacted by ergonomic factors, such as angular discrepancies in technical setup of the monitors, which would further reduce the novice’s cognitive capacity and thus directly hinder their learning and performance.

The current literature, however yielded no substantial insight into which specific VSA processes appear to be most important, mostly due to extensive use of the MR test. These findings further highlight the greater need for a more holistic research approach, where the impacts of external and internal factors on the surgeon are more carefully considered. Furthermore, literature demonstrated that both physical and virtual surgical simulators are effective methods of acquiring MIS related technical and cognitive skills. Although research has shown technical skills to be transferable from the simulator into the OR, the same cannot be said for the cognitive skills. Considering the influential role of visuo-spatial processes, or more specifically the mental rotation in the acquisition of MIS skills, establishing the transferability from the simulators into the real life OR remains critical and to be proven.

Considering the role of MR alone does not seem to provide an explanation as to why mental practice influences intraoperative performance, or why MR diminishes in importance with practice [10]. Most importantly, by focusing on specific cognitive processes, such as MR and VSA alone, which carry high individual and environmental variabilities, we directly hinder our efforts in informing and designing MIS training curriculums or intraoperative system to suit individual training needs. This is especially critical when considering the environment in which these surgical skills ought to be acquired, as we know that external influences also carry further cognitive and behavioural consequences [29]. Thus, if wishing to use the knowledge of cognition to actually benefit the present and future generation of clinicians, we must take a holistic and broader new perspective. This can be achieved by more closely considering the functioning of a much larger cognitive network, the working memory (WM), or more specifically the visual working memory (VWM). We will provide a theoretical narrative of this new perspective, and its direct clinical implications, below. Before that, we discuss a number of shortcomings that ought to be considered when interpreting the findings of the existing studies, such as lack of expert data, inconsistent experience classifications, underrepresentation of the over 40 years age group, and unrealistic experimental simulator settings.

Methodological limitations of the reviewed studies

Firstly, out of 1214 participants from all 26 reviewed studies, only 13 of them were experienced surgeons (attending or consultant surgeons), which either acted as a comparison group or were used as a method of judging performance. Given the extremely small sample of the expert data, it is simply impossible to infer any insight into what the performance of novices ought to be like. This poses a particular challenge for current and future studies on the topic, as we currently do not have enough empirical evidence to allow us to understand what actually makes a proficient MIS surgeon and how this informs surgical education.

Secondly, labels such as ‘novice’ and ‘experienced’ to describe individuals’ expertise levels were used heterogeneously throughout the literature. For example, Haveran et al. [23] placed medical students under ‘novice’, Arora et al. [15] places surgical trainees, who assisted in ORS but have not done the procedure alone under ‘novice’ whilst Hassan et al. [21] provided no account on the experience level of the participants, except that they all had no previous simulator experience. This is a crucial factor that must be kept in mind when making sense of the novice data, as there seem to be variations of experience levels within that category. This also calls for future research to design a classification criterion where specific factors of experience are accounted for.

Thirdly, considering that a large majority of studies investigated cognition in the context of education, it is unsurprising that the average age of all participants in this review is 24. This trend was also seen in the experienced surgeon’s group, where the oldest participant 38 years old. This is particularly astonishing, as research on spatial cognition has clearly identified age as being an important factor to predict surgical performance [27]. Medical professionals were found to have generally low self-assessment capabilities, suggesting that they have limited self-awareness capabilities in recognizing their own cognitive decline [30].

Finally, the transferability of technical motor skills from a simulator has been documented [31], but the extent to which cognitive abilities are transferred remains unanswered. Thus, crucially, the transferability of the cognitive skills from the simulator in the OR remains unknown. In most studies in this review, individuals were trained on a simulator alone without any guidance and lack of feedback. This provides a student with an unrealistic experience, as performing surgery is much more of a team effort than an individualistic mission. This is especially the case in MIS, where it is common to have up to three supporting surgeons operating at the same time. Researchers are encouraged to take advantage of physical simulators as means to study the impact on cognition if team effort is required. Ambient intelligence, for example, has the means to design a much more realistic virtual environment, through the use of sensors, to train surgical trainees in a more realistic manner.

A new approach to surgical education research

Current literature on spatial cognition in MIS surgical training/performance shows VSA to have a clear impact on MIS learning and performance. Nonetheless, due to several identified methodological limitation in the existing literature, it is currently difficult to infer how to best utilize such knowledge to support MIS skill learning and promote the efficient intraoperative performance. Thus perhaps, it may be worthwhile to take a more holistic approach and consider the functioning of a much larger cognitive system, which is directly responsible for mediating all of the identified cognitive processes. This system is working memory, with a specific focus on visual working memory. We argue that by considering a more central role of WM as a whole, we could make better sense of the fragmentary findings in the literature. In this section, we will provide an argument as to how theoretical knowledge of the WM could help us answer these persisting questions, and advance our efforts for promoting efficient intraoperative learning and performance.

First and foremost, the concept of WM is used to describe an active ‘mental workspace’, which allows us to temporarily process all incoming stimuli encountered in the environment, helps us to maintain focus on what matters, helps to block out unnecessary information and finally, delegates the activation of specific cognitive processes required for the execution of a specific task [32]. The component of WM responsible for processing and manipulating visuospatial stimuli is referred to as VWM. As one example, just think of a novice surgeon who is learning the laparoscopic technique. In order for the surgeon to perform the technique safely and effectively, a step-by-step intraoperative procedure (e.g. positioning the patient, correct insertion of the trocars etc.) and associated knowledge, must be retained and maintained in the WM. When navigating to the targeted lesion inside the body, the surgeon must selectively and attentively attend to the task-specific visual and spatial information of the structures and tissues, whilst mentally manipulating the 2D image seen into the actual 3D representation of the human anatomy. The surgeon must then actively retrieve important operative knowledge (both patient and procedure specific, for example) from his short and long-term memory, whilst maintaining all of the currently relevant information. Finally, the surgeon must then also continue to track and monitor the coordination of his own instruments in hand in relation to the instruments of his assistants and the camera view, in an aim to perform a specific operative task. All of these individual activities and processes require simultaneous processing in the already limited ‘workspace’ within the WM, whilst appropriately mediating and recruiting relevant cognitive processes to allow for efficient task-specific behavior [33]. The impact of WM on operative performance has mostly been measured through eye-gaze analysis. Most of these differences lay in the gaze fixation, with expert surgeons fixating and maintaining a direct gaze of the operative field and novices showing a repeated gaze switch between the target and the movement of the instruments [29]. Through this continued gaze switch between the monitor and the instruments, novices take in significantly more perceptual information within the environment, placing a higher demand on their attentional resources. This inevitably leads to further reduction of their overall WM capacity, leading to cognitive overload [29, 32]. The concept of cognitive load refers to reduced WM capacity during learning, where through the inappropriate allocation of attention of various internal (thought process) and external information (environment), the novice attempts to process and retain multiple incoming cues and stimuli’s. Such notion can be described through the following formula: The more difficult or unfamiliar the task is, the more attention and cognitive resources must be attended to thus the greater the demand on the WM is [29].

Clinical implication

In this article, we propose a new approach to study the role of spatial cognition in MIS learning and performance. We argue that by assuming a more central role of the entire WM system, we could better understand the role of individual cognitive processes in relation to surgeon’s behavior and training outcome. Such an approach is deemed particularly useful if we wish to advance in the field of surgical education, as we know that learning surgery relies heavily on individual cognitive processes and domain-specific knowledge, and not merely on general knowledge and technical skills. Consequently, by supporting both technical and cognitive skill acquisition, we could potentially influence the rate and duration of learning, decreasing the learning curve for MIS. To put this idea into a practical context, one such cognitive training approach would be using a learning strategy called ‘chunking’. From a theoretical perspective, such a learning strategy involves breaking down a task(s) into multiple sequences/steps, by drawing on the individual own pre-existing knowledge and abilities. This is achievable through a ‘step-by-step’ process, by either breaking down large components of the task(s) or by focusing on breaking down a procedure in terms of a specific order and actions. Additionally, the same principle could be used to promote intraoperative learning, by, for example, teaching the resident how to ‘chunk’ visual information and teach them how to ‘fixate’ (through gaze training) on the only most important landmarks and cues on the screen. Such approach would in return teach the residents to appropriately allocate their attention resources, and thus in return increase the VWM capacity for information processing and storage, leading to more available capacity for decision-making, for example [34]. Yet another example of how to increase the capacity of VWM is through ‘cognitive rehearsal’, whereby the resident is asked to verbally describe each step of his action (say what they are thinking and doing), a strategy we called “think-aloud”. Through such method, the trainer would have a better understanding of the mental reasoning of the resident and could thus in return better understand the outcome behavior. We argue that such a holistic approach would be an effective research and training tool, as the residents would be taught to employ cognitive strategies that we know are used by expert surgeons [35]. Thus, considering that 97% of technical skill errors in resident surgeons are a result of disturbances in cognitive processing [36], one could argue that such a holistic approach could potentially accelerate the surgeons learning curve and promote competency-based training, all within the natural intra-operative environment.

Conclusions

Spatial cognition was found to play an important role in MIS, with mental rotation showing significant influence over MIS learning and operative performance. Future research is not only encouraged to expand on this theoretical foundation but also to test its validity in practice. Acknowledging the role of the VWM in MIS potentially could considerably facilitate surgical education, as it would allow us to better understand how we ought to present visual information to the novice, using VWM aiding cues, in order to reduce cognitive load and increase skill acquisition costs. Finally, it should be noted that these conclusions are based on studies employing simulators. To which degree MR and VWM influence the surgeons’ abilities in the actual operating room remains to be determined. Future research is encouraged to further confirm and quantify these findings in the clinical settings. In the light of our findings, a closer exploration of how training the overall WM network could potentially improve and accelerate MIS learning should also be further explored.