Background

Endonasal transsphenoidal approaches to the skull base are emerging as the first-line approach for resecting the majority of pituitary adenomas which require surgical intervention [1,2,3]. However, there is variation in the ways in which these operations are performed, largely based on surgeon preference and training, which may result in differing surgical outcomes [4,5,6,7]. These operations are technically demanding, relatively low volume, with steep learning curves—culminating in the frequent requirement for dedicated fellowships to achieve procedure-specific competency [8,9,10,11].

Surgical workflow analysis seeks to systematically break down surgical procedures into defined tasks and errors [12, 13]. In this hierarchical process, procedures are broken down into phases which contain a series of steps, generating a dedicated workflow [13]. During each step (e.g. suturing), surgical instruments (e.g. forceps) are used to perform manoeuvres (e.g. knot tying) via a series of gestures (e.g. grasping and pulling suture) [14]. Similarly, at each step, there is the potential for technical errors (lapses in surgical technique) and adverse events (an event that may lead to adverse outcomes or postoperative complications) [12].

These workflows may be used for the training (for example, creation of simulations), objective assessment of procedure-specific surgical skills and evaluation of novel surgical technologies or techniques [12, 15,16,17]. By creating a complimentary nurse and anaesthetic workflow analysis, operating room efficiency may be improved by orchestrating the surgical team [15]. The principal limitation to workflow analysis is the labelling and segmentation of operations into constituent phases, steps and errors, however this process can be automated (or semi-automated) using machine learning techniques [18,19,20]. The effectiveness of such automation is dependent on the generation of a comprehensive and exhaustive workflow to train deep neural networks to recognise the phases, steps, instruments and errors of an operation.

Consensus processes involving subject experts have been used in order to generate a comprehensive and standardised workflow for named operations [15, 21, 22]. The Delphi technique allows for the generation of group consensus through iterative surveys, interspersed with feedback [23]. Questions nested within surveys can be qualitative or quantitative (often using ordinal scales). If quantitative metrics are used, simplified scales (e.g. 3-point) may translate more clearly into clinical practice with greater test–retest reliability [24]. With an engaged group of experts and the use of digital technologies, the process can be achieved in an accelerated fashion (a matter of weeks) [25]. The management of pituitary adenomas has benefitted from consensus statements, with groups such as the Pituitary Society producing a number of guidelines through its multidisciplinary specialist network [26,27,28,29,30,31,32]. However, there is no consensus on the operative workflow for endonasal transsphenoidal approaches (TSA) to pituitary adenomas.

We, therefore, sought to generate a surgical workflow for endoscopic TSA resection of pituitary adenomas, via an expert consensus process nested within the Pituitary Society.

Methods

Overview

This process aimed to generate a surgical workflow that captured the range of ways the operation is performed in contemporary practice. The aim of the process was not to decide on the optimal set of surgical phases, steps or instruments—this will be explored in subsequent studies. In order to create this exhaustive workflow, expert input was derived through an iterative, mixed-methods consensus process (Fig. 1). The components of the workflow analysis and associated definitions are listed in Table 1 [13, 33]. The beginning of the operation was taken at entry of the endoscope endonasally with the use of surgical instruments, reflecting the American College of Surgeons definition of surgery—“structurally altering the human body by the incision or destruction of tissues” [34].

Fig. 1
figure 1

Schematic diagram of Delphi process – highlighting the generation of a surgical workflow through iterative consensus from Pituitary Society expert members

Table 1 Definitions of operative workflow terminology per domain

Modified Delphi process & sampling

Literature review

The process (Fig. 1) began with a brief literature review of neurosurgical textbooks and articles (PubMed or EMBASE). Keywords “endoscopic transsphenoidal”, “pituitary adenoma” and “operative technique” were used. From the relevant resources found, an initial operative workflow was generated [5,6,7, 35, 36].

Consensus round 1

The initial, literature-based workflow was discussed with a small group (n = 7) of experts—UK and Ireland based consultant neurosurgeon members of the Pituitary Society. Each expert reviewed the workflow individually—via computerised document (Microsoft Word, Version 16.4, Microsoft, Washington, USA)—with the definitions of phases, steps, instruments, technical errors and adverse events as above. Each expert was asked a series of questions (via e-mail), seeking to assess the completeness and accuracy of the workflow (“Appendix A” section). Any additional suggestions were reviewed and added to the workflow matrix if (i) in-scope, (ii) not duplicate. According to the Delphi technique, circulation and iterative revision of the workflow was repeated until data saturation was achieved, that is, all experts were satisfied that the workflow was complete and accurate. Resultantly, round 1 was repeated three times, occurring over 12 weeks (October 2020–Jan 2021).

Consensus round 2

The refined workflow was then sent to a larger group (n = 11)—international members of the Pituitary Society that are recognised experts in the field and nominated by the Physician Education Committee. Again, individuals were asked to assess the workflow (“Appendix A” section), and expand the defined domains (steps, instruments, technical errors and adverse events) to cover possible global variations in practice. As in Round 1, any additional suggestions were reviewed and added to the workflow matrix if (i) in-scope, (ii) not duplicate. This round was completed until (i) all experts agreed that the workflow captures the operative practice they have observed and (ii) there were no additional suggestions for the workflow from the participant group. Round 2 was repeated twice, occurring over 8 weeks (January 2021–March 2021).

Administration

Invitations to participate in the Delphi process were via direct email only. Workflow documents were presented using Microsoft Word (Version 16.4, Microsoft, Washington, USA) in both rounds and supported by Google Forms (Google LLC, California, USA) in Round 2.

Data collection and analysis

Participant demographics collected included training grade and country of practice. The collected data regarding the surgical workflow were quantitative (whether participants agree it is complete and accurate) and qualitative (additional suggestions or comments). Summary statistics (e.g. frequencies) were generated for participants demographics. Content analysis was used to analyse free-text responses—to remove out-of-scope suggestions, group similar suggestions together and compare them to existing data points in the workflow. Data analysis and workflow updates were performed in duplicate by two independent analysers (HJM, DZK).

Ethics

No identifiable data were collected about participants in the Delphi process. This study was independent of national health services and did not require ethical approval (interrogated via online Health Research Authority decision tool—“Appendix B” section) [37].

Results

General

There was a 100% response rate and no attrition across both Delphi rounds. Across both rounds, 18 panel members participated, representing seven countries: United Kingdom (n = 6), United States of America (n = 7), Australia (n = 1), Colombia (n = 1), Germany (n = 1), Italy (n = 1) and Republic of Ireland (n = 1).

Final surgical workflow

Four distinct operative phases were delineated on discussion—nasal, sphenoid, sellar and closure. The component steps within each phase were defined as core (necessary) or optional (case and/or surgeon dependent) and were agreed upon by 100% of panel members across rounds. Pre-operative set-up and post-operative protocols were judged as important but not included as per the defined study scope.

Nasal phase

This phase was composed of 10 steps (4 core, 6 optional), from the identification of pertinent nasal anatomy until entry into the sphenoid sinus (Table 2). Amongst our panel, this phase was performed both with otorhinolaryngologists or by neurosurgeons alone.

Table 2 The nasal phase with constituent steps, errors and adverse events

Sphenoid phase

This phase was the shortest in terms of the number of steps, composed of 4 steps (3 core, 1 optional) as detailed in Table 3.

Table 3 The sphenoid phase with constituent steps, errors and adverse events

Sellar phase

The sellar phase was composed of 12 steps (7 core, 5 optional) representing entry into the intracranial space and tumour (macroadenoma or microadenoma) resection (Table 4).

Table 4 The sellar phase with constituent steps, errors and adverse events

Closure phase

The closure phase was composed of 14 steps (3 core, 11 optional), consisting of haemostasis and repair of the skull base (when appropriate) (Table 5). This phase had the largest number of optional steps, reflecting the acknowledged heterogeneity in the various methods of skull base repair that may be used.

Table 5 The closure phase with constituent steps, errors and adverse events

Discussion

Principal findings

Firstly, a workflow for the performance of endoscopic transsphenoidal pituitary adenoma resection has been generated, using Delphi methodology based on an international expert consensus agreement. The agreed “core” steps can be used for education (e.g. operative video annotation), surgical skills assessment, and the development of models and simulators [13, 19, 22, 38]. Similarly, the agreed “optional” steps highlight areas of heterogeneity of practice that will benefit from further research—most notably in skull base reconstruction (closure phase) and surgical exposure (nasal, sphenoid, sellar phases) [2, 3, 5, 7, 39]. This workflow also captures the instruments, errors and adverse events for each step and is the first of its kind in neurosurgery.

Furthermore, ensuring that the workflow captured a breadth of operative practice, in a structured fashion with consistent terminology, was a challenge and required multiple iterations across multiple rounds. For example, the presence of “optional” steps reflects differences between the practice of individual surgeons (e.g. choice of reparative material) and adaptation to case-specific factors (e.g. tumour extension) [5, 7, 40]. Resultantly, delineation of whether these steps were core or optional and the content of these steps (particularly instrument use) was an area of the workflow which required significant revisions. Similarly, another area that required significant iterative changes was distinguishing errors from adverse events and complications. Definitions of each of these components were therefore presented repeatedly, throughout each round. Adverse events were linked in line to particular technical errors and were limited to intra-operative consequences (as opposed to post-operative complications which occur later and more likely to be multifactorial) [33]. Many adverse events linked to particular technical errors were related to the damage of distinct anatomical structures (e.g. carotid artery) which often overlapped across adverse events with a step. Driven by consensus, the terminology was often broadened (e.g. “neurovascular injury, e.g. carotid artery injury”) to capture a breadth of events whilst decreasing repetition within steps and improving the readability of the workflow.

Findings in the context of existing literature

This Delphi consensus methodology has been used in various surgical specialities to generate similar surgical workflows, with demonstrated utility as a method to consolidate complex opinions into practical workflows [15, 17, 21, 22]. For example, a workflow for steps and errors in laparoscopic surgery by Bonrath et al. focussed on the need for standardised steps and errors for education and structured assessment of trainees [33]. Kaijser et al. explored the steps of laparoscopic bypass and sleeve gastrectomy in detail, deconstructing them further into constituent tasks in order to develop advanced simulators and training curricula [21]. Previous studies have tailored the workflow analysis to different levels of learners, for example, Dharamsi et al. highlighted the need and utility of a consensus-driven workflow for bougie-assisted cricothyroidotomy aimed specifically at novices [22]. A more in-depth analysis is occasionally performed to task or gesture level (which together make up a surgical step), and this level of granularity has been achieved through similar Delphi consensus techniques [41]. Notably, the terminology for the operative workflow hierarchy (e.g. phases, steps, tasks, gestures, motions) is not used in a standardised fashion (e.g. often task and step are used interchangeably) and alignment of future studies to a common language will be important as this field expands [13].

There are many applications of surgical workflows—including education and training; surgical assessment; research; and technology development. In relation to education, highlighting the core components of operations is a useful learning resource for training surgeons and has been used to develop educational curricula, courses and simulators [13, 38]. Similarly, these workflows can be used to inform objective assessment instruments specific to particular operations, for example, Knight et al. combined a consensus-driven surgical steps workflow for laparoscopic hysterectomy with an established skills assessment form (Objective structured assessment of technical skill or OSATS) to generate a reliable and specific measure of procedural proficiency [42]. Augmented assessment and training is particularly pertinent in low-volume surgeries, with steep learning curves and a unique set of surgical skills—such as pituitary surgery [8,9,10]. Resultantly, proficiency in such procedures requires dedicated fellowships and competency-based assessments, with services providing these operations becoming increasingly consolidated into centres of excellence [10, 26]. Operative workflows may facilitate this through standardisation of terminology, providing a platform to build education materials and specific skills assessments, and highlighting acceptable variations in contemporary practice [13].

A complimentary and related process to surgical workflow analysis is the segmentation of operative videos [13]. For example, focussing on laparoscopic colorectal surgery, Dijkstra et al. distilled the key operative steps—intending to use this information to segment operative videos into component steps [15]. These segmented videos are integrated into the intra-operative environment, to guide and assess trainee surgeons in a uniform fashion [15]. Indeed, such segmentation and procedure-specific analysis has been presented in live operations in animals, displaying an ability to improve the efficiency of tasks and reduce operative times [17]. A disadvantage of operative video segmentation is its labour-intensive nature, however, this process can be automated (or semi-automated) using machine learning techniques [18,19,20]. Indeed, in the context of the COVID-19 pandemic, where operative caseload is reduced (therefore maximising learning from each case is important) and waiting list backlog is at its highest (therefore more efficient surgery is important), these technologies may be particularly useful [43,44,45].

Strengths and limitations

There are several limitations to this study that are important to highlight. Whilst the Delphi method is useful for capturing and refining the opinions of various stakeholders, attention to expert panel selection will naturally influence process output [46]. In our study, our expert panel was international and multicentre. As expected, multicentre consensus processes are capable of identifying a broader and more granular workflow than single centre analyses [21, 47]. However, only one (of 18) expert panel members represented a low or middle-income country and thus our results may not reflect a global operative workflow for this procedure. Moreover, rating regarding the utility or rationale for operative steps (particularly optional steps) was not characterised in this study and this is certainly a point for further study. Finally, pre-operative set-up (e.g. nasal preparation and patient positioning) and post-operative strategies (e.g. placement of a nasogastric tube) were excluded for practical and scope purposes, and this again is an area that requires further study to characterise heterogeneity and explore comparative effectiveness.

Conclusions

Through an international expert panel consensus, a workflow for the performance of endoscopic transsphenoidal pituitary adenoma resection has been generated. This workflow captures a wide range of contemporary operative practice. The agreed “core” steps will serve as a foundation for education, training, assessment and technological development (e.g. models and simulators). The “optional” steps highlight areas of heterogeneity of practice that will benefit from further research (e.g. methods of skull base repair). Further adjustments could be made to increase applicability around the world.