1 Introduction

The insertion of an external ventricular drain (EVD) is a widely performed neurosurgical procedure for measuring and treating increased intracerebral pressure (Dey et al. 2012). The current standard is freehand insertion based on anatomical landmarks. The drain is usually inserted through Kocher’s point (1–2 cm anterior to the coronal suture and 1–2 cm lateral to the cranial anteroposterior midline) and aimed at an imaginary intersection formed by transverse and sagittal planes passing through the external auditory meatus and the medial cantus, respectively (Huyette et al. 2008). Successful placement is then verified by the loss of resistance as the ventricular wall is punctured and the subsequent flow of cerebrospinal fluid through the EVD (Huyette et al. 2008). The procedure is performed blindly without visualizing subsurface structures and catheter misplacements are common (Clifton et al. 2019; Cho et al. 2020). Misplacement is reported in 10–40% of cases and is associated with irreversible tissue damage, hematomas, and unfavorable outcomes (Sarrafzadeh et al. 2014; Kakarla et al. 2008).

Several technological solutions are available to assist the surgeon in EVD placement (Aljoghaiman et al. 2022; Stuart et al. 2021). Intraoperative computed tomography (CT), 3D ultrasound, endoscopic navigation, fluoroscopic guidance and stereotactic neuro-navigation have all been shown to increase insertion accuracy (Aljoghaiman et al. 2022; Stuart et al. 2021). However, the different solutions all carry their own sets of limitations, which may make them useful in some settings and not in others. Ideally, a universal guidance solution for EVD-placement should be low cost, self-sufficient, easily portable between operating rooms (ORs) or intensive care units (ICUs), quick to set up, easy to use and provide accurate three-dimensional guidance with an intuitive interface. EVD placements are commonly performed in emergency settings and more complex and bulky systems such as neuro-navigation, intraoperative CT or Cone Beam CT may not be easily available. The presentation of 3D structures in 2D on separate screens is associated with attention shift that may lead to fatigue, prolonged operating time, and a potential increase in errors due to task switching (Cho et al. 2020; AlAzri et al. 2017).

Augmented Reality (AR) is an emerging method for neuro-navigation where computer-generated 3D images are overlayed onto real-world objects, significantly reducing attention shifts during procedures (Cho et al. 2020; Besharati Tabrizi and Mahvash 2015; Iop et al. 2022). AR has been shown to improve accuracy in the surgical treatment of intracranial pathologies such as aneurysms, gliomas, meningiomas, and acoustic neuromas (Cabrilo et al. 2014; Louis et al. 2021). However, research on the utilization of AR for EVD insertion is limited (Aljoghaiman et al. 2022) and largely experimental (Li et al. 2018; Skyrman et al. 2021; Chiou et al. 2022; 2023; Schneider et al. 2021; 2023; Kunz et al. 2021; Lai et al. 2022). This study aimed to review the current knowledge on the use of AR guidance for the insertion of EVDs, to evaluate and discuss potential benefits and challenges. A special focus was placed on reported accuracies.

2 Methods and material

Before conducting the analysis, a comprehensive protocol outlining the objectives and strategies for data collection and analysis was developed. This protocol adhered to the Preferred Reporting Items for Systematic Review and Meta-analysis (PRISMA) guidelines (Page et al. 2021).

2.1 Search strategy and selection process

Literature on AR guided EVD insertions published until August 2023 was reviewed using MEDLINE, EMBASE and Web of Science. Potential studies were identified using the following key-terms (including MeSH terms, relevant abbreviations, synonyms, variations due to plurality and spelling): “ventriculostomy”, “external ventricular drain”, “accuracy”, “guidance” and “augmented reality”. The detailed search strategy is provided in the supplementary file (Online Resource 1). Following the primary search, duplicates were removed. Studies were then further selected based on title, abstract and full text. Studies where AR was used to visualize the insertion of an EVD using the freehand technique were included for analysis. The reference lists of the studies included were screened for possible relevant publications. Other systematic reviews, virtual simulation studies, and studies where accuracy was not reported were excluded.

2.2 Data extraction

Variables of interest included study characteristics (country, design, and year of publication), location where the procedure was performed (bedside, simulated operation room, or operation room), surgeon experience level, type of AR interface, type of subject (patients, cadaver, or anthropomorphic phantom), follow-up method, and outcome measures. The primary outcome was the accuracy of inserting an EVD using AR.

2.3 Assessment of bias and certainty of evidence

The Newcastle-Ottawa quality Assessment Scale (NOS) checklist (Gierisch et al. 2014) for cohort studies was used to assess the quality of reporting in the analyzed studies. Studies were rated as good quality if they had 3 or 4 stars in the selection domain, 1 or 2 stars in the comparability domain and 3 or 4 stars in the outcome domain. Studies were of fair quality if they scored 2 stars in the selection domain, 1 or 2 stars in the comparability domain, and 2 or 3 stars in the outcome domain. Studies were classified as poor if they scored 0 or 1 star in the selection domain, no stars in the comparability domain and 0 or 1 star in the outcome domain. Bias was assessed using the bias domains of the Cochrane Risk of Bias Tool for Randomized Controlled Trials (Higgins et al. 2011). The tool has the following domains: random sequence bias, allocation concealment, blinding of participants, blinding of outcome assessment, incomplete outcome data, selective reporting, and other sources of bias. The Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) approach was employed to rate the body of evidence supporting the review’s key findings (Schünemann et al. 2008). A GRADE summary of findings table assembled using the GRADEpro Guideline Development Tool is provided (“GRADEpro GDT: GRADEpro Guideline Development Tool [Software]. McMaster University 2020 (Developed by Evidence Prime, Inc.). Available from Gradepro.Org.,” n.d.).

2.4 Statistical analysis

Continuous variables were summarized using means, while categorical variables were presented as counts and percentages. The accuracy of freehand insertion using AR guidance was reported either as means or percentages, extracted directly from the studies or calculated based on reported outcomes. To visualize the average accuracy of each study forest plots were used, indicating the range where 95% of the observed accuracies lie (two standard deviations from the mean). The Kruskal-Wallis test was used to analyze the variance of accuracy for the different devices. A p value lower than 0.05 was deemed significant. All statistical analyses were performed using the statistical software R (R Foundation for Statistical Computing, Vienna, Austria).

3 Results

The search through the included databases provided 497 studies, of which 199 were duplicates. After removing duplicates, 298 studies remained and were screened based on titles and abstracts, yielding 40 studies for full text analysis. Of the 40 studies, 14 studies were identified to analyze the use of AR for EVD insertion (Fig. 1). The most common reason for exclusion upon full-text analysis was that non-AR navigational systems had been investigated. Moreover, one specific study was excluded as it employed AR solely as a method for a senior neurosurgeon to supervise a junior colleague.

Fig. 1
figure 1

PRISMA flowchart detailing the systematic screening process

3.1 Study characteristics

All 14 included studies were prospectively designed and published between 2019 and 2023. These studies were conducted in the United States of America (29%), Germany (21%), Taiwan (14%), China (7%), Sweden (7%), Italy (7%), the Netherlands (7%), and Belgium (7%) (Table 2).

Most studies (79%) performed insertions on anthropomorphic phantoms. One study performed subsequent experiments using cadaveric subjects. Two studies also corroborated their findings on patients. Experiments on anthropomorphic phantoms were done in an OR setting. Two studies (14%) were performed on patients, and in one of these studies (7%), the insertions were performed in a bedside setting. One study (7%) performed insertions solely on cadaveric subjects. The number of insertions for each study ranged from 10 to 150.

In most experiments, insertions were conducted by experienced neurosurgeons (79%). Two studies included only medical students (14%). One study included students and employees at the study center. The employees’ clinical experience was not reported. One study featured a heterogeneous group of participants, including medical students, engineers in augmented reality (AR), neurosurgeons, surgical residents, and interventional radiologists.

The AR-solutions all contained three core components. First, creating the virtual 3D-model of the cranium and subsurface structures using preoperative imaging (CT or MRI). Second, the registration of the 3D-reconstruction on real anatomy in real-time, using markers to match virtual and real spaces. Both registration and navigation were frameless. Third, a visualization device was used to combine virtual and real environments. Once the registration was performed, the ventricles were visible through the visualization device. Devices were either head mounted displays (HMD) or external monitors (Table 3).

Regarding the AR interface, all experiments used frameless navigation. The primary device used for subsurface structure visualization was the Microsoft HoloLens HMD (79%). Other devices included external monitors (Samsung Galaxy Tab S5e (7%), Microsoft Surface Pro 7 tablet (7%), GCX screen (7%)) and a hybrid operating room-based augmented reality surgical navigational system (ARSN) (7%).

The assessment of accuracy was based on postoperative CT (93%), or CBCT (7%) scans. The definition of accuracy varied among studies: 11 studies (79%) quantified accuracy as the distance, in millimeters, by which the catheter tip deviated from the preoperatively planned target. Two studies assessed accuracy according to the Kakarla grading system (Table 1) (Kakarla et al. 2008). One study reported accuracy based on both the deviation from the planned path and the Kakarla grading system. Four studies also (28%) performed freehand insertions without AR guidance, as a reference for accuracy assessment (Tables 2 and 3).

Table 1 Kakarla grading system for external ventricular drain insertion (Kakarla et al. 2008)
Table 2 Summary of study characteristics
Table 3 Reported accuracies and navigation set-up from included studies

3.2 Risk of bias and certainty of evidence

Based on the Cochrane Risk of Bias tool, 11 studies (79%) were categorized as “poor,” and three studies (21%) as "good" (Online Resource 2). The predominant sources of bias were the absence of randomization and a lack of allocation concealment, both leading to selection bias. Following the assessment of the studies’ quality according to NOS, most studies were classified as fair (67%), two studies were classified as good (16%) and two as poor (16%) (Online Resource 3). Using the GRADE approach, the existing body of evidence exhibited a low level of certainty regarding both the accuracy of AR-guided EVD insertions and the superiority of AR over the freehand approach (Online Resource 4).

3.3 Accuracy of EVD insertion using AR

The 11 Studies reporting the deviation from the planned path revealed an average accuracy ranging between 0.7 to 11.9 mm (Fig. 2). Insertions performed by neurosurgeons consistently exhibited higher accuracy with deviations below 5.2 mm. There were no significant differences (p = 0.755) in accuracy for the different interfaces: Microsoft HoloLens (HMD), ARSN (External monitor), and tablet screens. The type of subject experimented on did not yield significant variations in accuracy. The study by Li et al. (2018) including neurosurgeons and a control group, investigating bedside placement of EVD on patients, demonstrated a significant improvement in accuracy. The average deviation from target decreased from 11.3 to 4.3 mm.

Fig. 2
figure 2

Reported average accuracy from each study with respective 95% interval. Studies that did not report accuracy in means were excluded in this plot. Dotted line marks 5 mm. Grey circles: average accuracy of AR-guided EVD insertion. Black squares: accuracy achieved by control groups performing freehand insertions

The studies by Van Gestel et al. (2021) and Eom et al. (2022), involved medical students using Microsoft HoloLens. Both studies had control groups using the freehand technique. Compared to these control groups, the use of Microsoft HoloLens improved accuracy. The deviations decreased from 19.9 to 11.9 mm, and from 13.5 to 7.4 mm, in the respective study.

The study by Benmahdjoub et al. (2023) that included students and employees achieved high accuracy for both AR devices (Xiaomi tablet and Microsoft HoloLens), both exhibiting an average deviation of 2 mm. The study by Bounajem et al. (2023) which included medical students, clinicians, and engineers using Microsoft HoloLens, reported a low average deviation of 1.9 mm.

Studies that used the Kakarla grading system showed AR accuracies, between 82 and 96% (Fig. 3). Among these studies, two conducted comparative analyses between AR-assisted and the freehand technique. In these control groups, the accuracy of EVD insertion was 72% and 73%, respectively. Notably, the study conducted by Schneider et al. (2023) initially indicated that the freehand technique (73%) yielded higher accuracy than AR (68%). However, after undergoing three sessions of training with AR navigation, accuracy significantly improved to 95%. In all three studies, EVD insertions were carried out by neurosurgeons using Microsoft HoloLens. The study conducted by Kunz et al. (2021) was performed on patients, and the other two on anthropomorphic phantoms.

Fig. 3
figure 3

Percentage of accurately placed drains according to the Kakarla grading system. The rate of accurate AR insertion is presented in black. The accuracy reported by control groups using the freehand technique is presented in grey. The study by Umana et al. (2021) lacks a control group

4 Discussion

The conventional freehand technique, relying on anatomical landmarks, remains the main method for EVD insertion (Huyette et al. 2008). However, due to the lack of subsurface guidance, misplacement rates are high, even in experienced hands (Frisk et al. 2022; Munzer et al. 2019). This review summarizes the current evidence for the use of AR for EVD insertion (Li et al. 2018; Skyrman et al. 2021; Chiou et al. 2022; 2023; Schneider et al. 2021; 2023; Kunz et al. 2021; Benmahdjoub et al. 2023; Bounajem et al. 2023; Gibby et al. 2021), in an effort to provide insights into the benefits and challenges of implementing AR navigation for EVD placement.

4.1 Benefits

Current literature indicates that AR may serve as an accurate guidance system for EVD insertion. When performed by neurosurgeons, AR guided EVD insertions exhibited 5 mm deviation from the planned target (Li et al. 2018; Skyrman et al. 2021; Chiou et al. 2022; 2023; Schneider et al. 2021; 2023; Kunz et al. 2021; Benmahdjoub et al. 2023; Bounajem et al. 2023; Gibby et al. 2021). In that context, the typical width of a normal ventricle is within the range of 9.6 to 18.3 mm (Toma et al. 2009). With a 5 mm deviation, the catheter tip would still be correctly placed in the ipsilateral ventricle in most cases. The inbound error of most conventional navigation systems is in the range of 2–3 mm (Frisk et al. 2022). AR studies using the Kakarla grading system (Kakarla et al. 2008) also showed that 82–96% of the catheters were correctly placed in the ventricle (Kakarla grade I). Compared to the freehand technique, AR navigation excels in accuracy.

In clinical practice, EVD insertions are often performed by junior residents (Guha et al. 2017), and it is of interest to note that insertions performed by non-specialists were more accurate than freehand. In this review, four of the studies included non-specialists. While Eom et al. (2022) and Van Gestel et al. (2021) reported deviations exceeding 7 mm, Benmahdjoub et al. (2023) and Bounajem et al. (2023) found deviations lower than 2 mm. Additionally, Van Gestel et al. (2021) and Bounajem et al. (2023) discovered that non-specialists positively influenced the learning curve, resulting in superior performance among untrained individuals compared to trained ones.

EVD insertions are often urgent procedures, and the use of conventional navigational technologies may conflict with the need for speed (AlAzri et al. 2017). In this context, AR presents a fast and user-friendly alternative requiring less bulky hardware at a competitive cost (Chiou et al. 2023; Khoshnevisan and Allahabadi 2012; Mikhail et al. 2019). Importantly, AR-navigation systems based on HMD-technology are sufficiently mobile to be used outside of the OR, such as in emergency bedside procedures (Karmonik et al. 2018).

4.2 Challenges

The benefits of adding AR-navigation to the relatively simple procedure of EVD-insertion are arguably greatest in novice hands (Li et al. 2018; Skyrman et al. 2021; Schneider et al. 2021; Kunz et al. 2021; Van Gestel et al. 2021; Eom et al. 2022). However, proficiency requires familiarity and training with the systems. Neither the novice nor the expert can expect to perform AR-navigated procedures with ease, if the systems are used only in extreme cases when their benefit is obvious. Apprehensions of the neurosurgeons, that a relatively straightforward procedure becomes overly complex need to be addressed if the technology is to become adopted (Skyrman et al. 2021).

The adoption of AR for EVD insertion introduces some technological challenges. One is the choice of interface. Optic see-through devices can impair depth perception due to accommodation-divergence, where the eye must continuously refocus between the nearby glass display and the distant real object. In contrast, semi-transparent HMDs and external displays reduce perceptual discrepancies by offering control of both virtual and real environments. Schneider et al. and Kunz et al. (2021) discussed that most HMDs are designed for domestic use and may not perform as well in an OR. External monitors provide navigation without the need for eye tracking, but their placement in the OR may be challenging. Additionally, insufficient data processing and transfer speeds may introduce delays in remapping the virtual environment to the real world and cause disturbing visual lag (Dixon et al. 2014).

Another challenge associated with all types of navigation is brain shift (Munzer et al. 2019; Yeh and Wickens 2001). This may be remedied by using intraoperative imaging to update the AR view (Mikhail et al. 2019). However, none of the included studies have addressed this issue.

The use of AR guidance may lead to “inattentional blindness”, where the surgeon becomes distracted by information added to the visual field. Solid overlays may be replaced by wire-mesh and inverse-realism overlay techniques to reduce inattentional blindness (Lai et al. 2022; Dixon et al. 2014; Yeh and Wickens 2001; Marcus et al. 2015). None of the studies reviewed provided a cost–benefit analysis. If the cost is too high, it could hinder adoption of the technology in smaller centers or in low- and middle-income settings, where it may provide the greatest benefit (Meola et al. 2017).

4.3 Limitations of current studies

The majority of the studies were conducted on anthropomorphic phantoms. While these phantoms are designed to be realistic, they do not fully replicate a clinical situation (Chiou et al. 2022). Unfortunately, the studies on patients had small sample sizes. Not all studies incorporated control groups, which hinders the assessment of improved accuracy. Meta-analysis of accuracies could not be performed since outcome metrics were heterogenous. Most studies did not report the number of insertion attempts required for successful placement, limiting the reliability of the accuracy data. A high number of attempts is associated with an elevated risk of infection, tissue damage, and hemorrhage (Yuen et al. 2018). Most studies exhibited a high risk of selection bias and suboptimal quality according to the NOS, decreasing the overall quality of the evidence.

4.4 Future studies

Randomized controlled trials are needed to compare the efficiency of AR navigation to conventional navigation in a clinical setting. Important variables are the choice of interface, the ease of use, efficacy, accuracy, solutions to inattentional blindness and visual lag. Cost-effectiveness of AR navigation in relation to surgical volumes must be investigated. It is imperative that AR interfaces are developed to seamlessly integrate with the demands of an operating room environment and come at a competitive cost.

5 Conclusion

Current evidence demonstrates that AR is more accurate than free-hand technique for EVD insertion. However, studies are few, the technology developing and there is a need for further studies on patients in relevant clinical settings.