Introduction

Due to the proximity of pivotal neural and vascular structures, pedicle screw placement can be associated with significant complication rates [1, 2]. To ensure safe pedicle screw placement, assistive methods such as utilizing anatomic landmarks and fluoroscopic or computed tomography-guided imaging have been employed, with exposure to notable radiation for both the patient and theatre staff in the process. Other methods (e.g. mechanical drilling, 3D-printed preoperative planning) have been implemented in an attempt to reduce radiation exposure [2, 3]. However, these methods can prove expensive and prolong preoperative planning. Additionally, rates of accurate pedicle screw placement still vary considerably, thus questioning their overall efficacy.

More recently, there has been notable excitement around developments in the realm of augmented reality and potential applications for the field of spine surgery. Augmented reality intraoperative navigation (ARSN) improves visualization of the surgical field with 3D intraoperative imaging produced via optical cameras incorporated into a ceiling-mounted C-arm (Fig. 1), aided by adhesive skin markers that facilitate tracking of patient position [3,4,5]. This allows surgeons to view an ideal trajectory path for pedicle screw placement superimposed on subjective visual fields through a headset, while simultaneously viewing the surgical field [1,2,3]. Furthermore, there is no need for pre-registration due to an integrated system. A recent systematic review by Vavra et al. [1] reports an accuracy range of 1–5 mm for augmented reality systems, highlighting its safety and efficacy across a variety of surgical procedures. However, little is known regarding the collective efficacy of ARSN for pedicle screw placement in spine surgery.

Fig. 1
figure 1

A and B: augmented reality with optical camera-based navigation and tracking highlighting the trajectory path from an oblique view at different time points, adapted from Elmi-Terander et al. [5] (2016) under the creative commons attribution-non-commercial license 4.0 (CCBY-NC), where it is permissible to download, share, remix, transform and build-up the work provided it is properly cited and not used for commercial purposes

Methods

Two independent reviewers (J.M.M and S.Y) performed a literature search per Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [6]. In the case of disagreement, a senior author was consulted (J.S.B). A comprehensive search was performed for eligible articles using the Medline/PubMed, Cochrane and Embase database to include studies up to, and including July 1st 2022. Search terms included “augmented reality” and “spine surgery” and “pedicle screw placement”. Inclusion criteria were predefined as comparative (i) clinical or cadaveric comparative studies, (ii) non-comparative studies and (iii) phantom or printed models. Exclusion criteria were articles not available in English. Bibliographies of retrieved, full-text articles were screened for further studies meeting the inclusion criteria.

Relevant data extracted were results of screw placement accuracy and breaches, as defined by the Gertzbein–Robbins classification; grade 0 (screw within the pedicle without cortical breach), grade 1 (0–2 mm breach, minor perforation including cortical encroachment), grade 2 (2–4 mm breach, moderate breach), grade 3 (more than 4 mm breach, severe displacement), grade 4 (4–6 mm breach) and grade 5 (> 6 mm breach). Accuracy was defined as the summation of grade 0 and grade 1 events per the Gertzbein–Robbins classification [7]. Additionally, events of lateral and medial breaches and mean deviation from preoperative plans were also collected. All extracted data was collated and compared for augmented reality-assisted pedicle screw placement.

Results

The initial literature search produced 104 results. After the removal of duplicates, 93 remained. Seventy-two articles were excluded after a review of the title and abstract. Thus, 27 studies were included for full-text review, with 23 studies subject to qualitative synthesis (Fig. 2).

Fig. 2
figure 2

Preferred reporting items for systematic review and meta-analyses (PRISMA) study selection flow diagram

There was a total of 74 cadavers (including 2 pig cadavers), 118 patients, 61 phantom models and 80 sawbone (agar-based) models across all studies. In terms of study design, 14 studies were cadaver, four mixed methods clinical approach, three phantom, one sawbone models in agar-based gel and one prospective case series. Overall, 2232 ARSN screws were placed. The basic characteristics of each study are outlined in Table 1. Of note, one study was omitted from further analysis in terms of collectively evaluating overall accuracy and respective breaches as the ARSN patient cohort was included in an additional study [18, 26]. Thus, only one of the two studies was included for analysis.

Table 1 Characteristics of studies

Clinically accurate placement

Twenty studies reported clinically accurate placed screws (Gertzbein and Robbins Grade 0+1). The range of ARSN clinically accurate placed screws was 26.3–100%. Out of the 2232 total ARSN screws that were placed, 2095 (93.1%) were deemed clinically accurate (Table 2). Comparatively, cadaver (1155/1237; 93.4%) and clinical studies (777/827; 93.4%) showed improved accuracy over phantom/sawbones models (123/168; 73.2%).

Table 2 Results of “clinically” accurate pedicle screw placement

Grade 0

Seventeen studies reported grade 0 breaches. nine studies were cadaver, four were clinical and one was phantom. Grade 0 breaches ranged from 0 to 100%. A total of 2088 screws were placed, of which 1395 were reported as grade 0 breaches (66.8%) (Table 3).

Table 3 Results of grade 0 pedicle screw placement

Grade 1 breaches

Seventeen studies indicated grade 1 breaches. The range of grade 1 breaches was 0.0–95.6%. Overall, there were 595/2088 (28.5%) grade 1 breaches out of all screws placed (Table 4).

Table 4 Results of grade 1 pedicle screw placement

Grade 2 breaches

Grade 2 breaches were described in seventeen studies. Grade 2 breaches ranged from 0.0–45%, with a collective 5.4% of screws placed (112/2088) representing grade 2 breaches (Table 5).

Table 5 Results of grade 2 pedicle screw placement

Grade 3 breaches

Similarly, grade 3 breaches were reported in seventeen studies. Three, with a range of 0.0–6.3%. A total of 2088 were placed, of which 33 (1.6%) were grade 3 breaches (Table 6).

Table 6 Results of grade 3 pedicle screw placement

Grades 4 and 5 breaches

Only two studies reported grades 4 and 5 breaches [11, 19]. Molina et al. [19] reported one grade 4 (1/120; 0.8%) and three grade 5 breaches (3/120; 2.5%), while Farshad et al. [11] reported one (1/80; 1.3%) grade 4 breach.

Medial breaches

Nine studies reported medial breaches, with a range of 0.0–52.6% of screws placed. From the 926 screws placed, there were 29 medial breaches (3.1%), as outlined in Table 7.

Table 7 Results of medial breaches pedicle screw placement

Lateral breaches

Nine studies included lateral breaches. Incidence ranged from 0.0 to 20.8% across respective studies, which represented 2.3% (21/926) of screws placed (Table 8).

Table 8 Results of lateral breaches pedicle screw placement

Mean linear and mean trajectory/angular deviation

Mean linear deviation was described in four studies, with a range of 1.3–5.99 mm. Mean angular/trajectory deviation was reported in six studies. The range of deviation across studies was 1.6°–5.88°. Results of individual studies are outlined in Table 9.

Table 9 Results of mean linear deviation and mean trajectory/angular deviation

Discussion

Since the inception of virtual reality by Robert Mann in 1965, augmented reality has been adapted across various sectors and recently, healthcare. To date, studies depict the successful application of AR navigation systems across several surgical fields [27], with neurosurgery particularly innovative regarding the early adoption of augmented reality into clinical practice. For example, Skyrman et al. [28] demonstrated that the mean accuracy for percutaneous cranial biopsy needle insertions (n = 30) was 0.8 mm ± 0.43 mm, deemed an acceptable discrepancy from pre-planned trajectories. Demonstrated accuracy is further complimented by its efficiency in terms of workflow, with implementation of AR resulting in similar operative times to that of conventional methods [28]. Furthermore, the versatility of AR application is depicted in the literature as with Cercenelli et al. [29] who report a comparable success rate (100% vs 97%) in task-related osteo-myocutaneous fibular flap reconstructive surgery on 3D-printed leg phantoms (HoloLens vs tablet), with the AR model showing potential to minimize registration error in clinical practice [29].

In terms of orthopaedics, the first VR orthopaedic application was a knee arthroscopy simulator introduced in the 1990s. Since then, AR has been employed in numerous educational settings [30], in addition to pre-clinical diagnostic and therapeutic procedures [31, 32]. Tsukada et al. [33] demonstrated in a sawbones model that an AR-KNEE system was efficacious as CT measurements for global alignment in total knee arthroplasty (TKA) models. The absolute differences between the values displayed on the smartphone screen and CT measurement values for varus/valgus, posterior slope, internal/external rotation angles and thickness of the resected bone were 0.5° ± 0.2° (less than 1°), 0.8° ± 0.9° (less than 1°), 1.8° ± 1.5° (< 2°) and 0.6 mm ± 0.7 mm, respectively. Thus, this study highlights the reliable and reproducible accuracy of the AR system for coronal, sagittal and rotational alignment in tibial bone resection during TKA [33]. Additionally, Cho et al. [34] demonstrated in a porcine femur model of template introduced tumours, that the AR system demonstrated improved accuracy (p < 0.05) of resection compared to that of conventional resection. One hundred and sixty-four tumours were resected in 82 femurs in the AR group, with a mean error of 1.71 mm (range 0–6 mm). Comparably, 82 tumours were resected in 41 femurs in the conventional resection group, with a mean error of 2.64 mm (range 0–11 mm), with the AR group more probable to attain a 10 mm surgical margin compared to the conventional group (90.2% vs 70.7%) [34].

In spine surgery, AR has been used to assist pedicle screw insertion in phantom, pre-clinical and recently clinical models. ARSN improves visualization of the surgical field with 3D intraoperative imaging produced via optical cameras incorporated into a ceiling-mounted C-arm, aided by adhesive skin markers that facilitate tracking of patient position. A video-based system providing augmented reality surgical navigation ARSN with intraoperative 3D imaging proves the latest advancement in computer-assisted navigation. Furthermore, it has been shown to be accurate, safe and effective for pedicle screw placement [26]. Nevertheless, it remains in its infancy, with only five clinical studies and seven comparative studies to conventional techniques noted in the literature to date. Peh et al. [21] report in a cadaveric model that AR placed lumbar pedicle screws showed an improved comparative accuracy to that of fluoroscopy-guided screw placement (64/68; 94.2% vs 60/68; 88.2%), with reproducible accuracy across studies in both open and percutaneous/minimally invasive approaches. Pre-clinical findings are corroborated by recent clinical studies as demonstrated by Gu et al. 2020 [23], who in a mixed methods study of 50 patients undergoing lumbar pedicle screw placement, show improved accuracy for AR-guided screws (136/142; 95.77%) compared to conventional freehand technique (123/138; 89.13%). In addition to accuracy and avoidance of breaching the anterior cortex of vertebral body, the importance on accuracy in the axial plane and mitigation of lateral and medical breaches is integral. Three studies (all pre-clinical) report on lateral breaches for AR vs conventional technique; Elmi-Terander et al. [26] (5/23; 21.7% vs 15/33; 45.5%), Peh [21] (2/29; 6.7% vs 7/21; 33.3%), Urakov [15] (3/18; 16.7% vs 3/10; 30%), all showing reduced rates of lateral breaches for AR technology. In contrast, the same studies higher rates of medial breaches with ARSN; Elmi-Terander et al. [26] (2/23; 8.7% vs 2/33; 6.1%), Peh [21] (2/29; 6.7% vs 1/21; 4.7%), Urakov [25] (10/18; 55.5% vs 3/10; 30%). Such findings highlight certain obstacles to further clinical implementation of AR technology of thoracolumbar screws, particularly if there are reduced occurrences of medial breaches, due to concerns of the narrow spinal canal in the thoracic region. However, a reduced degree of lateral breaches could provide encouragement to the novel application for AR-assisted pedicle screw placement in the cervical region, and mitigate concerns due to the close proximity of vertebral arteries to cervical pedicles, with similar accuracy rates to novel intraoperative navigation systems employed for similar purposes [35, 36]. However, ARSN has certain advantages such as absence of intraoperative radiation experienced with intraoperative C-arm, and a reference field framed around the surgical site rather than one particular point of reference that may need to located a number of vertebrae away from the vertebrae level of concern. This can prove vital as shown that the further the distance from the vertebrae level of interest that the reference is located, the greater potential for inaccuracy [36].

Nevertheless, there are some obstacles to the widespread clinical adoption of AR-assisted spine surgery, in addition to the one aforementioned. It is currently unclear the degree of the learning curve associated with AR technology. The learning curve is a commonly used term in surgery to describe the process of demonstrating proficiency in a consistent manner. Several ARSN models exist on the market, whose comparative characteristics are not within the scope of this article. None of the studies included in this study report on the associated learning curve with AR-assisted pedicle screw placement. Furthermore, many of the studies are inclusive of experienced surgeons. As such, formal studies are needed to conduct and evaluate the learning curve for AR-assisted pedicle screw placement across varied levels of experience, with formal assessment measures such as objective structured assessment of surgical skills (OSATS) [36]. Secondly, institutions must aware of their capacity to afford the initial capital costs associated with erecting an AR-assisted spine surgery program. However, it is often considered a low-cost alternative requiring minimum infrastructure compared to certain intraoperative navigation systems and can be employed as a supplement to existing equipment for procedures other than pedicle screw placement, such as operating microscopes [37]. Other limitations include micromovement of fiducial markers which can affect accuracy, inadequate brightness or inattention blindness from natural light and battery life of AR head-packs [38, 39]. However, this is expected to be addressed and rectified as models advance, and AR may serve a cost effective alternative to navigation or robotic systems for smaller institutions wishing to provide modern surgical techniques for improved radiological and clinical outcomes. The mitigation of such concerns will further compliment technical advantages such as negating the need for surgeons to switch their eyeline from a 3D surgical field to a 2D bedside navigation system, and the absence of a reference frame in the surgical field, a current feature of certain navigation systems.

Computerised spine navigation using robotics and augmented reality is becoming more frequently used and is appealing to surgeons for both open and minimally invasive procedures. Comparing accuracy of different methods of pedicle screw placement yields interesting results [41]. Fatima et al. examined the differences of pedicle screw placement between robotic-assisted placement using augmented reality and freehand surgery. They found that patients undergoing robot-assisted screw placement had 1.68-fold greater likelihood of achieving ‘perfect’ accuracy, defined as when the screws were completely situated within the pedicle [41]. The meta-analysis by Fatima et al. also compared robotic-guided and fluoro-guided instrumentation placement in minimally invasive surgery. It found that there was a significantly higher risk of complications among patients with fluoro-guided surgery (OR 12.2; 95% CI 4.1–36.3) [41]. Importantly, this study declined to conclude that robotic guidance is superior to freehand techniques based on the quality of the data [41]. Comparing pedicle screw placement accuracy between freehand and computer-aided placement using augmented reality, Amiot et al. found that in screws placed between T5 and S1, freehand placement had an error rate of 15.3% from 544 cases, whereas there was a 5.4% error rate from 294 cases when computer-assistance was used [42]. Yu et al. had similar findings when looking at lumbar pedicle screw placement, with 4.6% of ARSN placed screws having malposition versus a 16% malposition rate for freehand placement [43].

Advantages to conventional freehand techniques over robotic guidance and ARSN include the robustness and commonly practiced nature of the technique, the ability to experience tactile feedback when placing screws which can influence real-time decision making, reduced costs and the ability to expediently adjust screw placement if required [41]. Furthermore, to our knowledge, there are currently no prospective studies available which show a significant difference between ARSN and robotic-guided procedures in terms of efficacy and mortality. Advantages of the robot guidance are the reduced ergonomic strain of screw placement, mechanical precision, the use of minimally invasive access, execution of preprogrammed screw trajectories and the use of mechanical components to improve surgical dexterity, such as tremor filters, motion scaling and directional locks to ensure accurate pedicle screw trajectories. Robot-assisted surgery is also known to have a lower operating time and radiation exposure compared with freehand placement [44, 45]. These factors may reduce fatigue or user-dependent errors. Drawbacks of robotic systems include the possibility of mechanical failure, the additional space requirement for robotic units and imperfect sensory feedback to the user console [44]. Advantages of augmented reality platforms include the use of three-dimensional imaging for visualisation of spatial parameters to improve accuracy [44]. Furthermore, the ability to reduce the surgeons ‘extrinsic’ cognitive load, by providing the navigational information within their view of the surgical site, eliminating the need to switch attention from a screen display to the surgical site [45]. Allowing integration of preoperative imaging and the intraoperative surface may also reduce the requirement for intraoperative imaging and thereby improve operational workflow while also reducing radiation exposure [45]. Additionally, ARSN systems have enhanced portability and reduced costs when compared with bulky and expensive surgical robotic systems [45]. Combining ARSN with robotic guidance, however, has obvious benefits in terms of safer, more accurate procedures with improved outcomes, lower complication rates, reduced levels of radiation exposure and shorter operating time, particularly in the setting of minimally invasive surgery. Combining these methods has the capacity to build on the benefits of each system, while eliminating many of the possible drawbacks to each method.

Thus, it is envisioned that further clinical application of AR-assisted spine surgery may be inevitable. However, there is a need for more robust prospective clinical studies for pedicle screw placement, in addition to pre-clinical versatility and proof-of-concept studies such as cervical pedicle screw placement, tumour resection, comparison to robotic navigated systems and efficacy for percutaneous non-surgical procedures such as lumbar facet joint injections. Patient cohorts should be matched accordingly for comparative studies. Only two out of five overall clinical studies which exist in the literature compare ARSN to conventional methods [14, 26]. However, different patient populations does not allow for accurate comparison between studies. Furthermore, AR could be employed to improve the consent process by demonstrating individualised surgical approaches in a manner retainable for surgical patients, improving several aspects of spine surgery both for the patient and surgeon. Nevertheless, from our study in particular, ARSN appears a safe, accurate and efficacious method of assisting in thoracolumbar pedicle screw placement.

Conclusion

ARSN pedicle screw placement had shown its consistent accuracy across a variety of models. However, its adoption into clinical spine surgery remains in its infancy, with little known regarding the learning curve. Further robust prospective studies are needed to accurately compare to conventional methods of placement.