Introduction

Achieving optimal clinical and functional outcomes after total knee arthroplasty (TKA) depends on a number of factors such as leg alignment, orientation of the components, as well as adequate soft tissue balancing [2, 9, 21, 24]. Assessing soft tissue balance is challenging for the surgeon, as the perceived “feeling for balance” is influenced by several elements such as surgical experience, patients’ body mass index (BMI), gender, soft tissue laxity, the degree of joint contracture, as well as the fitness level of the surgeon [14]. Consequently, this might lead to an unbalanced knee as well as further revisions, which both reduce the patients’ quality of life [11]. Hence, the development of new methods to assess intraoperative ligament laxity and balancing is important [28]. One of the current debated methods is the use of intraoperative sensor technology, which allows surgeons to dynamically quantify in vivo joint laxity and perform a more objective soft tissue balancing [26]. These sensors enable the surgeon to receive objective intraoperative digital feedback [17]. Intraoperative pressure sensors may play an important role in reducing dissatisfaction related to knee stiffness or instability [11]. Despite their ability to improve soft tissue balance, it is not clear whether the use of intraoperative sensors improve clinical and functional outcomes in TKA. Several studies have shown improved early outcomes with the use of sensors [8, 11, 12], whereas others have failed to demonstrate any clinical benefit [18, 25]. Although an optimal balance of the medial and lateral joint gaps in flexion and extension has been considered an important determinant of clinical and functional outcomes, improved outcomes when compared with TKAs that are not balanced, have not yet been proven [9, 17]. One of the commercially available intraoperative sensors is the Verasense sensor [17]. Due to its widespread use, and to the fact that the majority of the studies, which report on clinical and functional outcomes following sensor-assisted TKA, are using Verasense, the present study focuses on its use. The aim of this systematic review was to investigate the clinical and functional knee outcomes after Verasense sensor-assisted TKA (VA TKA) and to compare these outcomes where possible with those from manually balanced TKA (MB TKA). The hypothesis was VA TKA does yield satisfactory postoperative clinical and functional outcomes, but not superior to those after MB TKA.

Materials and methods

A systematic literature search was conducted on PubMed, Embase, Medline and Scopus to identify potentially relevant articles published between June 2012 and June 2022. Terms including “intraoperative sensor”, “sensor technology”, “sensor assisted”, “Verasense”, “pressure sensor”, “force sensor”, “sensor balancing”, “total knee arthroplasty”, “knee revision”, “knee replacement”, “revision arthroplasty”, “TKA” and “TKR” were searched for in both title and abstract. Detailed information regarding the used search strategy can be found in “Additional file 1”.

Inclusion criteria were full text English- or German-language clinical studies, published in peer-reviewed journals that assessed the clinical and functional outcomes following VA TKA. Only original research articles were considered. Preprints, abstract-only studies, protocols, reviews, expert opinion studies, book chapters, surgical technique studies, and studies pertaining only to unicondylar knee arthroplasty (UKA) or patellofemoral arthroplasty (PFA), were excluded. Only articles reporting clinical and functional knee outcomes after VA TKA were included.

After collecting all articles and removing duplicates, two authors screened the studies by title and abstract analysis for inclusion. In a second step, selected articles were checked for their eligibility by full text analysis. In case of uncertainty regarding inclusion a third author was consulted. A further manual screening of the reference lists of included articles was done for additional studies. For this review, only studies reporting clinical and functional outcomes were included. Endpoints included various postoperative patient-reported outcome measures (PROMs) such as Knee Society Score (KSS), Oxford Knee Score (OKS), Knee Injury and Osteoarthritis Outcome Score—4 Subscales (KOOS4), Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) and Physical Function—Computerised Adaptive Testing (PF˗CAT), as well as range of motion (ROM), rate of arthrofibrosis and reoperation rate.

Quality assessment

The methodological quality of the included comparative and non-comparative non-randomised clinical studies was assessed by two raters independently, using the Methodological Index for Non-Randomised Studies (MINORS). The global ideal score is 16 for non-comparative studies, and 24 for comparative studies. For the assessment of included randomised control trials (RCTs), the Jadad Scale was used, with the maximum score being 5. The level of evidence of the included studies was also reported.

Data extraction

From the selected publications, title, author, year of publication, study design, level of evidence, number of knees, follow-up time, patient demographics, clinical outcome scores and functional outcomes were extracted.

Statistical analysis

Continuous variables were described with mean and standard deviation or median and range. Categorical variables were reported with absolute and relative frequencies. A p < 0.05 was considered statistically significant.

Results

Search results and characteristics of included studies

The literature search identified 243 scientific articles in the initial screening. A total of 11 met all the inclusion criteria following the selection process shown in Fig. 1 (Additional file 2). Median MINORS for non-comparative studies value was 11.5 (11–12), and for comparative studies 18.2 (13–21). Median Jadad Scale score was 3.6 (2–5). Results from a total number of 3633 patients were evaluated (median age at surgery 68.5 years, range 32–88 years). The reported median body mass index (BMI) was 30.8 (17–45). Several scores (KSS, OKS, WOMAC, KOOS4, PF˗CAT), alongside postoperative ROM, reoperation rates and the rate of manipulation under anaesthesia (MUA), were used to evaluate clinical and functional outcomes. The detailed characteristics of the included studies are presented in Table 1 (Additional file 3).

Clinical outcomes

ROM was evaluated in five studies (Table 1). They reported no statistically significant differences between the VA and MB groups [5, 7, 19, 25, 29]. Mean follow-up time ranged from 3 to 24 months.

Table 1 Summary of reported flexion

Nine studies examined the rate of MUA as an assessment of postoperative arthrofibrosis (Table 2). Geller et al. reported that the rate of MUA in the MB TKA group was higher than in the VA TKA group, and that the period between TKA and MUA was on average 13.5 weeks and 7.5 weeks, respectively [15]. Some of the other studies also showed higher rates of MUA in the MB TKA group, but with no significant differences. The mean follow-up time ranged from 6 to 24 months [1, 3, 5, 7, 16, 18, 19, 25, 29].

Table 2 Summary of reported rate of MUA

Livermore et al. and MacDessi et al. reported the reoperation rate at a mean follow-up of 12 months and 24 months, respectively, with no significant differences [16, 18].

Patient-reported outcome measures

Four studies have evaluated pre- and postoperative KSS (Table 3). These studies showed improved postoperative KSS, for both groups (VA TKA and MB TKA), but with no significant differences between the analysed groups at 6 and 12 months [3, 5, 25].

Table 3 Summary of reported KSS scores

Three studies evaluated pre- and postoperative OKS (Table 4). Although the 6-month OKS scores were higher in the VA TKA group (p = 0.01), no significant differences were found between the two groups at 1- and 2-year follow-ups [3, 5, 29].

Table 4 Summary of reported OKS scores

Three studies have reported preoperative and 1- and 2-year postoperative KOOS and WOMAC after VA and MB TKA [16, 19, 25]. They described improved postoperative scores in both groups, but with no significant differences between them. Detailed information can be found in Table 5.

Table 5 Summary of other reported PROMs

Discussions

The main findings of this study are that both VA TKA and MB TKA showed improved clinical and functional scores, but that there is no statistically significant difference in between VA TKA and MB TKA [5, 7, 19, 25, 29]. Similar findings were shown for PROMs (OKS, KSS, WOMAC, KOOS4 and PF-CAT) [5, 16, 18, 19, 25, 29]. There is only one exception, which has been also reported by Chow et al. who described a statistically significant difference (p < 0.025) in OKS at 6-month follow-up between VA TKA and MB TKA groups [3]. The VA TKA group scored higher (Δ17 vs Δ13). A similar difference for the KSS (Δ63 vs. Δ52) has also been shown by Chow et al. [3]. However, the findings although significant do not represent clinically relevant differences. In addition, in this study, only the mean change between preoperative OKS/KSS and 6-month postoperative OKS/KSS and not the absolute values were presented.

The most interesting finding of the present review is the fact that when no pressure sensors were used for TKA, the knee was more likely to undergo a manual manipulation [1, 3, 7, 16, 17, 19, 29]. However, although there was clear difference for a higher need for MUA in the conventional groups, the findings were not always statistically significant. This might be due to the laxity differences in the included knees. Many studies have just recently highlighted the importance of the envelope of laxity [9, 10]. In line with the ongoing knee phenotype discussion, with regards to alignment parameters, the identification of different knee phenotypes for the individual envelope of laxity has just begun [20, 27]. When having identified different knee laxity types, the question arises how to measure these pre-, intra- and postoperatively. Pre- and postoperatively stress radiographs might play a major role. Intraoperative sensor technology, navigational or robotic devices might also be helpful. However, all technologies available are still in the early childhood and needs to mature to make a real clinical impact in TKA. The biggest obstacle besides availability is questionable inter- and intra-observer reliability of all such tools, when measuring stress or pressure in the different joint compartments.

The findings of this review are in line with the findings of Sun et al. in their metanalysis [26]. The number of included studies was lower than in ours. Since then, many more studies have been published. Another difference lies in the availability and the language of the studies included. In the present review, no articles published in other languages aside English and German were included. In addition, the articles had to possess identification numbers (i.e. PID, DOI), or these had to be accessible (e.g. in online medical libraries [i.e. PubMed, Scopus, Embase, Medline, Google Scholar] and/or peer-reviewed journals).

This study presents a number of limitations. The low number of RCTs analysed in our review (i.e. three) combined with the inferior quality of some non-randomised clinical studies limits the level of evidence. The low number of enrolled patients in the majority of analysed studies could also play a major role as the ability to identify a statistically significant difference between study groups is heavily linked to the sample size. Although all present completed studies pertaining to this subject have been included, more RCTs and prospective studies are currently in different stages of completion [4, 23]. Their results are yet to be reported; thus, there is a possibility their future findings might not concur with ours. The variability between included studies regarding surgical techniques, type of prosthesis (i.e. cruciate retaining [CR] or posterior stabilised [PS]), endpoints, outcomes, follow-up periods and size of cohorts adds to the heterogeneity of the study samples. In addition, the follow-up period for these studies is relatively short, with no study analysing long-term outcomes (clinical or functional). In the end, this systematic review only assessed studies which included a certain type of intraoperative sensors (Verasense). Several other sensors are available on the market (e.g. eLibra, Omnibot), to which our findings may not apply [6, 15]. The number of available studies reporting on outcomes following these sensors is low though, and there is none comparing those outcomes to the ones following MB TKA. Therefore, inclusion of these studies in the review, and a subsequent comprehensive analysis of their findings, has not been performed.

Conclusion

The use of Verasense pressure sensors in TKA leads to no significant improvement in ROM, reoperation rate or functional outcomes, when compared to the standard manually balancing technique. However, lower rates of MUA have been described in the VA TKA group.