Introduction

Osteoarthritis (OA) of the knee can be a painful and debilitating condition and total knee replacement (TKR) is an effective treatment to alleviate pain and restore function for patients with OA [1]. Alignment and balance of the components in TKR is critical to the kinematics of the knee; often subtle changes in balance and alignment are only perceived by more active patients or whilst doing more functionally demanding tasks. The introduction of computer navigation improved the accuracy of alignment in the coronal and sagittal planes [2] but this has not translated into improvement in functional outcome or satisfaction [3, 4] and failed to address rotational alignment [5]. With manual instrumentation, there are two techniques commonly used to determine the rotational alignment of the components in TKR: gap balancing (GB) and measured resection (MR). Which technique is superior has been the subject of numerous studies [6], however, there is still continued controversy regarding the best method to determine accurate femoral component rotation.

It has been widely accepted the MR does not restore natural knee kinematics and does not necessarily recreate the natural joint line obliquity, joint height [7], femoral rotation or joint laxity. Accepting this compromise, MR TKR can provide relief of pain, a correction of deformity and a stable predictable joint. MR with Mechanical Alignment (MA) has been the predominant technique for TKR over the last 50 years [8] and it uses independent bone cuts and ligament releases to balance the joint, it also decouples the relationship between the femoral and tibial components in flexion and extension, in particular femoral rotation and flexion. With MR MA alignment is prioritised over balance, there is now substantial literature suggesting that balance may be as important, if not more so [6].

GB uses the patient’s soft tissue envelope to dictate the bone cuts and to achieve this in extension and flexion requires specific instrumentation, navigation or robotics. The compromise with GB is a sacrifice of the historical alignment goals [9].

The subject of TKR surgical philosophy and defining alignment goals remains topical [10] and techniques such as kinematic, natural or anatomical alignment have gained traction over the last few years; leaving patients with preoperative constitutional varus in mild postoperative varus has been associated with improved functional and clinical outcomes [11]. Results from studies utilising alternative alignment techniques have been promising but are not yet clearly inferring long term benefit [12,13,14,15].

The Gap balancer (GB) used in this trial (Unity Knee™ EquiBalance™) is a variation on the classic GB technique. It was designed to provide intra-operative confirmation of medio-lateral and flexion–extension soft tissue balance, from which the distal and posterior femoral resections are made, without the need to make ligamentous releases. Where this deviates from previous GB methodology is that the GB is designed to accommodate anatomic alignment principles in extension as well as flexion. The EquiBalance™ GB is unique in that in extension it accommodates up to 3° alignment correction, thereby allowing a balanced cut to be made in extension and provides the opportunity to preserve the natural tibial joint obliquity (0 ± 3°). In this study the tibial cut was made in MA and the distal femoral was made in extension followed by flexion, with the use of the GB.

The aim of this study was to compare the GB technique in individualised alignment based off a MA tibial cut with a standard MR in MA and the effect on functional outcome scores and alignment.

Patients and methods

Patients were recruited from the waiting list of four consultant knee surgeons (ADT, VIM, KSE, JRP) between June2015 and August 2018. The inclusion criteria were: all skeletally mature patients listed for a primary total knee replacement, who were able to consent for the study participation and could comply with the study protocol follow-up.

Patients were excluded if they had one of the following: a coronal deformity greater than 15°, a fixed flexion contracture greater than 20°, a confirmed OA in the contralateral knee that was anticipated to require surgery within 12 months, prospects for a recovery to independent mobility that would have been compromised by a known co-existent, medical problem. Additionally, any patient known to be a substance abuser or had a psychological disorder that could have affected follow-up care or treatment outcomes were excluded.

A flow diagram of the patients in the study, according to the CONSORT guidelines is shown in Fig. 1. A total of 108 patients fulfilled the criteria and were recruited to the study. Of these, 94 (87%) underwent surgery. Fourteen patients withdrew prior to the operation. This was either due their care being transferred to a different facility because of bed pressures, or because they failed their anaesthetic assessment. A total of 9 patients had incomplete follow-up. 44 patients in the GB group and 41 in the MR had complete follow up. Post-operatively one patient in the MR group had a fall and ruptured their patella tendon requiring repair and had to be withdrawn from the functional analysis as they could not complete the tasks.

Fig. 1
figure 1

Flow diagram of recruitment and follow up

Randomisation utilised a true random number generator program and sealed envelopes. All patients were reviewed at a pre-operative clinic and at 6 weeks, 3, 6 and 12 months post-operatively by a blinded research physiotherapist.

The primary outcome measure was peak quadriceps torque measured on a digital myometer (MIE Medical Research Ltd, Leeds, United Kingdom). Function was further assessed using the Knee Injury and Osteoarthritis Outcome Score (KOOS) [16], the Oxford knee Score (OKS) [17], and the EuroQol EQ-5D [18], which were collected at each visit. A blinded research physiotherapist assessed the timed get up go (TUG) [19], 6-min walk [20], timed up and down stairs [21], Wii Fit Balance [22], as well as measuring the range of movement (ROM) with a goniometer.

Alignment parameters were measured both pre- and 6 weeks post-operatively on CT and long leg Hip Knee Ankle (HKA) radiographs, that were independently analysed by blinded assessors (HBW and RW) who were not involved in the surgical procedure. As an assessment of reproducibility of these measurements, the interobserver correlations were calculated using Pearson’s r correlation (0.97; p < 0.001).

TKRs were performed either with the use of the GB to achieve individualised alignment or with MR MA. A medial parapatellar approach, without the use of a tourniquet, was used and the patella was resurfaced in all cases. The post-operative protocol was identical for both groups. The operations were performed by one of the four consultant knee surgeons who were familiar with both techniques. The trial was registered (NCT02145455) and ethical approval for this study was obtained from the National Research Ethics Service.

The implant used in this randomised clinical trial is the Unity Knee™ (Corin Group Ltd, Cirencester, UK) Cruciate Retaining (CR) or Posterior Stabilised (PS) implant and it has instrumentation sets that allow both a MR MA philosophy as well as a GB non-MA philosophy. The Unity knee was introduced in 2013 as part of the Beyond Compliance initiative to promote responsible introduction of new prostheses [23]. The femoral component follows a single radius philosophy and is implanted with cement. The GB technique (EquiBalance™, Corin Group Ltd, Cirencester, UK) uses a manual ligament balancer that allows equal medial–lateral tension in extension through altering the position of the femoral cutting block (Fig. 2).

Fig. 2
figure 2

The GB technique (EquiBalance™) illustrating the 3° varus/valgus cut correction that can be made in extension

The GB workflow starts with a tibial cut made in MA using an extra-medullary guide with 3° of posterior slope. A 5 mm distal femoral cut is made off a femoral rod at 5° to allow for stable platform for the ligament balancer instrumentation. The extension balance is assessed and the remainder of the distal femoral cut (making up to 9 mm in total) is made with an adjustment of up to 3° in a varus-valgus plane to achieve a parallel balanced gap. The flexion cut is then made balanced off the tibial cut to achieve medial–lateral balance at the same tension as was achieved in extension. The rotation of the femoral component is not fixed but is dictated by the balance.

The patella was resurfaced using a standard technique to restore the pre-existing patella height. The ability of the extension balancer to adjust the overall leg alignment through an altered distal femoral cut infers that the overall alignment achieved does not aim for a mechanical axis of 0° and can therefore be best termed non-mechanical or individualised.

The MR workflow starts with a tibial cut made in an identical way. The distal femoral cut is made using a 9 mm 5° valgus cut off an intramedullary rod. The femoral flexion cut is made in 3° of external rotation based off the posterior femoral condyles. The patella is resurfaced using the same technique as above. Appropriate soft tissue releases were performed to ensure symmetrical and balanced flexion and extension gaps.

Statistical analysis

The primary endpoint for the study was a comparison of the power of knee extension via peak torque of quads measured on digital myometer between the two treatment groups. The sample size was powered based on the study by Gomez-Barrena et al. [24] with a difference between the two groups of 8.1 N (SD 13.0), power to 80%, α = 0.05 and lost-to-follow up based upon 95% compliance, the sample size was calculated to be 40 in each group.

Functional assessment and patient reported outcome scores are presented as means with 95% confidence intervals (CI). The scores at different visits were compared between treatment groups by means of Analysis of Variance (ANOVA) and for change from baseline by means of Analysis of Covariance (ANCOVA), adjusting for the baseline value, using mixed models. For the ANCOVA we present adjusted means (least square means) with 95% CIs. Differences in alignment between treatment groups were analysed using unpaired t-tests. A p-value ≤ 0.05 was defined clinically significant.

Results

The two groups of patients did not differ with regards to baseline characteristics (Table 1) and baseline values (Table 2). At 1 year, the primary outcome measure of mean quadriceps peak torque (Nm) measured 72.48 (SD 34.57) for GB and 69.13 (SD 23.42) for MR group with no significant difference between the two groups (p = 0.61) on ANOVA (Table 2) or ANCOVA (Table 3) testing.

Table 1 Baseline characteristics
Table 2 Results of functional tests
Table 3 Results of functional tests and KOOS pain score
Table 4 Patient reported outcome measures

There was a significant improvement in the change in ROM seen at 3 months in the GB group in comparison to the MR group (12.97° vs 6.39° p = 0.028) (Table 3.). The functional test of Wii Fit Balance showed statistically significant improvement at 6 weeks (p = 0.03) in the GB group in comparison to the MR group (Table 3). These differences were no longer statistically significant at longer follow-up.

There was no significant difference at any of the time points between patient groups when assessing their quadriceps and hamstring torque, 6-min walk test, timed stair test EQ-5D and OKS.

The pain component of the KOOS score demonstrated a significantly greater improvement for the GB group, compared to the MR group, at all time-points post-operatively, reaching what is considered the Minimally Clinically Important Change of 8–10 points [25] (Tables 3 and 4). At one year the mean difference in KOOS pain between the two groups was 10.54 (CI − 18.18 to 2.90 p = 0.008). There was no significant difference in the other subsections of the KOOS score.

Pre-operative assessment of alignment demonstrated an overall alignment mean of 5.2 (SD 5.82) degrees varus with no significant difference in overall alignment between the two groups (p = 0.77). The distribution of pre-operative alignment is shown in Fig. 3. Post-operative alignment in the MR group demonstrated an average overall HKA angle of 0.3(SD 3.4) degrees valgus compared to 2 (SD 3.6) degrees varus in the GB group, this reached significance (p = 0.05) (Fig. 4). The mean post-operative Medial Proximal Tibial Angle (MPTA) was 90.1 (SD 1.8) degrees in the MR group and 89.6 (SD1.8) degrees in the GB group (p = 0.44). The post-operative Lateral Distal Femoral Angle (LDFA) was 89.8 (SD 3.1) degrees in the MR group and of 91.0 (SD 2.9) degrees in the GB group (p = 0.22). Figures 5 and 6 illustrate the pre and post-operative change in alignment in the GB and MR groups, and demonstrate less of an overall alignment change in the GB group.

Fig. 3
figure 3

Pre-operative alignment distribution

Fig. 4
figure 4

Post-operative alignment in the GB and MR groups

Fig. 5
figure 5

GB group change in alignment from pre- (blue) to post-operatively (red)

Fig. 6
figure 6

MR group change in alignment from pre- (blue) to post-operatively (red)

Discussion

Individualised alignment philosophy utilising a GB technique did not demonstrate an improvement in the primary outcome measure of quadriceps peak torque, however, we found some improvement in early outcomes favouring the GB technique in certain other parameters. Notably the KOOS pain score, was better at 6 weeks, 3, 6 and 12 months in the GB group, although there was no difference in other KOOS subsections. No significant difference at any of the time points was found between patient groups in the 6-min walk test, timed stair test EQ-5D and OKS. ROM and balance were also improved in the GB group post-operatively but this was not maintained at 1 year. The loss of significant difference at 1 year in functional outcomes is similar to other knee trials that have looked at alternative alignment techniques [12].

Our findings suggest that implementation of an individualised alignment strategy using a gap balancing technique may infer some benefit in early functional recovery in patients undergoing TKR. The improvement in KOOS pain score has to be taken in the context of a number of the other functional and PROMs not demonstrating a difference.

Meta-analysis has previously demonstrated that GB may provide better radiographic and clinical outcomes then MR [6], hypothesizing that GB may reflect better anatomical sitting of the components. Previous studies have used GB techniques in flexion, one of the key functions and point of difference of the gap balancer in this trial is that it enables the surgeon to balance the knee in extension by adjusting the angle of the distal femoral cut. The objective intraoperative feedback on alignment enables on-table adjustments to bone resection and implant positioning to achieve balanced and equal flexion and extension gaps without having to perform extensive soft-tissue releases. Direct comparison with previous studies is difficult because of the difference in GB technique and variety of outcome scores that have historically been used. Commonly the Knee Society Score (KSS) and the Knee Society Function Scores have previously been utilized [6], but these are surgeon subjective and have now largely been superseded.

The outcome measures in this study were extensive and incorporated a variety of objective functional outcomes, along with generic and joint specific PROMs. Using a variety of scores has the advantage of being more sensitive in detecting subtle differences in outcome, but could also lead to type 1 errors. The primary outcome in this study (peak quadriceps torque) did not demonstrate any difference between the two groups. We can only hypothesize why specifically the KOOS pain score was significantly improved. No additional soft tissue releases for knee balancing were performed in the GB group in this study. This is in contrast to the MR group where appropriate soft tissue releases were performed to ensure symmetrical and balanced flexion and extension gaps. In the varus knees this included a greater medial release to the posteromedial corner if necessary. In valgus knees this included a posterolateral capsule release and pie crusting of the iliotibial band if necessary. Reduced soft-tissue trauma may have helped reduce post-operative pain as experienced in the GB group.

National Joint Registry of England and Wales data has previously shown that persistent pain following TKR is the strongest predictor of patient dissatisfaction and reduced functional outcomes including the Oxford Knee Score [26] and that pain is the most important prognostic indicator for long-term dissatisfaction following TKA [27]. Interesting, although the KOOS pain score was significant improved in the GB group, this was not reflected in the overall OKS. This likely reflects that the KOOS pain score is more specific to pain with 9 dedicated pain questions, whereas with the OKS, only 5 of the 12 questions are specifically related to pain. Reduced pain scores have been reported in papers using robotic surgery when the technique similarly preserves the soft tissue envelope [28].

The post-operative alignment analysis would support the assertion that the soft tissue envelope was preserved in the GB group. In both groups of patients, the philosophy of the tibial cut was to achieve alignment perpendicular to the mechanical axis of the tibia for insertion of the tibial component. The perpendicular tibial cut in the GB group was to address concerns that a varus tibial resection may predispose to tibial loosening and early failure [29]. There was no significant difference in the LDFA between the two groups although the mean LDFA was slightly more varus (91.0°) in the GB group in comparison to the MR group (89.8°). This was as expected, as the extension gap with the GB technique was not aiming for natural joint line obliquity. The mean pre-operative alignment for both groups was moderate varus (5.2°) and this would be in keeping with a standard knee constitutional varus [30] population, increased by medial OA. The results would suggest that with the GB the constitutional varus alignment has been preserved by maintaining the soft tissue envelope but not at the expense of placing obliquity on the implant. This has the advantages of individualised alignment without the potential risks [29] of the true natural or kinematic alignment philosophy associated with placing the tibial in varus in a limb that is overall in varus.

It should be noted that the outcomes in both groups of patients (OKS > 39 Table 4), are extremely good and compare favourably to other studies [31], this is likely to accentuate any ceiling effect. Looking closely at the results in Table 3, it was the change from baseline results that yielded significant differences in the Wii fit balance at 6 weeks, and it could be hypothesized that proprioception was improved by maintaining a more natural alignment in the GB group helping with balance in the early post-operative period, but as time progressed the patients in the MR group became more accustomed to their new alignment and so relatively improved their balance. The improvement in ROM was only significant in the GB at 3 months, but close analysis of Table 3 demonstrates that at all time points ROM was better in the GB group but not statistically significant.

There are limitations of this study that need to be considered when interpreting the findings. The number of patients in each group was adequate for the power calculation but may have resulted in a type II error. The follow up assessment was time consuming and onerous for the patients and may have led to some loss to follow up. The reported functional outcome measures were not correlated to long-term clinical outcomes beyond 1 year or implant survivorship. By using a large number of tests and a p value of p < 0.05 to denote significance, this may have led to type 1 error for some of the results. However, it should be noted that even if a p value of p < 0.01 was used there would still have been a significant difference in KOOS pain score at 12 months. A Bonferroni correction was not performed but if the p-value was multiplied with the number of tests performed then if would be difficult to demonstrate any significance.

Individualised alignment philosophy utilising a GB technique did not demonstrate an improvement in the primary outcome measure quadriceps peak torque. Improvement was seen in the GB group in PROM pain scores that was significant, both statistically and clinically, out to at least 1 year. Gains that were seen in functional assessment with GB, although significant at some time points, were no longer significant at 1 year and no difference was seen in quads strength. Compared to a MR technique, the individualised GB technique appears to confer some improvement in pain, ROM and some functional tests following TKR in the short-term.