Introduction

Long-bone non-union continues to be one of the most common post-fracture fixation complication with an incidence reported to range between 5 and 10% [1,2,3]. Its impact in terms of resource allocation and utilization can be substantial [4,5,6]. Currently, non-union according to the Food and Drugs Administration (FDA) agency is defined as a fracture being over nine months old with no signs of progression toward healing for three months [7]. However, assessing the risk of developing non-union early, within 12 weeks after fracture fixation, could be helpful for planning to intervene early and could lead not only to faster patient recovery but also to a reduction of utilization of clinical resources [8]. Previous studies have examined the value of biomarkers for predicting non-union, but up to now, whilst this work is promising, this approach has not been effective [9]. Consequently, the availability of a clinical decision rule, which could provide the orthopaedic surgeons with the extent of the risk of developing non-union, would assist them to formulate an early re-intervention plan, contrary to the usual established ‘wait and see policy’.

The aim of this case–control study was to explore the development and effectiveness of a clinical decision rule, which would provide valuable feedback in terms of the risk of developing non-union within 12 weeks after initial fixation of a fracture of the femoral or tibial shaft. This clinical decision rule would define a high-risk group for developing an impaired fracture healing response.

Materials and methods

Initially, we undertook a comprehensive evaluation of the literature in order to identify risk factors contributing to non-union of the femur and tibia [10]. Overall, ten risk factors with large effect sizes and good evidence were used to build a scoring model. The relative effects on the fracture of these factors were assessed by the use of a case–control approach.

Consequently, a retrospective case–control study was conducted at two level I trauma centres: Leeds General Infirmary, Leeds, UK, and Ospedale Policlinico San Martino, Genoa, Italy. Institutional board review approval was obtained from both institutions (LTH - 911 and OPSM-140REG2016) for the study. Using the orthopaedic trauma registries of the two departments, all the femoral and tibial non-unions treated between June 2009 and September 2016 were identified retrospectively. From the same period of time, randomly selected patients with femoral and tibial fractures that progressed uneventfully to union formed the control group of the study.

Study participants

Inclusion criteria required patients to be adults (≥ 18 years of age) having sustained a tibial and/or femoral shaft non-union following surgical stabilization. We considered as femoral shaft the segment between the lower edge of the lesser trochanter and the metaphyseal zone at least 5 cm above the femoral condyles. The tibial shaft was assigned as the segment between below the tibial tubercle and at least 3 cm above the tibial plafond distally. Non-unions that occurred in other segments of the femur and tibia and other long bones were excluded as well as fractures not surgically treated, pathological fractures due to tumours, open fractures associated with significant bone loss (defects), periprosthetic fractures, and patients with such conditions as osteogenesis imperfecta and Paget disease. In addition, segmental fractures were excluded as there are inherent difficulties in a standardized evaluation due to the variable length and location of the intermediate segment.

As surgical treatment, we consider only the three standard definitive methods of treatment including intramedullary nail (IM), plate, and circular external fixator. Patients managed definitively with axial external fixators were also excluded.

Collection of data

We developed a proforma for the accurate and consistent documentation of data. Both radiographs and clinical records of each patient were carefully evaluated, and all risk factors contributing to the development of non-union were documented. Radiographs were assessed for the state of fixation and state of mechanical stability according to the type of implant used. The fracture pattern and the location of non-union were also recorded. In addition, the initial degree of fracture displacement was documented as well as the degree of any post-fixation surgical fracture gap if present. From the clinical records, such details were extracted as the initial state of the soft tissue envelope at presentation, method of reduction during surgery, patient smoking habit, development of infection, diabetes, peripheral vascular disease, and other relevant medical details. As our institutions have stopped using non-steroidal anti-inflammatory drugs (NSAIDs) post-operatively for analgesia since 2007 due to their negative effects on bone healing, this parameter was not considered. Patients with incomplete documentation, either for radiographs or for medical records, were excluded. Patients were not matched on either demographics or clinical characteristics, but both cases and controls were selected from the same time period (2009–2016).

Study size

Up to ten risk factors were considered based on the strongest evidence from the literature review [10]. To robustly assess ten variables in a logistic regression, it is necessary to have 100 events (10 times the number of variables [11]), and consequently, 100 cases and 100 controls were recruited to the study.

Variables—definitions and measurements

Non-union

Non-union was defined as a fracture not able to heal without any further intervention within nine months of the initial surgical treatment, evidenced by no progression from three consecutive months of serial radiographs [7].

Ten risk factors for non-union

  1. i.

    Displacement. Initial deformity of the bone after a fracture, expressed as the degree of dislocation of each bone fragment over another one. We considered displacement as minimal or moderate when smaller than 75% of the shaft width and marked when greater than 75% of the shaft width [12]. The degree of displacement was evaluated on the first radiograph taken, at the highest point of the deformity regardless of the projection (anterior–posterior (ap) or lateral). In cases where due to application of a temporarily external fixator the displacement of the fragments increased compared to the original position at presentation, then this later degree of displacement was considered.

  2. ii.

    Anatomical location of fracture. Fracture that occurred in the tibial shaft rather than in the femoral shaft was considered as a location with a greater risk of non-union [2].

  3. iii.

    Fracture location according to the vascularization of the bone. The femoral and tibial shafts were divided into sections of thirds according to the degree of vascularization previously described [13]. Accordingly, for the tibia, the upper third has a rich vascularization, the middle third has a moderate vascularization, and the lower third has a deprived vascularization; For the femur, the upper third has a moderate vascularization, the middle third a rich vascularization, and the lower third a poor vascularization. Particular emphasis was given in fractures and subsequent non-unions which occurred in areas of low vascularization.

  4. iv.

    Infection. Infection being either superficial or deep was defined according to the Centers for Disease Control criteria [14]. The presence of superficial or deep infection was given the same gravitas, as many superficial infections not infrequently represent and/or become deep infections.

  5. v.

    Method of reduction. The type of reduction being closed or opened was given a different degree of importance. Opened reduction rather than closed is related with a higher incidence of non-union [15]. When the surgeon used an aid in order to achieve the best possible reduction utilizing a minimal opening at the fracture site (i.e. by using a pointed reduction forceps or other surgical instruments), then, in these cases, the method of reduction was considered as closed. For Gustilo grade II and III [16] open fractures, the method of reduction was considered as opened.

  6. vi.

    Mechanical stability. It relates to the type of surgical technique and implant used to stabilize the fracture (intramedullary nailing, plating, and external fixators) [17]. We standardized the way to determine the degree of mechanical stability of each fracture fixation technique in order to be able to compare them. Table 1 (A, B, C) specifies the standardization process in order to assess the degree of fracture stability in each patient on the basis of the type of surgery performed. Two levels of stability were assigned for each surgical technique, either optimum or not optimum stability. Optimum stability was defined if all the three rules defined from the outset were respected; otherwise, the degree of stability was defined as not optimum. The degree of mechanical stability present was assessed by the evaluation of post-operative radiographs [18,19,20,21,22,23,24,25,26,27,28].

  7. vii.

    Presence of post-fixation fracture gap. It was defined as the existing residual gap between the bony fragments after surgical fixation [29]. Evaluation was based on the post-operative radiographs by measuring the gap size at the point of the greatest gap regardless of the projection (AP or lateral). Patients were stratified based on whether their gap was less than or equal to 4 mm or greater than 4 mm.

  8. viii.

    Smoking habit. Smoking has been associated with a higher incidence of non-union [30]. Smoking was assessed from the medical records and where necessary by interviewing the patients in the clinics. We considered as smokers patients smoking more than five cigarettes per day [31].

  9. ix.

    Soft tissue damage. Both closed and open fractures were considered for this variable. Regarding closed fractures, soft tissues were graded as damaged in case of internal degloving, a condition that occurs in association with fractures with a degree of displacement greater than 100% of the shaft width or in case of type C complex fractures. Internal degloving in closed fractures was evaluated on the first radiograph taken. On the other hand, opened fractures were graded as having inherent soft tissue damage from the outset irrespectively of the grade of injury [32].

  10. x.

    Type of fracture. According to the AO classification [33] of the diaphyseal fractures, we contemplated three main fracture types: simple (type A), wedge (type B), and complex (type C). Wedge or complex types of fracture present a higher risk of non-union than simple fracture types. Type of fracture was evaluated on the first radiograph taken.

Table 1 Diagram for evaluation of mechanical stability

Bias

Cases were selected from consecutive records of the registries. Controls were randomly selected from the same time period. In this way, selection bias was minimized. There was no concern regarding recall bias since all factors were collected from patient notes/records. Records were only included if there was complete data available, so it is possible that the study patients were not representative of all patients in the registries. The requirement for full records was the same for both the study and the control groups respectively.

Statistical analyses

The risk factors for non-union as previously stated were established from the clinical literature. The task here was to determine the relative effect sizes when all were considered concurrently. This was achieved with a multivariable logistic regression.

Tabulation of the outcome (non-union/union) against each risk factor in turn (univariable analysis) was carried out. Adjusted effect sizes were determined using a multivariable logistic regression with all ten factors concurrently. Adjusted odds ratios and confidence intervals were derived using Wald’s method.

Having established the most important risk factors, a non-union risk index was proposed as the count of the risk factors present in each patient. This follows the approach taken by Clegg et al. [34] in the development and validation of a frailty index and is supported in the statistical literature by, for example, Dawes [35]. The aim was to develop a robust index which can be quickly calculated in routine clinical care.

To validate the non-union risk index, the Hosmer–Lemeshow test was applied. The performance of the usual logistic regression weighting was also assessed to demonstrate that the index loses little discriminatory power.

All statistical analysis was undertaken with R statistical software version 3.3.2 [36], and the Hosmer–Lemeshow test used the R library 0.2–6 [37].

The resulting clinical decision support rule was call the Leeds–Genoa Non-Union Index: LEG-NUI.

Results

A total of 338 study cases and 254 randomly selected controls were identified from the hospital registries. Of these, 238 cases and 154 controls were excluded due to non-eligibility based on the inclusion–exclusion criteria or to having incomplete data. Consequently, 100 eligible cases and 100 eligible controls were identified. Thus, in total, 200 participants were included for statistical analysis and formed the basis of this study (Fig. 1).

Fig. 1
figure 1

Participant flow diagram

Demographic and risk factors for the study participants are tabulated in Table 2, complete with a Pearson’s chi-square test to permit easier comparison between the union and non-union groups.

Table 2 Tabulation of demographic and risk factors by non-union outcome

The results from fitting a logistic regression with the ten risk factors are given in Table 3, which provides both adjusted and unadjusted odds ratios and 95% confidence intervals. No demographic factors were included in the model since they did not substantially alter the model.

Table 3 Unadjusted and adjusted odds ratios for 10 risk factors associated with non-union

A receiver–operator curve was calculated and the area under the curve was 0.938. At the optimal cut point, the sensitivity was 91% and the specificity 82%. It is clear, however, that the evidence from smoking status and vascularization area was limited. So, these factors were dropped and the fitted model became that as expressed in Table 4.

Table 4 Unadjusted and adjusted odds ratios for 8 risk factors associated with non-union

Note that the area under the curve was reduced only very slightly to 0.937, and at the optimal cut point, the sensitivity was 87% and specificity 86%. These eight risk factors all had large effect sizes and good statistical significance. The ROC curve for the logistic regression on eight factors is shown in Fig. 2. The Hosmer–Lemeshow test with ten groups yielded a chi-square value of 4.37 on eight degrees of freedom, p = 0.822, confirming a good fit.

Fig. 2
figure 2

ROC curve for rule derived from 8-factor logistic regression

The non-union index, which was the sum of the risk factors for each patient, performed almost as well. The ROC curve is shown in Fig. 3, area under the curve 0.924, sensitivity 91%, specificity 77%—see Tables 5 and 6. For the non-union index, the Hosmer–Lemeshow test with ten groups yielded chi-square = 1.26, p = 0.996.

Fig. 3
figure 3

ROC curve for the non-union index derived the sum of the risk factors

Table 5 Classification table for non-union index
Table 6 Non-union index by fracture outcome

In order to demonstrate the utilization of the non-union index, three cases have been selected. The first case refers to a tibial non-union in a male patient 39 years of age who sustained an open tibial fracture whilst playing football (Fig. 4). The non-union index score was 7 (Table 7). The second case refers to a 63-year-old female patient who sustained a distal 1/3 femoral fracture after a fall (Fig. 5). Here, the non-union index was 5 (Table 8). The third patient, a 44-year-old female who sustained a closed tibial fracture after a fall (Fig. 6), computed a non-union index score of 1 (Table 9).

Fig. 4
figure 4

Patient 1: a Preoperative x-rays of distal tibial fracture. b Post-operative x-rays with intramedullary nail fixation. c 8-month follow-up films showing distal tibial non-union. The score calculated was 7 points (see Table 7)

Table 7 Non-union index evaluation for H.C.L. patient
Fig. 5
figure 5

Patient 2: a Preoperative x-rays of distal third femoral diaphyseal fracture. b Post-operative x-rays with a locking plate with screws and a cerclage wire. c 9-month follow-up x-rays showing distal femoral non-union with implant breakage. The score calculated was 5 points (see Table 8)

Table 8 Non-union index evaluation for S.M. patient
Fig. 6
figure 6

Patient 3: a Preoperative x-rays of distal third tibial diaphyseal fracture. b Post-operative x-rays with an intramedullary nail. c 7-month follow-up x-rays showing distal tibial bone healing. The score calculated was 1 point (see Table 9)

Table 9 Non-union index evaluation for B.M. patient

Discussion

Ten factors being post-surgical fracture gap > 4 mm, presence of infection superficial or deep, suboptimum mechanical stability, initial displacement of more than 75% of bone shaft width, fracture of the tibia rather than the femur, soft tissue damage, open method of fracture reduction, wedge or comminuted type of fracture, smoking habit, and fracture location in an area of low vascularization of the bone were evaluated. The risk factors of smoking and area of vascularization were dropped, leaving 8 risk factors that comprised an index on non-union.

The non-union index (NUI) predicts union for 0–4 risk factors and non-union for 5–8 risk factors. In practice though, early intervention might be strongly motivated when there are at least six risk factors present. All risk factors will be observable during the early outpatient clinic follow-ups following discharge, so that appropriate intervention can be undertaken three months post-operation rather than ‘wait and see’ for nine months.

Had early intervention been taken for the 47 of 200 patients included in this study, then benefit would have been seen for 46 of the 47. The other would have undergone an unnecessary operation, but may have also benefitted from increased speed of union.

The risk factors are either properties of the fracture or features of the primary surgery, apart from infection which may be modifiable through treatment.

Consequently, LEG-NUI has the great advantage of being simpler to incorporate and use in the clinical setting. Moreover, it would allow the development of a mobile APP, thus simplifying further its use during patient consultation in the clinic environment.

Effectively, a clinical decision rule or clinical decision support system (CDSS) is a system that is mainly conceived to assist physicians in taking care of patients especially helping them with clinical decision-making [38]. Although some systematic reviews have not found a real benefit in the use of CDSS [39, 40], other studies have found tangible benefits related to the use of them. For instance, Garg et al. [41] conducted a systematic review in order to assess the effectiveness of CDSS in 100 randomized and nonrandomized controlled trials. They found that the use of CDSS improved health professionals’ performance in 64% of the analysed studies. Similarly, Kawamoto et al. [42] also conducted a systematic review analysing the benefits brought by the use of CDSS in 70 randomized controlled trials. They were able to show that CDSS improved patient care and clinical practice in a high percentage (68%) of studies taken into account. Authors concluded that an important benefit of having a CDSS is that it does not provide only assessments but practical recommendations for caring of patients. In addition, the CDSS may influence decision-making since it constitutes an effective aid during the moment of a patient’s care, rather than before or after it; another important benefit of having CDSS at disposal during clinical decision-making is an improvement in the adherence to the guidelines in a long-term period [43]. Evans et al. [44] demonstrated that the use of a CDSS reduced the quantity of drugs given to the patients and their mean therapy duration time. Moreover, they found CDSS having an effective and significant impact in decreasing of costs, in terms of cost of the administered drugs and of the hospital costs, even expressed with a reduction of the mean duration of the hospital stay [44].

These findings can be applied to non-union of long-bone fractures and to the whole diagnostic and therapeutic process related to this disease. This highlights how the use of a CDSS to early diagnose and to prevent the onset of the non-union disease can be proactive to help patients to minimize the incapacity and disability-related problems and reduce health service costs.

The impact of non-union on the health-related quality of life of the patients affected by this disease is devastating. Two different recent studies [45, 46] evaluated this parameter on a big sample of patients (237 and more than 800 respectively), and they found that patients would have been disposed to trade, on average, 32% and 37% of their remaining life, in return for good health. Brinker et al. [45] found quality of life associated with tibial non-union to be comparable to the end-stage hip osteoarthritis one, whilst Schottel et al. [46] found quality of life associated with non-union as worse than the one associated with different major medical affections such as stroke, myocardial infarct, and AIDS.

Further, as suggested by Mundi et al. [47] in a commentary on Schottel et al. [46] work, the devastating economic and social consequences related to non-union are also given by the fact that most of those who develop this disease are patients at the height of their productive age. Therefore, early recognition of patients with non-union risk factors would be extremely useful for orthopaedic surgeons in order to implement close monitoring and an eventual prompt intervention to minimize risk of non-union. This kind of preventive approach would completely revolutionize the course of such a relevant and disabling disease.

It is noteworthy that such an approach has not been feasible until now not only because such an identification tool has not been available but also due to the fact that a lack of consensus in defining non-union exists [48] [49].

However, the big innovation brought by this proposed CDSS comes from the fact that it would make the preventive approach widely achievable and affordable by every orthopaedic surgeon. This does not mean that it will replace or undermine each doctor’s own clinical judgement independence, but it rather would constitute an aid, an available and ready-to-use weapon at the disposal of every orthopaedic surgeon.

Note that the assessment of the decision rule was not restricted to comparing the area under the curve, which can be insensitive [50]. Rather, evaluation considered misclassification tables, sensitivity, and specificity and balanced these against robustness and ease of use.

The risk factors considered in this study were strongly supported by the clinical literature, which was comprehensively reviewed [10, 51,52,53,54]. This study then sought to code these risk factors simply so that a simple decision rule could be derived. It is possible that more detailed consideration of each factor could contribute more information and improve the decision rule. On the other hand, the decision rule would become more complicated and therefore less applicable in clinical settings for which a simple clear guide is required.

Only records with complete details of the ten factors and the outcome of fracture healing were included in the study. These patients may have had fuller records due to concerns over their healing and therefore all at greater risk of complications such as non-union. In some respects, this potential bias strengthens our findings regarding sensitivity but may have resulted in specificity lower than would have been seen in a more representative sample (where some patients had incomplete records). Missing data though may not have been ‘missing at random’ and subsequent analysis far more complex. At the stage of deriving a suitable decision rule, as in this study, we believe that this bias is acceptable.

Two hospital sites were included for recruitment purposes. To seek more generalizability and further assessment of this decision rule, a study with a much larger sample size from multiple centres is recommended.

The fact that the CDR/LEG-NUI is based on the simple count of risk factors and is so easy to apply in the clinical environment constitutes a great strength of this study. It promises to be a powerful aid for assessing fracture fixation outcome and to support early intervention. Prediction of such an important clinical condition is envisaged to bring benefits for both commissioners and for users of health services.