Introduction

Spine surgery involves a wide range of techniques and procedures with varying degrees of complexity when compared with, for example, hip or knee arthroplasty, where the procedures are more standardized and can be compared more easily [1]. There are large international and regional variations in the treatment of lumbar spinal disorders regarding the indications for surgery and extent of surgical procedures used [2, 3].

The “invasiveness” (magnitude and complexity) of a spinal surgical procedure has a significant influence on other characteristics of the operation [4]. For example, it has been shown that in patients undergoing lumbar decompression and arthrodesis, there is a significant increase in blood loss with an increasing number of segments operated on [5, 6]. Intraoperative blood loss has been shown to be the most significant risk factor for early perioperative complications [7]. The degree of technical difficulty, the number of segments operated on and the use of different approaches not only influence blood loss, but also the duration of the operation. Nuttall et al. [6] showed an increase of operative time per level fused of 114 ± 75 min. Rates for surgical site infections and complications in general are significantly higher with increased operative time [8, 9].

As the aforementioned studies show, there is a significant correlation between the invasiveness of spinal procedures and the surgical outcome of an operation (i.e. blood loss, operative time and complication rate). Being able to quantify the invasiveness of an operation and its possible risks in terms of complications is important in the decision-making process for patients and surgeons. Having a valid and applicable index is also valuable for comparing outcomes and complication rates across studies and in benchmarking activities, such as those typically carried out in registries [10, 11, 12].

Mirza et al. [4] developed and validated an invasiveness index for spinal surgery which was published in 2008. The method counts a maximum of six possible interventions (anterior decompression, anterior fusion, anterior instrumentation, posterior decompression, posterior fusion, posterior instrumentation) on each vertebral level, and the invasiveness index is given by the sum of all interventions on all addressed levels. The authors evaluated the association between the index score and blood loss and operating time in 1723 cases. They found an increase in blood loss of additional 11.5% (calculated by natural log transformation) and an increase of operating time of 12.8 min for each unit increase in the index.

To the best of our knowledge, the index has not been validated in any other datasets. A large international spine registry, such as the Spine Tango registry of EUROSPINE, is ideal for evaluating the validity of such an invasiveness index. It is a registry with detailed information regarding the clinical history, pathology, surgical measures, surgical outcomes and adverse events/complications during the hospital stay. The data are recorded in a systematic fashion, using standardized forms, by the operating surgeons.

The aim of this study is to evaluate the validity of a modified version of the Mirza invasiveness index (mMII) using surgical data from the international Spine Tango registry.

Methods

Patients

The study data were extracted from the Spine Tango registry of EUROSPINE [13]. This international registry contains patient and physician-based data, collected prospectively, from 53 clinics in 17 countries. The main contributing countries (with more than 1000 submitted cases each) were: the UK, Switzerland, Germany and Belgium. Ethics Committee approval was obtained for the use of the registry data for the study (BASEC-Nr. 2018–01033).

Figure 1 shows the flow of patients through the study forming the final study group. The registry was searched for Spine Tango version 2011 Surgery forms, which were officially in use from January 2012 to July 2017 (n = 57,045). All patients with surgery of the lumbar spine (Th12-S1) for degenerative disease were included (n = 38,400). Procedures with dynamic stabilization (n = 4412) were excluded, as they were in the original paper [4]. Forms with missing data for the essential predictor variables (see below) were also excluded (n = 10,907). If a patient had multiple surgery records, only the first record available was included and the remainder (n = 1436) were discarded. Forms with obviously false data, such as the number of operated segments being clearly too high for the short operative time documented, were excluded (n = 11). The total number of patients included was 21,634.

Fig. 1
figure 1

Selection of the study population

Spine Tango documentation forms

The Spine Tango 2011 Surgery form, completed by the operating surgeon, contained detailed information about the surgery, documented using standardized fields.

To calculate the mMII (see later), data from the following Tango Surgery fields were used: anterior or posterior access; surgical measures (decompression, fusion, rigid stabilization); the specific surgical measures used (for example, discectomy, laminectomy, interbody fusion {TLIF}, interbody stabilization with cage); and number of segments operated on. Table 1 shows the manner in which the specific procedures on the Spine Tango Surgery form were attributed to one of the six surgical components.

Table 1 Summary showing how the specific procedures listed on the Spine Tango Surgery form were attributed to the six surgical components defined by Mirza et al. [1]

As dependent variables to be predicted by the mMII, data from the fields operative time in hours (categories of < 1, 1–2, 3–4, 4–5, 5–6, 6–8, 8–10, > 10 h) and blood loss in millilitres (categories of < 100, 100–500, 500–1000, 1000–2000, > 2000 ml) were used. Operative time and blood loss were selected as dependent variables, because they are easily obtainable for every operation and were expected to be closely linked to the procedure and its invasiveness [6,7,8,9]. Intraoperative and postoperative complications (grouped as either surgical or general medical) were additional dependent variables to be predicted. The following surgical complications were documented on the Tango Surgery form as either present or absent: nerve root damage, spinal cord damage, dural tear, vascular injury, fracture of vertebral structures, epidural hematoma, other hematoma, radiculopathy, CSF leak, motor dysfunction, sensory dysfunction, bowel/bladder dysfunction, wound infection, implant malposition, implant failure, wrong level, other surgical complication. Dural tear, the most common surgical complication, was also investigated separately. The following general medical complications were similarly documented as present or absent: anaesthesiological, cardiovascular, pulmonary, thromboembolism, death, cerebral, kidney/urinary, liver/GI, other general medical complication.

Data on potential confounders were also extracted from the Tango Surgery form to be used as control variables in the statistical models (see below). These included: age, gender, body mass index (BMI; categories of < 20, 20–25, 26–30, 31–35, > 35 kg/m2), comorbidity (American Society of Anaesthesiologists Physical Status Score; ASA categories 1–5) and number of previous spine surgeries (categories of 0, 1, 2, 3, 4, 5, > 5).

Calculation of the mMII

The original formula used by Mirza et al. [4] to calculate the invasiveness index was as follows:

“A value of 1 unit was given for each surgical component performed at each vertebra: anterior decompression, anterior fusion, anterior instrumentation, posterior decompression, posterior fusion, posterior instrumentation. The score for each surgical component indicated the total number of vertebrae at which that procedural element was performed. The index was defined as the sum of the six surgical component scores. Spinal procedures that did not involve decompression, fusion, or instrumentation, such as irrigation and debridement, were assigned an index value of 0.”

Table 1 shows how the surgical procedures are defined.

The Spine Tango Surgery form (version 2011) does not specify what procedure was performed on how many segments. We therefore had to assume that the answer in the field “number of segments” applied to all documented procedures (decompression, fusion, stabilization) even though this was unlikely to be the case in all patients. Because of this difference, we refer to our index as a modified version of that proposed by Mirza et al. [4].

Statistical analysis

Descriptive data are presented as means ± standard deviation (SD) or percentages of the respective patient group. Multiple regression analyses were performed to investigate the influence of the mMII (independent variable; see above) on firstly, operative time and secondly, blood loss (dependent variables) while controlling for possible confounders (independent variables). All variables were entered into the model simultaneously. Since the operative time is recorded on the Spine Tango form as a range (e.g. 1–2 h), for the statistical analysis, the mean of the range (e.g. 1.5 h for the category 1–2 h) was used, to allow the regression coefficients to be interpreted in real units of time. For operative time > 10 h, we used the value 11 h. The same was done for blood loss, also recorded as a range that was converted to the mean of the range (e.g. the category 100–500 millilitres resulted in the mean of 300 millilitres) for statistical analysis. For blood loss > 2000 millilitres, we used the value 3000 millilitres.

A natural log transformation was used for the blood loss values, as was also done by Mirza et al. [4], because the distribution of values was skewed. The results of the multiple regression analyses are presented as regression coefficients (B) with 95% confidence intervals (95%CI) and corresponding p values. The regression coefficients in the regression models with log-transformed variables can be interpreted as percentage change of the dependent variable with one unit change in one independent variable, while other independent variables are held constant (e.g. 1 unit increase in the mMII {B = 0.128} leads to 12.8% increased blood loss when the other independent variables are constant). The regression coefficients for blood loss cannot be interpreted directly as change of blood loss in millilitres due to the log transformation.

Logistic regression models were built to investigate the association between the mMII and perioperative complications (surgical and general medical). The occurrence of complications was the dependent variable, and the mMII, patient age, number of previous spine surgeries, BMI, ASA score and gender were independent variables.

Statistical significance was accepted at p < 0.05. Statistical analysis was carried out using SPSS version 25 (IBM Corp., USA, 2017).

Results

Study population

The baseline characteristics of the final study population consisting of 21,634 patients are shown in Table 2. Approximately one-third of the patients had a disc herniation, one-third spinal stenosis and the rest had other degenerative diseases (Table 2). The mean mMII for all patients was 3.9 (SD 5.0) with a range from 0 to 40 (Fig. 2).

Table 2 Study population with baseline characteristics
Fig. 2
figure 2

Distribution of mMII in study population. Note: less frequent mMII values (< 1% of the study population) were summed in “Other”

mMII versus blood loss

The mean blood loss was 207 (SD 307) ml with a range from 50 to 3000 ml, and the median blood loss was 50 ml. Table 2 shows the distribution of category answers for this variable. The multiple regression model showed a coefficient of determination (R2) of 0.43, meaning 43% of the variance in blood loss was accounted for by the model. The mMII itself explained a significant 34% of the variance in blood loss. An increase of 1 point in the mMII was associated with an additional blood loss of 12.8% (95% CI 12.6–13.0; p < 0.001) (Table 3). Due to the log transformation of the blood loss values, the regression coefficients are interpreted as additional blood loss in percent change.

Table 3 Output from multiple regression in all patients, with blood loss (natural log transformed) as dependent variable

mMII versus operative time

The mean operative time was 101 (SD 80) min with a range from 30 to 660 min. Table 2 shows the distribution of category answers for operative time. The multiple regression model showed a R2 of 0.47, meaning 47% of the variance in operative time was accounted for by the model. The mMII uniquely explained a significant 38% of the variance in operative time. An increase of 1 point in the mMII was associated with an increase of operative time of 10.4 min (95% CI 10.20–10.53; p < 0.001) (Table 4).

Table 4 Output from multiple regression in all patients, with operative time as dependent variable

mMII versus complications

Perioperative surgical complications occurred in 2117 (9.8%) and general medical complications in 563 (2.6%) patients. Simultaneous surgical and general medical complications occurred in 138 (0.6%) patients. Logistic regression models confirmed that the mMII was a predictor of general medical but not surgical complications (Tables 5, 6, respectively). The odds ratio for surgical complications associated with the mMII was 1.002 (95% CI 0.994–1.011; p = 0.63) and for general medical complications, 1.046 (95% CI 1.032–1.060; p < 0.001). This means that with every additional index point, the odds of a general medical complication increase by 4.6%. The mMII was also a significant predictor of any complication (of either a surgical or general medical nature, as shown in Table 7) with an odds ratio of 1.013 (95% CI 1.005–1.021; p = 0.001).

Table 5 Logistic regression for general medical complications
Table 6 Logistic regression for surgical complications
Table 7 Logistic regression for any complications (surgical and general medical combined)

Discussion

The present study showed that the mMII can easily be calculated from the information documented in the Spine Tango registry. The mMII showed a significant association with parameters that indirectly reflect the invasiveness of an operation such as blood loss and operative time. Furthermore, it was associated with perioperative complications. We therefore believe it is a valid measure of surgical invasiveness that can be applied for use in future registry studies.

Our study showed an overall mean mMII of 3.9 (SD 5.0) with a range from 0 to 40. Mirza et al. [4] had two much smaller study groups with younger patients and a wider range of pathologies of the whole spine, including degeneration, trauma, neoplasm, infection and revision surgery. They reported a mean original Mirza invasiveness index of 10.5 (in an active surveillance group, n = 873) and 5.4 (in a passive surveillance group, n = 850), with a range from 0 to 48. The difference in the two surveillance groups was explained by the fact that the high-volume surgeons in the active surveillance group performed more complex procedures in contrast to the low-volume surgeons in the passive surveillance group. In terms of the procedures performed and their respective invasiveness, our study group was closer to Mirza et al. [4] passive surveillance group.

Comparing the studies in regard to the prediction of blood loss, Mirza et al. [4] showed a R2 for their model of 0.54, while the R2 in our study was 0.43. This indicates that 43% of the variance in blood loss was accounted for by the variables included in our multivariate model. In a univariate model, Mirza et al. [4] reported a R2 of 0.44 for the index alone, and in our study, a comparable value of 0.41 was found. An increase of 1 point in the index was associated with an additional blood loss of 12.8% (95%CI, 12.6–13.0) in our study and 11.5% (95%CI, 10.8–12.3%) in the study of Mirza et al. [4]. In relation to the prediction of operative time, Mirza et al. [4] showed an R2 of 0.53, while ours was 0.47; in univariate regression, Mirza et al. [4] reported a R2 of 0.52 for the index alone, which compared well with our R2 of 0.45. The beta coefficients in the two studies were very similar: an increase of 1 point in the mMII was associated with an increase of operative time of 10.4 min (95% CI, 10.2–10.5) in our study versus 12.8 min (95% CI, 11.8–13.7) in Mirza et al. [4] The lower R2 values in our study might be caused by a more heterogeneous study population because of the data originating from 53 different clinics and 17 different countries versus only 2 clinics in the original publication. However, the differences between the studies in R2 for operative time and in the change of operative time per unit increase in mMII are rather small and of questionable clinical relevance.

Mirza et al. [4] did not evaluate the association between the invasiveness index and perioperative complications. Campbell et al. [14] recently showed that both the number of segments fused and a more invasive approach (combined antero-posterior vs. only anterior or only posterior approach) correlated with a higher rate of perioperative complications. Lee et al. [15] investigated the postoperative occurrence of medical complications after spine surgery. They used the original Mirza invasiveness index to account for the invasiveness of the procedure. Their study showed a strong dose response of the invasiveness index with a postoperative medical adverse event, and their reported odds ratio of 1.063 (95% CI 1.045–1.082) [15] compared well with ours (1.046 {95% CI 1.032–1.060}). These odds ratios indicate that a unit increase in the mMII of one point increases the odds of a complication by 5–6%. This is a medium effect size for a 1-point increase in the mMII, but with an increase of multiple points in the mMII, the effect size grows. However, complications are dependent on a multitude of factors, do not occur with every surgery and are hence much more difficult to predict than, for example, blood loss and operative time. Also, the reporting of complications might be subject to variability within the registry as “what constitutes a complication” leaves some room for interpretation.

Our study was performed retrospectively with a large sample of patient data from a registry. This resulted in a number of limitations. The data in the registry were collected in 17 countries and 53 clinics. Operative procedures and techniques can vary considerably from country to country and even from clinic to clinic. For example, a posterior lumbar stabilization can be performed freehand, guided by fluoroscopy or with image-guided navigation [16]. Because of the setup and the technical aspects, this might result in considerably different operating time and blood loss for the same number of segments stabilized. With regard to this fact, it is surprising that our results are so close to those published by Mirza et al. [4] who included all operations carried out in just two clinics affiliated with one university. The large number of clinics in our study comes with a large number of operating surgeons with different levels of training and experience. In a future study, one could investigate the difference in the treatment for a specific pathology regarding invasiveness between countries or clinics within the registry.

The Spine Tango Surgery form (version 2011) is very detailed, but not designed specifically to calculate the original Mirza invasiveness index. For example, one documents whether a decompression, fusion or stabilization was performed, whether it was anterior or posterior, and the total number of segments operated on for the whole surgery, but it is not specified what procedure was performed on how many segments. The indicated procedures were multiplied by the total number of segments operated on for calculation of the mMII, assuming that each procedure was done on each operated level. However, in reality, this was unlikely to be the case. Because of this, our calculated mMII probably overestimates the true invasiveness and may have introduced some nonlinearity, since the effect would be expected to be more marked for the multilevel surgeries. The result would be that the effective R2 for blood loss and operative time would have been higher than calculated, and so too would be the increase of operative time and blood loss per unit mMII. The realization of this shortcoming led to improvements in the Spine Tango surgery form in its recent iteration (version 2017). The form is now more detailed, and for every category of procedure (decompression, fusion, stabilization), the exact vertebral level(s) on which it was performed is recorded. However, it still leaves some uncertainty if more than one specific type of procedure within a given category is indicated (e.g. both “interbody stabilization with cage” and “pedicle screws” within the “stabilization rigid” category). To ensure reliable results when calculating the mMII in future studies, it would be advisable to encourage documentation of exactly which procedures were performed on each vertebral level. To simplify the calculation of the mMII in the present study, we assumed that the lumbar interbody fusions (PLIF, TLIF, XLIF) were always performed with anterior and posterior fusion, whereas in reality, this might not have been the case and some may have been done without posterior fusion.

The Spine Tango Surgery form accounts for blood loss in categories (< 100 ml, 100–500 ml, 500–1000 ml, 1000–2000 ml, > 2000 ml). For the statistical analysis, the mean values of each category were used to allow the regression coefficients to be interpreted in real units of volume, e.g. 300 ml for the category 100–500 ml. The same was done for operative time, which was also recorded as a range (e.g. 2–3 h) but converted to the mean of the range (i.e. 2.5 h) for statistical analysis. It was assumed that the real values of blood loss and operative time are normally distributed within those categories. If they are not, and they are either negatively or positively skewed, then there will be a bias by taking the mean.

To the best of our knowledge, the original Mirza invasiveness index has not previously been validated in any external data sets. It is important to confirm a new model or index in an external data set to show that it performs well outside of the development set [17]. Our study indicates that a version of the index, adapted for use with Spine Tango registry data, is a valid measure of the invasiveness of common degenerative spine surgery procedures. The index should be of use in future registry and benchmarking studies to control for surgical complexity. It may also constitute a valuable predictor variable in the development of comprehensive predictor models for complications, length of stay, revisions and possibly even patient outcomes.

Conclusion

The mMII showed similar strength in predicting important perioperative parameters such as blood loss and operative time to that reported in the original publication of Mirza et al. in 2008 [4]. As our study showed that the mMII can be used for a broader spectrum of data collected internationally, we consider it valid and generalizable. In future studies, the more detailed information documented on the new Spine Tango Surgery form (2017) can be used to investigate whether the variance (R2) in blood loss and operative time explained by the mMII can be improved.