The challenge of measuring spinopelvic parameters: inter-rater reliability before and after minimally invasive lumbar spondylodesis

Hohenhaus, Marc; Volz, Florian; Merz, Yorn; Watzlawick, Ralf; Scholz, Christoph; Hubbe, Ulrich; Klingler, Jan-Helge

doi:10.1186/s12891-022-05055-9

The challenge of measuring spinopelvic parameters: inter-rater reliability before and after minimally invasive lumbar spondylodesis

Research
Open access
Published: 31 January 2022

Volume 23, article number 104, (2022)
Cite this article

Download PDF

You have full access to this open access article

BMC Musculoskeletal Disorders Aims and scope Submit manuscript

The challenge of measuring spinopelvic parameters: inter-rater reliability before and after minimally invasive lumbar spondylodesis

Download PDF

Marc Hohenhaus ORCID: orcid.org/0000-0002-8743-3161¹,
Florian Volz¹,
Yorn Merz¹,
Ralf Watzlawick¹,
Christoph Scholz¹,
Ulrich Hubbe¹ &
…
Jan-Helge Klingler¹

1086 Accesses
2 Citations
Explore all metrics

Abstract

Background

The common manual measurement technique of spinal sagittal alignment on X-rays is susceptible to rater-dependent variability, which has not been adequately considered in previous publications. This study investigates the effect of those variations in the characterization of patients receiving lumbar spondylodesis.

Methods

General alignment parameters on pre- and postoperative X-rays were evaluated by four raters in 43 prospectively sampled patients undergoing monolevel spondylodesis. The Intra-class Correlation Coefficient (ICC) for each rater pair and all raters together was calculated for inter-rater reliability. For the operation-induced change of the sagittal alignment in every patient the Wilcoxon test was applied to compare for each rater separately.

Results

The ICCs were “good” (>0.75) to “excellent” (>0.9) for all raters together and for 45 of the 48 single rater pairs (93.75%). All revealed a significant increase of the addressed segmental lordosis and disc height and no significant change for spinopelvic parameters and sagittal vertical axis from pre- to postoperative. The lumbar lordosis showed a significant increase through the operation of +2.5° (p = 0.014) and +3.7° (p = 0.015) in two raters and no difference for the other ones (+2.1°, p = 0.171; -2.2°, p = 0.522).

Conclusions

The pre- to postoperative change of lumbar lordosis revealed different significance levels for different raters, although the ICCs were formally good. Accordingly, the evaluation by only one rater would lead to different conclusions. Due to this susceptibility of alignment measurements to rater-dependent variability, the exact evaluation process should be described in every publication and the consistency of significant results be validated through multiple raters.

Trials registration

The trial was approved by the local ethics committee and listed at the national clinical trials register (DRKS00004514, date of registration: 08/11/2012).

View this article's peer review reports

Radiographic Classification for Degenerative Spondylolisthesis of the Lumbar Spine Based on Sagittal Balance: A Reliability Study

Article 01 July 2018

Moderate Interrater and Substantial Intrarater Reproducibility of the Roussouly Classification System in Patients With Adult Spinal Deformity

Article 01 March 2019

Reproducibility of thoracic kyphosis measurements in patients with adolescent idiopathic scoliosis

Article Open access 21 February 2017

Background

The evaluation of sagittal spinal alignment parameters in lumbar degenerative spine surgery is of extensive interest. Pathologic alterations are on the one hand partially responsible for the development of degenerative diseases and on the other hand relevant for treatment planning to reach an optimal outcome of affected patients [1,2,3]. Diverse surgical strategies imply variable modification options of the spinal alignment, so that a preoperative evaluation is clearly recommended [2, 4]. The extent taking these parameters into account for the individual treatment strategy is discussed controversial.

The radiological assessment is usually still done manually on plane X-ray images, which is prone to rater-dependent variation [5]. The dimension depends on the parameter and is reported up to an average of 10° for spinopelvic and lumbar lordosis angles [6,7,8]. For thoracic and cervical parameters, variations of more than 10° are described and dependent on the body position [9]. Causative seem to be the subjective measurement technique itself as well as the image quality, the individual position and configuration of the patient [5]. There have been several imaging standardization attempts, e.g. by the use of whole spinal imaging systems, like the EOS^® system, with reported increases of image quality and reduced radiation exposure [10, 11]. Some studies reported advantages for software-based parameter evaluations, whereas a fully automated assessment still does not exist [6, 12, 13]. There is no “gold standard” for detecting the real value of the sagittal alignment and no clear recommendation for standardized evaluation of spinal alignment yet.

An impact of such an inter-rater variation on study outcomes can be expected. The discussion of this relevant bias within previous as well as recent publications is heterogeneous. Many authors refer no information on this issue or report data of single raters [14,15,16,17,18,19,20,21,22,23,24,25,26]. Only few publications include a more detailed statement of the number of raters and evaluation procedure but rarely with a clear statement of inter-rater reliabilities [6, 27,28,29]. The quality of each publication is narrowed through an absent detailed statement concerning the evaluation procedure and the associated variability. Nevertheless, the precise impact of these variations seems to be unclear, because the reported effect strength is often diminutive.

The purpose of this study is to evaluate the effect of inter-rater variations within the measurement of sagittal alignment parameters in pre- to postoperative characterization of patients with monolevel lumbar spondylodesis. Thus, the relevance of the Intra-class Correlation Coefficient (ICC) should be clarified.

Methods

The aim of the study was to compare pre- and postoperative lumbar sagittal alignment parameters measured by different raters. We assumed that the variability of subjective measurements has a relevant impact on the significance levels of evaluated differences.

Study population

Patient data were sampled within a prospective, single-center, single-arm cohort study [30]. The trial was approved by the local ethics committee and listed at the national clinical trials register (DRKS00004514, date of registration: 08/11/2012). The study was carried out in accordance with relevant guidelines and regulations. In total 50 patients were included in this prospective trial after giving informed consent to participate. Seven patients were excluded for our radiographic evaluation because of the treatment of two lumbar levels, resulting in 43 patients receiving a monolevel, minimally invasive transforaminal lumbar interbody fusion (TLIF) due to a degenerative disease.

Radiographic Evaluation

All patients received directly preoperative and one year postoperative plane X-ray images of the whole spine to evaluate sagittal alignment. X-rays were performed in standardized patient comfortable standing position.

For evaluation of the sagittal lumbar alignment, the following parameters were measured: segmental lordosis (SL) as angle between superior endplate of the upper vertebral body and inferior endplate of the lower vertebral body of the addressed segment; ventral (vDH) and dorsal (dDH) disc height as distances of the ventral and dorsal edge of the treated vertebral disk; lumbar lordosis (LL) as angle between superior plate of L1 and S1; pelvic incidence (PI) as angle between the line of the center of the femoral heads to the center of the S1 endplate and the line orthogonal to the S1 endplate; pelvic tilt (PT) as angle between the line of the center of the femoral heads to the center of the S1 endplate and the reference vertical line; sacral slope (SS) as angle between S1 endplate and the reference horizontal line [7]. For global spinal balance the sagittal vertical axis (SVA) was measured as distance from the posterior superior corner of the S1 endplate to a vertical plumb line dropped from the center of C7 [2]. All parameters are depicted in Fig. 1.

Outcome measurements

Four raters evaluated each alignment parameter: one spine surgeon with more than 30 years of experience (rater A), one senior physician with more than ten years of experience (rater B), one resident (rater C) and one postgraduate student who was instructed in the measurement (rater D). The measurements were done manually on digitized X-rays within IMPAX EE R20 (Agfa HealthCare©) and every rater was blinded to the results of the other ones. For the determination of the inter-rater reliability the ICC was computed for all possible rater pairs (“A/B”, “A/C”, “A/D”, “B/C”, “B/D”, “C/D”) and for all four raters together (“ALL”), resulting in seven ICC values for each spinal parameter (see Fig. 2).

As second step, we divided our measurement results associated to the treatment of all patients into pre- and postoperative values, respectively. The differences induced by the operation were compared for every parameter in each rater separately.

Statistics

Data processing and statistical analysis were performed using IBM SPSS Statistics 25 and the R Project for Statistical Computing. Normal distribution for each variable was assessed by Shapiro-Wilk-Test. The inter-rater reliability for the radiographic outcome parameters was calculated using the ICC for absolute scales with multiple raters [31]. We calculated a two-way mixed-effects model, with single measures and absolute data agreement [32]. All ICC values were qualified according to Koo et al. [32] with an additional color-coding in Table 2: <0.50 = poor (red), 0.50 – 0.75 = moderate (yellow), 0.75 – 0.90 = good (green), >0.90 = excellent (blue).

For comparison of the single rater measurements pre- and postoperatively additionally to the ICC calculation, we added a one-way ANOVA calculation. Homogeneity of variances was asserted using Levene’s Test and revealed consistently equal variances (p > 0.05). Post-hoc-analysis was conducted using Tukey’s test.

We compared pre- to postoperative changes of each spinal alignment value within each rater and the mean of all raters together using the Wilcoxon test for paired samples. P values <0.05 were considered to be statistical significant.

Results

Baseline characteristics

Median age at the time of surgery of all 43 included patients was 57 years (interquartile range - IQR 48 - 69), 18 (41.9%) were male and 25 (58.1%) female. A spondylolisthesis was present in 42 patients with Meyerding grade I in 51.2% (n = 22) or grade II in 46.5% (n = 20). One patient showed no spondylolisthesis (2.3%). Primarily addressed level was L5/S1 (n = 22, 51.2%), followed by L4/5 (n = 17, 39.5%) and L3/4 (n = 4, 9.3%). Median Body-Mass-Index was 26.3 kg/m² (IQR 23.1 - 28.7). All baseline characteristics are summed up in Table 1.

Table 1 Baseline characteristics of all 43 patients, ^aMedian (IQR)

Full size table

Radiographic characteristics

Patients received pre- and postoperative sagittal standing X-rays, resulting in 86 measurements for each parameter. The median time between surgery and follow-up X-ray was 366 days (IQR 365 - 379). In two patients the postoperative X-rays were insufficient for measurement of the spinopelvic parameters and additional three patients had only lumbar standing X-rays after surgery. Therefore the postoperative evaluation of PI, PT and SS was possible in only 41/43 and SVA in 38/43 patients.

Inter-rater reliability

All ICC values are shown in Table 2. The inter-rater reliability was “excellent” (>0.9) for all measurements taking all raters together, except for the dDH with an almost “good” result (0.833, CI 0.713 – 0.899). The SVA showed the strongest agreement (0.995, CI 0.991 – 0.997), followed by PT (0.992, CI 0.989 – 0.995) and PI (0.977, CI 0.966 – 0.984).

For the single rater comparisons, the majority of ICCs were “excellent” too (37/48, 77.1%). Nine ICCs showed a “good” result (18.8%), two a “moderate” (4.2%) and only one a “poor” correlation (2.1%). In all single inter-rater comparisons, the dDH was the worst parameter, whereas the SVA showed the best correlations.

Within the ANOVA comparison of the measured values of all four raters pre- and postoperatively, we found a statistically significant difference only for the dDH pre- and postoperatively (p < 0.001 and p = 0.003). The other alignment parameters showed no significant differences. All values are shown in Supplement 1. The post-hoc-analysis showed an isolated significant difference between rater A and B preoperatively (difference -1.488° (CI -0.456° - -2.521°), p = 0.001) as well as between the following rater pairs postoperatively: A/B (-3.835°, CI -5.176° - -2.494°, p < 0.001), A/C (-3.709°, CI -5.050° - -2.369°, p < 0.001), A/D (-2.428°, CI -3.769° - -1.087°, p < 0.001) and B/D (1.407°, CI 0.066° - 2.748°, p = 0.036). This matches to the lowest ICC values for the dDH.

Table 2 ICC calculations (95% CI) for all raters together and each rater pair. SL = segmental lordosis, vDH = ventral disc height, dDH = dorsal disc height, LL = lumbar lordosis, PI = pelvic incidence, PT = pelvic tilt, SS = sacral slope, SVA = sagittal vertical axis

Full size table

Comparison of pre- and postoperative parameters

The differences induced by the operation were compared for each rater separately (Table 3). All raters showed consistently significant higher SL angles as well as vDH and dDH. For spinopelvic angles (PI, PT, SS) as well as for the SVA no significant changes were detected. Interestingly, we found different statistically relevant values for the LL, resulting in a significant increased postoperative angle in two raters (A: +2.5°, p = 0.014; C: +3.7°, p = 0.015) and no significant difference within the other two raters (B: +2.1°, p = 0.171; D: -2.2°, p = 0.522).

Table 3 Pre- to postoperative differences of the sagittal alignment parameters for each rater separately. ^aMedian (IQR), SL = segmental lordosis, vDH = ventral disc height, dDH = dorsal disc height, LL = lumbar lordosis, PI = pelvic incidence, PT = pelvic tilt, SS = sacral slope, SVA = sagittal vertical axis. Significant values (p < 0.05) are marked in bold type

Full size table

Finally, we calculated the mean values out of all four raters for each alignment parameter and compared the pre- to postoperative changes again, resulting in a significant increase of the LL of +1.4° (p = 0.035), whereas the other parameters showed similar significant tendencies like in all single rater evaluations (see Table 4).

Table 4 Pre- to postoperative differences of the sagittal alignment parameters for the mean values of all four raters together. ^aMedian (IQR), SL = segmental lordosis, vDH = ventral disc height, dDH = dorsal disc height, LL = lumbar lordosis, PI = pelvic incidence, PT = pelvic tilt, SS = sacral slope, SVA = sagittal vertical axis. Significant values (p < 0.05) are marked in bold type

Full size table

Discussion

In our cohort, the inter-rater reliability represented by the ICC was predominantly “good” or “excellent” between all raters for the measurement of sagittal spinal alignment parameters. In spite of the generally good agreement of all raters, there were different significance levels for the change of the LL from pre- to postoperative in patients receiving a monolevel, minimally invasive TLIF, which in summary leads to uncertainty concerning the estimation of these results.

The major problem is that we do not know the true value, because we refer to subjective measurements, which additionally depends on the heterogeneous image quality. There is no “gold standard” for determining the true value. Many publications only offer one rater for sagittal alignment measurements [14,15,16,17,18,19,20,21,22,23,24,25,26]. This seems to be critical because our evaluation showed that the results are rater-dependent and this can change the significance level, so that different raters could come to different conclusions. Taken only the results of rater A or C into account, we might postulate that the LL gets significantly increased through the minimally invasive TLIF, which could be classified as preferable result after this surgery technique. On the other hand, taken the rater B and D into account, no significant change and therefore benefit for the LL through the operation would be postulated.

Some prior studies report inter-rater variations, with mostly “good” or “excellent” ICC values [6, 29]. However, those values, formally reflecting a distinguished inter-rater reliability, may lead to a false sense of security. We could show that even with “good” or “excellent” ICCs in some cases the pre- to postoperative comparisons of each single rater showed different significance levels. This must be taken into account for the interpretation of previous as well as for future studies in the topic of sagittal alignment.

The variability of measurements seems to be independent from the formal “rater expertise”. Our evaluation showed that both senior raters (A and B) came to different results, as well as the two less experienced raters (C and D). Within the clinical practice the parameters for precise surgery planning are predominantly utilized through the experienced surgeons, respectively. But the rater experience in pervious published studies remains often unclear. We could not find a distinct difference concerning the experience as influencing factor in our evaluation.

But how to find the true value?

In our opinion, the data quality can be improved if several raters work on the same data set. It has to be postulated that the higher the number of raters, the better the reliability. To find out how many raters are adequate to reach an acceptable certainty for every parameter remains a statistical challenge. This seems not only to be important for the evaluation of sagittal alignment parameters, but might also be relevant for other manually medical measurements. To determine the minimum count of raters is a future challenge for the statistics and under investigation now. A specific statistical procedure should be developed also taking the magnitude of the effect of the addressed parameter into account. Another solution would be the development of a fully automated evaluation software. Unfortunately there are no such software solutions for sagittal balance parameters yet. There are many software-supported calculations, like the KEOPS®, mediCAD Spine® or Spineview® software, but the key points like S1 endplate or the femoral heads have to be marked manually, which leads back to the problem of rater-dependent variations.

If the ICC calculation of several raters shows “good” to “excellent” results, the measurements generally seem to be reproducible. But our evaluation shows that it is not adequate to rely on this information. As second step when calculating differences between two samples, the statistical evaluation should be done additionally for every rater separately. If the significance levels are reproducible too, the reliability on the accuracy of the results is strengthened. If the significance levels differ, the results have to be handled with caution. Within our cohort, all raters showed similar significant differences for SL, vDH and dDH from pre- to postoperative. Therefore, we can assume that there is probably a underlying effect of the operation. From the spine surgeon point of view this result is perspicuous because of the implanted intervertebral spacer that elevates the disk space and the SL because of the intraoperative dorsal compression. The spinopelvic parameters as well as the SVA showed no significant changes consistent through all raters. This seems to be plausible too, because of monolevel, minimally invasive TLIF procedures have been performed without the goal of a significant alteration of the whole spinal sagittal balance. Crucial seems to be the LL because of different significance levels through different raters. According to that, the statement of a change of the LL through the minimally invasive TLIF must be handled with caution.

To calculate the mean values of the measurements of all raters for the comparison pre- to postoperative might be another solution to increase reliability of the findings by manual measurements (Table 4). This could increase precision, the more raters have participated. For our evaluation we have to postulate, that for the mean of all four raters the minimally invasive TLIF significantly increased the LL about +1.4° (p = 0.035). The other parameters showed consistent significance levels for the mean of all raters like in every single rater separately (Table 4).

A major limitation when evaluating alignment parameters is the heterogeneity of the single X-ray examinations resulting in a different image quality. This depends on the examiner, the position and the individual anatomy of each patient. A comparison with standardized whole spine X-rays, like the EOS® system, would be interesting, but was not part of this evaluation. Additionally a comparison to software-supported measurement methods would be interesting too. Unfortunately, there is no full-automatic computed evaluation program. Furthermore our study is an exclusively radiographic evaluation without associated clinically effects of our measurements.

Conclusions

There was a “good” to “excellent” inter-rater reliability for the most sagittal alignment parameters. All raters detected a consistently significant increase of SL, vDH and dDH and no significant change of the PI, PT, SS and SVA after the operation. Nevertheless the pre- to postoperative change of the LL revealed different significance levels for different raters, although the ICCs were formally sufficient. Accordingly, the evaluation by only one rater would lead to different conclusions: Rater A and C would postulate a significant increase in LL following minimally invasive TLIF, while rater B and D would not detect any significant change.

We conclude that due to the susceptibility of spinopelvic parameter measurements to rater-dependent variability, several actions are recommended to increase reliability when evaluating significant changes: The measurements should be performed by several raters and the agreement should be statistically determined by ICC calculation. Even if the reproducibility is formally excellent, a validation of the consistency of the results in each rater should be included. In case of inconsistent levels of significance, the results should be handled with caution. Further investigations in statistical procedures are needed for the evaluation of subjective measured sagittal alignment parameters.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Abbreviations

C7PL:: C7 plumb line
dDH:: dorsal disc height
ICC:: Intra-class Correlation Coefficient
IQR:: interquartile range
LL:: lumbar lordosis
PI:: pelvic incidence
PT:: pelvic tilt
SL:: segmental lordosis
SS:: sacral slope
SVA:: sagittal vertical axis
TLIF:: transforaminal lumbar interbody fusion
vDH:: ventral disc height

References

Faundez A, Roussouly P, Le Huec JC. Sagittal balance of the spine: a therapeutic revolution. Rev Med Suisse. 2011;7(322):2470–4.
CAS PubMed Google Scholar
Ames CP, Smith JS, Scheer JK, et al. Impact of spinopelvic alignment on decision making in deformity surgery in adults. J Neurosurg Spine. 2012;16(6):547–64. https://doi.org/10.3171/2012.2.SPINE11320.
Article PubMed Google Scholar
Le Huec JC, Thompson W, Mohsinaly Y, et al. Sagittal balance of the spine. Eur Spine J Off Publ Eur Spine Soc Eur Spinal Deform Soc Eur Sect Cerv Spine Res Soc. 2019;28(9):1889–905. https://doi.org/10.1007/s00586-019-06083-1.
Article Google Scholar
Mehta VA, Amin A, Omeis I, et al. Implications of Spinopelvic Alignment for the Spine Surgeon. Neurosurgery. 2015;76(suppl_1):S42–56. https://doi.org/10.1227/01.neu.0000462077.50830.1a.
Article Google Scholar
Pumberger M, Schmidt H, Putzier M. Spinal Deformity Surgery: A Critical Review of Alignment and Balance. Asian Spine J. 2018;12(4):775–83. https://doi.org/10.31616/asj.2018.12.4.775.
Article PubMed PubMed Central Google Scholar
Maillot C, Ferrero E, Fort D, et al. Reproducibility and repeatability of a new computerized software for sagittal spinopelvic and scoliosis curvature radiologic measurements: Keops®. Eur Spine J. 2015;24(7):1574–81. https://doi.org/10.1007/s00586-015-3817-1.
Article CAS PubMed Google Scholar
Vrtovec T, Janssen MMA, Likar B, et al. A review of methods for evaluating the quantitative parameters of sagittal pelvic alignment. Spine J Off J North Am Spine Soc. 2012;12(5):433–46. https://doi.org/10.1016/j.spinee.2012.02.013.
Article Google Scholar
Polly DW, Kilkelly FX, McHale KA, et al. Measurement of lumbar lordosis. Evaluation of intraobserver, interobserver, and technique variability. Spine. 1996;21(13):1530-1535; discussion 1535-1536. doi:https://doi.org/10.1097/00007632-199607010-00008
Hey HWD, Lau ET-C, Wong GC, et al. Cervical Alignment Variations in Different Postures and Predictors of Normal Cervical Kyphosis: A New Understanding. Spine. 2017;42(21):1614–21. https://doi.org/10.1097/BRS.0000000000002160.
Article PubMed Google Scholar
Dubousset J, Charpak G, Dorion I, et al. A new 2D and 3D imaging approach to musculoskeletal physiology and pathology with low-dose radiation and the standing position: the EOS system. Bull Acad Natl Med. 2005;189(2):287–97 discussion 297-300.
PubMed Google Scholar
Deschênes S, Charron G, Beaudoin G, et al. Diagnostic imaging of spinal deformities: reducing patients radiation dose with a new slot-scanning X-ray imager. Spine. 2010;35(9):989–94. https://doi.org/10.1097/BRS.0b013e3181bdcaa4.
Article PubMed Google Scholar
Berthonnaud E, Labelle H, Roussouly P, et al. A Variability Study of Computerized Sagittal Spinopelvic Radiologic Measurements of Trunk Balance. J Spinal Disord Tech. 2005;18(1):66–71. https://doi.org/10.1097/01.bsd.0000128345.32521.43.
Article CAS PubMed Google Scholar
Vialle R, Ilharreborde B, Dauzac C, et al. Intra and inter-observer reliability of determining degree of pelvic incidence in high-grade spondylolisthesis using a computer assisted method. Eur Spine J. 2006;15(10):1449–53. https://doi.org/10.1007/s00586-006-0096-x.
Article PubMed Google Scholar
Hresko MT, Hirschfeld R, Buerk AA, et al. The Effect of Reduction and Instrumentation of Spondylolisthesis on Spinopelvic Sagittal Alignment. J Pediatr Orthop. 2009;29(2):157–62. https://doi.org/10.1097/BPO.0b013e3181977de8.
Article PubMed Google Scholar
Le Huec JC, Leijssen P, Duarte M, et al. Thoracolumbar imbalance analysis for osteotomy planification using a new method: FBI technique. Eur Spine J. 2011;20(S5):669–80. https://doi.org/10.1007/s00586-011-1935-y.
Article PubMed PubMed Central Google Scholar
Acosta FL, Liu J, Slimack N, et al. Changes in coronal and sagittal plane alignment following minimally invasive direct lateral interbody fusion for the treatment of degenerative lumbar disease in adults: a radiographic study: Clinical article. J Neurosurg Spine. 2011;15(1):92–6. https://doi.org/10.3171/2011.3.SPINE10425.
Article PubMed Google Scholar
Barbagallo GMV, Piccini M, Alobaid A, et al. Bilateral tubular minimally invasive surgery for low-dysplastic lumbosacral lytic spondylolisthesis (LDLLS): analysis of a series focusing on postoperative sagittal balance and review of the literature. Eur Spine J. 2014;23(S6):705–13. https://doi.org/10.1007/s00586-014-3543-0.
Article PubMed Google Scholar
Feng Y, Chen L, Gu Y, et al. Restoration of the spinopelvic sagittal balance in isthmic spondylolisthesis: posterior lumbar interbody fusion may be better than posterolateral fusion. Spine J Off J North Am Spine Soc. 2015;15(7):1527–35. https://doi.org/10.1016/j.spinee.2015.02.036.
Article Google Scholar
Hara M, Nishimura Y, Nakajima Y, et al. Transforaminal Lumbar Interbody Fusion for Lumbar Degenerative Disorders: Mini-open TLIF and Corrective TLIF. Neurol Med Chir (Tokyo). 2015;55(7):547–56. https://doi.org/10.2176/nmc.oa.2014-0402.
Article Google Scholar
Alentado VJ, Lubelski D, Healy AT, et al. Predisposing Characteristics of Adjacent Segment Disease After Lumbar Fusion: SPINE. 2016;41(14):1167–1172. doi:https://doi.org/10.1097/BRS.0000000000001493
Article PubMed Google Scholar
Kong L-D, Zhang Y-Z, Wang F, et al. Radiographic Restoration of Sagittal Spinopelvic Alignment After Posterior Lumbar Interbody Fusion in Degenerative Spondylolisthesis. Clin Spine Surg. 2016;29(2):E87–92. https://doi.org/10.1097/BSD.0000000000000104.
Article Google Scholar
Tessitore E, Molliqaj G, Schaller K, et al. Extreme lateral interbody fusion (XLIF): A single-center clinical and radiological follow-up study of 20 patients. J Clin Neurosci Off J Neurosurg Soc Australas. 2017;36:76–9. https://doi.org/10.1016/j.jocn.2016.10.001.
Article Google Scholar
Kim CH, Chung CK, Park SB, et al. A Change in Lumbar Sagittal Alignment After Single-level Anterior Lumbar Interbody Fusion for Lumbar Degenerative Spondylolisthesis With Normal Sagittal Balance. Clin Spine Surg. 2017;30(7):291–6. https://doi.org/10.1097/BSD.0000000000000179.
Article PubMed Google Scholar
Buckland AJ, Ramchandran S, Day L, et al. Radiological lumbar stenosis severity predicts worsening sagittal malalignment on full-body standing stereoradiographs. Spine J. 2017;17(11):1601–10. https://doi.org/10.1016/j.spinee.2017.05.021.
Article PubMed Google Scholar
Chang HS. Effect of Sagittal Spinal Balance on the Outcome of Decompression Surgery for Lumbar Canal Stenosis. World Neurosurg. 2018;119:e200–8. https://doi.org/10.1016/j.wneu.2018.07.104.
Article PubMed Google Scholar
Nakashima H, Kanemura T, Satake K, et al. Changes in Sagittal Alignment Following Short-Level Lumbar Interbody Fusion: Comparison between Posterior and Lateral Lumbar Interbody Fusions. Asian Spine J. 2019;13(6):904–12. https://doi.org/10.31616/asj.2019.0011.
Article PubMed Central Google Scholar
Bourghli A, Aunoble S, Reebye O, et al. Correlation of clinical outcome and spinopelvic sagittal alignment after surgical treatment of low-grade isthmic spondylolisthesis. Eur Spine J. 2011;20(S5):663–8. https://doi.org/10.1007/s00586-011-1934-z.
Article PubMed PubMed Central Google Scholar
Kim MK, Lee S-H, Kim E-S, et al. The impact of sagittal balance on clinical results after posterior interbody fusion for patients with degenerative spondylolisthesis: a pilot study. BMC Musculoskelet Disord. 2011;12:69. https://doi.org/10.1186/1471-2474-12-69.
Article PubMed PubMed Central Google Scholar
Fujii K, Kawamura N, Ikegami M, et al. Radiological improvements in global sagittal alignment after lumbar decompression without fusion. Spine. 2015;40(10):703–9. https://doi.org/10.1097/BRS.0000000000000708.
Article PubMed Google Scholar
Hubbe U, Sircar R, Scheiwe C, et al. Surgeon, staff, and patient radiation exposure in minimally invasive transforaminal lumbar interbody fusion: impact of 3D fluoroscopy-based navigation partially replacing conventional fluoroscopy: study protocol for a randomized controlled trial. Trials. 2015;16(1):142. https://doi.org/10.1186/s13063-015-0690-5.
Article PubMed PubMed Central Google Scholar
Hallgren KA. Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial. Tutor Quant Methods Psychol. 2012;8(1):23–34. https://doi.org/10.20982/tqmp.08.1.p023.
Article PubMed PubMed Central Google Scholar
Koo TK, Li MY. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med. 2016;15(2):155–63. https://doi.org/10.1016/j.jcm.2016.02.012.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Special thanks go to Moritz Hess, Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center - University of Freiburg, for the statistical support.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Neurosurgery, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Breisacher Str. 64, 79106, Freiburg, Germany
Marc Hohenhaus, Florian Volz, Yorn Merz, Ralf Watzlawick, Christoph Scholz, Ulrich Hubbe & Jan-Helge Klingler

Authors

Marc Hohenhaus
View author publications
You can also search for this author in PubMed Google Scholar
Florian Volz
View author publications
You can also search for this author in PubMed Google Scholar
Yorn Merz
View author publications
You can also search for this author in PubMed Google Scholar
Ralf Watzlawick
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Scholz
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Hubbe
View author publications
You can also search for this author in PubMed Google Scholar
Jan-Helge Klingler
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Substantial contributions to the conception: all authors. Design of the work: MH, FV, UH, JHK. Acquisition and analysis of data: all authors. Interpretation of data: MH, FV, UH, JHK. Drafting the work or substantively revising it: MH, FV, CS, UH, JHK. All authors approved the submitted version.

Corresponding author

Correspondence to Marc Hohenhaus.

Ethics declarations

Ethics approval and consent to participate

The trial was approved by the local ethics committee (Freiburg) and listed at the national clinical trials register (DRKS00004514, date of registration: 08/11/2012). All study participants provided written informed consent.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Hohenhaus, M., Volz, F., Merz, Y. et al. The challenge of measuring spinopelvic parameters: inter-rater reliability before and after minimally invasive lumbar spondylodesis. BMC Musculoskelet Disord 23, 104 (2022). https://doi.org/10.1186/s12891-022-05055-9

Download citation

Received: 15 October 2021
Accepted: 17 January 2022
Published: 31 January 2022
DOI: https://doi.org/10.1186/s12891-022-05055-9

The challenge of measuring spinopelvic parameters: inter-rater reliability before and after minimally invasive lumbar spondylodesis

Abstract

Background

Methods

Results

Conclusions

Trials registration

Similar content being viewed by others

Radiographic Classification for Degenerative Spondylolisthesis of the Lumbar Spine Based on Sagittal Balance: A Reliability Study

Moderate Interrater and Substantial Intrarater Reproducibility of the Roussouly Classification System in Patients With Adult Spinal Deformity

Reproducibility of thoracic kyphosis measurements in patients with adolescent idiopathic scoliosis

Background

Methods

Study population

Radiographic Evaluation

Outcome measurements

Statistics

Results

Baseline characteristics

Radiographic characteristics

Inter-rater reliability

Comparison of pre- and postoperative parameters

Discussion

But how to find the true value?

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation