Introduction

Roentgen stereophotogrammetric analysis (RSA) is used to make accurate assessments of the relative position and orientation of bone structures and implants in vivo. Application of the technique allows the detection of clinically relevant motion, or the lack thereof. In this technique, two X-ray images are taken simultaneously from different directions in order to perform a three-dimensional rigid body movement analysis [1]. Clinical applications of RSA in the spinal context have included verification of the rigidity of spinal fusions and evaluation of dynamic stabilization of the cervical, thoracic and lumbar spine [2,3,4,5,6,7,8,9,10,11,12]. In hip and knee arthroplasty, RSA has been widely used to analyze migration of implants (relative motion of bone and implant), due to its high precision and accuracy. The accuracy of marker-based RSA for rotational measurements is 0.18° (with a precision of 0.5°) for hip stems [13]; and 0.12° (with a precision of 0.02°) for knee femoral components [14].

For movement analysis in the area of the lumbar spine, the X-rays must pass through a greater volume of soft tissue than is the case for the major joints of the extremities. This occurrence could reduce the precision and accuracy of the technique. The voluminous soft tissue surrounding the spine introduces scattering of the beam, which attenuates image contrast and leads to a loss in analysis quality [15, 16]. Nonetheless, due to difficulties in representing soft-tissue effects realistically in phantom studies, previous in vitro studies of RSA accuracy for the spine did not focus on the simulation of soft tissue effects [14].

Although double examinations are recommended as a validation of RSA accuracy in clinical applications (ISO 16,087-2013), it must be considered that only precision (repeatability)—and not accuracy—can be determined by this method, since the known true value for comparison is not available [17, 18]. It should be noted that some literature published before ratification of the RSA standard ISO 16,087-2013 makes different use of the terms precision and accuracy then defined in the standard. Thus, care should be taken when interpreting the “accuracy” reported in various publications. The translational and rotational “accuracy” reported by Johnsson et al. [12] in a clinical assessment of spinal motion would better be describes as precision according to ISO 16,087-2013. Even in later literature, differentiation between precision and accuracy is not always respected. An example of this is evident in the review by Humadi et al., where some historic studies are cited as reporting accuracy, whereas the data were originally reported as a measure of precision with respect to the ISO standard [19].

The indisputable benefit of double examinations in clinical applications is ethically controversial due to the subjection of the patients to additional radiation exposure [20]. This is particularly true for X-ray images of the trunk, which expose many vital organs. For this reason along with the possibility of being able to determine accuracy and not only precision, an experimental phantom approach was chosen for our investigation.

Since the accuracy of the RSA method for applications in the spine has not been sufficiently characterized to date, the first goal of this study was to examine the precision and accuracy of RSA in a spinal motion-segment analysis, taking the surrounding soft tissue into account using a substitute material (nylon 6).

Another potential source of RSA measurement error is the size of the X-ray source focal spot, which has not been investigated in this context, prior to this study. Statements by authors such as H. Koehnle and Göran Selvik, one of the creators of RSA, imply that the use of the smallest possible focal spot could potentially improve RSA accuracy [1, 21]. Reducing the size of the focal spot minimizes geometrical blur resulting from the partial shade introduced by a finite small X-ray emission area (Fig. 1) [1, 16, 21]. Nonetheless, in clinical applications of spinal RSA, a larger focal spot has commonly been used to reduce thermal load on the X-ray tubes [22]. However, the low current–time product of RSA image acquisition falls well below the capabilities of modern tubes.

Fig. 1
figure 1

The extent of partial shade (bottom) caused by a tantalum bead (Ta), as a function of size of the FSV in the X-ray tube (top). The height of the black bar is proportional to the shadow of the tantalum bead with partial shades on both sides. The sizes of the FSV, tantalum beads and shade are approximately to scale, but exaggerated with respect to the actual distances. The proportion of distances reflects clinical practice

Therefore, the second objective of this in-vitro study was to compare the precision and accuracy of RSA using different nominal focal spot values (FSVs), to investigate whether clinical RSA could benefit from the use of a smaller focal spot.

Methods

To assess the precision and accuracy of RSA, the relative motions of two human macerated lumbar vertebrae (L3 and L4) were analyzed. The vertebrae were mounted onto a multi-axis micromanipulator (M-37 with SM13, Newport, Darmstadt, Germany). This allowed the vertebrae to be tilted, in order to simulate physiological flexion–extension, lateral bending and axial rotations (Fig. 2).

Fig. 2
figure 2

Experimental setup for lateral bending. Top left: Fixation of the spine segment to the carbon tubes. Top right: The manipulator observed from one side: ‘A’ denotes the center of rotation in the manipulator. The axis of rotation runs through this point parallel to the carbon tubes. Main image: Two X-ray tubes in front of the phantom with the spine segment inside

Flexion–extension can be regarded as the most relevant axis of motion in spine diagnostics [5, 6, 23]. The rotational accuracy of the micromanipulator is 0.0001° (SD 0.007°) [24]. The vertebrae were attached to the micromanipulator by means of square carbon tubes to reduce the effects of bending and obscuring in the X-ray images to a minimum. Different carbon mounts were designed for each axis of rotation, to ensure that the center of rotation of the vertebrae would be situated in its physiological position during every motion [25, 26]. The center of rotation was placed based on the work of Xia et al. incorporating an approximated scaling factor of 1.13 to account for the actual size of the used vertebrae (length: L3: 84.5 mm; L4: 82.9) [26]. In the anterior–posterior dimension, the center of rotation is located 23 mm dorsal of the anterior edge of L3 for flexion/extension, and 36 mm ventral for internal rotation. For lateral bending, the center of rotation was placed in the center of the top surface of the vertebral body. Five tantalum markers, each of 1 mm diameter, were attached to each vertebra's surface using bone cement (Palacos LV, Heraeus Medical GmbH, Wehrheim, Germany), according to a typical clinical distribution: one marker on the spinous process and four on the lamina (Fig. 3) [5]. The cement was also used to fix the vertebrae on the carbon mounts.

Fig. 3
figure 3

Marker distribution of the tantalum markers, similar to the clinical distribution of Daentzer et al. [5]

Nylon phantom

To simulate the soft tissue´s attenuation and scattering of X-rays during clinical RSA procedures, a custom-made phantom made of nylon 6 was designed. The phantom was cylindrical in form with an opening for the vertebral segment; it represents the soft tissue surrounding the lumbar spine. Nylon 6 was chosen, since Mann et al. found this to be a good simulation material for soft tissue exposed to X-rays in the diagnostic photon energy range [27].

RSA-Setup

Two X-ray tubes (SRO3310 ROT 360, Philips, Amsterdam, The Netherlands) were arranged in accordance with a typical clinical RSA-setup at a distance of 1.6 m from the picture plane, facing the vertebrae at an angle of 20° to the normal of the image plane [3,4,5, 28]. The calibration box (CarbonBoxLeiden10, Medis Medical Imaging Systems bv., Leiden, The Netherlands) was situated between the image plane and the vertebrae, along with the calibration and fiducial markers necessary for the three-dimensional reconstruction of the vertebral poses. The images were recorded on cassettes (35 × 43 cm IP Cassette Type CC, Fuji, Tokio, Japan) and digitized with a resolution of 125 dpi (PCR Eleva Corado, Philips, Amsterdam, The Netherlands). Every scene, consisting of two simultaneous X-rays, was captured at 20 mAs and 125 kV, from a lateral perspective consistent with that used in a the clinical setting of Daentzer et al. [5]. The coordinate system was oriented such that rotation about the x-axis corresponds to flexion–extension, rotation about the y-axis corresponds to axial rotation and rotation about the z-axis corresponds to lateral bending. The flexion–extension-axis (x-axis) is perpendicular to the image plane; the other axes (y and z) are oriented parallel to the image plane.

The Model-based RSA software (MBRSA) (Version 3.31, Medis specials bv., Leiden, The Netherlands) was used for analysis of the stereoscopic X-ray data. Every micromanipulator adjustment and every RSA evaluation were conducted exclusively by the same individual.

For each axis of rotation and for each focal spot value FSV (0.6 mm and 1.2 mm), six repetitions of three conditions were executed. The three conditions were: the reference scene, the zero-relative-motion scene (ZRM, no movement), and the relative-motion scene (RLM). The latter scene was obtained by tilting one vertebra through 2.62° in axial rotation and through rotations of 5.23° in flexion–extension and lateral bending around the center of rotation of the spinal segment. These odd numbers resulted from applying 10 mm of translation on the micrometer drives.

Each zero-relative-motion scene was analyzed with reference to its preceding reference scene. This procedure is the equivalent of a clinical double examination and the true motion is always zero. However, the observed motion between these scenes will not be exactly zero and repeated measures allow the calculation of the precision of the procedure.

For determination of the accuracy of the technique, the RLM scenes were compared to their corresponding reference scene. Here the micromanipulator functioned as the highly accurate standard, and the accuracy was determined from the average difference between the analyzed rotation and the respective micrometer setting.

The data were analyzed using the MBRSA software to determine rotation relative to the calibration box´s coordinate system. Since the executed movement did not necessarily rotate exclusively about one single axis at a time, the resulting total angle of rotation was calculated from the three rotational components, for all 72 cases which were considered. This calculation allowed a direct comparison of the measured angle to that set by the micromanipulator.

RSA evaluation parameters (fiducial marker error, control marker errors, rigid-body-error (RBE), rigid body fitting error and condition number) were recorded according to recommendations [20, 28].

Precision was calculated by: \(\mathrm{precision}=t*\sigma * 1/\sqrt{n}\), with t as a value from the t-distribution for a 95% confidence interval (CI) and n-1 degrees of freedom, \(\sigma\) the standard deviation and n the number of observations (n = 6) [29, 30]. Accuracy was defined as the mean difference between the true value as obtained from the manipulators and the measured value from RSA analysis [31].

Results

The RSA analysis was successful for each of the captured scenes. Thus, for each measurement (n = 72 in total), it was possible to compare the respective rotations measured with RSA and with the micromanipulator setting, both in the ZRM and RLM protocols.

The precision of flexion–extension was higher when using the 0.6 mm FSV, when compared to the 1.2 mm FSV (± 0.056°; ± 0.153°, respectively, Table 1), and the accuracy was slightly better for the 0.6 mm compared to the 1.2 mm FSV (− 0.137°; − 0.170°, respectively, Table 2).

Table 1 Precision of zero-relative-motion (ZRM) analysis: motion between two reference scenes is calculated. True motion is zero in all directions. All values represent rotations and are therefore given in degrees
Table 2 Accuracy of relative-motion (RLM) analysis: Movement is calculated between one reference scene and one moved scene. True motion is 5.23° for flexion–extension and lateral bending and 2.62° for axial rotation, respectively. All values represent rotations and are therefore given in degrees

When comparing results for the respective axes of rotation, the best precision was obtained for lateral bending, both for an FSV of 0.6 mm and for an FSV of 1.2 mm (± 0.019° and ± 0.015°, respectively, Table 1). The best overall accuracy was also achieved for lateral bending (FSV 0.6 mm: − 0.041°; FSV 1.2 mm: − 0.035°; Table 2). The worst accuracy appeared for axial rotation using the 1.2 mm FSV (0.174°) (Fig. 4). No clear difference in either the precision or accuracy could be detected between rotation about a perpendicular-to-image-plane axis and the in-plane-axes.

Fig. 4
figure 4

Observed rotation for zero-relative-motion (ZRM) (top row) and deviation from expected motion for relative-motion (RLM) (bottom row), using an FSV of 0.6 mm and an FSV of 1.2 mm, respectively, shown for flexion–extension, lateral bending and axial rotation

The contrast between markers and the background was better in the images with an FSV of 0.6 mm, compared to an FSV of 1.2 mm (Fig. 5).

Fig. 5
figure 5

Contrast between markers and the background depending on FSV: 1.2 mm (A), 0.6 mm (B)

Concerning the RSA evaluation parameters, the fiducial marker error was < 0.2 mm and the control marker errors were found to be ≤ 1 mm. The RBE was between 0.001 mm and 0.117 mm and the mean rigid body fitting error was 0.04 mm. The condition number, representing the quality of spatial marker distribution, ranged from 29.1 to 35.6.

Discussion

The focus of this study was to assess the precision and accuracy of RSA in a spinal application, whilst taking the soft tissue surrounding the lumbar spine into account by integrating a soft tissue phantom. We found a precision ranging from ± 0.019° to ± 0.153° and an accuracy ranging from 0.012° to − 0.17°, among all of the movements investigated using the ZRM and RLM protocols, respectively. We also observed an effect of FSV, which suggests that variations in this parameter can be used to increase the accuracy of angle measurements in clinical studies of spinal kinematics.

The RSA evaluation parameters were similar to those of clinical lateral lumbar roentgen stereophotogrammetry images from a previous study [5] and met all relevant thresholds from recommendations (RBE < 0.5 mm [28], rigid body fitting error < 0.35 mm [20] and condition number < 100 [20, 32]).

The effect of the improved contrast obtained with an FSV of 0.6 mm compared to an FSV of 1.2 mm can be observed in the improved precision and accuracy obtained for this FSV, especially in flexion–extension. In lateral bending, results for the two FSVs are comparable and in axial rotation a better accuracy is observed for the smaller FSV. However, even the lower accuracy obtained with the larger FSV should still be sufficient for most clinical questions, since the accuracies for both FSVs are better than 0.2°.

Since there are a number of clinical studies using RSA for the determination of movements in the spine, the precision of rotations was determined repeatedly by double examinations (Table 3 ). The results range from 1.5° to 4.2° in flexion–extension, from 0.8° to 2° in lateral bending and from 0.4° to 3.9° in axial rotation, respectively. This is considerably higher than the precision values in our study which are all below 0.2°. This huge difference might be explained by the fact that during a real double examination in a clinical context, the position of the spinal segment is not as stable and, especially the marker distribution is not as ideal (median condition number in this study: 30.6, while for usual clinical cases, e.g., Daentzer et al. it was 70.3) as in our context.

Table 3 Precision for rotational movements as originally determined by double examination in clinical RSA studies. All values represent rotations and are therefore given in degrees. CI means confidence interval

There are a number of previous in-vitro RSA studies reporting precision and accuracy. Lee et al. performed an in-vitro study assessing the accuracy of RSA in spinal examination before the advent of fully computerized image processing [33]. They found the accuracy as defined as root-mean-squared error to be 0.14° for flexion–extension and 0.08° for axial rotation, which is in close agreement with our results. Results of an employed soft tissue model were not separately reported and the range of motions was up to 20°, which present methodological differences to the presented study.

Seehaus et al. compared the precision and accuracy of marker- and model-based RSA in a phantom model study of knee and hip implants [24]. Using the same rotational micrometers as in our setup, they conducted rotations of 1.19° about the anterior–posterior axis, (perpendicular-to-plane rotation, corresponding to the flexion–extension axis in the present study) and of 1.31° about the other axes. The surrounding bones and soft tissues were simulated by two hollow cylinders of polymethylmethacrylate. For the marker-based RSA of hip implants, Seehaus et al. found a precision of ± 0.050° (for perpendicular-to-plane rotation) with a worst case (around the superior-inferior axis) of ± 0.230°, which is comparable regarding the perpendicular-to-plane rotation, but inferior to the other axes in the presented study (0.6 mm FSV: ± 0.056° for perpendicular-to-plane rotation, ± 0.019° and ± 0.090° for other axes). This difference in precision is likely attributed to the longitudinal marker distribution along the superior-inferior axis in hip RSA, which is not present in spinal RSA. Regarding accuracy, Seehaus et al. found values between − 0.013 mm and 0.030 mm for marker-based analysis of hip implants, which is superior to the accuracy in the presented study (− 0.174° to 0.012°) [24]. This can be attributed to the smaller soft tissue model in the study of Seehaus et al. which accurately reflects the situation in the hip region as seen in the anterior–posterior direction.

The influence of marker number and spatial distribution on the precision and accuracy of RSA of tibial plateau fractures were examined by Solomon et al. in a plastic tibia model [14]. The experimental setup did not include a soft tissue phantom. Solomon et al. analyzed a marker distribution that they had found to be sufficient and applicable (6 markers defining proximal tibia and fractured fragment each). They found the rotational accuracy to be better than 0.123° (in the sagittal plane), which is slightly superior to the results from our study (accuracy better than − 0.174°). This can once again be attributed to the presence of a soft tissue phantom in the presented study. The precision was measured to be ± 0.024°, which is again superior to our results. In the in vivo validation analyses of 12 tibial plateau fracture patients, precision values of ± 0.331°, ± 0.260° and ± 0.165° were measured for x- (lateral), y-(distal), and z-(posterior) axes, respectively [14]. This highlights the difference between the idealized in-vitro setting and the clinical reality, even when only small soft tissue effects should be present as around the tibia.

Bojan et al. examined the precision and accuracy of RSA in the context of screw cut-out in trochanteric hip fractures, in vitro with a plastic femur model [13]. Five tantalum markers were attached to the femoral head, the shaft fragment carried eight such markers, and the lag screw and intramedullary nail carried four tantalum markers each. They did not include a soft tissue phantom and no dynamic double examinations were conducted. The precision was found to be better than 0.5°, which is inferior to the results of the presented study (precision better than ± 0.153°), and the accuracy for rotations of the femoral head was found to be better than 0.18°, which is comparable to the results of the presented study (accuracy better than − 0.170°) [13].

Several limitations regarding the current study should be mentioned, none of which substantially affected the results and conclusions.

First, all patients have individual marker distributions which can have an effect on analysis quality, especially in the examination of rotational movement [13, 14, 20, 32, 34,35,36]. In the current study, only one marker distribution was analyzed. While this marker distribution was superior to the usual clinical situation (median condition number for this study was 30.6, while e.g., for Daentzer et al. it was 70.3), the range of motion (flexion extension: this study: 5.23°, Daentzer et al.: 2.9°–7.1°) and image quality closely reflect the situation found in practice [5].

Second, the human body consists of different soft tissues which attenuate X-rays differently, and the tissues in each body are shaped individually [15]. The nylon phantom used in this study represents only the average X-ray attenuation characteristics of average soft tissue. The sample itself does not contain soft tissues such as a vertebral disk. This is not likely to influence the results, since the markers would not be affected by the added x-ray attenuation.

Third, only a single segment was analyzed in this study. However, the technical challenge to incorporate the correct centers of rotation was deemed disproportional given the fact that many clinical RSA studies focus on one single treated segment [5, 7,8,9,10,11].

Fourth, despite efforts to minimize the bending of the carbon tubes of the manipulator, they displayed a maximum deflection under their own weight and the weight of the vertebrae of 0.13 mm in the worst case (flexion extension). Since both tubes were affected likewise, neither the location of the center of rotation between the vertebrae nor the rotation itself was biased considerably.

Fifth, the setup for the examination of axial rotation was moved between the experimental procedure conducted for an FSV of 0.6 mm and the procedure conducted for an FSV of 1.2 mm, since they could not be conducted in succession on the same day due to clinical processes. This led to a slight change in axis of rotation and spatial test arrangement between the test series. However, the effect of this misalignment on the results obtained is regarded as minimal.

Sixth, in order to apply the results of this study to the clinical use of RSA, it must be considered that no movement occurred during the X-ray exposure time in the experimental in vitro setting. A patient´s body will execute motion during examination, although this effect is minimal in a stable sideways position of the patient.

In summary, the results show a precision of ± 0.153° and an accuracy of 0.170° for an FSV of 1.2 mm, suggesting RSA to be a highly precise method for analyzing motion of the lumbar spine. The determination of accuracy and precision of RSA measurements of spinal motion will allow researchers to differentiate between clinically relevant spine motion and measurement error. In particular, this is relevant for the accuracy, which cannot be determined by double examinations. Nevertheless, the idealized circumstances of this in-vitro test should always be considered. The findings for the larger FSV are of more relevance, since it is the one commonly used in the clinical field. Our results show that it has an only slightly inferior precision and accuracy, compared to the smaller FSV [22].