Introduction

Classification systems of spinal deformities in adolescent idiopathic scoliosis (AIS) have been proposed to guide surgical decision-making [1, 2]. Two dimensional (2D) spinal radiographs remain the mainstay of the clinical evaluation in AIS, thus the classifications remain based on the sagittal and coronal characteristics of the curves. Yet, scoliosis is known to be a three dimensional deformity. Several studies have shown that the three dimensional (3D) characteristics of the curve play an important role in surgical decision-making and surgical outcomes prediction [3,4,5,6].

The 3D characteristics of the scoliotic curves, using mathematical and statistical methods, have been evaluated [4, 7]. Such quantitative analyses have identified different 3D curve patterns in AIS Lenke subtypes [4, 7]. However, the additional sophisticated image post-processing and mathematical analysis that are required to identify these different 3D variants have hampered clinical application of such methods.

Previously, we developed a statistical 3D classification of right thoracic AIS using 3D models of the spine [4]. Subsequently, we identified characteristics of these subtypes in the coronal and sagittal views (posterior-anterior (PA) and lateral radiographs) that could be identified on the 2D spinal radiographs to infer the axial view. As this classification proved to provide meaningful clinical guidelines for surgical planning and surgical outcome risk stratification [5], we aimed to assess the reliability of identifying these 3D subtypes using two-view 2D radiographs (PA and lateral) of the spine.

To this end, we evaluated the inter- and intra-observer reliability of our 3D classification system of Lenke1 AIS using the 2D characteristics of the curves. We particularly focused on this subtype of scoliosis as identifying the 3D subtypes is shown to be beneficial in selecting the fusion levels [5]. It was hypothesized that trained raters can reliably identify the 3D subtypes of the right thoracic AIS using the descriptive characteristics of the curve determined on the 2D radiographs.

Methods

Cohort specifications

After obtaining institutional review board approval, we retrospectively obtained a sample of a total of 40 consecutive pre-surgical Lenke 1AIS patients, 12–17 years old. All patients had two view orthogonal calibrated stereoradiography images (EOS imaging, Paris, France)- posterior-anterior and lateral spinal radiographs. The bender films were used to determine lumbar curve flexibility. We included male and female patients with a main thoracic Cobb angle greater than 45° and lumbar Cobb angle that corrects to below 25° on the bending films. Per Scoliosis Research Society definition of the right thoracic curve, the apex of thoracic curve was above the T11-T12 intervertebral disc. Patients with prior spinal or hip surgery, neuromuscular conditions, spondylosis and spondylolisthesis, vertebrae congenital abnormalities including supernumerary were excluded.

3D Scoliotic subtypes characteristics

The spinal characteristics of the scoliotic subtypes in the PA and sagittal views were determined from a previous study [4]. This study described five subtypes in a cohort of right thoracic AIS by determining a series of qualitative specifications of the spinal curves in the coronal, sagittal, and axial (determined from the pattern of the pedicles’ orientation in the PA radiographs) views (Fig. 1). The spinal curve patterns in these 5 subtypes are:

  • Type 1: Sagittal view: hyperkyphotic/normal [2] sagittal profile and negative sagittal vertical axis (SVA)- C7 behind the posterior aspect of the sacrum. The kyphotic section of the spine is longer than the lordotic section of the spine, i.e., the kyphosis was extended to thoracolumbar spine. PA view: A compensatory curve above the main thoracic curve is observed thus T1 is leveled or tilted to the right side (in posterior view). Axial view: The direction of the vertebral rotation changes in the thoracolumbar region. The axial projection of the curve appears as lemniscate (figure-eight) shape.

  • Type 2: Sagittal view: Hypokyphotic [2] with no proximal kyphosis and negative SVA. The lordotic section is longer than the kyphotic section of the spine, i.e., the thoracolumbar section is lordotic. PA view: no proximal curve above the main thoracic curve thus T1 is leveled or tilted to the left in posterir view. Axial view: The direction of the vertebral rotation changes in the lower lumbar region or does not change at all below the caudal neutral vertebra – the neutral vertebra located below the main thoracic apex- The axial projection of the curve appears as loop shaped.

  • Type 3: Sagittal view: hypokyphotic at T5-T10 levels, close to zero SVA. PA view: frontal imbalance and trunk shift, T1 is leveled or tilted to the right due to a compensatory curve in proximal thoracic. Axial view: The direction of the vertebral rotation changes in the thoracolumbar region (lemniscate axial view).

  • Type 4: Sagittal view: Flat (no substantial kyphosis/lordosis) or lordotic sagittal profile with positive SVA, high (above midpoint of the spinal curve) sagittal inflection point thus the lordotic section of the spine is longer than the kyphotic section. PA view: no proximal thoracic curve, T1 is leveled or tilted to the left as the main thoracic curve continues to the proximal thoracic. Axial view: The vertebral rotation changes direction in the lower lumbar region or does not rotate below the caudal neutral vertebra (loop shaped axial view).

  • Type 5: Sagittal view: hypokyphotic and forward trunk shift, positive SVA with low inflection point (below midpoint of the spine) and a proximal kyphosis curve. PA view: T1 is leveled or tilted to the right. Axial view: the direction of the vertebral rotation changes in the thoracolumbar region (lemniscate axial view).

Fig. 1
figure 1

Five subtypes of Lenke1 AIS. The spinal curvatures are shown in the sagittal, frontal (posterior-anterior), and axial (Top-down) views. The vertebra at which the direction of the vertebral location changes in the thoracolumbar section is shown with a green circle

Manual classification

Figure 2 shows the three steps that were used in manual classification by the five raters. A flowchart was created based on the curves characteristics in the three anatomical planes to guide the raters through the classification process (Fig. 2a). The schematic of the sagittal profiles that shows the kyphotic/lordotic sections of the spine and the position of the inflection point or the section of the spine without a curve (inflection region) are shown in Fig. 2b.

Fig. 2
figure 2

a Three steps classification flowchart, using the 2D radiographs, is shown. The raters used frontal, sagittal and again frontal radiographs (when case 3 or 5 were selected) to determine the 3D subtypes. b Schematics of the sagittal views. The length of the spine over which the sign of the sagittal curve changes (kyphosis to lordosis) is shown by two small lines

Raters and classification process

A total of five raters, two pediatric orthopedic spine surgeons, two pediatric radiologists, and one research engineer performed the classification. To assess interobserver reliability, the four physicians repeated the classification for a subset of randomly selected 20 patients in the same cohort. Raters were trained in person to use the flowchart (Fig. 2). Raters accessed the medical images independently and recorded their rating in Research Electronic Data Capture (REDCap). Raters also determined the kyphosis and lumbar modifiers based on the Lenke classification for each patient [2].

Image processing

The 3D reconstructions of the spines were generated in SterEOS 2D/3D software (EOS imaging, Paris, France) for all the 40 subjects. The vertebral bodies centroids were used to generate the spinal centerline using a previously described method [8]. In addition, the 3D spinal centerlines of the five subtypes, determined in a previous study, were included [4]. These five 3D curves (spinal centerlines) were used to determine the similarity between the new cases (each of the 40 patients) to one of the original five 3D curves, as shown in Fig. 1.

Statistical analysis

To determine the subtype of the new data points (40 patients) we created a dissimilarity matrix between the five original subtypes of 3D spinal curves (Fig. 1) and each of the 40 new patients by calculating the Euclidean distances between each pair. The dissimilarity matrix, as the basis of many clustering methods, determines a quantitative measure of dissimilarity between the cluster centers and each new patients and determines to which of the existing clusters the new patient belongs. As such, the dissimilarity between the subjects within each cluster is significantly smaller than the dissimilarity between the subjects from two different clusters. The smallest dissimilarity between the patients’ spinal 3D curve and the existing 5 cluster’s centerlines determined the new patients’ subtype (Types 1 to 5). If the dissimilarity between the new patient’ 3D spinal curve and the five clusters were larger than the dissimilarity between the five clusters, that patient was excluded from the inter- and intra-observer reliability analysis.

The inter- and intra-observer reliabilities were evaluated between and within the raters using Fleiss’s Kappa. The rating reliabilities were evaluated separately for the 3D classification (identifying subtypes 1 to 5) and the axial classification (loop and lemniscate subtypes) (Fig. 1- axial view). We also calculated the accuracy of the visual ratings by each observer, considering the numerical classification as the true observation (gold standard), using the number of true positive classifications in the confusion matrix.

Results

A total of 4 out of 40 patients were excluded from the analysis based on the distances calculated in the dissimilarity matrices (Table 1). The dissimilarity between the 3D spinal curves of these patients and all the five existing cluster centers was larger than the between cluster dissimilarities (Table 1). Table 1 shows the dissimilarity matrix for the five clusters and these four cases. Figure 3 shows the PA and lateral spinal radiographs of these cases. Only one of these patients was in the cohort that was selected randomly for interobserver reliability assessment. Table 2 summarizes the average and standard deviation of the frontal Cobb angles and sagittal parameters of the remaining 36 patients in the 5 subtypes based on the numerical (true) clustering.

Table 1 The cluster assignment based on the dissimilarity matrix. The minimum Euclidian distance for each patient, determined the type. For example patient 1 was assigned to Type 1 as the calculated Euclidean distance between the Type 1 and Patient 1 was smaller than the distance between the patient 1 and other Types. Four patients were excluded as the dissimilarity between these cases and the five types exceeded the Euclidian distance between the five types as shown in the shaded section of the table
Fig. 3
figure 3

Presentation of four cases with a large dissimilarity to the five subtypes of Lenke 1 (per our previous classification). The PA and lateral radiographs, the axial projection, and the 3D model of the spine are shown. The end vertebrae are colored in the 3D images. The position of the T1 and L5 vertebrae in the axial is shown

Table 2 Patients’ clinical parameters. The type specification was based on the numerical class assignment using the dissimilarity matrices

For the 3D classification of the curves, the interobserver reliability (for 19 patients and 4 observers) was moderate [9], κ = 0.69. The intraobserver reliability for 3D classification (for 36 patients and 5 observers) was moderate, κ = 0.61. The intraobserver reliability for identifying kyphosis modifier was moderate, κ = 0.78 and for lumbar modifier was strong, κ = 0.86.

For the axial classification, the interobserver reliability was almost perfect, κ = 0.91 and the intraobserver reliability was strong, κ = 0.80.

The balanced accuracy value (the proportion of the correct prediction for each subtype) for raters 1 to 5 are shown in Table 3. The classification accuracy was the lowest for Type 2, 0.63, 95% CI = [0.57,0.67] and the highest accuracy for Types 4, 0.76, 95% CI = [0.73,0.79] among all raters.

Table 3 Classification accuracy (number of correct observations divided by the total number of observations) of the five raters and for each subtype

Discussion

Two-dimensional classification systems of the scoliotic spine are developed to assist with surgical planning [2, 10, 11]. Yet, variations in the surgical outcomes exist partially due to the fact that the 3D shapes of the scoliotic spines are not incorporated in these classification systems [12,13,14,15,16]. We developed a true 3D classification of the right thoracic scoliosis and attempted to apply the classification system via 2D images to identify the characteristics of these 3D subtypes on 2D radiographs. This study determined the reliability of this 3D classification method based on 2D radiographs to be moderate to strong among the raters suggesting that this 3D classification system has the potential to be used in orthopedic clinics without a need for excessive image post-processing.

The reliability of the 3D classification system proved to be within the range of the Lenke modifier kappa values in the cohort of our raters and superior to other classification system commonly used in the field of orthopedics. For example the kappa value for tibial plateau fractures classifications was 0.476 based on Schatzker classification [17]. Similarly the kappa for pediatric supracondylar fractures is reported at 0.475 for Wilkins-modified Gartland classification [18]. Our intraobserver reliability for the 3D classification and axial classification were at κ = 0.61 and 0.80 respectively suggesting an acceptable range for clinical applications. The intraobsever reliability varied among the five raters; however as the number of raters is small we did not make any relationship between the raters (surgeon versus radiologist, versus engineer) and the reliability scores.

Surgical planning in AIS aims to stabilize the spine while minimizing the number of the fused vertebrae [19, 20]. In doing that, short fusion or variations in the upper and lower instrumented vertebrae (UIV and LIV) may result in compensatory curve progression, postural compensation, and subsequently a need for revision surgery [5, 20]. Our classification primarily focuses on identifying the true 3D characteristics of these curves by utilizing axial information to augment our understanding of the curve in right thoracic AIS [4, 21]. Our previous results demonstrated that the vertebral level below the thoracic apex at which the direction of the vertebral rotation changes can identify the true number of the 3D curves and the axial subtypes in right thoracic AIS [4, 21]. The lemniscate axial type (with a rather sudden change in the direction of the vertebral rotation in the thoracolumbar region) responds better to shorter fusion whereas loop shaped axial types (with a long sweeping curve and change in the direction of the vertebral rotation in the lower lumbar region or no rotation at all below the apex) require an extension of the fusion to the lumbar spine [5] as the spine is comprised of only one long 3D curve. Yet, detailed surgical planning based on this classification remains to be further established. The criterion based on the vertebral rotation is close to the concept of the neutral vertebra (NV), however current definition tries to draw attention to whether the least rotated vertebra is located between two curves (if the vertebral rotates to the opposite direction below NV i.e., lemniscate axial group) or part of the structural curve (if the vertebral rotation does not change below the NV, i.e., loop shaped axial group) [5]. A previous risk stratification analysis based on the 3D classification that was presented here showed that the subtypes with loop shape axial projection benefit from longer fusion that includes the entire 3D curve whereas fusion of one of the 3D curves in subtypes with lemniscate shaped axial projection can improve the rate of spontaneous lumbar Cobb angle correction [5]. The raters in our study could identify the axial subtypes with excellent reliability only by considering the pattern of the vertebrae rotation in the thoracolumbar/lumbar sections. This axial classification of the right thoracic AIS, as described in this study, can provide a better understanding of the nature of the compensatory curves in right thoracic AIS and assist with surgical decision-making.

As our 3D classification includes the sagittal alignment of the spine, attention should be paid to natural patient positioning during radiograph acquisition [22, 23]. As the importance of sagittal alignment in clinical evaluation of scoliosis is emphasized [21, 24, 25], patient positioning methods that do not change the postural alignment are critical for sagittal evaluation of the spine. Considering the sagittal profile in natural standing position, Types 1 and 2 have a negative SVA thus excessive transition of the UIV anteriorly, while imparting kyphosis, can result in proximal junctional kyphosis due to over-working of the posterior elements [5, 26]. Types 4 and 5 have a positive SVA. Imparting large kyphosis may result in disturbing the harmonious spino-pelvic alignment and developing compensatory mechanism [26]. Finally, the changes in the position of UIV are of greater importance in the curve types without a proximal kyphosis i.e., Types 2 and 4 as shown in a previous analysis [5]. This shows the importance of the proximal kyphosis which was not considered in previous sagittal classification of the spine [27, 28].

All raters primarily used the flowchart to perform the classifications (Fig. 2a). A total of 4 subjects were excluded from the current analysis because a large dissimilarity between the 3D spinal curvature of these patients and the original cluster centers as determined in our previous work [4, 5] was observed (Fig. 3 and Table 1). When visually evaluated, these patients had one or more characteristics that did not fit in the 3D description of any of the clusters (Figs. 1 and 2). For example, case 1 presented with a hypokyphosis and flat sagittal profile, as seen in Type 4, but with slight rotation of the thoracolumbar curve to the opposite direction of the thoracic curve, similar to what is seen in Types 1, 3, and 5 (Fig. 3-a). Case 2 had a sagittal profile similar to Type2 but slight vertebral rotation in the thoracolumbar region (Types 1, 3, or 5) was observed. Case 3 had a sagittal profile similar to Type 3 or 5 however no vertebral rotation in the thoracolumbar region was observed, suggesting a Type 2 or 4. Finally, case 4 had Type 4’s sagittal profile but the frontal curve, based on the vertebral rotation below the neutral vertebra, was classified as Types 1, 3, or 5. Analysis of a larger database can determine whether these cases should be considered as additional subgroups of Lenke1 AIS. Extra attntion needs to be paid during the radiograph acquisation and avoid patient positionings that alter the natural sagittal profile [23].

Advancements in deep learning in medical imaging can further facilitate this classification method. Such automated methods can determine new subgroups if the similarity between the patients’ subtypes increased and form new groups using an online processing algorithm. This is the subject of our future work. Using this 3D classifications algorithm [29] while including true 3D parameters of the spine [30], method needs to be expanded to other Lenke types and validated. Finally, an external validation of this classification is required to show whether this classification system can be used reliably in other hospitals and research centers.

Conclusion

Although the 3D reconstruction of the spine can contribute significantly to our understanding of the scoliosis spine, the proposed rule-based classification using 2D radiographs can provide valuable and readily accessible information to orthopedic clinics using the technology already utilized in the clinical setting.