Automated Clustering Technique (ACT) for Early Onset Scoliosis: A preliminary report

Viraraghavan, Girish; Cahill, Patrick J.; Vitale, Michael G.; Williams, Brendan A.; Balasubramanian, Sriram

doi:10.1007/s43390-022-00634-1

Automated Clustering Technique (ACT) for Early Onset Scoliosis: A preliminary report

Case Series
Open access
Published: 26 January 2023

Volume 11, pages 723–731, (2023)
Cite this article

Download PDF

You have full access to this open access article

Spine Deformity Aims and scope Submit manuscript

Automated Clustering Technique (ACT) for Early Onset Scoliosis: A preliminary report

Download PDF

Girish Viraraghavan¹,
Patrick J. Cahill²,
Michael G. Vitale³,
Brendan A. Williams²,
Pediatric Spine Study Group &
…
Sriram Balasubramanian ORCID: orcid.org/0000-0002-7886-6184¹

1511 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Purpose

While the C-EOS system helps organize and classify Early Onset Scoliosis (EOS) pathology, it is not data-driven and does not help achieve consensus for surgical treatment. The current study aims to create an automated method to cluster EOS patients based on pre-operative clinical indices.

Methods

A total of 1114 EOS patients were used for the study, with the following distribution by etiology: congenital (240), idiopathic (217), neuromuscular (417), syndromic (240). Pre-operative clinical indices used for clustering were age, major curve (Cobb) angle, kyphosis, number of levels involved in a major curve (Cobb angle) and kyphosis along with deformity index (defined as the ratio of major Cobb angle and kyphosis). Fuzzy C-means clustering was performed for each etiology individually, with one-way ANOVA performed to assess statistical significance (p < 0.05).

Results

The automated clustering method resulted in three clusters per etiology as the optimal number based on the highest average membership values. Statistical analyses showed that the clusters were significantly different for all the clinical indices within and between etiologies. Link to the ACT-EOS web application: https://biomed.drexel.edu/labs/obl/toolkits/act-eos-application.

Conclusion

An automated method to cluster EOS patients based on pre-operative clinical indices was developed identifying three unique, data-driven subgroups for each C-EOS etiology category. Adoption of such an automated clustering framework can help improve the standardization of clinical decision-making for EOS.

Towards a new 3D classification for adolescent idiopathic scoliosis

Article 05 February 2020

Data-driven Classification of the 3D Spinal Curve in Adolescent Idiopathic Scoliosis with an Applications in Surgical Outcome Prediction

Article Open access 02 November 2018

A cluster analysis describing spine and torso shape in Lenke type 1 adolescent idiopathic scoliosis

Article Open access 06 October 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Early onset scoliosis (EOS) is a progressive spine deformity defined by an onset of spinal curvature greater than 10 degrees in children under 10 years of age [1, 2]. EOS is believed to account for nearly 10% of all pediatric scoliosis cases, with the true prevalence unknown [3, 4]. While EOS encompasses different etiologies with a variable natural history, the resulting progressive spine and rib cage deformity that can very likely lead to thoracic insufficiency syndrome which is characterized by reduced lung development and growth [4,5,6,7]. To help better understand clinical presentations in the heterogeneous EOS patient population, the Classification of EOS (C-EOS) was developed by an expert committee of spine surgeons to group patients using pre-operative clinical indices such as age, etiology (congenital, idiopathic, neuromuscular, syndromic), major curve (Cobb) angle (< 20°, 20–50°, > 50- < 90°, > 90°) and kyphosis (< 20°, 20–50°, > 50°) [8]. The C-EOS system helps assign a patient to 1 of 48 subgroups based on the aforementioned clinical indices. In 12 of these subgroups, patients have < 20° of major curve (Cobb) angle group and rarely need surgical intervention due to the low extent of the deformity.

C-EOS is the only available classification system to group EOS patients based on the pre-operative clinical indices. While the system is both reliable and accurate, the cut-offs for major curve (Cobb) angle and kyphosis are not based on a data-driven approach [9]. In addition, the 48 subgroups in C-EOS limit meaningful analysis due to a small number of patients in each subgroup, thereby making it difficult to correlate interventions with outcomes [10]. While C-EOS has helped establish a standardized method to communicate different aspects of EOS deformity, it is not widely used like the Lenke classification system for Adolescent Idiopathic Scoliosis (AIS) to help guide surgical treatment [11, 12].

Currently, there is limited consensus among surgeons on treatment selection with variations in the timing of surgery, selection of treatment modality, and levels involved in the instrumentation [13,14,15,16]. Management decisions are guided more by a clinician’s experience and training, complicating outcome comparisons between institutions. This is in part due to C-EOS system not providing guidance on treatment decision-making. Clustering methods have been used to detect clinically relevant curve patterns for the AIS population using pre-operative clinical indices and a few studies have even used a data-driven approach to cluster such patients [17,18,19,20,21]. Such automated clustering methods and analysis have yet to be attempted for EOS.

Due to the need for data-driven grouping of the heterogenous EOS patient population, the objective of this study is to use machine learning-based clustering methods to generate a limited number of automated, meaningful subgroups based on pre-operative clinical indices of EOS patients. Such a framework can be used along with the existing C-EOS system as the foundation to build a novel automated classification system informed by patient data to provide standardized guidance for surgical interventions for EOS.

Methods

Patient data

After institutional review board approval, EOS patients were retrospectively selected from the prospectively maintained database of the Pediatric Spine Study Group (PSSG) over a 20-year period (1995–2015). For all subjects, pre-operative PA, and lateral radiographs along with measurement of pre-operative clinical indices were obtained. For patients with multiple clinical visits, only the clinical indices from the first visit were considered. Subjects with missing clinical indices were excluded, which resulted in a total of 1170 subjects (congenital (n = 259), idiopathic (n = 224), neuromuscular (n = 440), and syndromic (n = 247)) being included for the automated clustering analyses.

Clinical indices measurement and automated clustering method

Pre-operative clinical indices used for clustering were age, major curve (Cobb) angle, kyphosis, number of levels involved in a major curve (Cobb angle) and kyphosis along with deformity index (defined as the ratio of major Cobb angle and kyphosis.

Fuzzy C-means clustering algorithm is an automated method to group data into a specific number of clusters provided by the user. The algorithm works by initially finding cluster centroids based on the data, after which every datapoint is assigned a membership value that denotes how close it is to the cluster centroid. Based on these membership values, datapoints are clustered together [22]. In the current study, a Fuzzy C-means clustering algorithm was used to cluster EOS patients for each etiology based on age, major curve (Cobb) angle, kyphosis, number of levels involved in the major curve, number of levels involved in kyphosis and deformity index. Hence, four cluster analyses were performed for each of the four EOS etiologies namely congenital, idiopathic, neuromuscular, and syndromic. One of the advantages of the fuzzy c-means clustering algorithm is that it considers the heterogeneity of the dataset and is able to identify outliers using the membership values.

For this study, the optimal number of clusters for each etiology was decided based on the average membership values of the clusters. Different number of clusters were used to perform the clustering analysis, and the one that resulted in the highest average membership value was selected as the optimal number.

Clustering evaluation metric

The membership values calculated during the fuzzy C-means analyses describe how close a datapoint is to the calculated cluster centroid, hence offering the basis to detect outliers. These membership values help quantify the certainty (i.e. accuracy) of a data point belonging to a particular cluster. Since these values provide accuracy on a data point and its cluster center, these can also be used a metric to determine outliers by defining a lower bound. For the current study, a membership value of 0.6 was used as the lower bound with values below this deemed as outliers. Hence, subjects who had clinical indices with membership values lower than 0.6 were considered as not belonging to a cluster and were excluded from the clustering analyses.

Statistical analysis

MATLAB (v2020b, The MathWorks Inc, Natick, MA) was used to perform statistical analyses. Average and standard deviation values for each cluster within an etiology were computed and compared with each other using a one-way ANOVA (< 0.01), to detect differences between the clusters. Furthermore, Tukey’s post-hoc analysis was performed to detect differences between the clusters of all the etiologies.

Results

For fuzzy-C-means, based on the average membership values, three clusters per etiology resulted in the highest membership values. Out of 1170 eligible subjects, a total of 56 subjects (congenital (n = 19 out of 259), idiopathic (n = 7 out of 224), neuromuscular (n = 23 out of 440), and syndromic (n = 7 out of 247)) were excluded from the analyses due to their low membership values (i.e. < 0.6), resulting in a total of 1114 subjects being included for the final analyses.

Tables 1, 2, 3, 4 show the average and standard deviation values of clinical indices for each etiology along with exemplar radiographic images. In each table, clusters 1–3 are arranged in ascending order of average magnitude of major curve (Cobb) angle. To best visualize the clusters, scatter plots of kyphosis versus major curve (Cobb) for each etiology are shown in Figs. 1, 2, 3, 4, respectively. For each etiology, one-way ANOVA showed that the clinical indices for any cluster were significantly different (p < 0.01) from other clusters. Link to the ACT-EOS web application: https://biomed.drexel.edu/labs/obl/toolkits/act-eos-application.

Table 1 Average and standard deviations values of clinical indices along with exemplar radiographic images for Congenital EOS clusters

Full size table

Table 2 Average and standard deviations values of clinical indices along with exemplar radiographic images for Idiopathic EOS clusters

Full size table

Table 3 Average and standard deviations values of clinical indices along with exemplar radiographic images for Neuromuscular EOS clusters

Full size table

Table 4 Average and standard deviations values of clinical indices along with exemplar radiographic images for Syndromic EOS clusters

Full size table

Discussion

This is the first study to perform automated fuzzy C-means clustering of EOS subjects based on pre-operative clinical indices, complementing the existing C-EOS classification system. While the C-EOS system uses Cobb angle and kyphosis, which were based on expert review informed by normative values derived from the literature, the current clustering method uses a data-driven approach, by using the highest average membership values to determine subject assignments to respective clusters, to create the subgroups. The smaller number (3) of subgroups per etiology generated by this clustering analysis could better focus research initiatives for the rare and heterogenous population of EOS patients to improve and standardize clinical decision-making. In contrast, as shown in Tables 1, 2, 3, 4, subjects in each cluster would be classified under multiple C-EOS groups which would not be conducive to clinical decision making [23]. This is supported by previous studies that have reported limitations of large EOS subgroups to aid in clinical outcome determination [10].

In this study, we identified unique clusters within each EOS etiology that were significantly different from each other based on all the clinical indices. Although one cluster in the idiopathic and congenital groups, respectively, had severe kyphotic deformities, these clusters still had a deformity index greater than one (i.e. greater major curve Cobb angle as compared to kyphosis). In contrast, both neuromuscular and syndromic groups had one cluster each with a deformity index less than one indicative of greater kyphosis as compared to the major curve Cobb angle. Although further validation is required for these methods to be widely adopted, the novel preliminary findings reported in the current study may point to differences in general deformity patterns observed between EOS etiologies, which are not immediately discernible with the C-EOS system. Additionally, while post-hoc testing showed significant differences across all clusters for all etiologies, we may not fully capture the heterogeneity as these are solely based on radiographic measurements.

Limited previous studies for automated clustering of patients with spine deformity have used techniques such as ISOData and K-means + + [20, 24]. While these methods were able to create clusters based on patient deformity characteristics, they do not provide a quantitative basis to assign a patient to a particular cluster, and also lack the ability to assess whether or not a patient is an outlier. On the other hand, Fuzzy C-means, which is an automated unsupervised clustering algorithm that performs well on heterogenous datasets, uses membership values to provide a probabilistic estimate of a patient belonging to a particular cluster. Higher membership values (> 0.6) would indicate a greater probability of a patient’s assignment to a cluster, and thereby also help identify and exclude any outliers.

The current study is not without limitations. First, to better address the heterogeneity of EOS subjects, the current clustering method can be further improved. For example, a small number of subjects (n = 56 out of 1170), were excluded in any cluster due to low membership values (i.e. < 0.6) of the limited radiographic measurements. Inclusion of additional structural (such as thoracic deformity parameters), functional (such as pulmonary functional parameters) and time-based measures (Annual Progression Modifier (APM)) may help better address the variable presentations of EOS deformity. Secondly, we used a multi-institutional and multi-modal dataset with clinical indices measured made by multiple observers. Variations associated with such datasets would impact any classification method equally. However, the fuzzy C-means algorithm would be less affected by such variations due to the automated selection that is unbiased by preset cut-offs for clinical indices.

Conclusion

An automated framework to cluster EOS patients based on pre-operative clinical indices was developed identifying three unique, data-driven subgroups for each C-EOS etiology category. Adoption of this clustering framework with further validation may assist with clinical applications such as surgical planning, optimization of intervention type and timing to improve clinical outcomes and reduce complications.

Link to the ACT-EOS web application: https://biomed.drexel.edu/labs/obl/toolkits/act-eos-application.

Data availability

All data are available with the Pediatric Spine Study Group.

References

Skaggs D, Akbarnia B, Flynn J et al (2014) Chest wall and spine deformity study group; growing spine study group; pediatric orthopaedic society of North America; scoliosis research society growing spine study committee. A classification of growth friendly spine implants. J Pediatr Orthop 34(3):260–274
Article PubMed Google Scholar
El-Hawary R, Akbarnia BA (2015) Early onset scoliosis - time for consensus. Spine Deform 3(2):105–106
Article PubMed Google Scholar
Riseborough EJ, Wynne-Davies R (1973) A genetic survey of idiopathic scoliosis in Boston, Massachusetts. JBJS 55(5):974–982
Article CAS Google Scholar
Yang S, Andras LM, Redding GJ et al (2016) Early-onset scoliosis: a review of history, current treatment, and future directions. Pediatrics 137(1):e20150709
Article Google Scholar
Akbarnia BA, Yazici M, Thompson GH (2010) The growing spine: management of spinal disorders in young children. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85207-0
Book Google Scholar
Skaggs DL, Akbarnia BA, Flynn JM et al (2014) A classification of growth friendly spine implants. J Pediatr Orthop 34(3):260–274
Article PubMed Google Scholar
Campbell RM Jr, Smith MD, Mayes TC et al (2003) The characteristics of thoracic insufficiency syndrome associated with fused ribs and congenital scoliosis. JBJS 85(3):399–408
Article Google Scholar
Williams BA, Matsumoto H, McCalla DJ et al (2014) Development and initial validation of the classification of early-onset scoliosis (C-EOS). J Bone Joint Surg Am 96(16):1359–1367
Article PubMed Google Scholar
Dragsted C, Ohrt-Nissen S, Hallager DW et al (2020) Reproducibility of the classification of early onset scoliosis (C-EOS). Spine Deform 8(2):285–293
Article PubMed Google Scholar
Russo C, Trupia E, Campbell M et al (2019) The association between the classification of early-onset scoliosis and smith complications after initiation of growth-friendly spine surgery: a preliminary study. J Pediatr Orthop 39(10):e737–e741
Article PubMed Google Scholar
Lenke LG, Betz RR, Harms J et al (2001) Adolescent idiopathic scoliosis: a new classification to determine extent of spinal arthrodesis. JBJS 83(8):1169–1181
Article CAS Google Scholar
Garbuz DS, Masri BA, Esdaile J et al (2002) Classification systems in orthopaedics. JAAOS - J Am Acad Orthop Surg 10(4):290–297
Article PubMed Google Scholar
Vitale MG, Gomez JA, Matsumoto H et al (2011) Variability of expert opinion in treatment of early-onset scoliosis. Clin Orthop Relat Res 469(5):1317–1322
Article PubMed Google Scholar
Corona J, Miller DJ, Downs J et al (2013) Evaluating the extent of clinical uncertainty among treatment options for patients with early-onset scoliosis. JBJS 95(10):e67
Article Google Scholar
Hughes MS, Swarup I, Makarewich CA et al (2020) Expert consensus for early onset scoliosis surgery. J Pediatr Orthop 40(7):e621–e628
Article PubMed Google Scholar
Williams BA, Asghar J, Matsumoto H et al (2013) More experienced surgeons less likely to fuse: a focus group review of 315 hypothetical EOS cases. J Pediatr Orthop 33(1):68–74
Article PubMed Google Scholar
Sangole AP, Aubin C-E, Labelle H et al (2009) Three-dimensional classification of thoracic scoliotic curves. Spine 34(1):91–99
Article PubMed Google Scholar
Stokes IA, Sangole AP, Aubin C-E (2009) Classification of scoliosis deformity 3-D spinal shape by cluster analysis. Spine 34(6):584
Article PubMed PubMed Central Google Scholar
Donzelli S, Poma S, Balzarini L et al (2015) State of the art of current 3-D scoliosis classifications: a systematic review from a clinical perspective. J Neuroeng Rehabil 12(1):1–11
Article Google Scholar
Thong W, Parent S, Wu J et al (2016) Three-dimensional morphology study of surgical adolescent idiopathic scoliosis patient from encoded geometric models. Eur Spine J 25(10):3104–3113
Article PubMed Google Scholar
Kadoury S, Labelle H (2012) Classification of three-dimensional thoracic deformities in adolescent idiopathic scoliosis from a multivariate analysis. Eur Spine J 21(1):40–49
Article PubMed Google Scholar
Bezdek J (1981) Pattern recognition with fuzzy objective function algorithms. Plenum, New York
Book Google Scholar
Shen J, Parent S, Wu J et al (2020) Towards a new 3D classification for adolescent idiopathic scoliosis. Spine Deform 8(3):387–396
Article PubMed Google Scholar
Ball GH (1965) Isodata-An iterative method of multivariate analysis and pattern classification. Proc IFIPS Congr

Download references

Acknowledgements

This project was partly supported by funding from Drexel University, and the Wyss-Campbell Center for Thoracic Insufficiency Syndrome at the Children’s Hospital of Philadelphia.

Funding

Wyss-Campbell Center for Thoracic Insufficiency Syndrome.

Author information

Authors and Affiliations

School of Biomedical Engineering, Science and Health Systems, Drexel University, 3141 Chestnut Street, Bossone 718, Philadelphia, PA, 19104, USA
Girish Viraraghavan & Sriram Balasubramanian
Division of Orthopaedics, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
Patrick J. Cahill & Brendan A. Williams
Division of Pediatric Orthopaedic Surgery, Department of Orthopaedic Surgery, Morgan Stanley Children’s Hospital of New York-Presbyterian, Columbia University Medical Center, Columbia University, New York, NY, USA
Michael G. Vitale

Authors

Girish Viraraghavan
View author publications
You can also search for this author in PubMed Google Scholar
Patrick J. Cahill
View author publications
You can also search for this author in PubMed Google Scholar
Michael G. Vitale
View author publications
You can also search for this author in PubMed Google Scholar
Brendan A. Williams
View author publications
You can also search for this author in PubMed Google Scholar
Sriram Balasubramanian
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Pediatric Spine Study Group

Contributions

GV: Conception, data collection, data curation and analysis, methodology, data visualization, data analysis and interpretation, data visualization, writing—original draft and editing, agree to be accountable for the work, approval of the final version of the manuscript. PJC: Methodology, data curation and analysis, resources, formal analysis, writing-critical review and editing, approval of the final version of the manuscript, agree to be accountable for the work. MGV: Methodology, data curation and analysis, resources, formal analysis, writing -critical review and editing, approval of the final version of the manuscript, agree to be accountable for the work. BAW: Formal analysis, data curation and analysis, writing—critical review and editing, approval of the final version of the manuscript, agree to be accountable for the work. Pediatric Spine Study Group: Data curation and analysis, formal analysis, writing—critical review and editing, approval of the final version of the manuscript, agree to be accountable for the work. SB: Conceptualization, methodology, formal analysis, resources, data collection, data curation and analysis, writing—original draft, visualization, supervision, project administration, funding acquisition, approval of the final version of the manuscript, agree to be accountable for the work.

Corresponding author

Correspondence to Sriram Balasubramanian.

Ethics declarations

Conflict of interest

The authors declare no conflicts of interest.

Ethical approval

This study was given Ethical Approval by the Institutional Review Board prior to the conduct of this study. Informed consent and/or a waiver of consent was obtained.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Viraraghavan, G., Cahill, P.J., Vitale, M.G. et al. Automated Clustering Technique (ACT) for Early Onset Scoliosis: A preliminary report. Spine Deform 11, 723–731 (2023). https://doi.org/10.1007/s43390-022-00634-1

Download citation

Received: 25 March 2022
Accepted: 10 December 2022
Published: 26 January 2023
Issue Date: May 2023
DOI: https://doi.org/10.1007/s43390-022-00634-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Automated Clustering Technique (ACT) for Early Onset Scoliosis: A preliminary report