Abstract
Introduction
A novel three-dimensional classification to comprehensively describe degenerative arthritis of the shoulder (DAS) was recently published by our group. The purpose of the present work was to investigate intra- and interobserver agreement as well as validity for the three-dimensional classification.
Materials and methods
Preoperative computed tomography (CT) scans of 100 patients who had undergone shoulder arthroplasty for DAS were randomly selected. Four observers independently classified the CT scans twice, with an interval of 4 weeks, after prior three-dimensional reconstruction of the scapula plane using a clinical image viewing software. Shoulders were classified according to biplanar humeroscapular alignment as posterior, centered or anterior (> 20% posterior, centered, > 5% anterior subluxation of humeral head radius) and superior, centered or inferior (> 5% inferior, centered, > 20% superior subluxation of humeral head radius). Glenoid erosion was graded 1–3. Gold-standard values based on precise measurements from the primary study were used for validity calculations. Observers timed themselves during classification. Cohen’s weighted κ was employed for agreement analysis.
Results
Intraobserver agreement was substantial (κ = 0.71). Interobserver agreement was moderate with a mean κ of 0.46. When the additional descriptors extra-posterior and extra-superior were included, agreement did not change substantially (κ = 0.44). When agreement for biplanar alignment alone was analyzed, κ was 0.55. The validity analysis reached moderate agreement (κ = 0.48). Observers took on average 2 min and 47 s (range 45 s to 4 min and 1 s) per CT for classification.
Conclusions
The three-dimensional classification for DAS is valid. Despite being more comprehensive, the classification shows intra- and interobserver agreement comparable to previously established classifications for DAS. Being quantifiable, this has potential for improvement with automated algorithm-based software analysis in the future. The classification can be applied in under 5 min and thus can be used in clinical practice.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Degenerative arthritis of the shoulder (DAS) has traditionally been classified as either primary osteoarthritis (OA) or cuff tear arthropathy (CTA). Anteroposterior subluxation and glenoid morphology in the axial plane for OA is to date commonly described using the modified Walch classification [1, 2]. Superoinferior subluxation and glenoid erosion for CTA has commonly been described in the coronal plane (anteroposterior X-ray) according to the Hamada or Visotsky-Seebauer and Favard classifications, respectively [3,4,5]. These classifications were originally developed for describing X-ray findings and in the case of the modified Walch classification were later adapted for axial computed tomography (CT).
In recent years, however, several authors described biplanar eccentricity in glenoid erosion patterns [6,7,8,9,10]. OA patients often develop rotator cuff insufficiency as the disease progresses, or the disease may initially be influenced by rotator cuff degeneration [11,12,13]. Especially in early disease, axial plane imaging of a lying patient is more sensitive for superior subluxation than standing X-ray [14].
Therefore, a new three-dimensional (3D) classification, which categorizes anteroposterior (A-P) and superoinferior (S-I) alignment with erosion for DAS, was developed and recently published [15].
Whilst this novel classification allows for a more comprehensive description of DAS, it has not yet been validated. The higher number of categories compared to previous classifications may reduce reproducibility [1, 3, 4]. However, alignment is quantifiable in this novel classification, making this less subjective and possibly ameliorating the effect of more categories on reproducibility. Furthermore, the complex nature of the 3D classification and need for CT reconstruction could make it too time consuming for clinical practice, which requires investigation.
The purpose of this study was to validate the 3D classification for patients with DAS. The hypothesis was that validity as well as interobserver reliability would be moderate and, therefore, comparable to values for previous classifications from the literature (Table 1) [16,17,18,19,20].
Materials and methods
Patient population
For this validation study, a previously investigated cohort of patients with DAS who underwent primary shoulder arthroplasty (total shoulder arthroplasty, hemiarthroplasty or reverse shoulder arthroplasty) at the Department of Orthopaedic Sports Medicine of the University Hospital Rechts der Isar in Munich between 2009 and 2020 were identified [15]. 299 shoulder arthroplasty cases were performed in this period for DAS. 135 of these had preoperative CT scans taken according to a standardized in-house protocol (pitch, 0.39; slice thickness, 0.9 mm; tube voltage, 120 kV; tube current, 82 mA [range, 50–115 mA]) available for analysis. CT scans had been taken no more than 6 months prior to surgery. Five of these patients were excluded from analysis: in two cases, the scapula was not adequately exposed, one due to movement artefact and two due to severe erosion, due to which landmarks for measurement could not be reliably placed. Of the remaining 130 CTs, 100 were selected at random.
Demographically the mean patient age was 70 years (range 38–88 years) and 49 were male (49%). 59 patients had a preoperative diagnosis of CTA and 41 of OA, as documented in the operative reports.
CT classification
A clinical image viewing software capable of 3D reconstruction (IDS7 Workstation Version 22.2; Sectra) was used to classify CT images according to the previously described 3D classification for DAS [15]. First, the scapular plane was reconstructed in 3D using two-dimensional orthogonal planes (axial, coronal, and sagittal): The glenoid center, trigonum and inferior angle of the scapula were aligned in one plane (Fig. 1). Following this, anteroposterior alignment was classified as posterior, centered or anterior; superoinferior alignment was classified as superior, centered or inferior and combined with an erosion grade (1–3) (Fig. 2) [15]. Subluxation of the humeral head center from the scapular axis (line passing from trigonum scapulae through glenoid center) was assessed relative to the radius of the humeral head (Fig. 3):
Centered alignment was defined as: Between 20% posterior and 5% anterior subluxation in the anteroposterior direction; between 5% inferior and 20% superior subluxation in the superoinferior direction, as previously described [15]. Additionally, alignment could be described as extra-posterior if posterior subluxation was > 60% of the humeral head radius (> 80% of the diameter), or extra-superior if static acetabularization was present [15]. Where alignment seemed to be obvious, observers were not required to perform measurements. In borderline cases the scapula axis, humeral circumference with center point, radius and subluxation of the center of the humeral head from the scapula axis was determined to quantify the classification (Fig. 4).
After a training seminar of 2 h, four orthopedic residents with experience in shoulder surgery (none of which were involved in measurements for the primary study) independently classified all 100 CT scans. An instructional handout for the 3D reconstruction and classification method, including a schematic representation of the 3D classification (Fig. 2) was provided. Four weeks after the first analysis the same four observers classified the 100 CT scans for a second time. Observers were blinded to the results of their previous attempt. The time needed to apply the new classification was measured for each observer. To simulate time available in clinical practice, observers were instructed to aim to take < 5 min per CT.
The humeroscapular alignment of all 100 shoulder CTs had previously been precisely measured and classified for the primary study by BDK and MH, with almost perfect interobserver agreement for alignment measurements [15]. Erosion was classified in consensus between the two observers. These existing classification values (shown in Table 2) were taken as the gold-standard against which the present values of the Observers were compared for validation.
Statistical analysis
Statistical analysis was performed using SPSS Version 29.0 (IBM-SPSS, New York, USA) software. Classification values were recoded and entered as string variables with three ordinal categories. Intraobserver reliability, interobserver reliability and validity (compared to existing gold-standard values) were calculated using Cohen’s weighted κ with linear weighting. Mean averages of comparisons of both attempts from each observer (four comparisons for each individual intraobserver comparison) were calculated for all six individual interobserver comparisons between the four observers. A 95% confidence interval (CI) was determined for each value. Cohen’s categorization of agreement (≤ 0 indicating no agreement, 0.01–0.20 none to slight, 0.21–0.40 fair, 0.41– 0.60 moderate, 0.61–0.80 substantial and 0.81–1.00 almost perfect agreement) was taken for interpretation of κ values [21]. Tests were two-sided, with a significance level of 0.05.
Results
The intraobserver reliability of the 3D classification showed substantial agreement with a Cohen’s weighted κ of 0.71 (CI 0.63–0.79).
Interobserver agreement was moderate overall, ranging from fair to substantial (Table 3). When assessing agreement for biplanar alignment only (disregarding erosion grade), interobserver agreement improved in each comparison, ranging from moderate to substantial, although differences were not statistically significant (Table 3). Interobserver agreement did not change substantially when the additional descriptors of extra-posterior and extra-superior were included (κ = 0.44 [CI 0.33–0.55]).
The validity analysis, comparing the observers’ classifications to the gold-standard values, showed moderate agreement (Table 4). As observed in the interobserver analysis, when the additional descriptors of extra-posterior and extra-superior were included the agreement for validity did not change substantially (κ = 0.45 [CI 0.33–0.56]). When analyzing the quantifiable aspect (biplanar alignment) only, κ increased to 0.53 (CI 0.41–0.65).
The mean average for time taken for classification was 2 min and 47 s (range 45 s to 4 min 1 s).
Key findings
The most important finding of this study was that interobserver agreement (reproducibility) for the 3D classification for DAS was moderate (κ = 0.46). Despite being more comprehensive than these, agreement for the 3D classification is at least comparable to that reported for previous monoplanar (two-dimensional) classifications for CTA or OA of the shoulder (Table 1) [16,17,18,19,20]. This is likely to be due to the quantifiable nature of the alignment aspect of this classification, which allows for clear cut-off values. Furthermore, observers were able to apply the classification in < 5 min, which reflects time available in clinical practice.
Interpretation
The most recently published study to compare intra- and interobserver agreement for the modified Walch classification (modifications by Bercik and Iannotti) is the most comparable to the present study, as it also employed a 3D image viewing software for analysis of CT images [1, 16, 22]. This study also found moderate inter- and moderate to substantial intraobserver agreement, comparable to the results of the present study [16]. The authors did not find an improvement in agreement when considering the alignment groups only (without erosion subgrouping). Although there was a trend to improved agreement when disregarding erosion grading, this was also not statistically significant in the present analysis.
An equally recent study investigating the original Walch classification found substantial intraobserver but only fair interobserver agreement for both CT and MRI images [18]. Shukla et al. found substantial intra- and moderate interobserver agreement of the modified Walch classification on both X-ray and CT [17]. They suggest the use of automated computer-based analysis of CT scans to improve the reproducibility of the modified Walch classification further. Whilst the modifications to the Walch classification have improved interobserver agreement somewhat, it remains a two-dimensional classification with subjective intergroup cut-offs [1, 22]. The presently investigated 3D classification is quantifiable with precise cut-off values and may therefore be better suited to such an automated computer-based software algorithm. Being 3D, it is more aligned with current developments in imaging software, preoperative planning and instrumentation than previous two-dimensional classification systems [23,24,25]. Modern 3D planning software already comprise algorithms which calculate the anteroposterior subluxation and these merely need to be modified to include superoinferior alignment [24, 26].
Established classification systems for CTA are all based on AP X-ray images and show variable levels of intra- and interobserver agreement [19, 20]. Whilst the Visotsky-Seebauer and Hamada classifications showed substantial to almost perfect intraobserver agreement with moderate interobserver agreement, the Favard classification was found to have moderate intra- and none to slight interobserver agreement [19, 20]. Despite its higher complexity, the presently investigated 3D classification shows comparable intra- and interobserver agreement to those established CTA classifications with better agreement (Visotsky-Seebauer and Hamada). These seminal classifications helped lead to a greater understanding of the disease process, but are now based on outdated technology. To further differentialize diagnosis, treatment and outcomes for DAS for clinical application and in research, use of the 3D classification should be considered.
A validation analysis is new to the present study of the 3D classification for DAS. This can be performed as the alignment of the classification is quantifiable and, therefore, correct classification is possible. The erosion subgrouping aspect is, as in the previous classifications subject to interpretation. Statistically significant moderate agreement was found in this validity analysis, rendering it valid for the classification of DAS. Validity agreement was moderate both for alignment alone and for alignment with erosion grade. As agreement for alignment was only moderate when detailed measurements are not performed ubiquitously (simulation of clinical practice), this has potential to be improved using automated software algorithms in the future.
Limitations
The validity analysis of this study depends on gold-standard values, of which only the alignment aspect is quantifiable. Despite being determined by two orthopedic surgeons in consensus the erosion aspect is, as in previous classifications, subject to some interpretation. The analysis, therefore, gives a breakdown of these aspects. As the 3D classification and method to reconstruct the scapular plane was new to the four observers, the training provided may not have been sufficient to get the best results possible. However, this reflects clinical practice as future users of this classification will not receive training beyond reading the published information available. As all observers in the present study were orthopedic residents, a difference in performance for varying levels of expertise could not be investigated.
Generalizability
As a wide variety of classification types were examined in a large sample of patients with DAS by four independent observers, the results of this study can be applied widely to classification of DAS. Availability of an image viewer with the ability of 3D planar reconstruction is, however, a prerequisite for reliable use of this classification.
Conclusion
The 3D classification for DAS is valid. Despite being more comprehensive, the classification shows intra- and interobserver agreement comparable to previously established classifications for DAS. Being quantifiable, this has potential for improvement with automated algorithm-based software analysis in the future. The classification can be applied in under 5 min and thus can be used in clinical practice.
Data availability
The datasets generated and analyzed during the current study are not publicly available due to institutional data protection agreements but are available from the corresponding author on reasonable request.
Change history
02 August 2023
A Correction to this paper has been published: https://doi.org/10.1007/s00402-023-04995-8
References
Bercik MJ, Kruse K 2nd, Yalizis M, Gauci MO, Chaoui J, Walch G (2016) A modification to the Walch classification of the glenoid in primary glenohumeral osteoarthritis using three-dimensional imaging. J Shoulder Elbow Surg 25(10):1601–1606. https://doi.org/10.1016/j.jse.2016.03.010
Walch G, Badet R, Boulahia A, Khoury A (1999) Morphologic study of the Glenoid in primary glenohumeral osteoarthritis. J Arthroplasty 14(6):756–760. https://doi.org/10.1016/S0883-5403(99)90232-2
Favard L, Lautmann S, Clement P (1999) Osteoarthritis with Massive Rotator Cuff-Tear: The Limitation of its Current Definitions. In: Walch Gilles, Boileau Pascal (eds) Shoulder Arthroplasty. Springer Berlin Heidelberg, Berlin, Heidelberg, pp 261–265
Hamada K, Fukuda H, Mikasa M, Kobayashi Y (1990) Roentgenographic findings in massive rotator cuff tears. A long-term observation. Clin Orthop Relat Res 254:92–96
Visotsky JL, Basamania C, Seebauer L, Rockwood CA, Jensen KL (2004) Cuff tear arthropathy: pathogenesis, classification, and algorithm for treatment. J Bone Joint Surg Am 86(2):35–40
Terrier A, Ston J, Larrea X, Farron A (2014) Measurements of three-dimensional glenoid erosion when planning the prosthetic replacement of osteoarthritic shoulders. Bone Joint J 96(4):513–518. https://doi.org/10.1302/0301-620x.96b4.32641
Otto A, Scheiderer B, Murphy M, Savino A, Mehl J, Kia C, Obopilwe Me E, DiVenere J, Cote MP, Denard PJ, Romeo AA, Mazzocca AD (2021) Biconcave glenoids show three differently orientated posterior erosion patterns. J Shoulder Elbow Surg. https://doi.org/10.1016/j.jse.2021.04.028
Knowles NK, Keener JD, Ferreira LM, Athwal GS (2015) Quantification of the position, orientation, and surface area of bone loss in type B2 glenoids. J Shoulder Elbow Surg 24(4):503–510. https://doi.org/10.1016/j.jse.2014.08.021
Siebert MJ, Chalian M, Sharifi A, Pezeshk P, Xi Y, Lawson P, Chhabra A (2020) Correction to: qualitative and quantitative analysis of glenoid bone stock and glenoid version: inter-reader analysis and correlation with rotator cuff tendinopathy and atrophy in patients with shoulder osteoarthritis. Skeletal Radiol. https://doi.org/10.1007/s00256-020-03386-z
Jacxsens M, Van Tongel A, Henninger HB, De Coninck B, Mueller AM, De Wilde L (2016) A three-dimensional comparative study on the scapulohumeral relationship in normal and osteoarthritic shoulders. J Shoulder Elbow Surg 25(10):1607–1615. https://doi.org/10.1016/j.jse.2016.02.035
Hartwell MJ, Harold RE, Sweeney PT, Seitz AL, Marra G, Saltzman MD (2021) Imbalance in axial-plane rotator cuff fatty infiltration in posteriorly worn glenoids in primary glenohumeral osteoarthritis: an mri-based study. Clin Orthop Relat Res 479(11):2471–2479. https://doi.org/10.1097/corr.0000000000001798
Domos P, Checchia CS, Walch G (2018) Walch B0 glenoid: pre-osteoarthritic posterior subluxation of the humeral head. J Shoulder Elbow Surg 27(1):181–188. https://doi.org/10.1016/j.jse.2017.08.014
Walker KE, Simcock XC, Jun BJ, Iannotti JP, Ricchetti ET (2018) Progression of glenoid morphology in glenohumeral osteoarthritis. JBJS 100(1):49–56. https://doi.org/10.2106/jbjs.17.00064
Mirzayan R, Donohoe S, Batech M, Suh BD, Acevedo DC, Singh A (2020) Is there a difference in the acromiohumeral distances measured on radiographic and magnetic resonance images of the same shoulder with a massive rotator cuff tear? J Shoulder Elbow Surg 29(6):1145–1151. https://doi.org/10.1016/j.jse.2019.10.020
Kleim BD, Hinz M, Geyer S, Scheiderer B, Imhoff AB, Siebenlist S (2022) A 3-dimensional classification for degenerative glenohumeral arthritis based on humeroscapular alignment. Orthop J Sports Med 10(8):23259671221110510. https://doi.org/10.1177/23259671221110512
Ricchetti ET, Khazzam MS, Denard PJ, Dines DM, Bradley Edwards T, Entezari V, Friedman RJ, Garrigues GE, Gillespie RJ, Grawe BM, Green A, Hatzidakis AM, Gabriel Horneff J, Hsu JE, Jawa A, Jin Y, Johnston PS, Jun BJ, Keener JD, Kelly JD 2nd, Kwon YW, Miniaci A, Morris BJ, Namdari S, Spencer EE, Strnad G, Williams GR Jr, Iannotti JP (2021) Reliability of the modified Walch classification for advanced glenohumeral osteoarthritis using 3-dimensional computed tomography analysis: a study of the ASES B2 Glenoid Multicenter Research Group. J Shoulder Elbow Surg 30(4):736–746. https://doi.org/10.1016/j.jse.2020.07.013
Shukla DR, McLaughlin RJ, Lee J, Cofield RH, Sperling JW, Sánchez-Sotelo J (2019) Intraobserver and interobserver reliability of the modified Walch classification using radiographs and computed tomography. J Shoulder Elbow Surg 28(4):625–630. https://doi.org/10.1016/j.jse.2018.09.021
Hopkins CM, Azar FM, Mulligan RP, Hollins AM, Smith RA, Throckmorton TW (2021) Computed tomography and magnetic resonance imaging are similarly reliable in the assessment of glenohumeral arthritis and glenoid version. Arch Bone Jt Surg 9(1):64–69. https://doi.org/10.22038/abjs.2020.38922.2035
Kappe T, Cakir B, Reichel H, Elsharkawi M (2011) Reliability of radiologic classification for cuff tear arthropathy. J Shoulder Elbow Surg 20(4):543–547. https://doi.org/10.1016/j.jse.2011.01.012
Iannotti JP, McCarron J, Raymond CJ, Ricchetti ET, Abboud JA, Brems JJ, Williams GR (2010) Agreement study of radiographic classification of rotator cuff tear arthropathy. J Shoulder Elbow Surg 19(8):1243–1249. https://doi.org/10.1016/j.jse.2010.02.010
Cohen J (1960) A coefficient of agreement for nominal scales. Educ Psychol Measur 20(1):37–46. https://doi.org/10.1177/001316446002000104
Iannotti JP, Jun B-J, Patterson TE, Ricchetti ET (2017) Quantitative measurement of osseous pathology in advanced glenohumeral osteoarthritis. JBJS 99(17):1460–1468. https://doi.org/10.2106/jbjs.16.00869
Cabarcas BC, Cvetanovich GL, Gowd AK, Liu JN, Manderle BJ, Verma NN (2019) Accuracy of patient-specific instrumentation in shoulder arthroplasty: a systematic review and meta-analysis. JSES Open Access 3(3):117–129. https://doi.org/10.1016/j.jses.2019.07.002
Olaiya OR, Nadeem I, Horner NS, Bedi A, Leroux T, Alolabi B, Khan M (2020) Templating in shoulder arthroplasty - A comparison of 2D CT to 3D CT planning software: a systematic review. Shoulder Elbow 12(5):303–314. https://doi.org/10.1177/1758573219888780
Harding GT, Bois AJ, Bouliane MJ (2021) A novel method for localization of the maximum glenoid bone defect during reverse shoulder arthroplasty. JSES Int 5(4):667–672. https://doi.org/10.1016/j.jseint.2021.04.001
Olszewski A, Ramme AJ, Maerz T, Freehill MT, Warner JJP, Bedi A (2020) Vault perforation after eccentric glenoid reaming for deformity correction in anatomic total shoulder arthroplasty. J Shoulder Elbow Surg 29(7):1450–1459. https://doi.org/10.1016/j.jse.2019.11.011
Funding
Open Access funding enabled and organized by Projekt DEAL. No funding was sought or received to undertake this study.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
BDK, SL, PK, HD, LF: None. SS is a consultant for Arthrex, Medi, KLSM and MH is on the scientific editorial board of OH and his immediate family hold no relevant shares and receive no royalties from any commercial entity related to the subject of this article. MH: None.
Ethical approval
Ethical approval was sought and granted by the ethics commission of the medical faculty of the Technical University of Munich (reference number: 301/20 S) in advance of study commencement.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this article was revised: The first names and last names of the authors have been swapped.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kleim, B.D., Lappen, S., Kadantsev, P. et al. Validation of a novel 3-dimensional classification for degenerative arthritis of the shoulder. Arch Orthop Trauma Surg 143, 6159–6166 (2023). https://doi.org/10.1007/s00402-023-04890-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00402-023-04890-2