Machine learning in knee arthroplasty: specific data are key—a systematic review

Hinterwimmer, Florian; Lazic, Igor; Suren, Christian; Hirschmann, Michael T.; Pohlig, Florian; Rueckert, Daniel; Burgkart, Rainer; von Eisenhart-Rothe, Rüdiger

doi:10.1007/s00167-021-06848-6

Machine learning in knee arthroplasty: specific data are key—a systematic review

KNEE
Open access
Published: 10 January 2022

Volume 30, pages 376–388, (2022)
Cite this article

Download PDF

You have full access to this open access article

Knee Surgery, Sports Traumatology, Arthroscopy Aims and scope

Machine learning in knee arthroplasty: specific data are key—a systematic review

Download PDF

Florian Hinterwimmer ORCID: orcid.org/0000-0002-0273-0643^1,2^na1,
Igor Lazic¹^na1,
Christian Suren¹,
Michael T. Hirschmann³,
Florian Pohlig¹,
Daniel Rueckert²,
Rainer Burgkart¹ &
…
Rüdiger von Eisenhart-Rothe¹

5154 Accesses
31 Citations
2 Altmetric
Explore all metrics

Abstract

Purpose

Artificial intelligence (AI) in healthcare is rapidly growing and offers novel options of data analysis. Machine learning (ML) represents a distinct application of AI, which is capable of generating predictions and has already been tested in different medical specialties with various approaches such as diagnostic applications, cost predictions or identification of risk factors. In orthopaedics, this technology has only recently been introduced and the literature on ML in knee arthroplasty is scarce. In this review, we aim to investigate which predictions are already feasible using ML models in knee arthroplasty to identify prerequisites for the effective use of this novel approach. For this reason, we conducted a systematic review of ML algorithms for outcome prediction in knee arthroplasty.

Methods

A comprehensive search of PubMed, Medline database and the Cochrane Library was conducted to find ML applications for knee arthroplasty. All relevant articles were systematically retrieved and evaluated by an orthopaedic surgeon and a data scientist on the basis of the PRISMA statement. The search strategy yielded 225 articles of which 19 were finally assessed as eligible. A modified Coleman Methodology Score (mCMS) was applied to account for a methodological evaluation.

Results

The studies presented in this review demonstrated fair to good results (AUC median 0.76/range 0.57–0.98), while heterogeneous prediction models were analysed: complications (6), costs (4), functional outcome (3), revision (2), postoperative satisfaction (2), surgical technique (1) and biomechanical properties (1) were investigated. The median mCMS was 65 (range 40–80) points.

Conclusion

The prediction of distinct outcomes with ML models applying specific data is already feasible; however, the prediction of more complex outcomes is still inaccurate. Registry data on knee arthroplasty have not been fully analysed yet so that specific parameters have not been sufficiently evaluated. The inclusion of specific input data as well as the collaboration of orthopaedic surgeons and data scientists are essential prerequisites to fully utilize the capacity of ML in knee arthroplasty. Future studies should investigate prospective data with specific and longitudinally recorded parameters.

Level of evidence

III.

Artificial intelligence in total and unicompartmental knee arthroplasty

Article Open access 22 July 2024

Prediction of complications and surgery duration in primary TKA with high accuracy using machine learning with arthroplasty-specific data

Article Open access 08 April 2022

Can minimal clinically important differences in patient reported outcome measures be predicted by machine learning in patients with total knee or hip arthroplasty? A systematic review

Article Open access 20 January 2022

Find the latest articles, discoveries, and news in related topics.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Artificial intelligence (AI) has already led to tremendous advancements in the aviation and automobile industries and is now rapidly gaining importance in healthcare [1, 3, 18]. Machine learning (ML) represents a distinct application of AI which emerged from studies on pattern recognition and learning theory. ML describes algorithms for automatic and incremental function optimization which can be used to make predictions by detecting non-linear relationships in large data sets [1]. In a nutshell, ML algorithms are able to recognize correlations in data sets which subsequently permits predictions based on these “learned” patterns. The aspect of “learning” is achieved by automated weighting of distinct parameters in the mathematical model [1, 19]. Ever more data are being compiled digitally in healthcare while processing capacities are increasing rapidly so that various ML algorithms have already been tested in ophthalmology, dermatology, radiology, and cardiology [3, 5, 6, 18]. Especially in medical image analysis, ML algorithms were developed and validated which are capable to outperform medical specialists [17]. In orthopaedics, the number of published studies on machine learning has increased tenfold since 2010 [1]. ML has the potential to become a powerful tool for patient-specific decision making regarding surgical indications and predicting outcomes in orthopaedics. Although the application of ML is in a preliminary phase in orthopaedics, several studies report well-performing ML models for gait classification, fracture detection, exoprosthesis control, osteoarthritis detection, spine pathology detection and bone age assessment [19]. Extensive data are necessary for well-performing ML models. In this context, knee arthroplasty is highly suitable for ML analysis as multiple sources are available: national and international registries including follow-up data, patient characteristics, and patient-reported outcome measures after knee surgery from the last 40 years. At the same time, recently introduced enabling technologies such as navigated and robotic surgery offer novel, patient-specific data sets, and a vast amount of imaging data are available in digital form. Furthermore, various other clinical data may be suitable for ML models. Recently, an ML approach using registry and healthcare data sets to provide an adjusted payment model was demonstrated [23]. Overall, however, literature on ML in knee arthroplasty is heterogeneous and scarce.

ML comprises different theoretical approaches and represents a tool that must first be adapted for specific tasks in a given field such as knee arthroplasty. However, no standard approach for implementing ML algorithms in orthopaedics has been described nor established yet so that the question arises to which extent knee arthroplasty currently benefits from ML and which approaches might be promising. We hypothesized that successful ML algorithms in knee arthroplasty depend on arthroplasty-specific data and their reasonable use. Therefore, the collaboration between a data scientist and an orthopaedic surgeon is of utmost importance. Hence, we aim (1) to investigate which predictions are already feasible using ML models in knee arthroplasty and (2) to describe prerequisites for the effective use of this novel approach. For this reason, we conducted a systematic review of ML algorithms for outcome prediction in knee arthroplasty.

Materials and methods

A systematic review of the literature was performed to identify machine learning algorithms in knee arthroplasty on the basis of the PRISMA statement. Studies meeting the following criteria were included in this review:

Described methodology of a machine learning algorithm for data analysis in health or economic-related applications of knee arthroplasty.
At least one predicted outcome variable by a supervised machine learning algorithm using tabular data.
Written in English.

In March 2021, a literature search through PubMed, Medline database and Cochrane Library was conducted. For the systematic search, the following search terms were used: “total knee arthroplasty” AND (artificial intelligence OR machine learning), “TKA” AND (artificial intelligence OR machine learning) from 1990 to 2021. The studies were screened and evaluated by an orthopaedic surgeon (I.L.) and a data scientist (F.H.) at our institution using the aforementioned selection criteria. The results were summarized and duplicates were discarded. The selection procedure is presented in Fig. 1. All articles were initially screened for relevance by title and abstract to assess the inclusion criteria. The two authors independently performed a careful reading of the studies and extracted the data. The following information was extracted from each article: author, year of publication, study design, follow-up, number of patients/cases, ML algorithm, metric, data screening, fine-tuning, mathematical and medical interpretation. For quality assessment, the Coleman Methodology criteria, which assess methodology using ten different aspects, were modified for the systematic review of machine learning algorithms in knee arthroplasty (Table 1). A score of 100 indicates that the study largely avoids chance, bias and other confounding factors. Hence, the modified Coleman Methodology score (mCMS) can be defined as excellent (85–100 points), good (70–84 points), fair (50–69 points), and poor (< 50 points). Both the orthopaedic surgeon and the data scientist scored the included studies giving a total mCMS between 0 and 100. In the event of divergent results, the data were double-checked and a consensus was reached. The level of evidence is level III.

Table 1 Modified Coleman methodology score

Full size table

Statistical analysis

Continuous data were reported as median values with the respective range. All analyses were performed using the statistical software IBM SPSS Statistics (version 22.0, Armonk, NY, IBM Corp.).

Results

Selection and methodical characteristics

The initial search resulted in 255 references and 19 articles met the inclusion criteria (Fig. 1). Articles from the years 2018 to 2021 were included. All 19 included studies addressed total knee arthroplasty (TKA). Fifteen articles were retrospective study designs, one study evaluated retrospective and prospective data and one further study analysed prospective data only. Two studies analysed an ML approach for outcome prediction using an experimental data set. The data set volume consisting of patients or cases ranged from 24 to 1,049,160 with a median of 86. The median follow-up time was 4.0 (1–18) years. The data sources were indicated in all studies. 11 of 19 studies derived their data exclusively from their local or regional healthcare management system. Three of 19 studies obtained their data from two administrative databases (National Inpatient Sample administrative database, New York Statewide Planning and Research Cooperative System administrative database). One study complemented their local database with data from a national registry (Korean Society of Nephrology registry). One study derived all their data from a national registry (Danish Knee Arthroplasty Registry). Another study evaluated data generated by a mobile application. All studies specified a data screening. The studies included a median of 13.5 (8;52) input parameters in their analysis. Fifteen different ML algorithms were described and four studies applied multiple approaches with different algorithms. Random Forest (RF) was applied in five studies, Gradient Boosting in three studies, LASSO in three studies, and Artificial Neural Network (ANN) in two studies, respectively. All other algorithms were applied once. 18 studies presented an outcome metric. With 16 of 19 studies, the area under the curve (AUC) was reported most frequently. 13 of these 16 studies reported AUCs with poor to fair results (< 0.8). In total, seven studies indicated a task-specific fine-tuning of the algorithm, including methodology such as data pre-processing, loss weighting or hyperparameter search. Three studies demonstrated an interpretation of metrics and outcomes from a mathematical as well as a medical point of view. The mCMS was calculated for each of the studies reviewed. The median mCMS of all studies was 65 points, ranging from 40 to 80 points. There was no significant relationship between methodology scores and metrics. However, only 6 out of 19 studies had a good mCMS (> 70 points). Of these six studies, five studies yielded fair to good results [AUC > 0.7/mean squared error (MSE) < 0.3], with the exception of one study with an AUC of 0.56–0.6. Methodological deficiencies were the lack of prospective studies, missing necessary details to repeat the ML model, no fine-tuning of the ML model and modest elaborations of the metrics in the mathematical and medical context. All further results are summarized in Tables 2, 3 and 4.

Table 2 Outcome variables and characteristics of the included studies

Full size table

Table 3 Description of machine learning approaches of the included studies

Full size table

Table 4 Input variables of the included studies

Full size table

Prediction of complications

Six studies evaluated ML algorithms predicting complications after the implantation of TKA [9, 12, 14, 16, 20, 23]. Three studies focused on the prediction of the length of stay [16, 20, 23]. Of these three studies, Gradient Boosting, Naive Bayes and an ANN were applied, resulting in a mean AUC of 0.78 (0.74; 0.83). They evaluated 8, 13 and 14 input variables, respectively. A single study evaluated a Gradient Boosting with 18 input variables for the prediction of end-stage renal disease after TKA, resulting in an AUC of 0.89 [14]. Another study analysed a Gradient Boosting with eight input variables for the prediction of postoperative blood transfusions after TKA, resulting in an AUC of 0.88 [9]. A further study evaluated a Stochastic Gradient Boosting with 39 input variables for the prediction of postoperative opioid prescriptions after TKA, resulting in an AUC of 0.76 [12].

Prediction of costs

Four studies evaluated ML algorithms predicting the costs related to the implantation of TKA [8, 10, 20, 23]. Three studies evaluated the inpatient costs using MLP, DenseNet, Naive Bayes and an ANN with 13, 8 and 11 input variables, respectively. This resulted in a mean AUC of 0.79 (0.74; 0.81) [11, 20, 23]. Another study evaluated a Logic Forest algorithm with 11 input variables for the prediction of the most expensive 5% of health care users in the year following elective surgery (super-utiliser) [8]. TKA comprised 50% of 1,049,160 analysed surgeries. No metric was specified.

Prediction of functional outcome

Three studies evaluated ML algorithms predicting the functional outcome after the implantation of TKA [7, 13, 21]. One study prospectively evaluated a logistic regression and least absolute shrinkage and selection operator (LASSO) with 28 input variables for the prediction of the Knee Injury and Osteoarthritis Outcome Score (KOOS) one year after TKA, resulting in an AUC of 0.71 to 0.76 [7]. Another study analysed a decision tree with eight preoperative input variables provided by a gait sensor for the prediction of gait parameters after TKA, yielding an accuracy of 0.89 [13]. A further study evaluated four different ML approaches (XGBoost, RF, LASSO and SuperLearner) with 25 input variables for the prediction of severe walking limitation after TKA. XGBoost yielded the most promising results with an AUC of 0.7 [21].

Prediction of revisions

Two studies evaluated ML algorithms predicting revisions after the implantation of TKA [2, 25]. The first study evaluated four different ML approaches (LASSO, RF, Gradient Boosting, ANN) with 26 input variables for the prediction of revisions within two years after primary TKA, resulting in an AUC of 0.56–0.6 [2]. The second study analysed an RF with 52 input variables for the prediction of revision after irrigation and debridement for PJI in TKA, yielding an AUC of 0.74 [25].

Prediction of postoperative satisfaction

Two studies evaluated ML algorithms predicting the postoperative satisfaction after the implantation of TKA [4, 15]. The first study evaluated a TreeNet with 15 input variables for the prediction of postoperative satisfaction after TKA with minimum 1-year follow-up using a 5-point Likert scale. This approach resulted in an AUC of 0.81 [4]. The second study analysed an RF with 15 input variables for the prediction of postoperative satisfaction 2 years after TKA using a binary outcome. This approach yielded an AUC of 0.77 [15].

Prediction of optimal surgical technique

A single study evaluated the numerical data created in the surgical process to assess balance and alignment during TKA [26]. The authors suggested a decision process for an ideally balanced TKA. Three different ML approaches (RF, Support Vector Machine, ANN) were applied and resulted in AUCs of 0.75–0.81.

Discussion

The most important finding of the present review is that outcome prediction using ML models applying arthroplasty-specific data has already been successfully performed. Although ML applications for knee arthroplasty only gained particular popularity in the past years with all articles included in this review being from 2018 or more recent, several studies already demonstrated the high potential of ML. Exact study questions with well-described complications like renal failure or postoperative blood transfusions can already be particularly well-predicted [9, 14]. However, more complex and general predictions like revisions, which can have a variety of different causes, are more difficult to estimate [2]. Interestingly, registry data on knee arthroplasty were only scantly evaluated for ML analysis. In data science theory, the sheer quantity of patients and input parameters is of crucial importance. Registries include vast information with patient-related data and potentially specific patterns that are suitable for ML analyses. However, most studies (13/19) relied on local databases and only six studies applied registry data—including only two different medical registries. Multiple national registries on knee arthroplasty are being administered that contain relevant information from the last decades on this topic. Future studies for ML in knee arthroplasty should definitely investigate the amounts of data within knee arthroplasty registries.

A further finding of this review is that we could not demonstrate a correlation between the amount of input parameters or patients and the outcome metrics. From a data science perspective, the tabular form of clinical information is not well suited for ML analysis as it requires a tremendous amount of patients and various parameters to generate complex data sets. Six studies included more than 100,000 patients; however, ML is proficient to process substantially larger amounts of data. Image data contain a higher information density and is, therefore, more commonly applied to ML applications. However, the low data volume is only in part an explanation for poor outcomes as several studies yielded very accurate predictions, especially in confined, local data sets.

While the quantity of data remains critical to allow for repeating patterns, the data quality is likewise of high importance for ML algorithms: ML can only reveal patterns that can actually be derived from the data. While Karnuta et al. fed their algorithm with tabular data from 295,605 total knee arthroplasty and total hip arthroplasty patients, Rexwinkle et al. retrieved tissue samples from only 6 osteoarthritis patients in order to apply biomarker analysis as well as biomechanical testing [10, 24]. The latter study was based on much fewer cases, still the complexity of the study and specificity of the data allowed for ML application with promising results. We therefore assume that the inclusion of specific parameters is of paramount importance, especially if tabular data are applied for ML analysis.

Just as specific data are relevant, a multitude of irrelevant parameters may negatively impact the performance of ML models. Although an increased data width with the use of all accessible parameters may be beneficial in theory, such randomness may rather hinder the full potential of the ML algorithm within the limited scope of tabular data used so far for ML in knee arthroplasty. Especially in studies with a low proportion of subject-specific parameters, the input of parameters, which were chosen simply because they were available, may be overrated and confounding. Hence, the selection of data for ML applications is anything but trivial. In this review, only four studies included subject-specific parameters such as the administration of tranexamic acid for the prediction of postoperative blood transfusion after TKA. Therefore, further studies to investigate arthroplasty-specific parameters are encouraged.

Identifying and applying such specific parameters in knee arthroplasty is difficult, though. Currently, the parameter selection of ML approaches in orthopaedics is based on (1) known risk factors and clinical experience and (2)—to a significant part—on the accessibility of parameters in existing databases. Therefore, most studies apply administrative data including basic patient demographics. However, the data architectures of local healthcare management systems and registries were not constructed for ML evaluations. These data sources do not necessarily contain fully detailed and prospectively linked patient data, which, however, would be essential for predictions. In addition, most studies were retrospective so that the data architecture for the ML analysis had to be constructed post hoc. Hence, data screening and processing are inevitable if existing data sources should be utilized.

This retrospective approach requires the substantive discussion of the parameters to assure inclusion of relevant and exclusion misleading information, as well as the mere technical evaluation of the data with regard to incorrect inputs and missing data points. Further steps for enhancing the data set, such as data cleaning or completing a data set are recommended from a data science perspective. This data processing requires extensive human assessment and the coherent thinking of a medical professional and data scientist to identify relevant and applicable information. It has to be performed mostly by hand, which makes it time-consuming and futile. The work related to data screening and processing is not to be neglected: it dramatically slows down the potential progress that can be made using ML in knee arthroplasty. To avoid such an extensive data processing, we assume that prospective databases with an appropriate data architecture for ML analysis have to be established in the long term. Currently, we are unaware of a comprehensive and completed data integration of knee joint-specific data for AI applications. Besides the already discussed importance of the quality and amount of input data, the ML model itself has a significant impact on the outcome prediction. ML does not describe a single specific algorithm, but rather contains a variety of approaches which have to be modified to the addressed issue and data set. A wide range of algorithms was used in the studies discussed in this review. The most popular algorithm was an RF, which is closely related to decision trees and XGBoost. This may be attributed to the considerable easy implementation, application and stable results over different kinds of data sets. However, more sophisticated algorithms have been developed exploiting the potential of today’s data as well as hardware even more. With a considerable amount of data, even deep learning algorithms can be feasible. Ramkumar et al. demonstrated how to use an ANN for, e.g. length of stay prediction of 175,042 TKA cases using 15 preoperative variables [22]. To achieve the best results, the focus of most studies relied on testing various machine learning algorithms and picking the one with the highest metric value as final algorithm. However, an exclusive measurement of performance using only one metric can be misleading so that several metrics depending on the task at hand need to be examined. Metrics which weigh false-positive and false-negative predictions, such as sensitivity, specificity or Dice Score, are often more meaningful. Furthermore, prediction models based on ML algorithms are not explanatory models [7]. Therefore, the outcome metrics need to be elaborated to understand its use in the medical context. In this review, 16 of 19 studies did not extensively discuss the results in the mathematical or in the medical setting. For future investigations, we consider the cooperative analysis by a medical professional and a data scientist of ML models in knee arthroplasty as mandatory.

Only few studies managed to increase the performance of their ML models through regularization or random search of hyperparameters in order to obtain more meaningful results [2]. For deep learning algorithms, multiple techniques, i.e., regularization and data augmentation, loss weighting, hyperparameter search or data simulation, are commonly applied to boost performance. In machine learning, these techniques are used not as frequently while still being possible most of the time. A hyperparameter search for instance contains significant potential and is easily applicable with today’s hardware capacity. Another underrated example for performance increase are methods to tackle class imbalance such as loss weighting. While a hyperparameter search was applied once [2], none of the reviewed studies indicated the use of any kind of loss weighting. From data science perspective, numerous modifications of the ML models in knee arthroplasty have not been implemented and tested yet. Especially for limited data sets, fine-tuning of ML models is crucial to avoid overfitting. Figure 2 depicts an overview of the development of an ML model in knee joint surgery based on this discussion. This review has several limitations. All studies on ML in knee arthroplasty were not presumed to be included. Numerous studies from the field of medical informatics examined big data that included data sets related to knee arthroplasty. However, we evaluated specific studies which focused on outcome prediction in knee arthroplasty from the medical perspective. This may have disregarded other studies using promising ML approaches in orthopaedics with other purposes. Especially imaging data are suitable for ML models, but those are mostly utilized in ML models for diagnostic applications. We deliberately did not evaluate ML algorithms using imaging data for that reason.

Conclusion

In conclusion, the prediction of distinct outcomes with ML models applying specific data is already feasible in knee arthroplasty. However, the prediction of more complex and general outcomes is still inaccurate. Currently, the benefits for the clinical application may be small, but—from a data science perspective—the possibilities are not yet being fully utilized. To date, specific parameters of knee arthroplasty have not been adequately evaluated and large data volumes gathered in registries have not been fully considered nor analysed. The inclusion of specific input data as well as the collaborative development, modification and evaluation of ML models and its data by an orthopaedic surgeon and data scientist are essential prerequisites to fully utilize the capacity of ML in knee arthroplasty. Predictions suitable for clinical application must be built on solid data structures. We consider that a data architecture specifically for ML with prospective data is necessary to allow for more accurate and complex predictions. Future studies should, therefore, necessarily examine large-scale data with specific and longitudinally recorded parameters.

References

Cabitza F, Locoro A, Banfi G (2018) Machine learning in orthopedics: a literature review. Front Bioeng Biotechnol 6
El-Galaly A, Grazal C, Kappel A, Nielsen PT, Jensen SL, Forsberg JA (2020) Can machine-learning algorithms predict early revision TKA in the Danish knee arthroplasty registry? Clin Orthop Relat Res 478:2088–2101
Article PubMed PubMed Central Google Scholar
Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM et al (2017) Correction: corrigendum: dermatologist-level classification of skin cancer with deep neural networks. Nature 546:686–686
Article CAS PubMed Google Scholar
Farooq H, Deckard ER, Arnold NR, Meneghini RM (2021) Machine learning algorithms identify optimal sagittal component position in total knee arthroplasty. J Arthroplasty. https://doi.org/10.1016/j.arth.2021.02.063
Article PubMed Google Scholar
Gulshan V, Peng L, Coram M, Stumpe MC, Wu D, Narayanaswamy A et al (2016) Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316:2402–2410
Article PubMed Google Scholar
Hannun AY, Rajpurkar P, Haghpanahi M, Tison GH, Bourn C, Turakhia MP et al (2019) Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nat Med 25:65–69
Article CAS PubMed PubMed Central Google Scholar
Harris AHS, Kuo AC, Bowe TR, Manfredi L, Lalani NF, Giori NJ (2021) Can machine learning methods produce accurate and easy-to-use preoperative prediction models of one-year improvements in pain and functioning after knee arthroplasty? J Arthroplasty 36:112-117.e116
Article PubMed Google Scholar
Hyer JM, Ejaz A, Tsilimigras DI, Paredes AZ, Mehta R, Pawlik TM (2019) Novel machine learning approach to identify preoperative risk factors associated with super-utilization of medicare expenditure following surgery. JAMA Surg 154:1014–1021
Article PubMed PubMed Central Google Scholar
Jo C, Ko S, Shin WC, Han HS, Lee MC, Ko T et al (2020) Transfusion after total knee arthroplasty can be predicted using the machine learning algorithm. Knee Surg Sports Traumatol Arthrosc 28:1757–1764
Article PubMed Google Scholar
Karnuta JM, Luu BC, Roth AL, Haeberle HS, Chen AF, Iorio R et al (2021) Artificial intelligence to identify arthroplasty implants from radiographs of the knee. J Arthroplasty 36:935–940
Article PubMed Google Scholar
Karnuta JM, Navarro SM, Haeberle HS, Helm JM, Kamath AF, Schaffer JL et al (2019) Predicting inpatient payments prior to lower extremity arthroplasty using deep learning: which model architecture is best? J Arthroplasty 34:2235-2241.e2231
Article PubMed Google Scholar
Katakam A, Karhade AV, Schwab JH, Chen AF, Bedair HS (2020) Development and validation of machine learning algorithms for postoperative opioid prescriptions after TKA. J Orthop 22:95–99
Article PubMed PubMed Central Google Scholar
Kluge F, Hannink J, Pasluosta C, Klucken J, Gaßner H, Gelse K et al (2018) Pre-operative sensor-based gait parameters predict functional outcome after total knee arthroplasty. Gait Posture 66:194–200
Article PubMed Google Scholar
Ko S, Jo C, Chang CB, Lee YS, Moon YW, Youm JW et al (2020) A web-based machine-learning algorithm predicting postoperative acute kidney injury after total knee arthroplasty. Knee Surg Sports Traumatol Arthrosc. https://doi.org/10.1007/s00167-020-06258-0
Article PubMed Google Scholar
Kunze KN, Polce EM, Sadauskas AJ, Levine BR (2020) Development of machine learning algorithms to predict patient dissatisfaction after primary total knee arthroplasty. J Arthroplasty 35:3117–3122
Article PubMed Google Scholar
Li H, Jiao J, Zhang S, Tang H, Qu X, Yue B (2020) Construction and comparison of predictive models for length of stay after total knee arthroplasty: regression model and machine learning analysis based on 1,826 cases in a single Singapore center. J Knee Surg. https://doi.org/10.1055/s-0040-1710573
Article PubMed Google Scholar
Litjens G, Kooi T, Bejnordi BE, Setio AAA, Ciompi F, Ghafoorian M et al (2017) A survey on deep learning in medical image analysis. Med Image Anal 42:60–88
Article PubMed Google Scholar
Martín Noguerol T, Paulano-Godino F, Martín-Valdivia MT, Menias CO, Luna A (2019) Strengths, weaknesses, opportunities, and threats analysis of artificial intelligence and machine learning applications in radiology. J Am Coll Rad 16:1239–1247
Article Google Scholar
Myers TG, Ramkumar PN, Ricciardi BF, Urish KL, Kipper J, Ketonis C (2020) Artificial intelligence and orthopaedics: an introduction for clinicians. J Bone Jt Surg 102:830–840
Article Google Scholar
Navarro SM, Wang EY, Haeberle HS, Mont MA, Krebs VE, Patterson BM et al (2018) Machine learning and primary total knee arthroplasty: patient forecasting for a patient-specific payment model. J Arthroplasty 33:3617–3623
Article PubMed Google Scholar
Pua YH, Kang H, Thumboo J, Clark RA, Chew ES, Poon CL et al (2020) Machine learning methods are comparable to logistic regression techniques in predicting severe walking limitation following total knee arthroplasty. Knee Surg Sports Traumatol Arthrosc 28:3207–3216
Article PubMed Google Scholar
Ramkumar PN, Haeberle HS, Ramanathan D, Cantrell WA, Navarro SM, Mont MA et al (2019) Remote patient monitoring using mobile health for total knee arthroplasty: validation of a wearable and machine learning-based surveillance platform. J Arthroplasty 34:2253–2259
Article PubMed Google Scholar
Ramkumar PN, Karnuta JM, Navarro SM, Haeberle HS, Scuderi GR, Mont MA et al (2019) Deep learning preoperatively predicts value metrics for primary total knee arthroplasty: development and validation of an artificial neural network model. J Arthroplasty 34:2220-2227.e2221
Article PubMed Google Scholar
Rexwinkle JT, Werner NC, Stoker AM, Salim M, Pfeiffer FM (2018) Investigating the relationship between proteomic, compositional, and histologic biomarkers and cartilage biomechanics using artificial neural networks. J Biomech 80:136–143
Article PubMed Google Scholar
Shohat N, Goswami K, Tan TL, Yayac M, Soriano A, Sousa R et al (2020) 2020 Frank Stinchfield Award: Identifying who will fail following irrigation and debridement for prosthetic joint infection. Bone Jt J 102:11–19
Article Google Scholar
Verstraete MA, Moore RE, Roche M, Conditt MA (2020) The application of machine learning to balance a total knee arthroplasty. Bone Jt Open 1:236–244
Article PubMed PubMed Central Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL. Not applicable.

Author information

Florian Hinterwimmer and Igor Lazic share joint first authorship.

Authors and Affiliations

Department of Orthopaedics and Sports Orthopaedics, Klinikum Rechts Der Isar, School of Medicine, Technical University of Munich, Ismaninger Str. 22, 81675, München, Germany
Florian Hinterwimmer, Igor Lazic, Christian Suren, Florian Pohlig, Rainer Burgkart & Rüdiger von Eisenhart-Rothe
Institute for AI and Informatics in Medicine, Technical University of Munich, Munich, Germany
Florian Hinterwimmer & Daniel Rueckert
Department of Orthopaedic Surgery and Traumatology-Liestal, Kantonsspital Baselland, Bruderholz, Laufen, Switzerland
Michael T. Hirschmann

Authors

Florian Hinterwimmer
View author publications
You can also search for this author in PubMed Google Scholar
Igor Lazic
View author publications
You can also search for this author in PubMed Google Scholar
Christian Suren
View author publications
You can also search for this author in PubMed Google Scholar
Michael T. Hirschmann
View author publications
You can also search for this author in PubMed Google Scholar
Florian Pohlig
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Rueckert
View author publications
You can also search for this author in PubMed Google Scholar
Rainer Burgkart
View author publications
You can also search for this author in PubMed Google Scholar
Rüdiger von Eisenhart-Rothe
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All of the authors as well as the responsible authorities have approved the contents of this paper and have agreed to the KSSTA policies. Each named author has substantially contributed to conducting the underlying research and drafting this manuscript.

Corresponding authors

Correspondence to Florian Hinterwimmer or Igor Lazic.

Ethics declarations

Conflict of interest

The authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest (such as honoraria; educational grants; participation in speakers’ bureaus; membership, employment, consultancies, stock ownership, or other equity interest; and expert testimony or patent-licencing arrangements), or non-financial interest (such as personal or professional relationships, affiliations, knowledge or beliefs) in the subject matter or materials discussed in this manuscript.

Ethics approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hinterwimmer, F., Lazic, I., Suren, C. et al. Machine learning in knee arthroplasty: specific data are key—a systematic review. Knee Surg Sports Traumatol Arthrosc 30, 376–388 (2022). https://doi.org/10.1007/s00167-021-06848-6

Download citation

Received: 16 August 2021
Accepted: 16 December 2021
Published: 10 January 2022
Issue Date: February 2022
DOI: https://doi.org/10.1007/s00167-021-06848-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Machine learning in knee arthroplasty: specific data are key—a systematic review

Abstract

Purpose

Methods

Results

Conclusion

Level of evidence

Similar content being viewed by others

Artificial intelligence in total and unicompartmental knee arthroplasty

Prediction of complications and surgery duration in primary TKA with high accuracy using machine learning with arthroplasty-specific data

Can minimal clinically important differences in patient reported outcome measures be predicted by machine learning in patients with total knee or hip arthroplasty? A systematic review

Explore related subjects

Introduction

Materials and methods

Statistical analysis

Results

Selection and methodical characteristics

Prediction of complications

Prediction of costs

Prediction of functional outcome

Prediction of revisions

Prediction of postoperative satisfaction

Prediction of optimal surgical technique

Discussion

Conclusion

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Ethics approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation