The promise of machine learning applications in solid organ transplantation

Gotlieb, Neta; Azhie, Amirhossein; Sharma, Divya; Spann, Ashley; Suo, Nan-Ji; Tran, Jason; Orchanian-Cheff, Ani; Wang, Bo; Goldenberg, Anna; Chassé, Michael; Cardinal, Heloise; Cohen, Joseph Paul; Lodi, Andrea; Dieude, Melanie; Bhat, Mamatha

doi:10.1038/s41746-022-00637-2

The promise of machine learning applications in solid organ transplantation

Review Article
Open access
Published: 11 July 2022

Volume 5, article number 89, (2022)
Cite this article

Download PDF

You have full access to this open access article

npj Digital Medicine

The promise of machine learning applications in solid organ transplantation

Download PDF

Neta Gotlieb ORCID: orcid.org/0000-0003-3827-8262^1,2,
Amirhossein Azhie ORCID: orcid.org/0000-0001-6393-9064¹,
Divya Sharma³,
Ashley Spann ORCID: orcid.org/0000-0003-1278-7712⁴,
Nan-Ji Suo⁵,
Jason Tran¹,
Ani Orchanian-Cheff ORCID: orcid.org/0000-0002-9943-2692⁶,
Bo Wang⁷,
Anna Goldenberg⁷,
Michael Chassé^8,9,
Heloise Cardinal^9,10,
Joseph Paul Cohen^9,11,12,
Andrea Lodi^9,13,
Melanie Dieude^9,10,14,15 &
…
Mamatha Bhat^1,9,16

8717 Accesses
29 Citations
19 Altmetric
1 Mention
Explore all metrics

Abstract

Solid-organ transplantation is a life-saving treatment for end-stage organ disease in highly selected patients. Alongside the tremendous progress in the last several decades, new challenges have emerged. The growing disparity between organ demand and supply requires optimal patient/donor selection and matching. Improvements in long-term graft and patient survival require data-driven diagnosis and management of post-transplant complications. The growing abundance of clinical, genetic, radiologic, and metabolic data in transplantation has led to increasing interest in applying machine-learning (ML) tools that can uncover hidden patterns in large datasets. ML algorithms have been applied in predictive modeling of waitlist mortality, donor–recipient matching, survival prediction, post-transplant complications diagnosis, and prediction, aiming to optimize immunosuppression and management. In this review, we provide insight into the various applications of ML in transplant medicine, why these were used to evaluate a specific clinical question, and the potential of ML to transform the care of transplant recipients. 36 articles were selected after a comprehensive search of the following databases: Ovid MEDLINE; Ovid MEDLINE Epub Ahead of Print and In-Process & Other Non-Indexed Citations; Ovid Embase; Cochrane Database of Systematic Reviews (Ovid); and Cochrane Central Register of Controlled Trials (Ovid). In summary, these studies showed that ML techniques hold great potential to improve the outcome of transplant recipients. Future work is required to improve the interpretability of these algorithms, ensure generalizability through larger-scale external validation, and establishment of infrastructure to permit clinical integration.

Application of machine learning in liver transplantation: a review

Article 12 January 2022

Personalized prediction of delayed graft function for recipients of deceased donor kidney transplants with machine learning

Article Open access 27 October 2020

Development and validation of a risk index to predict kidney graft survival: the kidney transplant risk index

Article Open access 21 June 2021

Introduction

There has been tremendous progress in the outcomes of solid-organ transplantation in recent decades. Nonetheless, there remain challenges at various levels of the transplant journey. Organ allocation is a major limiting factor as there is constant increased demand while the donor organ supply is limited¹. In addition, the increasing medical complexity of transplant candidates with their advanced age, metabolic risk factors, and cardiovascular comorbidities is associated with higher risks of morbidity and mortality on the waiting list and after transplant, resulting in higher risk of infections, malignancy, and medication-induced side effects. As a result, allocating organs to appropriate recipients who will benefit the most from transplantation with the lowest possible risk is a significant challenge^2,3,4,5.

The allocation algorithms must consider transplant outcomes that depend on a complex combination of factors including patient demographics, comorbidities, genetics, graft quality, and more. Although short-term outcomes have markedly improved secondary to better surgical techniques, optimization of immunosuppressive therapy and post-operative management, complications due to graft rejection and secondary to long-term use of immunosuppressive medications can result in significant morbidity and mortality⁶. To overcome these issues, protocols for optimizing immunosuppression based on biomarkers are needed, but cannot provide patient-level predictions^7,8,9.

Machine learning (ML) is a branch of Artificial Intelligence (AI) in which a computer algorithm learns from examples to generate reproducible predictions and classifications on previously unseen data^10,11. Machine learning can be (1) supervised, referring to manually mapping an observation’s characteristics to a known outcome; (2) unsupervised, referring to the discovery of innate patterns using unlabeled data; or (3) reinforcement learning, referring to the training of ML models in an interactive environment to make a sequence of decisions by employing trial and error through ongoing feedback¹¹. ML can analyze large, complex and heterogeneous datasets, yielding sophisticated outcomes and predictive models. ML techniques have been applied in different fields in medicine where large datasets with complex data points exist, resulting in the generation of important predictive models with the potential to ameliorate clinical practice.

Novel applications of ML techniques in transplant medicine have emerged and are constantly evolving. The prediction of post-transplant outcomes is extremely complex and involves a large amount of clinical, laboratory, genetic, immunologic, and metabolic data. Beyond waitlist prioritization and organ allocation, other areas of application in transplant medicine include a better identification of potential organ donors, prediction of overall survival, short- and long-term complications, and pharmacokinetic analyses¹².

In this review, we aim to provide an overview of recent advances, potential power, and limitations of ML applications in transplant medicine. Waitlist prioritization, donor–recipient matching, and post-transplant outcomes are the three main sections of this review. An overview of ML applications in solid-organ transplantation is depicted in Fig. 1.

**Fig. 1: Applications of ML in solid-organ transplantation.**

Methods

A comprehensive search strategy was initially developed for Medline (Ovid) using a combination of database-specific subject headings and text words for the main concepts of solid-organ transplantation, donation, and machine learning. The search strategy was then customized for each of the other databases. The following databases were searched on September 2020 and limited to years 2015–2020: Ovid MEDLINE; Ovid MEDLINE Epub Ahead of Print and In-Process & Other Non-Indexed Citations; Ovid Embase; Cochrane Database of Systematic Reviews (Ovid); and Cochrane Central Register of Controlled Trials (Ovid).

The queries retrieved 155 papers for initial review. The citations were reviewed manually by NG, NJS, and AS. In all, 36 papers were included in this review according to clinical significance and relevance to machine learning, transplantation, and donation. The relevant articles which were included in the review are presented in Table 1. The flowchart of this process is illustrated in Fig. 2.

Table 1 Machine-learning applications studies in adult solid-organ transplant recipients.

Full size table

**Fig. 2: Flowchart of search strategy and selection of studies for inclusion.**

Organ allocation and predictive modeling for waitlist mortality

Many existing organ allocation policies hinge on a few criteria based on recipient need for transplant and donor–recipient matching. An optimal allocation system should incorporate important factors affecting waitlist mortality by expansion of the data types considered. ML may be a suitable tool for this task by identifying variables of importance from both recipients and donors and be utilized for the assessment of complex nonlinear interactions. Random Survival Forest (RSF) approach is an Ensemble tree method for analysis of right-censored survival data resulting in better survival prediction and variable selection through bagging of classification trees¹³. For example, the RSF approach was applied on a dataset of 33,069 patients waiting for heart transplant, and nine laboratory and hemodynamic variables affecting waitlist mortality were identified¹⁴. Two of these variables (eGFR and serum albumin) are not currently considered in the United Network for Organ Sharing (UNOS) allocation system (i.e., 6-tiered heart allocation system), and RSF further identified nonlinear relationships between these variables. Furthermore, RSF showed that the importance of each variable correlated with their effect on other variables. For example, in patients with eGFR >40 mL/min/1.73 m², RSF found sex differences as a predictor of waitlist mortality. The advantage of RSF is the identification of predictive variables for waitlist mortality without prior knowledge of parametric relationships. It can manage various interactions and significant missingness unlike conventional statistical methods such as Cox proportional hazards models. However, the model was trained on the Scientific Registry of Transplant Recipients (SRTR) database which only collects certain variables at specific timepoints and has a certain degree of missingness.

Other ML studies have merged both donor and recipient characteristics to optimize post-transplant outcomes rather than waitlist mortality. Artificial Neural Network (ANN) models are used for nonlinear modeling of the input features through a collection of neurons that take input and, in conjunction with information from other nodes, develop output without programmed rules. An ANN-based model was developed to predict waitlist mortality, post-transplant survival, and to simulate the heart allocation process. The model was trained on both donor and recipient data and was able to predict waitlist and post-transplant mortality with significant accuracy (AUROC = 89% and AUROC = 66%, respectively)¹⁵. This study showed that transplant allocation using ANNs was able to utilize 124 more available hearts compared to the cox regression model. Accordingly, ANNs have been shown to be efficient in organ allocation where prognosis depends on the complex interaction between multiple variables pertaining to both donor and recipient.

In summary, ANNs and RSF models provide better accuracy and deal well with nonlinear interactions in the data while predicting waitlist mortality as compared to other conventional approaches.

Optimization of donor–recipient matching

Donor identification and matching

Potential organ donor identification relies entirely on timely identification of these patients and their referral to Organ Procurement Organizations (OPOs). More optimal identification of potential organ donors may be feasible using ML. A recent study demonstrated that both ANN and logistic regression models trained on 105 distinct laboratory test variables from 19,717 ICU patients resulted in similar performance with an area under the curve (AUC) of 0.950 (95% CI 0.923–0.974) and 0.947 (95% CI 0.9169–0.9730), respectively¹⁶. However, in this study, the ANNs accuracy was more consistent across different subgroups compared to the logistic regression model which obtained lower AUC for non-referred potential organ donors (AUC for ANN: 0.95 versus LR: 0.78). ANNs benefit from automatically learning from high-dimensional data and detecting complex nonlinear relationships between input variables and ultimate outcome of interest¹⁷. However, ANNs require large data to be trained on in order to estimate their predictive parameters accurately. On the other hand, logistic regression method benefits from a small number of hyperparameters in its modeling but is not suitable to capture nonlinear relationships in its conventional form, wherein it assumes linearity between the dependent variable and the independent variables¹⁸.

Donor selection is a challenging and multifactorial decision influenced by both donor and recipient factors as well as match considerations. Risk models have been used in the heart, kidney, and liver transplantation, to better assess the interactions between donor and recipient factors and their overall impact on post-transplant outcomes. Accordingly, ML and discrete optimization have been used in the context of paired kidney exchange¹⁹. The aim of the kidney exchange program is to obtain the maximum number of donor–recipient matches in a pool of incompatible pairs¹⁹. For this purpose, a stochastic-based ML algorithm (an algorithm that can make use of randomness during learning) called Ant Lion Optimization (ALO) using the bio-inspired technique was proposed, having the advantage of requiring relatively small computing power in consideration for the typical resources accessible by hospitals and was successful in generating possible pair matches. Its performance was on par with deterministic-based models such as integer programming and outperformed other stochastic-based models including Genetic algorithm¹⁹. This algorithm was also able to give higher weight to patients who had lower chances of getting matched by conventional model¹⁹.

ML tools have been used in the integration of specific donor characteristics with recipient ones to produce matches with maximal post-transplant survival. In liver transplantation, a study of 822 donor–recipient pairs from King’s College Hospital (London, UK) successfully validated an ANN model that originally had been developed using multicentric Spanish cohorts²⁰. This ANN model was trained to help clinicians with donor–recipient matching to achieve maximum graft survival at 3 and 12 months after liver transplant²¹. The ultimate outcomes of interest were graft survival and non-survival at three (AUC 0.94 for ANN) and twelve months (AUC 0.78 for ANN). Finally, ANN achieved 20% higher AUC compared to Model For End-Stage Liver Disease (MELD) score in predicting 3-month graft survival²¹. The number of days on waitlist, underlying liver disease, and MELD score were among most predictive variables from recipient and cause of death, cold ischemic time, hypertension and AST were among top-ranked features from donor side for predicting 3-month graft survival. Although the ANN model performed even better using data from the English center compared to the original Spanish cohorts, it is still necessary to optimize and fine-tune the model when new data are available from other transplant units. Also, using the same ML model in countries with different health care systems may not achieve the same significant performance due to the heterogeneity and different variables collected in various programs²¹. In another study, an ANN-based algorithm trained on data from seven Spanish hospitals and one hospital from the UK correctly predicted liver transplant outcomes for 73% of the population. The ANN algorithm was trained to predict the risk of graft failure within 15 days, between 15 and 90 days, and over 90 days after transplant based on the features from both the donors and the recipients. Compared to ANN (with 73% correct prediction), logistic regression, and SVM models showed correct prediction only for 50 and 66% of the population, respectively²². However, this ordinal classification of post-transplant outcome resulted in imbalanced subgroups with more than 85% of recipients surviving more than 90 days after LT. The authors solved this imbalance challenge by giving dynamic weights to the minority classes by using a cost-sensitive evolutionary ordinal artificial neural network and an ordinal oversampling technique. This ANN algorithm can serve as a decision-support system beside MELD score and other clinical risk scores for transplant hepatologists to make more informed decisions based on both donor and recipient characteristics.

Decision-support tools for organ donation

While there is a high demand versus supply for donor organs, some organs are still discarded in the process. Marginal grafts have unpredictable acceptance rates which vary between centers but are offered to all centers based on need and regardless of their capabilities to perform transplant for hard-to-place organs. Decision-support tools that identify deceased donor kidneys which may experience placement difficulties and streamline this process may increase donor organ utilization. To better assist in accept/decline decisions for patients needing adult kidney donors, Natural Language Processing (NLP) methods that tap into free-text data beside structured data from donor information were studied²³. Using this method, both known and new key clinical terms holding significant predictive value were discovered, producing a model with a C-statistic of 0.75 for accept or decline decisions which is comparable with the performance of traditional indices including Reduced Probability of Delay or Discard (r-PODD, C-statistics = 0.80) and Kidney Donor Profile Index (KDPI, C-statistics = 0.77). Intravenous drug use as well as some other keywords pertaining to cardiovascular disease such as “stent”, “CHF”, and “cholesterol” in the free text were found among predictive words for discarding donor organs using the NLP method. However, these variables are not documented in structured data from OPTN. Accordingly, a combination of structured (from OPTN database) and unstructured data (from free text) can improve the performance of ML models for this purpose. Another study of 75,350 deceased donors from OPTN database comparing linear statistical models with ML algorithms in prediction of organ yield from a deceased donor. Organ yield was defined as the number of organs transplanted per donor. This study showed that Bayesian Additive Regression Trees (BART) had the highest yield with the lowest Mean Absolute Error (P < 0.001) and highest resolution out of 13 models in predicting deceased donor organ yield²⁴. BART is a nonparametric Bayesian regression approach that creates a binary tree by recursively splitting the data on the predictor values using a statistical model consisting of a prior and a likelihood. Advantages for BART include easier pre-processing and visualization of data and capturing nonlinear relationships during prediction whereas, disadvantages include high model complexity and instability in the tree in case of small changes in data. Accordingly, these studies suggest how ML can facilitate the identification/availability of potential donors.

Strengths and weaknesses of machine-learning algorithms in pre-transplant settings

ML models hold great potential in helping clinicians with the prediction of waitlist mortality, organ allocation, donor–recipient matching, and donor organ assessment. The main strength of these models is their capacity to work efficiently with large datasets, and to find complex hidden relationships between donor and recipient characteristics, leading to better performance compared to conventional statistical algorithms such as logistic regression. However, the main limitation of ML models is their dependence on the quality of input data especially in large data registries which are susceptible to human error in data documentation. Moreover, heterogeneity and variations in collected data between different transplant centers require clinicians to fine-tune ML models using the variables from their local data registry. Moreover, since these algorithms work better with large datasets, they also require systems with high computing power for data analysis which may not be accessible in every clinical setting.

Post-transplant outcomes

Prediction of post-transplant survival

Optimal decision-making and management rely on prediction of patient survival on the waiting list and after transplant, aiming to increase the number of successful transplants and improve overall outcome. Several survival models before and after transplant have been built using deep learning techniques.

The International Heart Transplantation Survival Algorithm (IHTSA) model was an ANN-based model derived and tested from a pool of 27,705 adult patients from the UNOS registry, utilizing both recipient and donor variables. The model outperformed a conventional logistic regression-based model (i.e., Index for Mortality Prediction After Cardiac Transplantation (IMPACT)) in accurate prediction of 1-year mortality (AUC 0.654 vs. 0.608, P = 0.004) and long-term survival (C-index 0.627 vs 0.584)²⁵. The added capabilities of deep learning in capturing nonlinear and hidden patterns resulted in error reductions by 12% in prediction of short-term mortality and by 10% in long-term mortality when compared with the traditional models. This survival model consists of a flexible nonlinear generalization of the standard Cox proportional hazard model, integrating ensembles of ANNs with a prediction capability of more than 1 year. Although the IHTSA model includes both donor and recipient variables compared to the IMPACT model which only includes recipient variables, IHTSA still showed significantly better performance even after training using similar variables with the IMPACT model. This also confirms that ANN-based models benefit from identifying new patterns among the same input variables compared to conventional models²⁵.

Similarly, a decision tree ML algorithm including 53 donor, recipient, and donor–recipient compatibility features for heart transplant candidates resulted in improved prediction of survival across all assessed time intervals of post-transplant survival evaluation compared to risk-stratification score (RSS)²⁶. The decision tree algorithm predicted 3-year survival correctly for 14% more patients compared to RSS after holding specificity at 80%. This ML model overcame the challenge of heterogeneity in this patient population by identifying clusters with similar features among the whole patient population and finding the specific predictive variables for survival for each cluster²⁶. This suggested that ML-derived methods may be better able to adapt and outperform conventional models despite changes in clinical practice over time. However, a separate study on survival prediction using pre-transplant variables did not appreciate any improvement in 1-year post-transplant survival prediction when comparing standard statistical methods (C-statistic = 0.65) to a myriad of ML methodologies (C-statistic = 0.66 for ANN)²⁷, despite adjusting for policy changes in allocation over time. The handling of missing data within the UNOS database differed between the two above studies, suggesting that data quality and management could potentially play a significant role in the resultant predictability of ML models.

In kidney transplantation, the use of a combination of Random Forest (RF) classification and Cox regression with least absolute shrinkage and selection operator (LASSO) for variable selection on 73 donor and recipient characteristics resulted in a model that outperformed the standard Estimated Post-Transplant Survival (EPTS) model for 5-year survival prediction (concordance index 0.724 vs. 0.697)²⁸. They trained two separate models for recipients below and above 50 years old and finally obtained two different sets of predictive variables for these two cohorts²⁸. Accordingly, this study showed that developing separate ML models for different cohorts of patients may improve their accuracy even further. LASSO regression technique uses regularization to avoid overfitting but leads to arbitrary dropping of the predictors when those are highly correlated²⁹. RF technique on the other hand, combines decisions from multiple decision trees, making it highly flexible and accurate while estimating nonlinear relationships in the data³⁰. These survival decision trees have also been used to predict kidney graft survival, outperforming conventional statistical methods (such as cox regression model and conventional decision tree model), as seen in a study of over 3000 renal transplant patients³¹. These models obtained a 3-month serum creatinine level as one of the most important features for predicting graft failure. The survival decision tree model was able to take advantage of censored patient data and to add interpretable clinical information using survival statistics³¹.

ML methodologies are perhaps the most well studied in liver transplant recipients. Two studies utilizing preoperative UNOS data from adult transplant recipients for the assessment of 90-day mortality separately highlight the importance of feature selection^32,33. Pre-selected variables of recipient age, BMI, MELD score, history of preoperative renal replacement therapy and diabetes were used to develop an ML-based scoring system with an AUC of 0.952 in identifying high-risk patients, defined as those with predicted probability of death greater than 10%³². Similar studies using ANNs and LASSO regression have been performed on smaller patient populations to effectively predict post-transplant survival^34,35,36. RF predicted post-operative graft failure in the immediate post-operative period up to 30 days with greater accuracy than Logistic Regression and scoring indices such as the Donor Risk Index (DRI) and the survival outcomes after liver transplantation (SOFT) score, with AUROC = 0.818 (95% CI: 0.812–0.824)³⁷. Furthermore, ANN algorithm trained on the top 15 important pre-transplant features outperformed RF (AUC: 0.835 vs 0.818), showing the importance of feature selection to improve the model performance³⁷.

Prediction of rejection

Several studies aimed to predict rejection and identify those at greatest risk. These studies however have been limited in their ability to accurately ascertain risk. Given the often-large number of contributory factors to rejection, ML may play a role in managing many data points, considering not only readily available clinical data variables but also genetic, metabolomic, and pathology-based variables that may not have been previously utilized for model development to provide better prediction capability. ANNs have consistently been proven to be highly accurate predictive tools for graft rejection in both renal and liver transplantation compared to standard modeling techniques^37,38,39.

In liver transplantation, ANNs outperformed logistic regression models in predicting the risk of acute rejection in 148 recipients using clinical and laboratory test data at 7 days after transplant, with 90% accuracy, 87% sensitivity, and 90% specificity³⁸.

In renal transplantation, sensitization defined as the formation of antibodies against human leukocyte antigens (HLA) makes successful transplant for highly sensitized candidates difficult. Many desensitization processes exist; however, patients often fail to respond to therapy, and it can be difficult to determine which factors make one patient an appropriate candidate for desensitization therapy. Through evaluation of various assays of immune and gene expression profiles of highly sensitized patients, a Support Vector Machine (SVM) model was able to identify a distinguishing pattern for those likely to respond to desensitization therapies⁴⁰. SVM classification is based on finding a hyperplane in a high-dimensional space representing the largest margin that distinctly classifies the data points of the two outcomes⁴¹. The dimension of the hyperplane depends upon the number of features, if two the hyperplane is just a line and if there are three features, then the hyperplane becomes a 2-D plane and so on. SVMs are robust to overfitting, however, do not scale well to large datasets. Utilization of ML processes was able to highlight variables of importance for classification and also allowed for the identification of important patterns that would have otherwise been difficult to ascertain with conventional methodology.

ML methodology can also be advantageous in allowing for boosted diagnostic performance through simultaneous assessment of varied data types. In a study combining data obtained from diffusion-weighted MRI images and clinical biomarkers such as creatinine clearance and serum plasma creatinine, a Convolutional Neural Network (CNN) was able to accurately identify 92.9% of rejected kidney grafts regardless of scanner type and differences in image collection protocols between patients⁴². CNNs are neural network models that are popular for image classification tasks and help in efficient feature extraction through convolution operation when studies consist of clinical imaging and radiomics features such as MRI data as input¹⁷. Through this approach, deep learning tools combining medical image analyses with clinical data highlight the opportunities for early, noninvasive detection of rejection in solid-organ transplantation as an alternative for graft biopsy.

The diagnosis of graft rejection is historically made by tissue histopathology assessment. However, graft biopsy assessment is usually limited by low reproducibility and inter-observer variability. In attempts to mitigate the variability in interpretation of biopsy results, ML has been utilized to provide more definitive, standardizable methods. Supervised learning methods have been used with endomyocardial biopsies of heart transplant recipients to predict rejection using microarray analysis⁴³. Accordingly, molecular classifiers enabled better molecular rejection prediction than histologic rejection (AUCs >0.87 compared to AUCs <0.78, respectively). The authors also utilized an automated RF model which was highly predictive of corresponding expert diagnoses based on the molecular reports suggesting that these algorithms could be utilized in lieu of pathologist assessment so as to increase efficiency in diagnosis⁴³. Similar microarray analyses have been utilized in lung transplant recipients for molecular phenotyping of rejection from both transbronchial and mucosal biopsies, for which unsupervised learning methods were able to identify molecular signatures correlating with patterns of rejection from samples that would have been previously deemed unusable^44,45.

Post-transplant complications

ML and kidney transplant complications

In the context of kidney transplantation, delayed graft function (DGF) is defined as the need for dialysis within the first week after transplant, is associated with a higher risk of graft loss in the long term, prolonged hospital stays, and thus costs. ML algorithms can uncover potentially useful prognostic indicators as well as targets for future diagnostic and therapeutic studies.

The RF approach was used to identify urine proteins with predictive value for DGF using targeted urine proteome assay⁴⁶. Data from 52 patients with intermediate, slow, and delayed graft function and urine samples were collected within 12–18 h post-surgery. Four key urine proteins were found to be changed in recipients with DGF, with a sensitivity of 77.4% and a specificity of 82.6% (AUC 0.89). Alternative ML strategies were used to investigate how pre-transplant donor maintenance practices contribute to the onset of DGF. This multicenter study used data from 443 deceased donors. An initial multivariate logistic regression model was not able to identify significant variables⁴⁷. In a post hoc analysis, predictive models that use ML tools, including boosted Decision Tree (DT) using the C5.0 algorithm (AUC 0.79), boosting ANN (AUC 0.88), and SVM with polynomial kernel (AUC 0.78), were able to identify key donor maintenance variables for DGF. The models were also able to recognize nonlinear relationships between variables. This study showed the importance of donor maintenance variables such as urine output and mean arterial pressure which were not included in other regression-based risk scores for predicting post-transplant DGF. However, with its retrospective nature, the data collected were not time-sensitive and may not represent the full picture in terms of donor kidney maintenance. The RF approach was also used to evaluate the health-related quality of life (HRQOL) and its determinants within the first 3 years in 337 kidney transplant recipients. This approach found a significant association between HRQOL at 1 month after transplantation and HRQOL between 3 and 36 months after transplantation⁴⁸. Also, in contrast to conventional models, the ensemble of ML algorithms allowed analyses of both quantitative and qualitative variables together without limitation of the covariates tested⁴⁸.

Another multicenter study applied probabilistic unsupervised classification techniques called archetype analysis to risk-stratify five distinct groups of patients (archetypes) with transplant glomerulopathy although all of them had similar histological morphology (double contour of the glomerular basement membrane)⁴⁹. The study consisted of kidney biopsies from 385 recipients with confirmed glomerulopathy. These archetypes were classified based on post-transplant clinical, functional, immunologic, and histologic data, and were shown to have significantly distinct graft outcomes. These were used to produce an online application which can potentially be used in clinical contexts to flag those kidney transplant recipients with glomerulopathy who are at higher risk of graft failure. The study was successful in using an unbiased approach to classify heterogeneous data, and further studies should elucidate the driving mechanisms of the underlying pathology to understand the produced archetypes.

ML and liver transplant complications

Before a liver transplantation, one of the key features that can help surgeons with predicting post-transplant graft function is the amount of graft steatosis. Since biopsy and histological methods of steatosis assessment are expensive and inefficient, physicians are left to use donor clinical characteristics and visual analyses of the graft. The subjectivity and variability of this process remains a challenge, to which researchers employed ML tools on photographic data to detect hepatic steatosis. Using a semi-supervised classification approach on 40 liver images obtained by smartphone camera in the operating room translating to 600 liver patches as well as clinical variables and blood sample tests, an SVM model was able to qualitatively assess grafts steatosis before transplantation with an accuracy of 0.88, sensitivity of 0.95 and specificity of 0.81 considering liver biopsy as the reference method⁵⁰. However, due to the small sample size they used leave-one-patient-out cross-validation for the evaluation of their algorithms which may lead to overestimation of the performance.

Metabolic side effects of immunosuppressive medications including diabetes mellitus compromise the long-term survival of solid-organ transplant recipients. Accordingly, in a study for the prediction of new-onset diabetes after transplantation (NODAT) within the first year after transplant, a RF classifier was able to robustly identify the most important risk factors for NODAT using the SRTR database⁵¹. This study identified sirolimus-based immunosuppression as a risk factor for NODAT. However, since the SRTR database does not include data about immunosuppressive medication serum levels and fasting glucose, it was not possible to accurately evaluate the association of sirolimus with hyperglycemia. Another common side effect of long-term immunosuppression is de novo non-melanoma skin cancers (NMSC) including basal cell carcinoma (BCC) and squamous cell carcinoma (SCC). This elevated risk begs early detection and prediction methods to allow for timelier adjustments in care. A study conducted by Tanaka & Voigt adopted a decision tree approach to stratify patients based on the risk of developing NMSCs⁵². First, using a cox regression analysis, main independent risk factors including BMI, not receiving sirolimus, and recipient age was identified and fed into the decision tree analysis. Then, these risk factors were ranked based on their importance using decision tree analysis, and various value ranges were defined to stratify patients from low to high risk (validation set: R²= 0.971, P < 0.0001). This study showed that recipients with BMI < 40 kg/m² and over 47 years were at higher risk of NMSC and may benefit from more frequent cancer screening compared to other recipients who undergo annual screening for skin cancers. As this study showed, compared to other ML methods, decision trees are more interpretable; this model is able to provide clinicians with guiding algorithms for their clinical management based on the identified risk factors and their cut-offs for different post-transplant outcomes.

Another study developed a novel scoring system using the LASSO regression method to predict the risk of sustained alcohol consumption post-transplant for patients with alcohol hepatitis using to prioritize those candidates with lower risk of relapse for early transplantation. This study used data from 134 liver transplant recipients and the resulting model had a C-statistic of 0.76, after internal cross-validation⁵³. Four objective pre-transplant variables including greater than ten drinks per day at initial hospitalization, history of illicit substance abuse, history of any alcohol-related legal issues, and history of multiple rehabilitation attempts were identified as the main predictive risk factors for sustained alcohol consumption after transplant. Accordingly, the LASSO regression method was shown to work well when the number of events is low. More importantly, this model can be used for building risk scores based on the provided coefficients, leading to easier and more explainable implementation of this algorithm in clinical practice. This model is simple to implement and has the potential to open up organ access for those traditionally restricted, however larger and prospective studies would be needed for validation.

Some studies have also directly compared how traditional statistics perform against ML methods. In the context of predicting acute kidney injury (AKI) after liver transplantation, one study compared Logistic Regression to several ML and neural networks models⁵⁴. Models were developed using pre- and perioperative factors from a single-center dataset of 1211 cases with both living and deceased donors. Gradient Boosting yielded the best AUC value (0.90, 95% CI 0.86–0.93) in predicting stages of AKI and performed better than logistic regression methods (AUC 0.61, 95% CI 0.56–0.66). Gradient Boosting builds a sequential series of decision trees, where each tree corrects the residuals in the predictions made by the previous trees. This ML technique is robust when the data contains correlated features, however, this methodology requires tuning of many hyperparameters therefore, rendering model development slower⁵⁵.

ML and lung transplant complications

Among lung transplant recipients, chronic lung allograft dysfunction (CLAD) affects more than 50% in the first 5 years post-transplant, and negatively impacts long-term survival. Bronchiolitis obliterans syndrome (BOS) is an obstructive form of CLAD due to chronic immune-mediated rejection. BOS can lead to decreased airflow represented by reduced forced expiratory volume in the first second (FEV1). Early diagnoses can pave the way for future research in interventions. To this end, an SVM to analyze quantitative CT imaging data from 71 patients was investigated as a diagnostic tool for early CLAD detection⁵⁶. Using quantitative CT scan at first post-transplant visit, SVM algorithm was able to identify those recipients who are at higher risk of developing BOS in near future while pulmonary function tests did not show any significant changes in early stages. The model resulted in an 85% accuracy rate using three features from patient images. SVM model identified smaller lobar and airways volumes, smaller airways surfaces, and higher airways resistance as predictive factors for BOS at early quantitative CT scan. Earlier identification of these patients can guide clinicians with appropriate management to avoid further progression of this pathology.

Strengths and weaknesses of machine-learning algorithms in post-transplant settings

For post-transplant complications, ML techniques such as SVMs, Random Forests and ANNs have the ability to model evolving and heterogeneous patient-level data over time, identify predictors of poor outcomes, and inform care. Uses of ML in predicting post-transplant complications have numerous applications including decision-support systems and even the development of data-driven assessments. CART and LASSO regression models can guide post-transplant patient management by providing clinicians with transparent algorithms that can be easily understood and explained from a clinical perspective. Moreover, recent research also strives to make ML models more interpretable in terms of identifying features playing a key role during the prediction tasks. Some recent methods including SHapley Additive exPlanations (SHAP) as used for evaluating feature importance and explain the predictions made by ML algorithms toward post-LT AKI⁵⁷, Local Interpretable Model-Agnostic Explanations (LIME) used to assess relative impact of key predictors in post-transplant patient survival⁵⁸, and integrated gradients used to identify important predictors in diagnosing allograft rejection⁵⁹, make ML models more explainable. However, further validation of their results using multicenter prospectively collected data is important before wider application of these algorithms in daily clinical practice.

Limitations of ML in transplant medicine

While studies show promise for ML; data quality, small sample size, inconsistency regarding the number of cases adequately powering these models and inter-center variability may affect model generalization. Inconsistent data collection, recording and classification included in models can potentially lead to the wrong features being used for predictions and a potential bias. There is often no inherent benefit to ML models as a tool, especially when the number of predictors in a setting is low.

In addition, there is still a lack of prospective and external validation for these models. The anticipated improvement in outcomes improvement to traditional methods is marginal in some aspects of transplantation such as predicting graft pathologies after liver transplantation. Therefore, clinical integration and post-deployment monitoring of these algorithms in real-time will be necessary to evaluate the true impact of algorithms in clinical practice. In external validation studies, reductions in the predictive accuracy of models (relative to their original performance in development studies) is expected. Therefore, given the additional complexities introduced by ML algorithms, it should be ensured that models undergo rigorous but fair external validation in cohorts or simulated data. As a result, there is a potential lack of generalizability and the performance of these models should be assessed with caution before considering their application in daily clinical work, as well as more multicenter studies with distinct demographic variables, should be used as input while creating an ML model to increase it robustness on varied datasets.

For more equitable systems, studies should also consider non-clinical variables of transplantation like geographic disparity, physical compatibility between donor and recipients, and resource availability, all of which may significantly impact transplant outcomes. Depending on the ML model development, representation of the key population (by sex, age, and ethnicity) can influence the predictive accuracy of the algorithm in different subgroups, thus, creating inequities. Hence, methodologies to ensure fairness evaluation should be employed.

Another potential challenge with ML algorithms is that they are computationally complex and time-intensive, whereas some models like neural networks require more computational resources and higher number of iterations for better training. Hence, an ML model should be chosen for analysis after carefully estimating whether the ML model is better able to address key clinical questions, in a scenario where there is marginal gain in implementing a complex ML solution as compared to traditional statistical methods. Also, each ML approach has pros and cons associated with it and should be employed after careful analysis of the nature of the data and clinical questions at hand.

Another limitation of ML modeling is that they are perceived as black boxes, therefore in-depth analysis to improve the interpretability of machine-learning-based outcome predictions should be done to make ML models more clinically relevant and explainable. As supported by recent methodologies such as Local Interpretable Model-Agnostic Explanations (LIME), SHapley Additive exPlanations (SHAP), and Integrated gradients, there have been significant efforts to improve the interpretability of ML models^60,61,62. How to effectively incorporate an ML algorithm in clinical care is an additional frontier to be overcome. Whereas a few key parameters can be input into biostatistical models online (online calculators including the MELD Na), ML models involve multiple clinical and laboratory values that will need to be fed into an algorithm to generate an output. Future potential of ML also lies in integrating data from various omics sources such as genetics, proteomics, and metabolomics which are high-dimensional datasets appropriate for ML to investigate nonlinear relationships as well as which have a great application in disease prediction, patient stratification, and delivery of precision medicine in the area of solid-organ transplantation.

Although ML approaches are usually evaluated in terms of their accuracy and precision which is aimed at being 100%, many factors such as the incompleteness of data sample, noise in the data and stochastic nature of the modeling algorithm can reduce the accuracy. Therefore, while application of ML approaches, the aim should be to achieve the maximum potential for the prediction ability using the dataset at hand and also tune the parameters adequately and compare to other baseline approaches to establish generalizability and robustness.

Future directions

Training ML algorithms using multicenter data and with larger databases can decrease the risk of overfitting and provide the opportunity to examine the generalizability of these models. Therefore, larger international databases for organ transplant populations will be essential for future research in this field. Also, applying ML algorithms to molecular, genetic, and radiological data as well as their combination could inform a more personalized approach to patient management.

ML techniques hold tremendous potential to further improve the outcomes of transplant recipients, given the complexity and diversity of factors that impact health in transplant medicine. With the increasing amount of data available, and the ability of ML algorithms to uncover hidden interrelationships, these tools hold great promise in informing a precision medicine approach to transplant and improving overall outcomes.

Summarized recommendations

The performance of ML algorithms significantly correlates with the quality of the input data. Therefore, organizing data registries with low missing rates is imperative to the development of robust ML algorithms. Accordingly, large clinical datasets might lack the accuracy and granularity needed for ML algorithms to uncover hidden nonlinear associations.
Training ML and DL algorithms usually require systems with high computing power which may not be accessible by hospitals. Therefore, developing models which require lower computing power and time could result in more universal application of these models in clinical settings.
Due to heterogeneity and variation in the variables collected by different transplant data registries, it is important to fine-tune ML algorithms using new data available at every transplant center to optimize the model’s performance.
By using ML algorithms such as Natural Language Processing (NLP), clinicians can take advantage of non-structured data (such as free texts from patient medical records and progress notes) along with structured organized data to improve organ allocation and post-transplant outcome.
Routinely used ML algorithms may not be able to work efficiently with imbalanced datasets. Therefore, applying more recent methods such as weighted LSTM or ordinal oversampling techniques can improve the performance of ML models trained on imbalanced datasets which are very common in medical research.
Advantages of Bayesian Additive Regression Trees (BART) compared to logistic regression models include easier pre-processing and visualization of data, as well as capturing nonlinear relationships during prediction, whereas disadvantages include high model complexity and instability in the tree in case of small changes in data.
ANNs require large data to be trained on to estimate their predictive parameters accurately. On the other hand, the conventional logistic regression method benefits from a small number of hyperparameters in its modeling but is not suitable to capture nonlinear and complex relationships among them.
Training ML models using identified important predictive features can improve the performance of these algorithms compared to developing models using all the available variables
Although ML models especially ANNs used to be considered as “black box”, recent methods including Local Interpretable Model-Agnostic Explanations (LIME), SHapley Additive exPlanations (SHAP), and Integrated gradients have improved the interpretability of these algorithms by providing a list of the most important predictive variables for ultimate outcomes, enabling clinicians to assess the performance of ML algorithms based on the clinical perspective.
Compared to other ML methods which are usually perceived as a black box, decision trees are more interpretable providing a tree-like structure distinguishing the key features from the feature set used to split the tree at each node; this model requires basic clinical variables for input to produce an algorithm guiding clinicians with their patient management.
The CART and LASSO regression models can be used for building risk scores based on the provided coefficients, leading to the easier implementation of this algorithm in clinical practice.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

References

Giwa, S. et al. The promise of organ and tissue preservation to transform medicine. Nat. Biotechnol. 35, 530–542 (2017).
Article CAS PubMed PubMed Central Google Scholar
Haugen, C. E. et al. National trends in liver transplantation in older adults. J. Am. Geriatrics Soc. 66, 2321–2326 (2018).
Article Google Scholar
Abecassis, M. et al. Solid‐organ transplantation in older adults: current status and future research. Am. J. Transplant. 12, 2608–2622 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mitchell, A. B. & Glanville, A. R. Lung transplantation: a review of the optimal strategies for referral and patient selection. Therapeutic Adv. respiratory Dis. 13, 1753466619880078 (2019).
Google Scholar
Schwager, Y. et al. Prediction of three-year mortality after deceased donor kidney transplantation in adults with pre-transplant donor and recipient variables. Ann. Transplant. 24, 273 (2019).
Article PubMed PubMed Central Google Scholar
Jadlowiec, C. C. & Taner, T. Liver transplantation: current status and challenges. World J. Gastroenterol. 22, 4438 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ortega, F. Organ transplantation in the 21th century. In López-Larrea, C., López-Vázquez, A., Suárez-Álvarez, B (eds) Stem Cell Transplantation 13–26 (Springer, 2012).
Piao, D., Hawxby, A., Wright, H. & Rubin, E. M. Perspective review on solid-organ transplant: needs in point-of-care optical biomarkers. J. Biomed. Opt. 23, 080601 (2018).
Google Scholar
Tonsho, M., Michel, S., Ahmed, Z., Alessandrini, A. & Madsen, J. C. Heart transplantation: challenges facing the field. Cold Spring Harb. Perspect. Med. 4, a015636 (2014).
Article PubMed PubMed Central Google Scholar
Mitchell, T. M. Learning M (The McGraw-Hill Companies. Inc, 1997).
Rajkomar, A., Dean, J. & Kohane, I. Machine learning in medicine. N. Engl. J. Med. 380, 1347–1358 (2019).
Article PubMed Google Scholar
Connor, K. L., O’Sullivan, E. D., Marson, L. P., Wigmore, S. J. & Harrison, E. M. The future role of machine learning in clinical transplantation. Transplantation 105, 723–735 (2021).
Article CAS PubMed Google Scholar
Ishwaran, H., Kogalur, U. B., Blackstone, E. H. & Lauer, M. S. Random survival forests. Ann. Appl. Stat. 2, 841–860. (2008).
Article Google Scholar
Hsich, E. M. et al. Variables of importance in the Scientific Registry of Transplant Recipients database predictive of heart transplant waitlist mortality. Am. J. Transplant. 19, 2067–2076 (2019).
Article CAS PubMed PubMed Central Google Scholar
Medved, D., Nugues, P. & Nilsson, J. Simulating the outcome of heart allocation policies using deep neural networks. in 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) 6141–6144 (IEEE, 2018).
Sauthier, N. B. R., Carreir, F. M. & Chassé, M. Detection of Potential Organ Donors; An Automatic Approach on Temporal Data (Critical Care Canada Forum, 2020).
Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (MIT Press, 2016).
Wright, R. E. Logistic Regression. In Grimm, L. G. & Yarnold P. R. (eds), Reading and Understanding Multivariate Statistics (pp. 217–244). Washington DC: American Psychological Association (1995).
Hamouda, E., El-Metwally, S. & Tarek, M. Ant Lion Optimization algorithm for kidney exchanges. PLoS ONE 13, e0196707 (2018).
Article PubMed PubMed Central CAS Google Scholar
Briceño, J. et al. Use of artificial intelligence as an innovative donor-recipient matching model for liver transplantation: results from a multicenter Spanish study. J. Hepatol. 61, 1020–1028 (2014).
Article PubMed Google Scholar
Ayllón, M. D. et al. Validation of artificial neural networks as a methodology for donor‐recipient matching for liver transplantation. Liver Transplant. 24, 192–203 (2018).
Article Google Scholar
Dorado-Moreno, M. et al. Dynamically weighted evolutionary ordinal neural network for solving an imbalanced liver transplantation problem. Artif. Intell. Med. 77, 1–11 (2017).
Article PubMed Google Scholar
Placona, A. M. et al. Can donor narratives yield insights? A natural language processing proof of concept to facilitate kidney allocation. Am. J. Transplant. 20, 1095–1104 (2020).
Article PubMed Google Scholar
Marrero, W. J., Lavieri, M. S., Guikema, S. D., Hutton, D. W. & Parikh, N. D. Development of a Predictive Model for Deceased Donor Organ Yield (LWW, 2018).
Medved, D. et al. Improving prediction of heart transplantation outcome using deep learning techniques. Sci. Rep. 8, 1–9 (2018).
Article CAS Google Scholar
Yoon, J. et al. Personalized survival predictions via trees of predictors: an application to cardiac transplantation. PLoS ONE 13, e0194985 (2018).
Article PubMed PubMed Central CAS Google Scholar
Miller, P. E. et al. Predictive abilities of machine learning techniques may be limited by dataset characteristics: insights from the UNOS database. J. Card. Fail. 25, 479–483 (2019).
Article PubMed Google Scholar
Mark, E., Goldsman, D., Gurbaxani, B., Keskinocak, P. & Sokol, J. Using machine learning and an ensemble of methods to predict kidney transplant survival. PLoS ONE 14, e0209068 (2019).
Article CAS PubMed PubMed Central Google Scholar
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc.: Ser. B (Methodol.) 58, 267–288 (1996).
Google Scholar
Liaw, A. & Wiener, M. Classification and regression by randomForest. R. N. 2, 18–22 (2002).
Google Scholar
Yoo, K. D. et al. A machine learning approach using survival statistics to predict graft survival in kidney transplant recipients: a multicenter cohort study. Sci. Rep. 7, 1–12. (2017).
Article CAS Google Scholar
Molinari, M. et al. Prediction of perioperative mortality of cadaveric liver transplant recipients during their evaluations. Transplantation 103, e297–e307 (2019).
Ershoff, B. D. et al. Training and validation of deep neural networks for the prediction of 90-day post-liver transplant mortality using Unos registry data. Transplantation Proc. 52, 246–258 (2020).
Khosravi, B., Pourahmad, S., Bahreini, A., Nikeghbalian, S. & Mehrdad, G. Five years survival of patients after liver transplantation and its effective factors by neural network and cox poroportional hazard regression models. Hepat. Mon. 15, e25164 (2015).
Raeisi Shahraki, H., Pourahmad, S. & Ayatollahi, S. M. T. Identifying the prognosis factors in death after liver transplantation via adaptive LASSO in Iran. J. Environ. Public Health 2016, 7620157 (2016).
Kazemi, A., Kazemi, K., Sami, A. & Sharifian, R. Identifying factors that affect patient survival after orthotopic liver transplant using machine-learning techniques. Exp. Clin. Transpl. 17, 775–783 (2019).
Article Google Scholar
Lau, L. et al. Machine-learning algorithms predict graft failure after liver transplantation. Transplantation 101, e125 (2017).
Article PubMed PubMed Central Google Scholar
Zare, A. et al. A neural network approach to predict acute allograft rejection in liver transplant recipients using routine laboratory data. Hepatitis Monthly 17, (2017).
Tapak, L., Hamidi, O., Amini, P. & Poorolajal, J. Prediction of kidney graft rejection using artificial neural network. Healthc. Inform. Res. 23, 277–284 (2017).
Article PubMed PubMed Central Google Scholar
Yabu, J. M., Siebert, J. C. & Maecker, H. T. Immune profiles to predict response to desensitization therapy in highly HLA-sensitized kidney transplant candidates. PLoS ONE 11, e0153355 (2016).
Article PubMed PubMed Central Google Scholar
Suykens, J. A. & Vandewalle, J. Least squares support vector machine classifiers. Neural Process. Lett. 9, 293–300 (1999).
Article Google Scholar
Abdeltawab, H. et al. A novel CNN-based CAD system for early assessment of transplanted kidney dysfunction. Sci. Rep. 9, 1–11. (2019).
Article CAS Google Scholar
Parkes, M. D. et al. An integrated molecular diagnostic report for heart transplant biopsies using an ensemble of diagnostic algorithms. J. Heart Lung Transplant. 38, 636–646 (2019).
Article PubMed Google Scholar
Halloran, K. M. et al. Molecular assessment of rejection and injury in lung transplant biopsies. J. Heart Lung Transplant. 38, 504–513 (2019).
Article PubMed Google Scholar
Halloran, K. et al. Molecular phenotyping of rejection‐related changes in mucosal biopsies from lung transplants. Am. J. Transplant. 20, 954–966 (2020).
Article CAS PubMed Google Scholar
Williams, K. R. et al. Use of a targeted urine proteome assay (TUPA) to identify protein biomarkers of delayed recovery after kidney transplant. PROTEOMICS–Clin. Appl. 11, 1600132 (2017).
Article CAS Google Scholar
Costa, S. D. et al. The impact of deceased donor maintenance on delayed kidney allograft function: A machine learning analysis. PLoS ONE 15, e0228597 (2020).
Article CAS PubMed PubMed Central Google Scholar
Villeneuve, C. et al. Evolution and determinants of health-related quality-of-life in kidney transplant patients over the first 3 years after transplantation. Transplantation 100, 640–647 (2016).
Article CAS PubMed Google Scholar
Aubert, O. et al. Archetype analysis identifies distinct profiles in renal transplant recipients with transplant glomerulopathy associated with allograft survival. J. Am. Soc. Nephrol. 30, 625–639 (2019).
Article CAS PubMed PubMed Central Google Scholar
Moccia, S. et al. Computer-assisted liver graft steatosis assessment via learning-based texture analysis. Int. J. computer Assist. Radiol. Surg. 13, 1357–1367 (2018).
Article Google Scholar
Bhat, V., Tazari, M., Watt, K. D. & Bhat, M. New-onset diabetes and preexisting diabetes are associated with comparable reduction in long-term survival after liver transplant: a machine learning approach. Mayo Clinic Proc. 93, 1794–1802 (2018).
Tanaka, T. & Voigt, M. D. Decision tree analysis to stratify risk of de novo non-melanoma skin cancer following liver transplantation. J. Cancer Res. Clin. Oncol. 144, 607–615 (2018).
Article PubMed Google Scholar
Lee, B. P. et al. Predicting low risk for sustained alcohol use after early liver transplant for acute alcoholic hepatitis: the sustained alcohol use post–liver transplant score. Hepatology 69, 1477–1487 (2019).
Article PubMed Google Scholar
Lee, H.-C. et al. Prediction of acute kidney injury after liver transplantation: machine learning approaches vs. logistic regression model. J. Clin. Med. 7, 428 (2018).
Article CAS PubMed Central Google Scholar
Natekin, A. & Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobotics 7, 21 (2013).
Article Google Scholar
Barbosa, E. J. M. Jr et al. Machine learning algorithms utilizing quantitative CT features may predict eventual onset of bronchiolitis obliterans syndrome after lung transplantation. Academic Radiol. 25, 1201–1212 (2018).
Article Google Scholar
Zhang, Y. et al. An explainable supervised machine learning predictor of acute kidney injury after adult deceased donor liver transplantation. J. Transl. Med. 19, 1–15. (2021).
Article Google Scholar
Kampaktsis, P. N. et al. State‐of‐the‐art machine learning algorithms for the prediction of outcomes after contemporary heart transplantation: results from the UNOS database. Clin. Transplant. 35, e14388 (2021).
Article PubMed Google Scholar
Peyster, E. G., Madabhushi, A. & Margulies, K. B. Advanced morphologic analysis for diagnosing allograft rejection: the case of cardiac transplant rejection. Transplantation 102, 1230 (2018).
Article PubMed PubMed Central Google Scholar
Ribeiro, M. T., Singh, S. & Guestrin, C. “Why should I trust you?” Explaining the predictions of any classifier. in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1135–1144 (ACM, 2016).
Štrumbelj, E. & Kononenko, I. Explaining prediction models and individual predictions with feature contributions. Knowl. Inf. Syst. 41, 647–665 (2014).
Article Google Scholar
Sundararajan, M., Taly, A. & Yan, Q. Axiomatic attribution for deep networks. in International Conference on Machine Learning. 3319–3328 (PMLR, 2017).

Download references

Acknowledgements

We wish to thank the Canadian Donation and Transplantation Research Program for supporting N.G. and N.S. and Paladin Labs Inc for supporting A.A. and D.S.

Author information

Authors and Affiliations

Ajmera Transplant Program, University Health Network, Toronto, ON, Canada
Neta Gotlieb, Amirhossein Azhie, Jason Tran & Mamatha Bhat
Department of Medicine, University of Ottawa, Ottawa, ON, Canada
Neta Gotlieb
Department of Gastroenterology, Toronto General Hospital Research Institute, Toronto, ON, Canada
Divya Sharma
Division of Gastroenterology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Ashley Spann
Department of Cell and Systems Biology, University of Toronto, Toronto, ON, Canada
Nan-Ji Suo
Library and Information Services, University Health Network, Toronto, ON, Canada
Ani Orchanian-Cheff
Vector Institute for Artificial Intelligence, Toronto, ON, Canada
Bo Wang & Anna Goldenberg
Department of Medicine (Critical Care), University of Montreal Hospital, Montréal, QC, Canada
Michael Chassé
Canadian Donation and Transplantation Research Program, Data and Innovation Expert Group, Toronto, ON, Canada
Michael Chassé, Heloise Cardinal, Joseph Paul Cohen, Andrea Lodi, Melanie Dieude & Mamatha Bhat
Centre hospitalier de l’Université de Montréal Research Center, Université de Montréal, Montréal, QC, Canada
Heloise Cardinal & Melanie Dieude
Center for Artificial Intelligence in Medicine & Imaging, Stanford University, Stanford, CA, USA
Joseph Paul Cohen
Mila, Quebec Artificial Intelligence Institute, Montréal, QC, Canada
Joseph Paul Cohen
Canada Excellence Research Chair, Polytechnique Montréal, Montréal, QC, Canada
Andrea Lodi
Department Microbiology, Infectiology and Immunology, Faculty of Medicine, Université de Montréal, Montréal, QC, Canada
Melanie Dieude
Héma-Québec, Montréal, QC, Canada
Melanie Dieude
Division of Gastroenterology and Hepatology, Department of Medicine, University of Toronto, Toronto, ON, Canada
Mamatha Bhat

Authors

Neta Gotlieb
View author publications
You can also search for this author in PubMed Google Scholar
Amirhossein Azhie
View author publications
You can also search for this author in PubMed Google Scholar
Divya Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Ashley Spann
View author publications
You can also search for this author in PubMed Google Scholar
Nan-Ji Suo
View author publications
You can also search for this author in PubMed Google Scholar
Jason Tran
View author publications
You can also search for this author in PubMed Google Scholar
Ani Orchanian-Cheff
View author publications
You can also search for this author in PubMed Google Scholar
Bo Wang
View author publications
You can also search for this author in PubMed Google Scholar
Anna Goldenberg
View author publications
You can also search for this author in PubMed Google Scholar
Michael Chassé
View author publications
You can also search for this author in PubMed Google Scholar
Heloise Cardinal
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Paul Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Lodi
View author publications
You can also search for this author in PubMed Google Scholar
Melanie Dieude
View author publications
You can also search for this author in PubMed Google Scholar
Mamatha Bhat
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.G., A.A., D.S., A.S., N.J.S., and J.T. conducted the review, prepared figures, and wrote the initial draft. A.O.C. performed the literature search. B.W., A.G., M.C., H.C., J.P.C., A.L., and M.L. revised the manuscript. M.B. revised the manuscript and supervised the project. All authors reviewed the manuscript, read and approved the final manuscript submitted for publication.

Corresponding author

Correspondence to Mamatha Bhat.

Ethics declarations

Competing interests

M.B. received grants from Paladin, Novo Nordisk, Oncoustics, Natera, MedoAI, Lupin Speakers Bureau: Novartis, Paladin. The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gotlieb, N., Azhie, A., Sharma, D. et al. The promise of machine learning applications in solid organ transplantation. npj Digit. Med. 5, 89 (2022). https://doi.org/10.1038/s41746-022-00637-2

Download citation

Received: 23 November 2021
Accepted: 24 June 2022
Published: 11 July 2022
DOI: https://doi.org/10.1038/s41746-022-00637-2
Springer Nature Limited

This article is cited by

Exploring the variable importance in random forests under correlations: a general concept applied to donor organ quality in post-transplant survival
- Christoph Wies
- Robert Miltenberger
- Antje Jahn-Eimermacher
BMC Medical Research Methodology (2023)
Development of a calculated panel reactive antibody calculator for the United Arab Emirates: a proof of concept study
- Marion Alvares
- Siddiq Anwar
- Gehad ElGhazali
Scientific Reports (2023)
Enhanced survival prediction using explainable artificial intelligence in heart transplantation
- Paulo J. G. Lisboa
- Manoj Jayabalan
- Johan Nilsson
Scientific Reports (2022)

The promise of machine learning applications in solid organ transplantation

Abstract

Similar content being viewed by others

Application of machine learning in liver transplantation: a review

Personalized prediction of delayed graft function for recipients of deceased donor kidney transplants with machine learning

Development and validation of a risk index to predict kidney graft survival: the kidney transplant risk index

Introduction

Methods

Organ allocation and predictive modeling for waitlist mortality

Optimization of donor–recipient matching

Donor identification and matching

Decision-support tools for organ donation

Strengths and weaknesses of machine-learning algorithms in pre-transplant settings

Post-transplant outcomes

Prediction of post-transplant survival

Prediction of rejection

Post-transplant complications

ML and kidney transplant complications

ML and liver transplant complications

ML and lung transplant complications

Strengths and weaknesses of machine-learning algorithms in post-transplant settings

Limitations of ML in transplant medicine

Future directions

Summarized recommendations

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Exploring the variable importance in random forests under correlations: a general concept applied to donor organ quality in post-transplant survival

Development of a calculated panel reactive antibody calculator for the United Arab Emirates: a proof of concept study

Enhanced survival prediction using explainable artificial intelligence in heart transplantation

Search

Navigation