Introduction

Solid-organ transplantation (SOT) has evolved from an experimental to definitive treatment for patients with end-stage liver, kidney, pancreas, heart and lung dysfunction1. Advances in surgical techniques and immunosuppressive therapy have seen reduced perioperative and medical complications and improved graft survival [1]. In turn, patient survival rates are high; for example, in the United States, as of 2019, one-year patient survival rates were equal to or greater than 90% for all single SOT except intestine transplantation [2]. Improvements in other patient-important outcomes, such as quality of life and functional performance, have also been reported [3]. Consequently, demand for SOT now exceeds donor organ supply and the field must explore strategies to address this balance. One such strategy is to optimize organ allocation processes through greater understanding of potential transplant candidates’ risk profiles [4, 5].

Determining transplant eligibility is a complex process undertaken by a multidisciplinary team, including transplant physicians and surgeons, specialist nurses and allied health professionals. Even though there are established scoring methods (such as the Model for End-Stage Liver Disease) and listing criteria for each organ, the decision to place a patient on the transplant waiting list often comes down to expert clinical judgement [6]. The multidisciplinary team must weigh the inherent risks of surgery and immunosuppression against the potential benefits for each individual, which in turn must be weighed against the need to ensure that there is maximum benefit derived from a finite resource. Moreover, transplant teams are increasingly being asked to evaluate older patients with more complex medical and functional needs [7]. Differences in health status, which corresponds with differences in risk of adverse outcomes, can be captured by measuring frailty [8]. Routinely identifying and quantifying frailty in potential transplant candidates would support patients and the multidisciplinary team to make well-informed, individualized, management decisions, a viewpoint shared by more than 250 kidney, liver, heart and lung transplantation specialists at the 2018 consensus conference on frailty [9].

Frailty is a state of increased vulnerability to stressors that is associated with adverse health outcomes, including death, disability and hospitalization [10, 11]. A frail individual has reduced physiological reserve and a reduced ability to compensate for disruptions to homeostasis [10]. Frailty increases with, but is not synonymous with, chronological age [12]. Frailty is prevalent in adults with organ failure and tends to develop at a younger age than the general population [13,14,15,16,17]. For example, in one study of dialysis-dependent patients, 73% of the entire cohort and 64% of those younger than 40 years of age were frail [18]. Even though kidney transplant candidates are likely to be a younger and healthier subset of patients with end-stage kidney disease, approximately 15% of wait-listed patients have been assessed as frail in recent studies [19, 20]. Similarly, frailty prevalence has been found to be high among younger heart failure patients, and the lack of relationship between age and frailty in this group indicates frailty in heart failure is not confined to older adults [16, 17].

Currently, there is no consensus as to a standardized tool for frailty measurement in any patient population. In a 2018 scoping review, 89 different measures were being utilized in the acute care setting alone [21]. To our knowledge, no systematic reviews have been conducted with a focus on the SOT population. The consensus conference concluded that a single frailty measure across all SOT candidates may not be appropriate due to different aspects of frailty being relevant to different patient (organ) groups [22]. However, we argue that frailty assesses intrinsic vulnerability rather than the impact of individual diseases and, as a result, there is merit in comparing frailty measurement across groups. The aim of our scoping review, therefore, was to synthesise the literature around frailty measurement in SOT candidate populations.

Materials and methods

Protocol and registration

This scoping review is reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) criteria [23]. The protocol was registered with Open science Framework (OSF; Digital Object Identifier https://doi.org/10.17605/OSF.IO/GZN38).

Search strategy

The search strategy was developed by JK, EG and NR, with the assistance of a librarian, and conducted by JK. We searched Cochrane (Cochranelibrary.com), Pubmed (PubMed.gov), EMBASE (Embase.com) and CINAHL (EBSCOhost) databases. Search terms used were broad and included ‘transplant*’ OR ‘allog*’ (title/abstract) and ‘frail*’ (title/abstract), which we deemed would capture all types of transplants and allograft studies, as well as any studies where frailty was an intended measure, regardless of which specific measure was used. The search included all studies up until 31st July 2022, and there were no limits placed. References lists of included full-text articles were searched for additional relevant studies.

Study selection

Studies were eligible for inclusion if they purported to measure ‘frailty’ in solid-organ (including kidney, liver, pancreas, heart or lung) transplant candidates. Additionally, studies of transplant recipients that measured frailty at, or just prior to, admission for transplant surgery were included as participants were ‘transplant candidates’ at the time of frailty assessment. Included studies could be of any design, but were to be conducted in an adult human population and published in English. Studies were excluded if they were not an original study (e.g., a protocol or review paper), were an abstract only or were not published in English. Abstract only studies were excluded as it was felt there would not be sufficient information for data analysis. Studies of non-solid organ transplant candidates (such as bone marrow transplant candidates) and studies that only measured frailty in transplant recipients were also excluded. As this review was interested in the methods by which these studies measured frailty, the measurement tool used was neither an inclusion nor exclusion criteria.

After removal of duplicates, reviewers JK and SK screened titles, abstracts and full-text of the studies. Any ambiguity regarding the study eligibility was resolved by an independent review of the study by a second and/or third reviewer (EG and/or NR).

Data extraction and analysis

A data extraction template was devised by three reviewers (JK, NR, EG) and imported into Covidence [24]. The following data was extracted from each included article: year and country of publication, transplant organ, study design, sample size, participant sex and age, frailty measurement tool used, timing of frailty measurement, reason for measuring frailty and, where relevant, adverse outcome measures examined in relation to frailty. Frailty tools were identified as ‘established’ if they were tools specifically developed and validated as a measure of frailty in a general population. ‘Other frailty tools’ referred to all other measures, including operational definitions developed for a SOT or disease-specific group (e.g., the Liver Frailty Index) and scales not specifically developed to measure frailty (e.g., the Short Physical Performance Battery).

These are definitions that have been employed in other studies of frailty measurement [21]. No specific adverse outcomes were of interest, we sought to identify all adverse outcomes that the included studies reported on. Data was extracted for all studies, and each reviewer (NR, EG, LS, EP, BL, SK, RH, SF) was given eight to ten unique studies to extract. Another reviewer (JK) independently extracted and cross-checked data for all studies. Disagreements were resolved by consensus.

Results

Search and study selection

The search strategy yielded 3124 articles, which were imported into Covidence (Fig. 1). After removal of duplicates, title and abstract screening and full-text review, a total of 101 studies were eligible for inclusion.

Fig. 1
figure 1

PRISMA diagram of study selection

Study characteristics

The characteristics of included studies are presented in Table 1. The majority of studies were conducted in kidney (36 studies; [14, 19, 20, 25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57]) and liver (36 studies; [15, 58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92]) transplant candidates. Nine studies [93,94,95,96,97,98,99,100,101] and 16 studies [102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117] were conducted in heart and lung transplant candidates, respectively. Four studies [118,119,120,121] included double organ transplant candidates (i.e., kidney and liver transplant candidates or heart). No studies that met our inclusion criteria have been published in pancreas transplant candidates. The majority of studies (N = 93) were published within the last seven years, with 62.4% (N = 63) published between 2018 and 2022. The majority of the studies were published in North America (N = 74). Nearly all studies were observational in nature, with only three being experimental. Study sample sizes ranged from 15 participants to over 120,000 participants. The median age of participants was 56.4 years (IQR 53.1–59.0) and the majority of participants were male (median percentage male = 60.0% (IQR 57.9–66.7)).

Table 1 Characteristics of the included studies

Frailty measurement

Frailty measurement characteristics of the included studies are presented in Table 2. Most studies measured frailty at time of transplant eligibility assessment (N = 27) or at admission to hospital for transplant surgery (N = 16). Ten studies measured frailty retrospectively in transplant recipients and eight studies measured frailty when the participant was added to the transplant waitlist. Fifteen studies measured frailty at other times, such as four weeks after the participant was added to transplant waitlist. There were 18 studies that did not specify the point at which frailty was measured during the participant’s transplant journey. Seven studies measured frailty at more than one time point.

Table 2 Frailty measurement

Overall, 22 different frailty measurement tools were used 123 times in the 101 included articles. The majority of studies used one frailty measurement tool (N = 83, 82.2%) and 18 studies utilized two or more frailty measurement tools. Of the 22 different measures used, four were ‘established frailty tools’ used in 51.2% of cases (N = 63) and the other 27 were ‘other frailty tools’ used in 48.8% of cases (N = 60). Descriptions of the frailty measurement tools are presented in Table 3. The most commonly used frailty tool across all included studies was the Fried Frailty Phenotype (standard and modified; N = 55/123, 44.7%). However, in liver transplant studies, the most commonly used frailty tool was the Liver Frailty Index (N = 19/43, 44.2%). Other established tools included the Clinical Frailty Scale, Rockwood’s Frailty Index and the Groningen Frailty Indicator. Other tools included validated scales such as the Short Physical Performance Battery and physical metrics such as grip-strength.

Eighty-two studies reported a frailty prevalence rate. Prevalence ranged from 2.7% in heart transplant candidates [96] to 100% in HIV-positive liver transplant candidates [118]. Risk stratification was the most common reason for frailty measurement (N = 74). Twenty studies had more than one purpose for frailty measurement. For example, some studies evaluated the feasibility of incorporating frailty assessment into transplant eligibility assessment as well as estimated frailty prevalence and its associated risk of adverse outcomes.

Table 3 Description of frailty measurement tools

Adverse outcome measures examined in relation to frailty

The association between frailty and adverse outcomes was investigated in 74 studies. The majority of these studies measured more than one adverse outcome, which are listed in Table 4. The most commonly measured adverse outcomes were mortality (N = 29), hospital length of stay (N = 20), waitlist mortality (N = 21) and transplant status (e.g., de-listing; N = 17). Of these 74 studies, 70 (94.6%) found that pre-transplant frailty was predictive of at least one adverse outcome, where those who were frail (or had higher levels of frailty) were more likely to experience an adverse event (Fig. 2). For example, frailty was predictive of mortality in 23 of the 29 studies (79.3%) examining this outcome.

Table 4 Adverse outcome measures examined in relation to frailty
Fig. 2
figure 2

Association between frailty and adverse outcomes. (Note: ‘Predictive’ = statistically significant association between adverse outcome and pre-transplant frailty)

Discussion

This scoping review synthesizes the literature around frailty measurement in solid-organ transplant candidates. We found this to be an emerging field of research, with most studies conducted in kidney and liver transplant candidates in North America in the last seven years. Most studies were observational and examined the relationship between frailty and adverse outcomes, particularly mortality and hospital length of stay. Overall, there were 22 different frailty measures used across the 101 studies, including four ‘established frailty tools’. The two most commonly used tools were the Fried Frailty Phenotype and Liver Frailty Index. Frailty prevalence estimates for this middle-aged, predominantly male, population varied widely, from 2.7 to 100%.

In this review, we aimed to include any article claiming to measure frailty to ascertain the full range of frailty measurement tools utilized in the SOT literature. We demonstrated that whilst many articles purport to measure frailty, almost half did not use an established frailty measurement tool. Using non-established measures to identify frail patients is not unique to this population, and is problematic as it constrains the generalisability of studies [122].

Defined by Fried et al. in 2001, the Fried Frailty Phenotype conceptualizes frailty as a clinical syndrome (a set of signs and symptoms that tend to occur together, thus characterizing a specific medical condition) [123]. More specifically, it identifies frailty as the presence of ≥ 3 of 5 criteria: unintentional weight loss of ≥ 10lbs in the preceding year, self-reported exhaustion, weak grip strength, slow walking speed, low physical activity [123]. It is a well-known and widely-used tool in the non-transplant frailty literature and it has many strengths: it is clinically coherent, reproducible, and identifies frailty as a wasting disorder with sarcopenia as a key pathophysiological feature [123]. The most commonly used frailty measurement tool in liver transplant candidates, the Liver Frailty Index, also focuses on physical signs and symptoms of frailty. Developed only four years ago, it comprises three performance-based tests (of grip strength, sit-to-stand transfers and balance) and was originally designed to capture extrahepatic complications of cirrhosis and to enhance mortality prediction in patients with cirrhosis [64].

While the physical manifestations of frailty are important, the omission of disorders of cognition and mood from frailty assessment is controversial: frailty in the clinical setting consists of more than weakness, slowness and wasting [124]. In the non-transplant literature, an alternative conceptualization of frailty is a multidimensional risk state which can be quantified by the number rather than by the nature of health problems [125, 126]. The Frailty Index (FI) model, defined by Rockwood and colleagues, employs a well-defined methodology to create an index as a proportion of deficits [127]. Since FIs can be constructed from different numbers and types of deficits, a measure of frailty status can be derived from information routinely collected during patient assessment and, as a result, it is often touted as the ‘ideal’ frailty measure in the clinical setting. The FI methodology allows for organ-specific deficits to be included and as a result it may be a suitable standardized measure that addresses the concern that not all frailty measures are applicable to all patient (organ) groups [9], An added advantage of using the FI in studies of SOT populations would be the ability to compare prevalence levels and adverse outcomes across groups, which would unify the literature and generate greater impact and advocacy. There are many other tools that measure the ‘multidimensionality’ of frailty. For example, the Clinical Frailty Scale (CFS), also defined by Rockwood [128], is a judgement-based tool that estimates frailty severity according to increasing morbidity and functional dependence. Despite the prominence of these tools in the frailty literature, this scoping review found that they are infrequently utilized in the SOT population.

Different frailty tools do yield different estimates of prevalence. For example, in a meta-analysis of general population-based data from 62 countries and territories, frailty prevalence estimates ranged between 12% (Fried Frailty Phenotype) and 24% (Frailty Index) in adults aged 50 years and over [129]. While there is some overlap in identification of frailty, it is likely that the phenotypic and multidimensional models (and their associated measurement tools) capture different patient groups [130]. In the SOT literature, prevalence rates vary greatly, reflecting between-study methodological differences in sample size and participant inclusion criteria (such as HIV positive status) as well as frailty measurement tools. It is possible that the predominance of phenotypic frailty tools in this field of research may result in an underestimation of frailty burden in the SOT population. This in turn may lead to an underestimation of risk in this population. Consequently, we would further advocate that future SOT studies utilize an established multidimensional frailty tool, such as Rockwood’s Frailty Index, in their assessment of frailty in potential candidates.

In the SOT candidate population, this scoping review found frailty to be associated with a range of adverse outcomes, with most evidence for increased mortality (including post-transplant and wait-list mortality), post-operative complications and prolonged hospitalisation. Fewer studies explored the relationship between frailty and patient-centered outcomes, such as functional and cognitive decline, which are of critical importance to the informed decision-making process. In the general population literature, there has been an extensive evaluation of the ability of frailty tools to predict adverse outcomes, particularly survival. In comparison studies (wherein multiple frailty measures are applied to the same patient population), predictive power varies between tools, with a trend towards multidimensional frailty tools having higher predictive power for short-term survival [131]. At the present time there is limited data to compare the predictive validity of frailty tools in the SOT population.

SOT studies also varied greatly with respect to the timing of frailty measurement relative to transplantation. We would argue that measurement of frailty at transplant eligibility assessment would be most clinically informative, yet only approximately one quarter (N = 27; 26.7%) of studies measured frailty at this time-point. In the kidney transplant population, frail candidates experience a greater deterioration in health-related quality of life and face a higher risk of mortality while awaiting a transplant when compared with non-frail candidates [25, 132]. Similar associations between frailty, reduction of quality of life and wait-list mortality have been found in heart, lung, and liver transplant candidates [60, 69, 133,134,135]. Overall, frail candidates are less likely to receive a transplant than non-frail candidates [136]. While frailty at time of kidney transplantation has been associated with several post-transplantation complications, including delirium, prolonged hospitalisation, delayed graft function, immunosuppression intolerance and mortality [22], it remains unclear how frailty at time of transplant eligibility assessment relates to these outcomes.

This scoping review has a number of limitations. Firstly, the scope of this review was very broad, not only in terms of organ systems, but also frailty measures and time-points. As a result, quality assessments and meta-analysis were not conducted. Further, if a study used other measures of physical function, such as muscle strength or gait speed tests, but did not claim to measure frailty itself, it would not have been identified by our search. Additionally, our search terms were conducted in title/abstract only, however we consider this strategy would have captured almost all relevant studies. Secondly, non-English studies were not included and the grey literature was not reviewed. Thirdly, initial screening was performed by mainly one rather than two reviewers.

The broad scope of this review is also a strength. By synthesizing frailty data for all SOT groups, clinicians and researchers will be able to consider the evidence relevant to their specialty. It highlights gaps in frailty research in this field, such as the lack of experimental and qualitative studies addressing frailty and the small number of studies utilizing established multidimensional frailty tools such as the Frailty Index. Furthermore, it emphasizes the need for future research to address more focused research objectives such as determining the predictive power of different frailty tools in different organ groups for standard medical/surgical as well as patient-centered outcomes.

Ultimately, identifying frailty in potential SOT candidates may improve patient care. While the primary benefit may be in risk-stratification (i.e., defining an individual’s risk profile so to inform the decisions of patients and specialists), it is possible that, in the future, frailty measurement may also shed light on potentially reversible factors amenable to targeted interventions pre- and post-transplant. At a health service level, identifying frailty in SOT candidates has the potential to influence the balance of organ supply and demand by informing the clinician and the patient regarding the risks and benefits of transplantation. Since misclassification of frailty may have significant, negative implications for the patient and the health service, relying solely on clinical acumen is not ideal. Preferably, a validated frailty measurement tool would be incorporated into SOT eligibility assessments internationally. This would facilitate comparisons between patient sub-groups and national and international transplant services and would permit large-scale synthesis of relevant data, which in turn would foster improvements in SOT patient care.