Background

Compared with dialysis, renal transplantation is a more preferred option for end-stage renal disease (ESRD) [1]. In recent report of global database on donation and transplantation (http://www.transplant-observatory.org), about 80,000 renal transplants were performed annually [2]. However, in 2016 United States Renal Data System (USRDS) Annual Data Report, the long-term survival benefit remained unsatisfactory, with ten-year graft survival probabilities of 46.9% for deceased donor transplant [3].

Human leukocyte antigen (HLA) was important biological barrier to a successful transplantation and has substantial impact on the prolongation of graft survival [4]. However, the emergency of modern immunosuppressive agents minimized the effect of HLA compatibility. The US kidney allocation system was extensively modified to eliminated HLA-A similarity in 1995 [5] and HLA-B similarity in 2003 [6]. In the revised United Kingdom kidney allocation scheme, HLA-A matching is no longer considered [7]. But the latest European Renal Best Practice Transplantation Guidelines still recommended that matching of HLA-A, -B, and -DR whenever possible, while gave more weight to HLA-DR locus [8]. So far, the current kidney allocation guideline recommendations were inconsistent in term of HLA compatibility. Besides, for the primary aim to make the kidney last as long as possible, all the current kidney allocation systems were not perfect. Here, we sought to conduct a meta-analysis to assess the magnitude effect of HLA mismatching in adult kidney transplantation, with a particular focus on graft survival and recipient mortality.

Methods

The study was registered in the PROSPERO international prospective register of systematic reviews (CRD42017071894). Details of protocol are described in Additional file 1: Supplemental Methods. The meta-analysis was performed in accordance with the Meta-analysis of Observational Studies in Epidemiology (MOOSE) protocol [9] (Additional file 2: Table S1) and the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guideline [10] (Additional file 3: Table S2).

Literature search strategy

We searched PubMed, EMBASE and the Cochrane Library from their inception to December, 2016, without language restriction. We used the following combinations of Medical Subject Heading (MeSH) terms and corresponding text words: “kidney transplantation”, “renal transplantation”, “human leukocyte antigen”, “HLA” and all possible spellings of “survival”. Further details are described in Additional file 1. Reference lists of articles were manually screened to identify further relevant studies. The literature search was performed independently by two investigators (XMS and XHZ). Differences were resolved by consensus.

Study selection

We included studies that (1) included a study cohort comprising adult post-kidney transplant recipients; (2) were cohort studies/trials reporting associations between HLA mismatching and post-transplant survival outcomes; and (3) provided effect estimates of hazard ratios (HRs) with 95% confidence interval (CIs). Studies reporting data on children or animals or in vitro research were excluded. Besides, reviews, meta-analyses, case reports, case series and technical descriptions with insufficient data or unrelated topics were also excluded. For studies covered overlapping data, we included the most recent and informative one. XMS and XHZ independently screened the titles and abstracts for eligibility. Discrepancies were resolved by consensus.

Outcome measures

Our primary clinical endpoint was overall graft failure; secondary clinical endpoints were death-censored graft failure and all-cause mortality. The European Renal Best Practice Transplantation Guidelines and Kidney Disease: Improving Global Outcomes Guidelines was used to evaluate the incidence of measured outcomes [11, 12].

Data extraction and quality assessment

Data were extracted from predefined protocol, then recorded in a standardized Excel form, including the first author’s name, publication date, study location, study design, cohort size, recipient age, sex distribution, duration, donor source, data source (multi-centered or single-centered), follow-up, unadjusted and adjusted HRs of overall graft failure, death-censored graft failure and all-cause mortality per HLA-mismatch increased, and adjusted covariates in reported multivariable analysis. We contacted libraries abroad or corresponding author of relevant articles by email when detailed data for pooling analysis was unavailable. The methodological quality of included studies was described using the Newcastle-Ottawa Scale. High-quality studies were defined by a score of > 5 points [13]. Disagreements in the scores were resolved by consensus between XMS, XHZ and JD.

Statistical analysis

Hazard ratios (HRs) with corresponding 95% confidence intervals (CIs) were directly retrieved from each study. We chose HRs as the statistic estimates because they correctly reflect the nature of data and account for censoring. Cochran’s Q test and I2 statistic were applied to assess heterogeneity between studies. The following criteria were used: I2 < 50%, low heterogeneity; 50–75%, moderate heterogeneity and > 75%, high heterogeneity [14, 15]. When significant heterogeneity was found between studies (P < 0.10 or I2 > 50%), the effect estimates were calculated using a random-effects model and the DerSimonian-Laird method [16]; otherwise, a fixed-effects model with the Mantel-Haenszel method was used [17]. Subgroup analyses included recipient sample size (≥10,000 vs < 10,000), the nature of data (univariable-unadjusted vs multivariable-adjusted effect estimates), donor source (deceased vs living and deceased), data source (multi-centered vs single-centered) and geographical locations (Europe, North America, Asia and Oceania). A sensitivity analyses was performed by omitting one study at a time and then reanalyzing the data to assess the change in effect estimates. To further explore heterogeneity, a random-effects univariate meta-regression was conducted when at least 10 studies were available. For outcomes of at least 10 studies included, publication bias was assessed by funnel plot and Egger test [18]. Egger test with two-sided P < 0.10 was considered to be statistically significant. Analyses were performed using STATA software, version 13.0 (STATA Corporation, College Station, Texas, USA).

Results

Of 5647 articles identified, we reviewed the full text of 541 reports, and 23 studies [19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41] with 486,608 adult post-transplant recipients were included in the meta-analysis (Fig. 1). Detailed characteristics of included studies are presented in Table 1. Among these studies, 18 studies provided multivariable-adjusted effect estimates [19, 21,22,23,24, 27,28,29,30,31,32,33,34,35,36,37,38,39], 3 studies provided both multivariable-adjusted and univariable-unadjusted data [20, 25, 26], and 2 studies provided univariable-unadjusted data [40, 41]. Besides, 8 studies were multi-centered [19, 23, 27, 32, 33, 35, 37, 38]; another 15 studies were single-centered [20,21,22, 24,25,26, 28,29,30,31, 34, 36, 39,40,41]. When considering HLA locus as categories, 11 studies reported survival outcomes of all HLA locus (HLA-A, -B and -DR) [20, 23,24,25,26,27,28, 32, 35, 36, 40], 8 [19, 20, 22, 29,30,31, 33, 41] reported HLA-DR locus, 4 with HLA-B locus [19, 30, 31, 33], and 3 with HLA-A locus [30, 31, 33]. The methodological quality score was high, ranging from 6 to 8 points (details of quality assessment are provided in Additional file 4: Table S3).

Fig. 1
figure 1

PRISMA flow chart of study selection

Table 1 Baseline characteristics of the studies included in the meta-analysis

Primary outcomes

HLA per mismatch and overall graft failure

Eleven studies (289,987 adult recipients) reported data on HLA mismatching and overall graft failure. The pooled analysis revealed that each incremental increase of HLA-mismatches was significant associated with a higher risk of overall graft failure, both in univariable-unadjusted summary estimates (HR: 1.14; 95% CI: 1.04–1.26; P = 0.008; Fig. 2) and multivariable-adjusted summary estimates (HR: 1.06; 95% CI: 1.05–1.07; P < 0.001; Fig. 2). The heterogeneity was low (I2 = 24.8 and 27.4%, respectively). Detailed predefined subgroup analyses were listed in Table 2. The effect estimates did not changed significantly after stratification for sample size (≥10,000 vs < 10,000), data source (multi-centered vs single-centered), donor source (cadaveric vs living and cadaveric), geographic locations (European, North America, Asia and Oceania) and year period (prior to 1995 vs not prior to 1995). In sensitivity analysis, the summary estimates were not modified after excluding one study at a time. Subsequent univariate meta-regression indicated that these factors did not significantly change the overall effect (Additional file 5: Fig. S1). Publication bias was not significant (Additional file 6: Fig. S2).

Fig. 2
figure 2

Forest plots of the association between HLA per mismatch and overall graft failure, using both of univariable-unadjusted and multivariable-adjusted effect estimates

Table 2 Subgroup analyses of overall graft failure associated with HLA per mismatch and HLA-DR mismatches

HLA-DR mismatches and overall graft failure

Eight studies with 152,105 adult recipients were analyzed to investigate the association between HLA-DR mismatching and overall graft failure. The pooled results revealed an unadjusted HR of 1.44 (95% CI: 0.86–2.41; P = 0.160) with moderate heterogeneity (I2 = 70.0%). After adjustment, each incremental increase of HLA-DR mismatches was significant associated with 12% higher risk of overall graft failure (HR: 1.12; 95% CI: 1.05–1.21; P = 0.002; Fig. 3), with moderate heterogeneity (I2 = 58.3%). Sensitivity analysis with a fixed-effects model obtained similar results (HR: 1.08; 95% CI: 1.05–1.11; P < 0.001). Subsequent subgroup analysis demonstrated that the effect was not modified after stratification for sample size, data source, donor source and geographical locations (Table 2). The effect estimates remained stable after excluding one study at a time. Considering only 8 studies included in meta-analysis, we did not perform a meta-regression.

Fig. 3
figure 3

Forest plots of the association between HLA-A, -B, -DR mismatches and overall graft failure

In addition, three studies with 41,957 recipients evaluated 1 or 2 DR-mismatches versus 0 DR-mismatches. Compared with 0 mismatches in HLA-DR antigen, 1 mismatches and 2 mismatches were all associated with higher risk of overall graft failure, with pooled HRs of 1.12 (95% CI: 1.04–1.21; P = 0.002) and 1.15 (95% CI: 1.05–1.25; P = 0.002), respectively (Additional file 7: Fig. S3). In both pooled analysis, there was no heterogeneity (I2 = 0%).

HLA-B mismatches and overall graft failure

Associations of HLA-B epitope and overall graft failure were reported in 4 studies with 146,019 recipients. The pooled analysis demonstrated that each incremental increase of HLA-B mismatches was not associated with higher risk of overall graft failure (HR: 1.01; 95% CI: 0.90–1.15; P = 0.834; Fig. 3), with moderate heterogeneity (I2 = 66.0%). Sensitivity analysis with a fixed-effects model obtained similar effect estimates (HR: 1.01; 95% CI: 0.89–1.14; P = 0.079). In addition, the effect estimates did not changed significantly after stratification for sample size (≥10,000 vs < 10,000) of cohorts.

HLA-A mismatches and overall graft failure

Only 3 studies (40,000 recipients) reported data on the association of HLA-A epitope and overall graft failure. The results revealed an insignificant association (HR: 1.06; 95% CI: 0.98–1.14; P = 0.121; Fig. 3), with no heterogeneity (I2 = 0%). Sensitivity analysis with a random-effects model showed similar results (HR: 1.06; 95% CI: 0.98–1.15; P = 0.121). The results should be cautiously interpreted because of only three studies included.

Secondary outcomes

Death-censored graft failure

We included 101,093 recipients from 4 cohorts. Each incremental increase of HLA mismatches was associated with a higher risk of death-censored graft failure, with summary HR of 1.09 (95% CI: 1.06–1.12; P < 0.001; Fig. 4), and moderate heterogeneity (I2 = 70.9%). Sensitivity analysis with a fixed-effects model showed similar results (HR: 1.09; 95% CI: 1.08–1.10; P < 0.001). The summary estimates were not modified after including only large sample size of cohorts (> 10,000 recipients) (Additional file 8: Fig. S4).

Fig. 4
figure 4

Forest plots of the association between HLA per mismatch and death-censored graft failure

All-cause mortality

We included 180,766 recipients from 4 cohorts. Each incremental increase of HLA mismatches was associated with a higher risk of all-cause mortality rates (HR: 1.04; 95% CI: 1.02–1.07; P = 0.001; Fig. 5). The heterogeneity was moderate (I2 = 65.3%). Summary estimates did not changed significantly after analyzing with a fixed-effects model (HR: 1.04; 95% CI: 1.02–1.07; P = 0.001). After stratification for sample size of cohorts (≥10,000 vs < 10,000), the effect estimates were not modified (HR: 1.04; 95% CI: 1.02–1.05; P < 0.001; I2 = 27.8%, Additional file 8: Fig. S4). However, the results should be cautiously interpreted due to small number of included studies (n = 4).

Fig. 5
figure 5

Forest plots of the association between HLA per mismatch and all-cause mortality

Discussion

This is the first meta-analysis to evaluate the magnitude effect of HLA mismatching on post-transplant survival outcomes of adult kidney transplantation. The analysis included 23 studies with a large sample of subjects (totally 486,608 recipients). The results indicated that each incremental increase of HLA mismatches was significantly associated with higher risks of overall graft failure, death-censored graft failure and all-cause mortality. The pooled results also indicated that HLA-DR mismatches were significantly associated with a 12% higher risk of overall graft failure. We also observed that HLA-A per mismatch was associated with a 6% higher risk of overall graft failure, but the association was insignificant. There was no significant association between HLA-B mismatching and graft survival. All included studies were in high methodological quality and the heterogeneity between studies was acceptable in each pooling analysis. In addition, we found that sample size or recipient ethnicity may be potential sources of heterogeneity.

Human HLA genes are located on chromosome 6 and code for 3 major class I alleles (HLA-A, -B, -C) and 3 major class II alleles (HLA-DR, -DQ, -DP). Polymorphisms in HLA, especially HLA-A, -B, and -DR loci, are important biological barriers to a successful transplantation [42, 43]. As closely HLA-matched graft is less likely to be recognized and rejected, HLA mismatching has a substantial impact on prolongation of graft survival. With the emergence of potent immunosuppressive agents that steadily improved the graft survival rates, the impact of HLA compatibility seems to be minimized [42, 44]. But the recent Australia and New Zealand Dialysis and Transplant Registry (ANZDTR) survey with 12,662 recipients still demonstrated that each incremental increase of HLA mismatches was significantly associated with higher risk of graft failure and rejection [27]. Another recent survey from Massie et al. [19] with 106,019 recipients from the Scientific Registry for Transplant Recipients (SRTR) database revealed that HLA-B and -DR mismatches were all significant associated with worse graft survival outcomes. Using multivariable-adjusted data (adjusting for other determinant confounders such as donor and recipient age, gender, combined disease, serum creatinine levels, ischemic times, etc.), the present analysis indicated that HLA per mismatch was associated with an increased risk of overall graft failure (9%), death-censored graft failure (6%) and all-cause mortality (4%). The pooled results were in favor of recommendations of the latest European Renal Best Practice Transplantation Guidelines, which recommended that matching of HLA-A, -B, and -DR whenever possible [8].

The meta-analysis suggested that HLA-DR per mismatch was significant associated with a 12% higher risk of overall graft failure. Besides, a subsequent analysis suggested that compared with 0 DR-mismatches, 1 and 2 mismatches were significant associated with 12 and 15% higher risk of overall graft failure, respectively. The pooled results were in favor of the kidney allocation guideline recommendations in almost all countries, such as the current US kidney allocation system, the revised United Kingdom kidney allocation scheme, and the latest European Renal Best Practice Transplantation Guidelines, which all highlighted the importance of HLA-DR testing [5,6,7,8].

Notably, the present analysis revealed a tendency that HLA-A mismatching had an impact on overall graft survival as there were only 3 studies included with a pooled HR of 1.06 (95% CI: 0.98–1.14). However, we did not observe a significant association between HLA-B mismatching and overall graft survival (HR: 1.01; 95% CI: 0.90–1.15). Our pooled results were inconsistent with the recommendations of the revised United Kingdom kidney allocation scheme, which eliminated the impact of HLA-A similarity instead of HLA-B similarity [7]. Moreover, miscellaneous factors can result in inferior outcomes [45]. For instance, inferior graft outcomes could be related to high risk for rejection particularly antibody-mediated rejection [45,46,47]. Inferior patient survival could partly be associated with consequences of enhanced immunosuppression [45]. Consequently, the pooled results should be cautiously interpreted and further studies should be conducted to investigate the impact of HLA-A mismatching on graft and recipient survival outcomes.

Subgroup analysis and meta-regression was conducted to explore heterogeneity between studies. In subgroup analysis of the association between HLA per mismatch and overall graft failure, we found that after stratification for donor source (cadaveric vs living and cadaveric), the heterogeneity decreased to insignificant (I 2 = 0 and 16.4, respectively). But subsequent meta-regression analysis revealed that donor source did not change the overall effect significantly. In subgroup analysis of the association between HLA-DR mismatching and overall graft failure, we found that ethnicity and recipient sample size were potential source of heterogeneity. Large sample size of cohorts usually demonstrated more stable results. Besides, ethnic diversity was a potential source of heterogeneity probably because of varying HLA polymorphisms in the genetic makeup of the geographically distinct cohorts.

Strengths and limitations

Strengths of our meta-analysis are large sample of subjects (totally 486,608 recipients) and strict study design. Besides, we used multivariable-adjusted data for pooling analysis, which adjusted for some primary determinant confounders. However, the present meta-analysis had some limitations. Firstly, the absence of randomized controlled trials was the biggest limitation of this meta-analysis. Secondly, several studies have suggested that other HLA loci, such as HLA-C and -DQ locus, may contribute to poorer graft outcomes [48,49,50], but this meta-analysis only included the HLA-A, -B and -DR loci. Thirdly, heterogeneity is inevitable in some outcomes. We conducted several subgroup and meta-regression analyses to explore the potential source of heterogeneity, and used random-effects models to incorporate heterogeneity between studies. Fourthly, few studies included could provide data about induction agent, maintenance agent or PRA, so that it cannot be achieved to do the stratified analysis.

Conclusions

HLA mismatching was still a critical prognostic factor that affects graft and recipient survival. HLA-DR mismatching has a substantial impact on recipient’s graft survival. HLA-A mismatching has minor but not significant impact on graft survival outcomes. Further studies should be conducted to confirm the impact of HLA-A similarity.