Introduction

Lung transplantation is a major surgical procedure for patients with end-stage pulmonary diseases such as idiopathic pulmonary fibrosis [1]. Since the first transplant performed by Hardy in 1963 the delivered care in this field has significantly improved in terms of surgical techniques, candidate selection, post-operative immunosuppressive treatment, and diagnosis and management of organ rejection [1, 2]. Regarding this last aspect, allograft rejection is due to an immune response toward the transplanted organ and can be categorized as hyperacute, acute, and chronic. The hyperacute response is usually caused by preformed antibodies of the recipients against the human leukocyte antigen (HLA) of the donor, the acute form can be T-cell or antibody mediated while the chronic type is still considered multifactorial with various risk factors contributing to its occurrence (e.g., infections and HLA-mismatching) [3,4,5]. The diagnosis of rejection requires a specialized multidisciplinary team including a thoracic radiologist [5,6,7]. Computed tomography (CT) signs of acute primary graft dysfunction and acute allograft rejection comprise ground-glass opacities, consolidation, interstitial thickening, and pleural effusion while chronic allograft lung dysfunction can be characterized at imaging by bronchiectasis, small nodules, and air trapping as part of the bronchiolitis obliterans syndrome pattern or by reticulation and upper lobes fibrosis representative of the restrictive allograft syndrome [7, 8].

In the complex management of the post-transplant phase, also the impact of muscle composition has been assessed by mechanical tests and imaging [9,10,11,12]. Various radiological techniques can be used for muscle assessment including ultrasound, dual-energy X-ray absorptiometry (DXA), CT, and magnetic resonance (MR). For instance, by ultrasound information regarding muscle thickness, architecture and composition such as muscle echogenicity, fiber length, and pennation angle can be collected [13]. By DXA several parameters reflecting body composition and muscle status can be obtained, like the appendicular lean mass and the appendicular lean mass index. Nevertheless, it should not be overlooked that these variables should be associated with a qualitative assessment performed by physical evaluation [14]. CT allows the extraction of quantitative data like muscle density, cross-sectional area, and the skeletal muscle index while MR enables a qualitative and quantitative assessment of muscle structure and characteristics by semi-quantitative scores applied on turbo spin-echo images or using specific sequences such as Dixon, Diffusion Weighted Imaging (DWI), and mapping, respectively [15, 16].

Nowadays, advanced quantitative imaging techniques like radiomics and machine learning have been successfully used also to investigate muscle characteristics showing promising results in predicting sarcopenia in patients with non-small cell lung cancer [17,18,19]. These methods which are based on complex computational analyses of radiological images providing crucial information beyond the visible were initially applied in the oncological field but as mentioned above are currently used for different types of diseases [20,21,22,23].

Nevertheless, until now, to the best of our knowledge, the application of radiomics on muscles in transplantology has not been explored yet.

Thus, aim of this study was to evaluate if muscle composition, expressed as muscle density, and radiomics of the paravertebral muscle act as predictors of allograft dysfunction in bilateral lung transplant candidates.

Methods

Study design

The last available high-resolution chest CT (HRCT) before surgery of bilateral lung transplant candidates (i.e., performed in the two months preceding operation) referring to our tertiary center from January 2010 to February 2020 was examined for this retrospective institution review board approved study. The following inclusion criteria were applied: scans performed with volumetric acquisitions, 1 mm slice thickness, and B30 kernel reconstructions.

For each patient, clinical and first-line laboratory variables [body mass index (BMI), Barthel index, creatinine (umol/L), white (× 109 L) and red blood cell (× 1012 L) count, hematocrit (vol%), hemoglobin levels (g/dl), and erythrocyte sedimentation rate (ESR) (mm/h)] were collected.

Muscle segmentation and radiomic analysis

One radiologist with ten years of experience in chest and musculoskeletal imaging used a semi-automatic tool of an opens source software (3D Slicer, www.slicer.org) to segment the spinal muscles of each patient at the level of the 11th dorsal vertebra, covering a volume of 1 cm height, and applying the usual Hu range for muscle assessment (− 29 to 150 Hu) [24, 25] (Fig. 1A, B, C). Aiming to standardize the approach and guarantee repeatability, the first slice including the upper plate of the 11th dorsal vertebra was considered as reference point to start the segmentation.

Fig. 1
figure 1

Graphic representation of the study design: in A axial HRCT scan indicating the examined population, composed of candidates to bilateral pulmonary transplant referring to our tertiary center; in B the selection of the paravertebral muscles at the level of the 11th dorsal vertebra with an extension of 1 cm height, used for the segmentation; in C the segmented muscle area, performed applying the threshold − 29 to + 150 Hu; in D the examined volume from which were extracted the radiomic features; in E the factor analysis applied for variables reduction

We considered the value < 30 Hu of the examined volume as threshold for the definition of muscle loss [15, 26, 27].

For the radiomics analysis, the radiomics plugin of 3D Slicer which encapsulates pyRadiomics library was applied. The previously segmented volume was loaded in the radiomics module interface. Range re-segmentation and intensity outlier filtering were not needed since the range for muscle assessment (− 29 to 150 Hu) was previously applied [28].

Aiming to avoid model overfitting due to the small sample size, only 72 radiomic features extracted (Fig. 1D). The radiomic features were of first and second order: intensity-based features (FOS) and texture features [gray-level co-occurrence matrix (GLCM) as well as gray-level run length matrix (GLRLM)]. Among the FOS, mean, standard deviation, skewness, kurtosis, and energy were included, while for the GLCM, for instance, inverse variance, sum average, joint entropy, and cluster shade were extracted; last, among the GLRLM, short run emphasis, run percentage, and long run low gray level emphasis were included.

Factor analysis was applied to select highly correlating radiomic features (Fig. 1E).

Lung transplant allograft rejection

Allograft rejection was defined by a combination of tests and evaluations including: (i) immunological test, for instance demonstrating donor-specific circulating antigen; (ii) histopathological changes in the specimen obtained by transbronchial lung biopsy such as perivascular mononuclear cell infiltrates typical of acute rejection; (iii) visual assessment of HRCT scans, detecting findings such as ground-glass opacities and interstitial thickening in acute rejection; (iv) and pulmonary function test for example demonstrating 20% or greater decline in forced expiratory volume in the first second (FEV1) or forced vital capacity (FVC) from the best postoperative value in patients with chronic lung allograft disfunction [7, 29, 30]. In our tertiary center a weekly multidisciplinary meeting for monitoring lung transplanted patients and early detecting lung allograft rejection is performed.

Statistical analysis

Descriptive statistics were performed. The Student’s t-test, the chi-square, and the Mann–Whitney tests were used, respectively, for continuous, categorical, and ordinal variables, to evaluate if any difference regarding muscle volume and the above-mentioned clinical and laboratory variables occurred between patients with and without muscle loss or between patients with and without allograft rejection.

To investigate the prognostic value of the features, previously selected by factor analysis, for allograft rejection, logistic regression analysis was used. In case of statistically significant results, the diagnostic accuracy was computed using receiver operating characteristic curves and the value with the highest Youden index was selected as cutoff.

To evaluate the robustness of the proposed method, all segmentations and data extraction were repeated by a second reader with four years of experience in chest imaging and the intraclass correlation coefficient (ICC) of the variables highly correlating at factor analysis, computed. ICC values > 0.750 were considered excellent [31].

All statistical analyses were performed with SPSS (IBM SPSS Statistics version 27, IBM Armonk, NY, USA), applying p < 0.05 as significant level.

Results

An overall number of 200 patients underwent a HRCT prior to the transplant. Nevertheless, only ninety-seven matched the inclusion criteria and were examined since 75 HRCT did not satisfy the technical criteria while 28 scans were performed earlier than two months before the transplant. The main characteristics of the examined population are summarized in Table 1. The investigated population was composed of 29 women (29.9%) and 68 men (70.1%) and the mean age was 50.4 years old. Most of the patients underwent bilateral lung transplant for idiopathic pulmonary fibrosis (n = 37, 38.1%), followed by cystic fibrosis (n = 12, 12.4%) (Table 1). Overall, 21 (21.6%) patients showed allograft rejection (i.e., fifteen acute and six chronic; Figs. 2, 3).

Table 1 Main characteristics of the examined population
Fig. 2
figure 2

Forty-year-old female transplanted for cystic fibrosis who developed acute cellular allograft rejection six months after the transplant. In A the coronal reconstruction of the high resolution computed tomography performed before the bilateral lung transplant well demonstrating the severe bronchiectasis affecting the entire left lung as well as the bronchiectasis and ground-glass opacities in the right lung. In B an axial high resoution computed tomography image demonstrating ground-glass opacities and interlobular thickening in the left lower lobe suggesting acute allograft rejection then confirmed by laboratory and histological findings. The patient died a few weeks after the transplant

Fig. 3
figure 3

Twenty-four-year-old female transplanted for, histologically proven, lymphocytic interstitial pneumonia who developed restrictive chronic allograft rejection 20 months after the transplant. In A an axial image of the high-resolution computed tomography performed before the bilateral lung transplant well demonstrating bilateral severe interstitial thickening, bronchiectasis, and fibrotic ground-glass as well as cysts in the left lower lobe. In B an axial high-resolution computed tomography image demonstrating peri-hilar consolidations on the right side as well as band atelectasis and ground-glass opacities in the lower lobes; in C an axial image of the expiratory phase showing air trapping in the lower lobes. The diagnosis of restrictive chronic rejection was achieved including respiratory test as well as laboratory and histological findings

The mean Hu value and volume of the examined spinal muscles were 36.6 ± 8 and 68.8 ± 54 cm3, respectively. Sixteen patients had radiological signs of muscle loss (i.e., Hu values < 30; in particular 23.2 ± 5 Hu), 13 were males and 10 affected by idiopathic pulmonary fibrosis (mean age 54.7 ± 9 years old). There were no differences of muscle volume neither between patients with and without muscle loss (70.8 ± 59 vs. 58.7 ± 12 cm3, p = 0.423) nor between patients developing or not allograft rejection (59 ± 16 vs. 71.7 ± 60 cm3, p = 0.334).

Patients with muscle loss showed significantly higher levels of inflammation expressed as ESR (43 ± 23 vs. 27.8 ± 19 mm/h, p = 0.023). Allograft rejection occurred most frequently in men (p = 0.016). None of the other clinical and laboratory variables, including BMI and Barthel index, showed any significant difference comparing patients with and without muscle loss as well as patients with and without allograft rejection (Table 2, Fig. 4).

Table 2 Comparisons of population characteristics according to muscle composition and allograft rejection
Fig. 4
figure 4

Box-plots demonstrating the lack of statistically significant differences regarding Barthel index and body mass index (BMI) in patients with or without muscles loss (A and B) and with or without allograft rejection (C and D), respectively

The following features were selected by factor analysis: cluster prominence, Imc2, gray level non-uniformity normalized, median, kurtosis, gray level non-uniformity, and inverse variance. The radiomics-based model including also Hu demonstrated that only the feature Imc2 acts as a predictor of allograft rejection (p = 0.021). Such feature showed 76.6% accuracy and the value 0.19 demonstrated 81% sensitivity and 64.5% specificity in predicting lung transplant rejection (Fig. 5). The variables selected by factor analysis showed excellent repeatability: cluster prominence ICC = 0.984, 95%CI (0.976–0.989); Imc2, ICC = 0.970, 95% CI (0.956–0.980); gray level non-uniformity normalized, ICC = 0.955, 95% CI (0.934–0.970); median, ICC = 0.985, 95% CI (0.934–982); kurtosis, ICC = 0.937, 95% CI (0.907–0.957); gray level non-uniformity, ICC = 0.846, 95% CI (0.820–0.963); inverse variance, ICC = 0. 917, 95% CI (0.901–0.947).

Fig. 5
figure 5

Receiver operating characteristic curve of the radiomic variable Imc2 extracted from the paravertebral muscle which demonstrated 76.6% accuracy in predicting allograft rejection

Discussion

Our study suggests that advanced CT analysis of paravertebral muscles may play a role in the prediction of allograft rejection in bilateral lung transplant candidates and that the radiomic variable Imc2, which quantifies the complexity of the texture in terms of correlation between the probability distributions of the elements of the matrix, could be then considered as a novel radiological biomarker [32]. Currently, the diagnosis of acute or chronic allograft rejection relies on a multidisciplinary approach including histopathology, radiology, functional and immunologic tests [4,5,6,7]. Diagnostic imaging is of support, for instance, leading to targeted transbronchial biopsy or, in case of suspected chronic allograft dysfunction, detecting signs compatible with bronchiolitis obliterans or restrictive allograft syndrome [7].

Despite our encouraging and novel results, it certainly has to be highlighted that the proposed model has moderate sensitivity and low specificity. Nevertheless, our accuracy is higher than the one reached by de Jong and colleagues (AUC 0.49), who used radiomics to predict muscle loss in patients affected by non-small cell lung cancer. Similarly, it is also higher than the one obtained by the model of Kim et al. when only the erector spinae (AUC range 0.55–0.75) was assessed in patients affected by the same type of tumor. In fact, this latter group of researchers has also demonstrated that a multifactorial model including gender, coarseness, skewness, and cluster prominence has a good performance in defining sarcopenia especially if all muscles of the thoracic wall at the level of the third lumbar vertebra are used (AUC > 0.75) [17, 18].

Although our results need to be further explored in multifactorial models, maybe expanding the analysis to all thoracic muscles, our evidence underlines the importance of a systemic approach to the overall management of lung transplant candidates. Indeed, the fact that muscle characteristics and strength may have an impact on the outcome is in agreement with previous studies showing that muscle composition plays a role on the general status and performance of patients after bilateral lung transplant [10,11,12]. Thus, our study further supports the evidence that several systemic factors may influence the success of the procedure [33,34,35].

It cannot be overlooked that the prognostic value of muscle loss assessed by CT is still controversial in the literature. In fact, Rozenberg et al. and Hsu and colleagues indicated that sarcopenia is a poor prognostic factor in patients undergoing bilateral lung transplant while our results are in accordance with Lee et al. who suggested that sarcopenia, assessed as muscle cross sectional area, does not affect the early outcome [9, 12, 36]. This heterogeneity of results could be due to the different muscle target used for the analysis (psoas vs. paraspinal muscles), to the type of assessment (single slice vs. volumetric), and to the extracted parameters (muscle density vs. muscle area).

The absence of differences in terms of allograft rejection between patients with and without muscle loss, lets us assume that Imc2 may either reflect a very subtle onset of myosteatosis or more complex changes constituted of fat and inflammatory infiltrates considering also that significantly higher ESR occurred only in patients with muscle loss.

Certainly, correlating the radiological findings with muscle histological features could provide deeper insights into the characteristics of such changes. This type of investigation was not feasible in our study given the retrospective design of the project. Moreover, it cannot be overlooked that a biopsy of the paravertebral muscles would represent an invasive procedure. Considering the type of surgical intervention, a different muscle target, such as the diaphragm could be more easily accessible although it would be a challenge for the assessment at imaging. To overcome such difficulties, in future studies noninvasive techniques like diffusion tensor imaging, which enables the collection of quantitative information such as fractional anisotropy and mean diffusivity providing detailed information about muscle structure, could be compared or even associated with radiomics to provide new insights into the changes occurring in the paravertebral muscles of this group of patients [37, 38].

This study is affected by several limits. First, we did not assess the prognostic value of muscle composition on the overall outcome because this information was not available for most of the patients. Certainly, further studies including also this type of analysis are needed to further investigate the role of radiomics in transplantology.

We did not subdivide our patients according to the type of rejection (i.e., acute or chronic) because of the overall low number of cases of allograft dysfunction. It has to be underlined that our rate of allograft rejection is in the range reported in the literature and that this limit could be overcome by multicenter studies with a larger sample size [5].

Given the retrospective study design, we could not investigate the role of additional clinical and laboratory variables, such as the levels of creatine phosphokinase because they were not available for all or at least most of the included patients. Prospective studies with tailored collections of muscle-related laboratory and clinical variables are highly recommended. Moreover, further muscle parameters like the skeletal muscle index and/or the application of additional methods like DXA to assess muscle loss were not used but our study was focused on a CT-based radiomic evaluation of a muscle volume extracted from the paravertebral area [14, 39]. In future a multiparameter model including also these elements may be developed. Last, we must address that from a pool of 200 patients only 97 were included. This is given to the retrospective study design with a large time interval. In fact, given the technical improvements of the last decade, most of the excluded HRCT scans were performed with different techniques and heterogeneous parameters. Certainly, future prospective studies may overcome all limits. Nevertheless, considering that bilateral lung transplant is performed for rare diseases, we consider the size of our population satisfying for this preliminary study.

In conclusion, in our population the use of a muscle CT-based model of radiomics allowed the prediction of allograft dysfunction in lung transplanted patients. This evidence, if confirmed in larger, multicentric, and prospective studies, may improve the overall clinical management of patients undergoing bilateral lung transplant, influence the selection of optimal candidates, and guide the development of specific tailored physiotherapy programs in patients considered at risk at imaging.