Abstract
Objectives
To predict kidney fibrosis in patients with chronic kidney disease using radiomics of two-dimensional ultrasound (B-mode) and Sound Touch Elastography (STE) images in combination with clinical features.
Methods
The Mindray Resona 7 ultrasonic diagnostic apparatus with SC5-1U convex array probe (bandwidth frequency of 1–5 MHz) was used to perform two-dimensional ultrasound and STE software. The severity of cortical tubulointerstitial fibrosis was divided into three grades: mild interstitial fibrosis and tubular atrophy (IFTA), fibrotic area < 25%; moderate IFTA, fibrotic area 26–50%; and severe IFTA, fibrotic area > 50%. After extracting radiomics from B-mode and STE images in these patients, we analyzed two classification schemes: mild versus moderate-to-severe IFTA, and mild-to-moderate versus severe IFTA. A nomogram was constructed based on multiple logistic regression analyses, combining clinical and radiomics. The performance of the nomogram for differentiation was evaluated using receiver operating characteristic (ROC), calibration, and decision curves.
Results
A total of 150 patients undergoing kidney biopsy were enrolled (mild IFTA: n = 74; moderate IFTA: n = 33; severe IFTA: n = 43) and randomized into training (n = 105) and validation cohorts (n = 45). To differentiate between mild and moderate-to-severe IFTA, a nomogram incorporating STE radiomics, albumin, and estimated glomerular filtration (eGFR) rate achieved an area under the ROC curve (AUC) of 0.91 (95% confidence interval [CI]: 0.85–0.97) and 0.85 (95% CI: 0.77–0.98) in the training and validation cohorts, respectively. Between mild-to-moderate and severe IFTA, the nomogram incorporating B-mode and STE radiomics features, age, and eGFR achieved an AUC of 0.93 (95% CI: 0.89–0.98) and 0.83 (95% CI: 0.70–0.95) in the training and validation cohorts, respectively. Finally, we performed a decision curve analysis and found that the nomogram using both radiomics and clinical features exhibited better predictability than any other model (DeLong test, p < 0.05 for the training and validation cohorts).
Conclusion
A nomogram based on two-dimensional ultrasound and STE radiomics and clinical features served as a non-invasive tool capable of differentiating kidney fibrosis of different severities.
Key Points
• Radiomics calculated based on the ultrasound imaging may be used to predict the severities of kidney fibrosis.
• Radiomics may be used to identify clinical features associated with the progression of tubulointerstitial fibrosis in patients with CKD.
• Non-invasive ultrasound imaging-based radiomics method with accuracy aids in detecting renal fibrosis with different IFTA severities.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Chronic kidney disease (CKD) describes a state of progressive structural and functional deterioration of the kidney, presenting as a reduced estimated glomerular filtration rate (eGFR). CKD can lead to end-stage kidney disease (ESKD) and is responsible for 9.1% and 4.6% of noncommunicable disease-related morbidity and mortality, respectively [1]. It is projected that CKD will become the fifth leading global cause of death by 2040 [2]. Consequently, timely diagnosis followed by early treatment initiation for those with CKD is crucial for optimizing their outcomes.
Interstitial fibrosis and tubular atrophy (IFTA) are tightly correlated with CKD severity and impact patients’ long-term prognosis. Moderate and severe IFTA, compared to mild IFTA, and global glomerulosclerosis are associated with more than a two- and three-fold increased risk of kidney function loss, respectively [3]. However, current methods for monitoring kidney fibrosis remain unsatisfactory. In clinical practice, eGFR is not always consistent with the degree of renal fibrosis. eGFR can be quite insensitive to subclinical kidney function impairment. Kidney biopsy is considered the gold standard for confirming CKD diagnosis and fibrosis grading [4,5,6]. However, kidney biopsy carries the risk of complications, and spatial sampling bias reduces the accuracy of pathological diagnosis; therefore, kidney biopsy has not been considered the preferred follow-up approach for patients with CKD [7, 8].
Morphological changes in the kidney cortex and volume mostly occur during ESKD. Under pathological examination, CKD is characterized by kidney fibrosis, or the pathological deposition of massive extracellular matrices related to an increasing number of fibroblasts [9, 10]. These changes are associated with subsequent scarring and sclerosis of kidney tissues, leading to kidney morphological alterations [11]. Ultrasound examinations can assess changes in speckling pattern and signal scattering, both of which variably correlate with changes in kidney morphology and rising parenchymal stiffness. However, distinguishing diseased kidneys from healthy ones using two-dimensional (2D) ultrasound can be difficult for sonographers. These limitations lead to the increased utility of radiomics. Radiomics are quantifications of medical images using statistical algorithms. The machine learning part is used for outcome prediction in subsequent steps. Radiomics aims to support diagnostic decisions through differentiating between different tissue types [12, 13]. Among radiomics, texture analysis is an emerging tool for quantitating the severity of kidney diseases. Radiomics has been applied to different imaging modalities for the identification and differentiation between kidney diseases, including kidney tumors, carcinomas [14,15,16,17], the discrimination of malignant and benign clinical T1 renal masses [18] and renal tumor histological subtypes [19], early kidney damage in patients with diabetes mellitus [20], the detection of kidney stones [21], and the differentiation between normal and diseased kidneys in those with CKD [22].
Based on the reasons outlined above, we combined radiomics data from 2D ultrasound and Sound Touch Elastography (STE) images, as well as clinical factors to construct models for application, followed by model verification. We tried to use a nomogram to predict the degree of IFTA among CKD patients without histopathological data. We aimed to provide a non-invasive diagnosis approach for CKD and used this approach to monitor the treatment responses and disease course of these patients.
Materials and methods
Ethics statement
The current study complied with the Declaration of Helsinki and was approved by the local ethics review board (KY2021146). We obtained written informed consent from each participant.
Selection of study participants
The definition of CKD was made based on an eGFR < 60 mL/min/1.73 m2 for at least 3 months [23]. The inclusion criteria were CKD patients who had a clinical indication of kidney biopsy. The exclusion criteria were as follows: patients with any contraindications for kidney biopsy, asymmetric bilateral kidney atrophy, abnormal kidney structure, or poor resolution of kidney cortex and medulla on 2D ultrasound. Clinical and laboratory tests were collected from each patient within 2 days before they underwent kidney biopsy.
Ultrasonography procedures
We used the Mindary Resona 7 Ultrasound System and SC5-1U convex array probe (bandwidth frequency of 1–5 MHz) (Mindray Bio-Medical Electronics Co., Ltd.) to perform 2D ultrasound and STE software. STE measurements were performed 5 times with uniform color fill, and the final standard deviation (SD) of the STE values was set at less than 2.0 as quality control. All examinations were performed by a sonographer with 8 years of experience, who was blinded to serological, imaging, and kidney biopsy pathological results.
Kidney biopsy and pathological examination
Renal biopsy specimens within 3 days of renal ultrasound were obtained from patients with CKD. A renal needle biopsy was done to sample the lower pole parenchyma of the target kidney under ultrasound (US) guidance. To ensure that the selected US images matched the US biopsy location, the kidney puncture operation, 2D ultrasound, and STE examination were performed by the same sonographer. Two experienced pathologists scored the severity of glomerular sclerosis, tubulointerstitial injury, and vascular sclerosis based on the Banff scoring system and experiences from Farris et al [24, 25]. Any disagreement between pathologists was resolved by consensus. We used the Image-Pro Plus 6.0 software to evaluate the proportion of tubulointerstitial fibrotic areas. Patients with CKD were classified according to the Banff scoring system for kidney cortical fibrosis [26]. In this scoring system, the severity of cortical tubulointerstitial fibrosis was divided into three grades: mild IFTA, fibrotic area < 25%; moderate IFTA, fibrotic area 26–50%; and severe IFTA, fibrotic area > 50%.
Processing flow of radiomics
Image segmentation
Images of Digital Imaging and Communications in Medicine (DICOM) format acquired during B-mode and STE examination were imported into ITK-snap software for manual image segmentation. We evaluated the region of interest (ROI) containing the kidney cortex but removed the kidney medulla and perirenal fat tissues during image curation. Any difference between the two interpreters was resolved by group discussions.
Feature extraction and establishment of radiomics label
The DICOM images and ROIs obtained from ITK-SNAP software were imported into the AK software (Artificial Intelligence Kit, GE Healthcare) for extracting radiomics. The extracted features included first-order (histogram and morphologic features) alongside second-order parameters. The second-order parameters mainly involved Gray Level Co-Occurrence Matrix (GLCM), Gray Level Run Length Matrix (GLRLM), Gray Level Size Zone Matrix (GLZSM), Neighboring Gray Tone Difference Matrix (NGTDM), and Gray Level Dependence Matrix (GLDM). The ROI of all images was delineated by two sonographers. The inter-observer agreement was evaluated using interclass correlation coefficient (ICC) analysis, which was defined as good consistency for values between 0.75 and 1, fair consistency for values between 0.4 and 0.75, and poor for values under 0.4. ICC values higher than 0.75 were selected for further analysis. Patients were randomly divided into training and validation cohorts at a ratio of 7:3. We planned for two types of comparisons: mild versus moderate-to-severe IFTA and mild-to-moderate versus severe IFTA.
Feature selection
Minimum redundancy maximum relevance (mRMR) was used to eliminate redundant and irrelevant features, retain the optimal ones, filter out the optimal feature subset through the least absolute shrinkage and selection operator (LASSO) algorithm, and build a final model. After determining the number of optimal features, we selected the most predictive feature subset and calculated the corresponding coefficients [27].
Model construction and result validation
Feature extraction based on B-mode and STE images yielded a radiomics quality score (Rad-Score), which was the radiomics label calculated by the weighted summation of selected features by their coefficients. We used receiver operating characteristic curve (ROC) analysis to evaluate the performance of each constructed model. The Akaike information criterion of the clinical model was applied to determine the most appropriate clinical model. Multivariate logistic regression combining clinical features with the Rad-Score was conducted to establish a predictive model and generate a clinical nomogram. The usefulness of a nomogram lies in its ability to map prediction probability to points on a picture with a scale between 0 and 100. The total points accrued based on different types of features corresponded with the predicted probabilities of the index patient [28, 29]
The predictive accuracy of each model was assessed by the area under the ROC curve (AUC) value for the training and validation cohorts. We tested the performance of the Knott diagram in the validation cohort. Using the logistic regression model established in the training cohort, we calculated the total score for each patient in the validation cohort and obtained the AUC and calibration curve. To estimate the prediction error of each model, we further tested the proposed model using the 1000-iteration bootstrap analysis for both the training and validation cohorts. We randomly selected 70% of patients from the training or validation cohort and calculated the corresponding AUC values.
Comparisons between AUCs were made with the DeLong test. The calibration curves and Hosmer–Lemeshow test were used to investigate the performance of the nomogram. Finally, to evaluate the clinical practicability by quantifying the net benefits of the nomogram model in both the training and validation cohorts, the decision curve analysis (DCA) was performed based on clinical features and radiomics labels from B-model, STE-model, B plus STE model, and the combined models. DCA determines the clinical practicability of radiomics nomograms by quantifying the net benefits under different threshold probabilities in the validation set.
Statistical analyses
SPSS (version 26.0; IBM), GraphPad Prism 8.0 (GraphPad Software), and R statistical software (version 4.0.2) were used for statistical tests. p < 0.05 was considered statistically significant.
Results
Basic clinical information from participants
The flowchart of patient selection is provided in Fig. 1. A total of 150 patients with CKD were identified in Table 1, along with their pathological diagnoses (Supplementary Table 1). The course of processing radiomics is shown in Fig. 2. We also illustrated how the ITK-SNAP software delineated the ROI of the punctured kidney cortex (Fig. 3). Table 2 shows the clinical characteristics of training and validation cohorts.
Radiomics flow chart of this study. We exported the collected images in DICOM format, used the ITK software to delineate ROIs, and performed image segmentation. We used the AK software to extract ultrasound radiomics, and built models based on the clinical characteristics of patients with CKD. Later, we performed model calibration and validation.
Pathological examinations using Masson staining from kidney biopsy contrasting images from B-mode, and ITK-SNAP ROI in patients with different IFTA severities. A–C A 38-year-old female patient with systemic lupus erythematosus. Kidney biopsy showing focal proliferative lupus nephritis, III-(A), tubulo-interstitial atrophy was 5% assessed by Masson staining, mild IFTA. D–F A 36-year-old female patient with chronic kidney disease, 40% tubulo-interstitial atrophy assessed by Masson staining, moderate IFTA; G–I A 28-year-old male patient with chronic kidney disease. Tubulo-interstitial atrophy assessed by Masson staining was 70%, with severe IFTA
Feature selection, model construction, and results validation
We extracted 1156 radiomics from the B-mode and STE images for each participant, based on the result of reproducibility analysis by two sonographers, 739 radiomics had good consistency (ICC > 0.75), and retained 120 features after being filtered by the mRMR method. We also did texture feature selection based on the LASSO logistic regression (Supplementary Figure 1) and selected 36 radiomics after the procedure. These features were used to construct the radiomics signature (Fig. 4). The final formula for calculating Rad-Scores is shown in the Supplementary Materials. We compared the Rad-Scores between the training and the testing groups, as shown in Supplementary Figure 2.
Radiomics signatures for B-mode and STE images. A Four features from B-mode images of mild versus moderate-to-severe IFTA; B Eleven features from STE images of mild versus moderate-to-severe IFTA; C Nine features from B-mode images of mild-to-moderate versus severe IFTA; D Twelve features from STE images of mild-to-moderate versus severe IFTA
We further compared the results of B-model, STE model, clinical model and combined model between groups of different IFTA grades, accompanied by model validation, and prediction parameters were calculated using the Youden index (Table 3). We also showed the results using decision curve evaluation models in Supplementary Figure 3. The predicting models built on clinical features for identifying mild vs. moderate-to-severe IFTA and for mild-to-moderate vs. severe IFTA are shown in Supplementary Figure 4.
Clinical features combined with ultrasound radiomics model performance and nomogram validation in analyses involving different IFTA group comparisons
During the validation of models comparing mild IFTA to moderate-to-severe IFTA, the clinical model established using serum albumin and eGFR achieved moderate prediction ability. Moderate prediction ability was also achieved using the STE radiomics model (Table 3). After adding the results of the STE radiomics model to the clinical model, the predictive performance of the combined model was significantly improved, with the nomogram shown in Fig. 5A, with AUCs of 0.91 (95% CI: 0.85–0.97) and 0.85 (95% CI: 0.77–0.98) for the training cohort and testing cohorts, respectively (DeLong test, p < 0.05) (Fig. 5B, C). The nomogram calibration curve showed good agreement between the predictions and observations in the two groups (Fig. 5D, E). The DCA of the nomogram is shown in Fig. 5F. The DCA based on the combined models (clinical and STE) showed greater benefits in the prediction of IFTA severity in the 20–80% threshold probabilities compared to the clinical and STE models.
A Nomogram for clinical features (albumin and eGFR) of mild vs. moderate-to-severe IFTA combined with STE Rad-Scores. B, C Clinical characteristics of mild vs. moderate-to-severe IFTA combined with the ROC curves of STE model in the training and validation sets. D, E Calibration curves of the nomogram for clinical model of mild vs. moderate-to-severe IFTA combined with STE model in the training and validation cohorts. F Analysis of the cut curve of the histogram for comparison of mild with moderate-to-severe IFTA in the clinical model alone, STE model alone, and combined model. The Y-axis is net income. The blue line represents the decision curve of the STE model. The green line represents the clinical model curve, whereas the red line represents the decision curve of the STE model combined with clinical model of patients with CKD
During the validation of models comparing mild-to-moderate IFTA to severe IFTA, the clinical model established based on age and eGFR achieved moderate prediction ability, with AUCs of 0.67 (95% CI: 0.55–0.79) and 0.55 (95% CI: 0.34–0.76) for the training and testing cohorts, respectively. Moderate prediction ability was also achieved using the B-mode radiomics model, with AUCs of 0.80 (95% CI: 0.71–0.90) and 0.78 (95% CI: 0.65–0.92) for the training and testing cohorts, respectively. Moderate prediction ability was similarly achieved using the STE radiomics model, with AUCs of 0.81 (95% CI: 0.73–0.89) and 0.73 (95% CI: 0.58–0.88) for the training and testing cohorts, respectively. Higher prediction ability was achieved using the B-mode plus STE radiomics model, with AUCs of 0.93 (95% CI: 0.88–0.98) and 0.86 (95% CI: 0.75–0.97) for the training and testing cohorts, respectively. Finally, models established using age and eGFR, B-mode, and STE radiomics data showed that the prediction ability of the combined model was high, with the nomogram shown in Fig. 6A. The AUCs of the training and the testing cohorts were 0.93 (95% CI: 0.89–0.98) and 0.83 (95% CI: 0.70–0.95), respectively (Fig. 6B, C). The AUCs of the combined model significantly differed from those of the clinical model, B model, or STE model (DeLong test, p < 0.005 for the training and validation cohorts). The nomogram calibration curves showed good agreement between predictions and observations in the two groups (Fig. 6D, E). The DCA of the nomogram is shown in Fig. 6F. Compared to other models, the combined nomogram model, showing the highest area under the curve, is the optimal decision making for maximal net benefit in classifying IFTA severity.
A Nomogram for clinical features (age and eGFR) of mild-to-moderate IFTA vs. severe IFTA combined with B-mode Rad-Scores and STE Rad-Scores. B, C Clinical characteristics model of mild-to-moderate vs. severe IFTA combined with ROC curves of B-model and STE modelin the training set and validation set. D, E Calibration curves of this nomogram for clinical features model of mild-to-moderate vs. severe IFTA in combination with B-model and STE-model in the training and validation cohorts. F Analysis of the cutting curve of the nomogram for the clinical model alone, B model alone, STE model alone, and B+STE combined model and the clinical+B+STE combined model comparing mild-to-moderate with severe IFTA. The Y-axis is net income. The gray, green, blue, purple, and red lines represent the clinical model curve, B model curve, decision curve of the STE model, B+STE model curve, and decision curve of the clinical +B+STE model, respectively
Discussion
The accurate and non-invasive classification of kidney fibrosis severities is crucial for clinical practice. Recently, researchers used machine learning based on elastography ultrasound images to gauge the severity of kidney fibrosis, with promising results [30]. The construction of a binary classification model is mostly used for comparing liver and kidney fibrosis severities [30,31,32,33]. In this study, we performed a binary classification by comparing one IFTA grade with the other grades as an approach. A combined model incorporating 2D ultrasound radiomics, STE radiomics, and clinical features for predicting IFTA severities was constructed and validated.
Among clinical features analyzed in this study, eGFR was an independent parameter as shown in different IFTA prediction models (Figs. 5 and 6), consistent with results from Zhu et al [30]. eGFR is an important indicator for estimating kidney function and assessing IFTA severity in patients with CKD [34]. eGFR is calculated based on a standardized formula using Scr, a laboratory index that is widely used for the clinical follow-up of these patients [35]. In the comparative model of mild and moderate-to-severe IFTA, the combined model based on clinical features (serum albumin and eGFR) and STE radiomics further improved the diagnostic performance. eGFR has limitations as an indicator for kidney fibrosis, since the levels of eGFR are frequently inconsistent with the degree of kidney fibrosis. Furthermore, eGFR is not sensitive to subclinical kidney damage [36]. In our training and testing cohorts, comparing mild-to-moderate IFTA to severe IFTA, the AUC of the clinical features (eGFR and age) model for discrimination was 0.67 (95% CI: 0.55–0.79) and 0.55 (95% CI: 0.34–0.76) in the training and testing cohorts, respectively, suggesting that clinical features model only performed worse than B-model or STE-model only or the combined model (Fig. 6B, C). Judging from the above arguments, we selected IFTA severity as the grouping variable and prediction model construction instead of eGFR. Therefore, a combined model consisting of clinical factors of eGFR and ultrasonography radiomics features can be helpful for achieving non-invasive monitoring of kidney fibrosis.
The main factors affecting STE elasticity measurements are anisotropy and the heterogeneity of kidney fibrosis. Other confounding factors for STE measurement include age and BMI [30]. In this study, age was used to construct a nomogram for predicting the comparison between mild and moderate-to-severe IFTA. Clinical model established by age and eGFR, and the combined model all achieved a fair predictive performance. In reality, kidneys become stiffened due to collagen deposition during ESKD, and STE measurement results will increase. However, with renal function further declining, kidneys may become softer due to poor blood perfusion, and the STE measurement results may decrease, whereas the kidney length becomes smaller on 2D ultrasound examination [37,38,39]. These factors likely lead to the emergence of a complex nonlinear relationship between 2D ultrasound measurements, STE measurements, and IFTA severity. In our study, a combined model built based on B-mode and STE results significantly improved the diagnostic performance of traditional ultrasound alone. Possible explanations for this finding include the ability of STE to capture the stiffness feature of patients’ kidneys, which is suitable for application during machine learning whose strength includes combining variables with nonlinear relationships and interactions [40]. Therefore, we used all variables including 2D ultrasound, STE radiomics, and clinical factors from these patients with CKD to model IFTA.
In the nomogram differentiating mild and moderate-to-severe IFTA models, STE radiomics and clinical factors were included, whereas 2D ultrasound radiomics were not. The reason is that the 2D ultrasound radiomics consist of data including the diameter of the kidney’s long axis and its cortical thickness. However, in patients with mild and moderate IFTA, changes in their kidney morphology remain minimal due to their early CKD stages [41]. In this study, there were no differences in kidney lengths and cortical thickness between different IFTA groups (p = 0.487 and p = 0.927 for the mild and moderate IFTA groups, respectively). During our construction of a comparison model between the mild and moderate-to-severe IFTA groups, we extracted 2D ultrasound image features from those with moderate-to-severe IFTA. Since the radiomics of moderate IFTA were included, the kidney morphological features that did not significantly differ between those with mild and moderate IFTA were extracted. The presence of redundant information might increase the probability of model overfitting, reducing model performance after constructing a joint model. However, when we compared between those with mild-to-moderate and severe IFTA, patients with ESKD and severe IFTA were more likely to have morphological kidney atrophy and cortical thinning. When we compared renal long-axis diameter and cortical thickness between severe and mild-to-moderate IFTA groups, there were differences between groups (p < 0.01). Therefore, the addition of radiomics including morphological differences of the kidneys in 2D ultrasound greatly increased the diagnostic performance of the combined model.
The combined model incorporating B-mode, STE, and clinical features can be applicable for IFTA detection for patients outside our training cohort, particularly during the follow-up of patients unable to receive a renal biopsy. The establishment of ultrasound radiomics model can be a great support for clinical ultrasound practice, and radiomics findings may assist in IFTA prediction in the future.
This study has some limitations. Patients selected were those with CKD and renal biopsy indications. The renal cortical tissues of patients with ESKD could be thin, precluding the derivation of histopathological results based on renal biopsy. The sample size of patients with severe IFTA was small, necessitating further expansion to reduce data redundancy during model construction, in order to facilitate the establishment of multi-classification models. In addition, this study was done based on data from one center, using a single-mode ultrasound diagnostic apparatus to collect ultrasound radiomics. Multi-center and different ultrasound modes may be needed to extract more 2D and ultrasound elasticity radiomics to construct a combined model and to test the generalizability of our established combined model. Finally, changes in 2D ultrasound and STE features and the course of CKD among these patients need to be further monitored and validated in the future.
Conclusion
STE combined with 2D ultrasound examinations can improve the diagnostic performance of traditional ultrasound for tubulointerstitial fibrosis in patients with CKD. The radiomics nomograms constructed based on 2D ultrasound and STE imaging features in combination with clinical features are non-invasive tools with high accuracy in detecting renal fibrosis with different IFTA severities. This approach can be helpful for non-invasive monitoring of kidney fibrosis.
Abbreviations
- AUC:
-
Area under the curve
- CKD:
-
Chronic kidney disease
- DCA:
-
Decision curve analysis
- eGFR:
-
Estimated glomerular filtration rate
- ESKD:
-
End-stage kidney disease
- IFTA:
-
Interstitial fibrosis and tubular atrophy
- LASSO:
-
Least absolute shrinkage and selection operator
- mRMR:
-
minimum Redundancy Maximum Relevance
- NPV:
-
Negative-predictive value
- PPV:
-
Positive-predictive value
- ROC:
-
Receiver operating characteristic
- ROI:
-
Region of interest
- Scr:
-
Serum creatinine
- STE:
-
Sound Touch Elastography
- UPCR:
-
Urinary protein-to-creatinine ratio
References
GBD Chronic Kidney Disease Collaboration (2020) Global, regional, and national burden of chronic kidney disease, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet 395:709–733
Foreman KJ, Marquez N, Dolgert A et al (2018) Forecasting life expectancy, years of life lost, and all-cause and cause-specific mortality for 250 causes of death: reference and alternative scenarios for 2016-40 for 195 countries and territories. Lancet 392:2052–2090
Srivastava A, Palsson R, Kaze AD et al (2018) The prognostic value of histopathologic lesions in native kidney biopsy specimens: results from the Boston kidney biopsy cohort study. J Am Soc Nephrol 29:2213–2224
Floege J, Amann K (2016) Primary glomerulonephritides. Lancet 387:2036–2048
Hahn BH, McMahon MA, Wilkinson A et al (2012) American college of rheumatology guidelines for screening, treatment, and management of lupus nephritis. Arthritis Care Res (Hoboken) 64:797–808
Binda V, Moroni G, Messa P (2018) ANCA-associated vasculitis with renal involvement. J Nephrol 31:197–208
Korbet SM, Volpini KC, Whittier WL (2014) Percutaneous renal biopsy of native kidneys: a single-center experience of 1,055 biopsies. Am J Nephrol 39:153–162
Franke M, Kramarczyk A, Taylan C, Maintz D, Hoppe B, Koerber F (2014) Ultrasound-guided percutaneous renal biopsy in 295 children and adolescents: role of ultrasound and analysis of complications. PLoS ONE 9:e114737
Muñoz-Félix JM, González-Núñez M, Martínez-Salgado C, López-Novoa JM (2015) TGF-β/BMP proteins as therapeutic targets in renal fibrosis. Where have we arrived after 25 years of trials and tribulations? Pharmacol Ther 156:44–58
Klinkhammer BM, Goldschmeding R, Floege J, Boor P (2017) Treatment of renal fibrosis-turning challenges into opportunities. Adv Chronic Kidney Dis 24:117–129
Berchtold L, Friedli I, Vallée JP, Moll S, Martin PY, De Seigneux S (2017) Diagnosis and assessment of renal fibrosis: the state of the art. Swiss Med Wkly 147:w14442
Kumar V, Gu Y, Basu S et al (2012) Radiomics: the process and the challenges. Magn Reson Imaging 30:1234–1248
Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures, they are data. Radiology 278:563–577
Meng Xl, Shu J, Xia YW, Yang RW (2020) A CT-based radiomics approach for the differential diagnosis of sarcomatoid and clear cell renal cell carcinoma. Biomed Res Int 2020:7103647
Shin HJ, Kwak JY, Lee E et al (2019) Texture analysis to differentiate malignant renal tumors in children using gray-scale utrasonography images. Ultrasound Med Biol 45:2205–2212
Diaz de Leon A, Kapur P, Pedrosa I (2019) Radiomics in kidney cancer: MR Imaging. Magn Reson Imaging Clin N Am 27:1–13
Yu HS, Scalera J, Khalid M et al (2017) Texture analysis as a radiomic marker for differentiating renal tumors. Abdom Radiol (NY) 42:2470–2478
Uhlig J, Biggemann L, Nietert MM et al (2020) Discriminating malignant and benign clinical T1 renal masses on computed tomography: a pragmatic radiomics and machine learning approach. Medicine (Baltimore) 99:e19725
Uhlig J, Leha A, Delonge LM et al (2020) Radiomic features and machine learning for the discrimination of renal tumor histological subtypes: a pragmatic study using clinical-routine computed tomography. Cancers (Basel) 12:3010
Deng Y, Yang BR, Luo JW, Du GX, Luo LP (2020) DTI-based radiomics signature for the detection of early diabetic kidney damage. Abdom Radiol (NY) 45:2526–2531
De Perrot T, Hofmeister J, Burgermeister S et al (2019) Differentiating kidney stones from phleboliths in unenhanced low-dose computed tomography using radiomics and machine learning. Eur Radiol 29:4776–4782
Bandara MS, Gurunayaka B, Lakraj G, Pallewatte A, Siribaddana S, Wansapura J (2022) Ultrasound based radiomics features of chronic kidney disease. Acad Radiol 29:229–235
National Kidney Foundation (2002) K/DOQI clinical practice guidelines for chronic kidney disease: evaluation, classification, and stratification. Am J Kidney Dis 39:S1–S266
Mariani LH, Martini S, Barisoni L et al (2018) Interstitial fibrosis scored on whole-slide digital imaging of kidney biopsies is a predictor of outcome in proteinuric glomerulopathies. Nephrol Dial Transplant 33:310–318
Farris AB, Alpers CE (2014) What is the best way to measure renal fibrosis?: a pathologist’s perspective. Kidney Int Suppl 4:9–15
Solez K, Colvin RB, Racusen LC et al (2008) Banff 07 classification of renal allograft pathology: updates and future directions. Am J Transplant 8:753–760
Alshamlan H, Badr G, Alohali Y (2015) mRMR-ABC: a hybrid gene selection algorithm for cancer classification using microarray gene expression profiling. Biomed Res Int 2015:1–15
Iasonos A, Schrag D, Raj GV, Panageas KS (2008) How to build and interpret a nomogram for cancer prognosis. J Clin Oncol 26:1364–1370
Stephenson AJ, Scardino PT, Eastham JA et al (2005) Postoperative nomogram predicting the 10-year probability of prostate cancer recurrence after radical prostatectomy. J Clin Oncol 23:7005–7012
Zhu MY, Ma LY, Yang WQ et al (2021) Elastography ultrasound with machine learning improves the diagnostic performance of traditional ultrasound in predicting kidney fibrosis. J Formos Med Assoc. https://doi.org/10.1016/j.jfma.2021.08.011
Xue LY, Jiang ZY, Fu TT et al (2020) Transfer learning radiomics based on multimodal ultrasound imaging for staging liver fibrosis. Eur Radiol 30:2973–2983
Pickhardt PJ, Graffy PM, Said A et al (2019) Multiparametric CT for noninvasive staging of hepatitis C virus-related liver fibrosis: correlation with the histopathologic fibrosis score. AJR Am J Roentgenol 212:547–553
Park HJ, Lee SS, Park B et al (2019) Radiomics analysis of gadoxetic acid-enhanced MRI for staging liver fibrosis. Radiology 290:380–387
Chen CJ, Pa TW, Hsu HH, Chien Hung L, Chen KS, Chen YC (2019) Prediction of chronic kidney disease stages by renal ultrasound imaging. Enterp Inf Syst 14:178–195
Floege J, Barbour SJ, Cattran DC et al (2019) Management and treatment of glomerular diseases (part 1): conclusions from a kidney disease: improving Global Outcomes (KDIGO) Controversies Conference. Kidney Int 95:268–280
Ruiz-Ortega M, Rayego-Mateos S, Lamas S, Ortiz A, Rodrigues-Diez RR (2020) Targeting the progression of chronic kidney disease. Nat Rev Nephrol 16:269–288
LeBleu VS, Taduri G, O’Connell J et al (2013) Origin and function of myofibroblasts in kidney fibrosis. Nat Med 19:1047–1053
Syversveen T, Brabrand K, Midtvedt K, Strøm EH, Hartmann A, Berstad AE (2011) Non-invasive assessment of renal allograft fibrosis by dynamic sonographic tissue perfusion measurement. Acta Radiol 52:920–926
Warner L, Yin M, Glaser KJ et al (2011) Noninvasive In vivo assessment of renal tissue elasticity during graded renal ischemia using MR elastography. Invest Radiol 46:509–514
Sealfon RSG, Mariani LH, Kretzler M, Troyanskaya OG (2020) Machine learning, the kidney, and genotype-phenotype analysis. Kidney Int 97:1141–1149
Hoi S, Takata T, Sugihara T et al (2018) Predictive value of cortical thickness measured by ultrasonography for renal impairment: a longitudinal study in chronic kidney disease. J Clin Med 7:527
Funding
The authors state that this work has not received any funding.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Guarantor
The scientific guarantor of this publication is Xin-Yue Ge.
Conflict of interest
The authors of this manuscript declare no relationships with any companies, whose products or services may be related to the subject matter of the article.
Statistics and biometry
One of the authors has significant statistical expertise.
Informed consent
Written informed consent was obtained from all patients in this study.
Ethical approval
Institutional Review Board approval was obtained.
Methodology
• retrospective
• diagnostic or prognostic study
• performed at one institution
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
ESM 1
(DOCX 1.35 mb)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ge, XY., Lan, ZK., Lan, QQ. et al. Diagnostic accuracy of ultrasound-based multimodal radiomics modeling for fibrosis detection in chronic kidney disease. Eur Radiol 33, 2386–2398 (2023). https://doi.org/10.1007/s00330-022-09268-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00330-022-09268-3