Automated measurement of total kidney volume from 3D ultrasound images of patients affected by polycystic kidney disease and comparison to MR measurements

Jagtap, Jaidip M.; Gregory, Adriana V.; Homes, Heather L.; Wright, Darryl E.; Edwards, Marie E.; Akkus, Zeynettin; Erickson, Bradley J.; Kline, Timothy L.

doi:10.1007/s00261-022-03521-5

Automated measurement of total kidney volume from 3D ultrasound images of patients affected by polycystic kidney disease and comparison to MR measurements

Kidneys, Ureters, Bladder, Retroperitoneum
Open access
Published: 27 April 2022

Volume 47, pages 2408–2419, (2022)
Cite this article

Download PDF

You have full access to this open access article

Abdominal Radiology Aims and scope Submit manuscript

Automated measurement of total kidney volume from 3D ultrasound images of patients affected by polycystic kidney disease and comparison to MR measurements

Download PDF

Jaidip M. Jagtap¹,
Adriana V. Gregory²,
Heather L. Homes¹,
Darryl E. Wright¹,
Marie E. Edwards¹,
Zeynettin Akkus¹,
Bradley J. Erickson¹ &
…
Timothy L. Kline ORCID: orcid.org/0000-0002-7917-9853^1,2

3555 Accesses
2 Altmetric
Explore all metrics

Abstract

Purpose

Total kidney volume (TKV) is the most important imaging biomarker for quantifying the severity of autosomal-dominant polycystic kidney disease (ADPKD). 3D ultrasound (US) can accurately measure kidney volume compared to 2D US; however, manual segmentation is tedious and requires expert annotators. We investigated a deep learning-based approach for automated segmentation of TKV from 3D US in ADPKD patients.

Method

We used axially acquired 3D US-kidney images in 22 ADPKD patients where each patient and each kidney were scanned three times, resulting in 132 scans that were manually segmented. We trained a convolutional neural network to segment the whole kidney and measure TKV. All patients were subsequently imaged with MRI for measurement comparison.

Results

Our method automatically segmented polycystic kidneys in 3D US images obtaining an average Dice coefficient of 0.80 on the test dataset. The kidney volume measurement compared with linear regression coefficient and bias from human tracing were R² = 0.81, and − 4.42%, and between AI and reference standard were R² = 0.93, and − 4.12%, respectively. MRI and US measured kidney volumes had R² = 0.84 and a bias of 7.47%.

Conclusion

This is the first study applying deep learning to 3D US in ADPKD. Our method shows promising performance for auto-segmentation of kidneys using 3D US to measure TKV, close to human tracing and MRI measurement. This imaging and analysis method may be useful in a number of settings, including pediatric imaging, clinical studies, and longitudinal tracking of patient disease progression.

Graphical abstract

Automatic Segmentation of Kidneys using Deep Learning for Total Kidney Volume Quantification in Autosomal Dominant Polycystic Kidney Disease

Article Open access 17 May 2017

Automatic semantic segmentation of kidney cysts in MR images of patients affected by autosomal-dominant polycystic kidney disease

Article Open access 17 September 2020

Computation of Total Kidney Volume from CT Images in Autosomal Dominant Polycystic Kidney Disease Using Multi-task 3D Convolutional Neural Networks

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Polycystic kidney disease (PKD) is a genetic disorder in which cysts develop within the kidneys, causing kidneys to enlarge and lose function over time[1]. Nine out of ten people with PKD have the autosomal dominant form (ADPKD) [2, 3]. In ADPKD, cysts develop primarily in the kidneys but can also be present in other organs, like the liver. Currently, there are ~ 140,000 people diagnosed with ADPKD in the United States [4]. Over time, kidney and liver volumes steadily increase, resulting in renal function decline [5, 6]. In particular, renal elasticity is associated with kidney function [7]. There is no cure for PKD, but dialysis, kidney transplant, blood pressure medication, and surgical removal of cysts are treatment options. If diagnosed and monitored at an early stage, better treatment options are possible.

Measuring kidney and liver volumes are some of the most important biomarkers in quantifying the severity of ADPKD and are used in clinical decision making [8,9,10]. Also, many studies found that TKV, along with age, height, and estimated glomerular filtration rate (eGFR) are useful prognostic biomarkers to predict renal function decline [11,12,13]. Bae et al. [14] reported MRI-based kidney volume measurement in ADPKD by manually segmenting the slices in kidney volume. However, annotating each slice for volume measurement is laborious, and to overcome this, many researchers have recently utilized AI for automatic kidney segmentation [15,16,17,18,19]. Keshwani et al. [20] used a 3D convolutional neural network (CNN) for automated kidney segmentation in CT scans. Sharma et al. [16] used a CNN with a visual geometry group (VGG) like structure for automated kidney segmentation from the CT dataset of ADPKD. In MR images, Mu et al. used a 3D V-Net model for automated kidney segmentation in ADPKD data [21]. van Gastel et al. [22] used semantic segmentation for automated measurement of both kidney and liver volumes in MR images of patients affected by ADPKD. Kline and his group used instance segmentation [23] and semantic segmentation [24] for kidney cyst segmentation in T2-weighted MR images of ADPKD patients for total cyst volume.

Apart from MRI and CT imaging, ultrasound (US) imaging is popular and widely used to diagnose acute and chronic kidney diseases [25, 26]. Kuo et al. [27] used 2D US images to perform automated classification of kidney images using ResNet, to determine chronic disease status but did not use segmentation. Mahmud et al. [28] used vector graphic detection image analysis for kidney and cyst boundary detection from 2D images along with various texture analyses, filtering, and patches to detect kidney boundaries from limited data. However, no further updates were found with this study on large data, or no development was reported by any other group. Imaging features computed from US data using deep CNNs improved the classification of children with congenital abnormalities of the kidney and urinary tract and controls [29]. However, the computation of these anatomic measures typically involves manual or semi-automatic segmentation of kidneys in US images, requiring multiple human annotators, increasing inter-operator variability, reducing reliability, and limiting utility in clinical medicine. Automatic kidney segmentation in US images with AI has not progressed recently. US images have irregular scan plane acquisition and low-image contrast, making it difficult to segment the kidney accurately. Image quality, image size and magnification, gain, nonuniform intensity and contrast, and human variability in moving the US probe contribute to the challenges faced in segmenting US images [30]. Having an extensive dataset could help overcome the artifacts from individual images and result in better model training. With a small dataset, data augmentation is a technique used to introduce variability in training data, which helps model training [31].

Automatic and reliable kidney segmentation from US images would improve precision and efficiency in many clinical conditions, including congenital renal disease, renal mass detection, and kidney stones. Very few studies have reported US-kidney segmentation using deep learning. Wu et al. [32] reported cascaded fully convolutional DenseNet for automatic kidney segmentation of 2D US images. The mean intersection over union for FC-DenseNet was improved slightly with cascaded FC-DenseNet on 461 images of 68 patients. Yin et al. [33] performed automatic kidney segmentation of 2D US images using a pre-trained VGG16 model and weights (i.e., a transfer learning-based approach) and subsequent boundary distance regression (Bnet) and pixel-wise classification with a Deeplab network. The algorithm was trained on 289 images, but this study was limited to the largest 2D sagittal image from the whole kidney. Because a single 2D image does not produce an accurate measurement of kidney volume, particularly in a disease-like ADPKD where cysts often produce a very irregular shape, a 3D US scan is vital to obtain accurate TKV measurements. The variability severely degrades the performance of AI models in kidney shape, imaging protocol variability, instrument resolution, and imaging field of view. Recently, Breysem et al. used 3D US as an alternative to MRI for measuring renal volume in children with ADPKD and found 2D US measurements had significantly lower kidney volume than the 3D US, and 3D US measurements of TKV were close to MRI measurement [34]. Hence, there is a need for further development of US-based kidney imaging and segmentation to understand the problems and improve the performance of the AI models in segmentation.

In this study, to mitigate some of the aforementioned issues with 2D images, we acquired 3D US images using an electromagnetic tracker attached to the US probe. This tracks the position and angle of a probe in space and results in a 3D stack of aligned 2D images. TKV was calculated from the 3D images of the kidneys. We trained a U-Net model to segment kidneys using 3D US images, allowing us to measure TKV automatically. All participants in this study were also imaged by MRI, and kidney volume measured using MRI serves as the reference standard. TKV measurements using AI-empowered 3D US could be an alternative approach to where MRI is challenging for diagnosing and monitoring ADPKD patients.

Methods and materials

The Institutional Review Board of Mayo Clinic, Rochester, USA, approved this study protocol, and informed consent was obtained from each participant. Patient data were anonymized before use. Table 1 shows the demographic information of the 22 patients and computed body mass index (BMI). Fourteen participants in the study were women (64%) and eight were men (36%). The mean and median age of the study cohort was 51 (min = 28, max = 70) and 48 years, respectively. The mean and median patient height were 1.71 and 1.68, respectively. Five patients had a normal BMI (i.e., BMI between 18.5 and 24.9 kg/m²), eleven patients were overweight (i.e., BMI between 25 and 29.9 kg/m²), and six patients were obese (i.e., BMI of 30 kg/m² or higher).

Table 1 The demographics from the ADPKD study cohort

Full size table

Ultrasound imaging

We used axially acquired 3D US-kidney images in 22 ADPKD patients. Each patient had both kidneys imaged three times, resulting in 132 (22 patients × (3 scans of left kidney + 3 scans of the right kidney)) image sets. Images were acquired with a Philips EPIQ 7 system using the C5-1 curved linear probe with broadband 1–5 MHz frequency range and electromagnetic probe positioning system. 2D B-mode imaging was selected for this study. The resolution/speed settings were adjusted in order to improve the resolution of images. The time-gain-compensation and image gain were optimized per patient, where the image gain ranged from 54 to 68%. A freehand sweep with an electromagnetic tracker attached to the probe for recording probe orientation was used to build a 3D volume stack from 2D cross-sectional images of kidneys aligned based on probe orientation. The image size was either 256 × 256 or 512 × 512; the frames (Z dimension) varied from 300 to 700. The axially acquired DICOM format US images were transformed into NIfTI with SimpleITK and Python. Semi-automated in-house-developed software customized for US images was used to annotate the kidney at every frame [35]. Two experienced readers performed the manual tracing for kidney segmentation on the 3D US images. Reader1 (A.V.G.) performed annotations for all the data, whereas Reader2 (H.L.H.) performed annotations on a subset of 54 scans (9 patients × 6 scans) to measure inter-rater agreement.

Deep learning model

Each image set was zero-padded or cropped to 320 × 320 × Z, such that the entire kidney was always fully included in the volume. In the scanned sequence, many frames from the start and end of the sequence did not contain any data, and such blank (zero intensity) frames were omitted in order to avoid data imbalance issues. The threshold was chosen to include frames that had at least 20% non-zero pixels in the frame. The train:validation:test split was 15:2:5 at the patient level, resulting in 90:12:30 scans respectively, where each scan consisted of a series of 300–700 slices. Data augmentation, including ± 15° rotation and random elastic deformation, resulted in a three times increase in training dataset size and was performed to introduce variability in training data for better model generalization. We observed that random flip worsened the model performance, and Gaussian noise had minimal or no effect on model performance (and was, thus, not used in the final training experiments).

A transfer learning approach was used in this study. The architecture and weights from our previously reported 2D U-Net model [19] were used to train on the US data for kidney segmentation (Schematic of U-Net structure presented in Supplementary data, Fig S1). A 6-layer 2D U-Net model (filters varied from 32 to 1024, kernel size varied from (7,7) to (5,5) and then (3,3) at the base layer and gradually increased back to (7,7) at the top level of decoders) was trained to segment the whole kidney in US images. The input data shape (3, 256, 256) was provided to the model, and a user-created mask on the central slice was used. The sigmoid activation was applied to the last layer of the U-Net. The learning rate was 10^–6, and the loss function was the Dice loss. Each model was trained for 200 epochs. The model was implemented in Keras with the TensorFlow backend and trained on an Nvidia V100 (32 GB memory).

MRI imaging

All 22 patients imaged with the US were also imaged with MRI that included 3–4 mm thick slices obtained in the coronal plane with T2-weighting and fat saturation [19]. There were no separate groups assigned to one image technique or the other. We applied our previously described model [19] to all 22 cases as the automated MRI measurement. These segmented kidney masks were further quality checked by an expert image analyst (A.V.G.). The mask was corrected if needed to finalize the segmentation and measure kidney volume.

ADPKD classification

The Mayo ADPKD classification tool [36, 37] was applied for ADPKD group classification to both the MRI and the US images. This tool uses the information of patient age, TKV, and height of the patient.

Results

Data processing

US-kidney images have variable intensity, shape, and size, as demonstrated in Fig. 1a–d. Unlike MRI and CT, US-imaging scans each kidney individually due to probe field of view limitations. In some cases, the field of view may not be sufficient for imaging large kidneys (Fig. 1a). The variability of kidney measurements in the left and right kidneys for each subject was measured as displayed in Fig. 1e. The mean and standard deviation were obtained from three scans from each individual kidney.

U-Net model training and test data evaluation

The U-Net model with pre-trained weights (from the MRI model) was trained on 90 scans (15 patients × 2 kidneys × 3 scans) and validated with 12 scans (2 patients × 2 kidneys × 3 scans) as demonstrated in Fig. 2. The U-Net model trained with transfer learning from pre-trained weights achieved a Dice Similarity Coefficient (DSC) on the validation data of 0.86 (Fig. 2a). The trained U-Net model was used to predict kidney segmentations on the hold-out test dataset for further analysis. On the test dataset of 30 scans (5 patients × 6 scans), the model had a 0.80 DSC, 0.67 Jaccard Index, 0.89 volume similarity (VS), 0.83 Matthews correlation coefficient (MCC), 20.65 average Hausdorff distance with 95 percentile of maximum distance (HD-95%), and 0.91 AUC, respectively. Table 2 has tabulated these parameters as mean ± standard deviation from 30 scans along with false negatives (FN), and the comparison of model prediction against Reader1 and Reader2. The last row compares the metric parameters from Reader1 and Reader2.

Table 2 Model prediction and comparison with Reader1 and Reader2

Full size table

Further, the DSC was calculated slice-wise to see the effect of kidney shape and size contribution in class imbalance and impact on the AI model performance. The slice number containing the whole kidney was split into 10 segments. The DSC was calculated for each segment of the kidney, as demonstrated in Table 3, for Reader1 vs. Reader2 and Reader1 vs AI prediction. The manual annotations and AI-based automated kidney region visualizations are shown in Fig. 3. The values in bold represent that AI performs similar to or better than human tracing in slices from the largest area of the kidney.

Table 3 Dice score at various slices in the kidney

Full size table

We also compared our model performance with other state-of-the-art U-Net models having backbones of VGG16, EfficientNet-B0, DensNet101, and ResNet50, as demonstrated in Supplementary data, Table S1. We chose these base models having trainable parameters close to our proposed model, ~ 18 M. The baseline U-Net with various backbone models was trained on the same training data and validation data for comparison. Pre-trained weights from ‘imagenet’ were used in training.

Inter-reader and intra-scan variability

Two medical imaging analyst experts annotated kidneys on 3D US images, which were used to calculate each reader’s agreement in kidney segmentation. Figure 4 shows the inter-reader correlation and a reader’s comparison against AI-predicted kidney volume with Bland–Altman correlation methods. The Bland–Altman plot displays the differences among measurements on the same scale in percentage ((method1 − method2)/mean %), which is useful due to the range of variation in kidney volume present in data. The first row shows the results for inter-reader observations of Reader1 and Reader2 from 54 scans which had an R² of 0.81 in linear regression (Fig. 4a), whereas Bland–Altman shows the bias (mean difference) − 4.42%. The bias is significant because the line of equality is not in the 95% confidence interval. The agreement limits were from − 72.04 to 62.20% (Fig. 4d). A subset of 24 scans used in the test dataset and have both readers tracing were compared in the second-row as Reader1 vs. Reader2, reduced the linear regression coefficient to R² = 0.75 due to significantly different volume for one scan in a small dataset (Fig. 4b), where the bias was 2.85%. The agreement limits were from − 46.88 to 52.59% (Fig. 4e). The third row compares the Reader1 vs AI model for 30 scans test data correlation (Fig. 4c, f) where linear regression had R² = 0.93 and bias = − 4.12% with the limits of agreements from − 54.88 to 46.64% in the Bland–Altman analysis.

The interscan variability was calculated by subtracting kidney volume from the mean of three scans and then taking the average of the absolute differences, which was found to be ~ 56 mL (interscan variability plot Supplementary data, Fig S2).

MRI vs US correlation

Human-corrected kidney volumes (left and right kidney separately) from MRI images were compared with manually annotated kidney volumes (left, right kidney volumes separately) in 3D US images at the patient level. Figure 5 demonstrates MRI and US measured kidney volumes comparisons gave a linear regression coefficient R² = 0.84 and bias of 7.47% with the limit of agreement from − 70.29 to 55.35% by Bland–Altman analysis.

ADPKD group classification

Mayo Clinic’s ADPKD classification tool [36] was applied to all 22 patients to compare US and MRI performance, as demonstrated in Table 4. Also, the ADPKD classification was performed separately by considering the test dataset only and compared classification across the US and MRI methods. The groups 1A to 1E indicate increasing severity of cases in ADPKD classification. In the US method, both kidney volumes were added to get total kidney volume and then averaged over three observations to consider total kidney volume per patient. Out of 22 patients, both MRI and US classified three patients of 1A group in common, whereas from US-manual measurement, one extra got added to each of 1B and 1C from 1C and 1D, respectively. In contrast, one patient measured higher kidney volume by the US and shifted from group 1D to 1E. On the test dataset (5 patients), the ADPKD classification tool shows good agreement in 4 patients for US-manual vs. AI prediction. Whereas, when comparing the US measurement to MRI measurement, AI prediction marginally performed better and closer to MRI measurements than manually annotated kidney volumes from US groups. Both the groups, manual tracing and AI prediction, in the US-based kidney volume measure higher volume in polycystic liver disease (PLD) cases due to difficulty in differentiating the liver from the kidney.

Table 4 ADPKD group classification and comparison between MRI and US classification

Full size table

Discussion

Two-dimensional US is difficult to use for kidney volume measurement due to variability in how the operator moves and holds the probe, resulting in inconsistent image spacing and orientation. However, a recently developed 3D US device can be used to measure kidney volume and has the benefit of acquiring images with a high temporal resolution, which limits motion and other artifacts. Still, due to field of view limits, only a single kidney can be imaged at one time. Such images also have probe sensitivity, intensity variation, brightness, and time-gain compensation artifacts which must be addressed. For example, Fig. 1e demonstrates the average kidney volume from three scans, where PKD_021 and PKD_024 show a significantly larger right kidney than the left kidney. MRI also confirms similar differences in PKD_024 for left and right kidney volume. The cyst development and kidney swelling in ADPKD patients often result in increased kidney volume. However, in PKD_021, both the US Readers traced the right kidney larger than the left, which was contrary to MRI measurements. The image quality with intensity and contrast severely affected kidney tracing in some US images. This problem could be tackled by better handling of image acquisition protocol, modifying probe setting parameters based on patient demographic information.

The proposed model was trained with pre-trained weights that performed better on the validation data than the randomly initialized model. From the state-of-art U-Net models, ResNet achieved similar performance to our proposed model but needed 1.7 × more parameters and more time to train the model. Since pre-trained weights were from our previously published model from a large dataset of MRI images, 2000 + patients corresponding kidney segmentations, we chose to report the transfer learning model and perform prediction on the test dataset. Even though the AI model was trained on annotations received from Reader1, its performance was equally comparable to Reader2 annotations (Table 2). Table 3 demonstrates that the Dice score was reduced on the start and end frames from the kidney compared to the center frames from the kidney. This reduced DSC was primarily due to model under-performance where class imbalance was greater at the start and end frames in the kidney. Also, the contribution from overfitting (shown by arrows in 3B, False positives) resulted in a ~ 2% increase in volume, which is significantly smaller than the interobserver annotations DSC loss of ~ 23%. The interobserver score measured between Reader1 and Reader2 (Table 2) through the DSC was 0.77. The primary reason behind Reader’s disagreement was image quality which makes kidney boundary detection difficult. Although AI model performance looks impressive, we note that this is a small test dataset and may further improve with a larger dataset.

Yin et al. [33] reported a superior DSC performance on 2D US images, also for segmenting a single slice containing the largest kidney cross section in sagittal view. Further Breysem et al. [34] found that the 2D US volumetry was prone to underestimation, and 3D US measured more accurate kidney volume and was close to the MRI technique. We believe ours is the first study applying deep learning to 3D US from ADPKD. Further, the AI model results were compared to both human annotations and MRI as a reference standard.

Considering the artifacts in US images affecting the AI model performance, manual segmentation comparison on 54 kidney volume measurements helped understand the interobserver variability. The linear regression plot (Fig. 4a) of Reader1 and Reader2 displayed an interobserver correlation of R² = 0.81, likely reflecting the lower tissue contrast observed in the US versus MRI [19]. The R² coefficient was also low when Reader1 and Reader2 were compared on a small test dataset (Fig. 4b), mainly because a few observations significantly deviated. Figure 4c shows linear regression plots that help understand that Reader and AI prediction are correlated. Furthermore, the R² may not be a good parameter to characterize performance when a small dataset is used. The 2nd column in Fig. 4 displays the Bland–Altman test, and the mean and bias were small when applied on a test dataset (Fig. 4e).

The volume of kidneys from the US (left and right kidney volumes averaged over three scans) method and MRI method were also highly correlated with a value of R² = 0.84. The Bland–Altman bias of − 7.47% also confirms that the differences in US and MRI measurements are small, indicating the possibility that US imaging may be used to measure total kidney volume if needed for frequent monitoring of kidney volumes where MRI is challenging.

The ADPKD group classification tool [37] classified patients in ADPKD groups based on age, height, and total kidney volumes measured from MRI, US, and AI-predicted total kidney volumes. When the whole dataset of 22 patients was compared for MRI measurement vs US-manual traced measurement, one patient (PKD_015 in Fig. 1e) was classified into a higher risk group based on the US, from 1D to 1E. On review of that case, it was found that the case was difficult to segment and one of the Readers added the volume from some portions of the liver (the patient also had polycystic liver disease). Based on five patient test datasets, ADPKD group classification was not ideal between MRI and AI-US on three patients, including one PLD case. The interscan variability (Supplementary data, Fig S2), differences in three scans from the same kidney, calculated was ~ 56 mL. This volume has contributed to two patients whose kidney volume lies on the boundary of the ADPKD group classification threshold, to shift the ADPKD group. So the benefit of margin from interscan variability needs to be considered performing ADPKD group classification from US data.

Furthermore, on the right kidney of a PLD patient (PKD_015), AI predicted a 4 × smaller volume than Reader1 traced. Interestingly, AI predicted that this lower volume was similar to MRI measurement, which indicates AI could help correctly trace kidneys and avoid liver inclusion if trained with enough PLD patient data.

In the future, it could be interesting to add data from polycystic liver disease to the training data and perform segmentation on both kidneys and liver. In general, liver volumes are often calculated in patients affected by TKV and/or TLV. We believe that 3D US could also be applied to acquiring liver volumes, though this is beyond the scope of this present study. TKV is a strong biomarker of future renal insufficiency in ADPKD [38]. Various imaging techniques (MRI, CT, and US) and post-processing methods (stereology and ellipsoid-based measurements) are being used to determine TKV [38, 39]. Analysis with stereology is time consuming, whereas the ellipsoid method is easy for volume estimation. Since the kidney organ is located deep in the body, the US technique could easily show artifacts in images due to air/fluid/tissue distribution in the body. The 2D US images have in general poorer resolution than MRI. Therefore, MRI is the preferred method for accurate measurement of renal volume compared with both US and CT. However, recent developments with the 3D US could help improve results with US imaging. With a large dataset, the model performance would improve, and 3D US imaging may become incorporated into the clinical practice for PKD monitoring. Another potential of the US method could be in pediatric patients since it would be desirable to avoid MRI or CT imaging in that population.

Conclusions

To the best of our knowledge, this is the first study to measure total kidney volume from 3D US images using deep learning. Our method shows promising segmentation performance for auto-segmentation of kidneys and calculating total kidney volume, close to human tracing, and measurement. We also compared its performance with MRI, and it achieved good performance, suggesting it may be useful in populations where MRI is more challenging, such as children.

Data availability

The data underlying this article cannot be shared publicly due to the privacy of individuals that participated in the study. The data will be shared on reasonable request to the corresponding author.

References

Torres VE, Harris PC, Pirson Y (2007) Autosomal dominant polycystic kidney disease. Lancet 369:1287–1301
Article Google Scholar
Harris PC (1999) Autosomal dominant polycystic kidney disease: clues to pathogenesis. Hum Mol Genet 8:1861–1866
Article CAS Google Scholar
Bae KT, Zhu F, Chapman AB, et al (2006) Magnetic Resonance Imaging Evaluation of Hepatic Cysts in Early Autosomal-Dominant Polycystic Kidney Disease: The Consortium for Radiologic Imaging Studies of Polycystic Kidney Disease Cohort. Clinical Journal of the American Society of Nephrology 1:64–69
Article Google Scholar
Willey C, Kamat S, Stellhorn R, Blais J (2019) Analysis of Nationwide Data to Determine the Incidence and Diagnosed Prevalence of Autosomal Dominant Polycystic Kidney Disease in the USA: 2013-2015. Kidney Dis (Basel) 5:107–117
Article Google Scholar
Ryu H, Kim H, Park HC, et al (2017) Total kidney and liver volume is a major risk factor for malnutrition in ambulatory patients with autosomal dominant polycystic kidney disease. BMC Nephrol 18:22
Article Google Scholar
Kim H, Park HC, Ryu H, et al (2015) Clinical Correlates of Mass Effect in Autosomal Dominant Polycystic Kidney Disease. PLoS One 10:e0144526
Article Google Scholar
Meola M, Samoni S, Petrucci I (2016) Imaging in Chronic Kidney Disease. Contrib Nephrol 188:69–80
Article Google Scholar
Serra AL, Poster D, Kistler AD, et al (2010) Sirolimus and kidney growth in autosomal dominant polycystic kidney disease. N Engl J Med 363:820–829
Article CAS Google Scholar
Torres VE, Chapman AB, Devuyst O, et al (2012) Tolvaptan in patients with autosomal dominant polycystic kidney disease. N Engl J Med 367:2407–2418
Article CAS Google Scholar
van Gastel MDA, Messchendorp AL, Kappert P, et al (2018) T1 vs. T2 weighted magnetic resonance imaging to assess total kidney volume in patients with autosomal dominant polycystic kidney disease. Abdom Radiol (NY) 43:1215–1222
Article Google Scholar
Perrone RD, Mouksassi M-S, Romero K, et al (2017) Total Kidney Volume Is a Prognostic Biomarker of Renal Function Decline and Progression to End-Stage Renal Disease in Patients With Autosomal Dominant Polycystic Kidney Disease. Kidney Int Rep 2:442–450
Article Google Scholar
Alam A, Dahl NK, Lipschutz JH, et al (2015) Total Kidney Volume in Autosomal Dominant Polycystic Kidney Disease: A Biomarker of Disease Progression and Therapeutic Efficacy. Am J Kidney Dis 66:564–576
Article Google Scholar
Tangri N, Hougen I, Alam A, et al (2017) Total Kidney Volume as a Biomarker of Disease Progression in Autosomal Dominant Polycystic Kidney Disease. Canadian Journal of Kidney Health and Disease 4:205435811769335
Article Google Scholar
Bae KT, Tao C, Zhu F, et al (2009) MRI-based kidney volume measurements in ADPKD: reliability and effect of gadolinium enhancement. Clin J Am Soc Nephrol 4:719–725
Article Google Scholar
Kline TL, Korfiatis P, Edwards ME, et al (2016) Automatic total kidney volume measurement on follow-up magnetic resonance images to facilitate monitoring of autosomal dominant polycystic kidney disease progression. Nephrol Dial Transplant 31:241–248
PubMed Google Scholar
Sharma K, Rupprecht C, Caroli A, et al (2017) Automatic Segmentation of Kidneys using Deep Learning for Total Kidney Volume Quantification in Autosomal Dominant Polycystic Kidney Disease. Sci Rep 7:2049
Article Google Scholar
Bevilacqua V, Brunetti A, Cascarano GD, et al (2019) A comparison between two semantic deep learning frameworks for the autosomal dominant polycystic kidney disease segmentation based on magnetic resonance images. BMC Med Inform Decis Mak 19:244
Article Google Scholar
Shin TY, Kim H, Lee JH, et al (2020) Expert-level segmentation using deep learning for volumetry of polycystic kidney and liver. Investig Clin Urol 61:555–564
Article Google Scholar
Kline TL, Korfiatis P, Edwards ME, et al (2017) Performance of an Artificial Multi-observer Deep Neural Network for Fully Automated Segmentation of Polycystic Kidneys. J Digit Imaging 30:442–448
Article Google Scholar
Keshwani D, Kitamura Y, Li Y (2018) Computation of total kidney volume from CT images in autosomal dominant polycystic kidney disease using multi-task 3D convolutional neural networks. In: Machine Learning in Medical Imaging. Springer International Publishing, Cham, pp 380–388
Chapter Google Scholar
Mu G, Ma Y, Han M, et al (2019) Automatic MR kidney segmentation for autosomal dominant polycystic kidney disease. In: Medical Imaging 2019: Computer-Aided Diagnosis. SPIE, pp 242–249
van Gastel MDA, Edwards ME, Torres VE, et al (2019) Automatic Measurement of Kidney and Liver Volumes from MR Images of Patients Affected by Autosomal Dominant Polycystic Kidney Disease. J Am Soc Nephrol 30:1514–1522
Article Google Scholar
Gregory AV, Anaam DA, Vercnocke AJ, et al (2021) Semantic Instance Segmentation of Kidney Cysts in MR Images: A Fully Automated 3D Approach Developed Through Active Learning. J Digit Imaging 34:773–787
Article Google Scholar
Kline TL, Edwards ME, Fetzer J, et al (2021) Automatic semantic segmentation of kidney cysts in MR images of patients affected by autosomal-dominant polycystic kidney disease. Abdom Radiol (NY) 46:1053–1061
Article Google Scholar
Ozmen CA, Akin D, Bilek SU, et al (2010) Ultrasound as a diagnostic tool to differentiate acute from chronic renal failure. Clin Nephrol 74:46–52
CAS PubMed Google Scholar
Pulido JE, Furth SL, Zderic SA, et al (2014) Renal parenchymal area and risk of ESRD in boys with posterior urethral valves. Clin J Am Soc Nephrol 9:499–505
Article Google Scholar
Kuo C-C, Chang C-M, Liu K-T, et al (2019) Automation of the kidney function prediction and classification through ultrasound-based kidney imaging using deep learning. NPJ Digit Med 2:29
Article Google Scholar
Mahmud WMHW, Mahmud WMH, Supriyanto E (2017) An Approach towards Ultrasound Kidney Cysts Detection using Vector Graphic Image Analysis. IOP Conference Series: Materials Science and Engineering 226:012137
Zheng Q, Warner S, Tasian G, Fan Y (2018) A Dynamic Graph Cuts Method with Integrated Multiple Feature Maps for Segmenting Kidneys in 2D Ultrasound Images. Acad Radiol 25:1136–1145
Article Google Scholar
Torres HR, Queirós S, Morais P, et al (2018) Kidney segmentation in ultrasound, magnetic resonance and computed tomography images: A systematic review. Comput Methods Programs Biomed 157:49–67
Article Google Scholar
Shorten C, Khoshgoftaar TM (2019) A survey on Image Data Augmentation for Deep Learning. Journal of Big Data 6:1–48
Article Google Scholar
Wu Z, Hai J, Zhang L, et al (2019) Cascaded Fully Convolutional DenseNet for Automatic Kidney Segmentation in Ultrasound Images. 2019 2nd International Conference on Artificial Intelligence and Big Data (ICAIBD)
Yin S, Peng Q, Li H, et al (2020) Automatic kidney segmentation in ultrasound images using subsequent boundary distance regression and pixelwise classification networks. Med Image Anal 60:101602
Article Google Scholar
Breysem L, De Rechter S, De Keyzer F, et al (2018) 3DUS as an alternative to MRI for measuring renal volume in children with autosomal dominant polycystic kidney disease. Pediatr Nephrol 33:827–835
Article Google Scholar
Kline TL, Edwards ME, Korfiatis P, et al (2016) Semiautomated Segmentation of Polycystic Kidneys in T2-Weighted MR Images. AJR Am J Roentgenol 207:605–613
Article Google Scholar
ADPKD Classification. https://www.mayo.edu/research/documents/pkd-center-adpkd-classification/doc-20094754. Accessed 22 Jul 2021
Irazabal MV, Rangel LJ, Bergstralh EJ, et al (2015) Imaging classification of autosomal dominant polycystic kidney disease: a simple model for selecting patients for clinical trials. J Am Soc Nephrol 26:160–172
Article CAS Google Scholar
Bhutani H, Smith V, Rahbari-Oskoui F, et al (2015) A comparison of ultrasound and magnetic resonance imaging shows that kidney length predicts chronic kidney disease in autosomal dominant polycystic kidney disease. Kidney Int 88:146–151
Article CAS Google Scholar
Soroka S, Alam A, Bevilacqua M, et al (2017) Assessing Risk of Disease Progression and Pharmacological Management of Autosomal Dominant Polycystic Kidney Disease: A Canadian Expert Consensus. Can J Kidney Health Dis 4:2054358117695784
Article Google Scholar

Download references

Acknowledgments

We would like to thank the Mayo Clinic Department of Radiology for access to use the US and MRI instruments. The authors also thank Duane Meixner and Kate Knoll for their help with performing the 3D US scans and those patients who volunteered to be a part of this study. The Mayo Clinic provided funding for this work and was supported by the Mayo Clinic Robert M. and Billie Kelley Pirnie Translational PKD Center and the NIDDK Grants P30DK090728 and K01DK110136.

Funding

Funding was supported by Mayo Clinic (Grant No. Robert M. and Billie Kelley Pirnie Translational PKD Center), National Institute of Diabetes and Digestive and Kidney Diseases (Grant Nos. P30DK090728, P30DK090728).

Author information

Authors and Affiliations

Department of Radiology, Mayo Clinic, 200 First St. SW, Rochester, MN, 55905, USA
Jaidip M. Jagtap, Heather L. Homes, Darryl E. Wright, Marie E. Edwards, Zeynettin Akkus, Bradley J. Erickson & Timothy L. Kline
Division of Nephrology and Hypertension, Mayo Clinic, Rochester, MN, 55905, USA
Adriana V. Gregory & Timothy L. Kline

Authors

Jaidip M. Jagtap
View author publications
You can also search for this author in PubMed Google Scholar
Adriana V. Gregory
View author publications
You can also search for this author in PubMed Google Scholar
Heather L. Homes
View author publications
You can also search for this author in PubMed Google Scholar
Darryl E. Wright
View author publications
You can also search for this author in PubMed Google Scholar
Marie E. Edwards
View author publications
You can also search for this author in PubMed Google Scholar
Zeynettin Akkus
View author publications
You can also search for this author in PubMed Google Scholar
Bradley J. Erickson
View author publications
You can also search for this author in PubMed Google Scholar
Timothy L. Kline
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization or design, TLK, ZA and BJE; Analysis, JJ, and TLK; Interpretation of data, JJ, TLK, ZA and BJE; Drafting the article, JJ; Providing intellectual content of critical importance to the work described, TLK, BJE, and ZA; Data collection and labeling, ZA, AVG, HLH, and MEE; Discussions, DEW, and all; Final approval, TLK on behalf of all co-authors.

Corresponding authors

Correspondence to Bradley J. Erickson or Timothy L. Kline.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 273 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visithttp://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jagtap, J.M., Gregory, A.V., Homes, H.L. et al. Automated measurement of total kidney volume from 3D ultrasound images of patients affected by polycystic kidney disease and comparison to MR measurements. Abdom Radiol 47, 2408–2419 (2022). https://doi.org/10.1007/s00261-022-03521-5

Download citation

Received: 21 February 2022
Revised: 01 April 2022
Accepted: 04 April 2022
Published: 27 April 2022
Issue Date: July 2022
DOI: https://doi.org/10.1007/s00261-022-03521-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Automated measurement of total kidney volume from 3D ultrasound images of patients affected by polycystic kidney disease and comparison to MR measurements

Abstract

Purpose

Method

Results

Conclusion

Graphical abstract

Similar content being viewed by others

Automatic Segmentation of Kidneys using Deep Learning for Total Kidney Volume Quantification in Autosomal Dominant Polycystic Kidney Disease

Automatic semantic segmentation of kidney cysts in MR images of patients affected by autosomal-dominant polycystic kidney disease

Computation of Total Kidney Volume from CT Images in Autosomal Dominant Polycystic Kidney Disease Using Multi-task 3D Convolutional Neural Networks

Introduction

Methods and materials

Ultrasound imaging

Deep learning model

MRI imaging

ADPKD classification

Results

Data processing

U-Net model training and test data evaluation

Inter-reader and intra-scan variability

MRI vs US correlation

ADPKD group classification

Discussion

Conclusions

Data availability

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (PDF 273 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation