Fully automated kidney image biomarker prediction in ultrasound scans using Fast-Unet++

Oghli, Mostafa Ghelich; Bagheri, Seyed Morteza; Shabanzadeh, Ali; Mehrjardi, Mohammad Zare; Akhavan, Ardavan; Shiri, Isaac; Taghipour, Mostafa; Shabanzadeh, Zahra

doi:10.1038/s41598-024-55106-5

Fully automated kidney image biomarker prediction in ultrasound scans using Fast-Unet++

Article
Open access
Published: 27 February 2024

Volume 14, article number 4782, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Fully automated kidney image biomarker prediction in ultrasound scans using Fast-Unet++

Download PDF

Mostafa Ghelich Oghli¹,
Seyed Morteza Bagheri²,
Ali Shabanzadeh¹,
Mohammad Zare Mehrjardi³,
Ardavan Akhavan¹,
Isaac Shiri⁴,
Mostafa Taghipour¹ &
…
Zahra Shabanzadeh¹

884 Accesses
2 Altmetric
Explore all metrics

Abstract

Any kidney dimension and volume variation can be a remarkable indicator of kidney disorders. Precise kidney segmentation in standard planes plays an undeniable role in predicting kidney size and volume. On the other hand, ultrasound is the modality of choice in diagnostic procedures. This paper proposes a convolutional neural network with nested layers, namely Fast-Unet++, promoting the Fast and accurate Unet model. First, the model was trained and evaluated for segmenting sagittal and axial images of the kidney. Then, the predicted masks were used to estimate the kidney image biomarkers, including its volume and dimensions (length, width, thickness, and parenchymal thickness). Finally, the proposed model was tested on a publicly available dataset with various shapes and compared with the related networks. Moreover, the network was evaluated using a set of patients who had undergone ultrasound and computed tomography. The dice metric, Jaccard coefficient, and mean absolute distance were used to evaluate the segmentation step. 0.97, 0.94, and 3.23 mm for the sagittal frame, and 0.95, 0.9, and 3.87 mm for the axial frame were achieved. The kidney dimensions and volume were evaluated using accuracy, the area under the curve, sensitivity, specificity, precision, and F1.

Transformative Deep Neural Network Approaches in Kidney Ultrasound Segmentation: Empirical Validation with an Annotated Dataset

Article 27 February 2024

URI-CADS: A Fully Automated Computer-Aided Diagnosis System for Ultrasound Renal Imaging

Article Open access 27 February 2024

Multi-classification of kidney abnormalities in sonography using the LOA-MFO and long-term recurrent convolutional network

Article 07 July 2023

Introduction

Renal ultrasound plays a critical role in kidney dimension prediction and evaluation of its function. The imaging modality assesses renal anatomy and image guidance for renal interventions¹. Ultrasound is the modality of choice due to its lower cost, ease of accessibility, lack of radiation, and availability². Several studies indicate the robustness and reliability of volume measurements using ultrasound validated by magnetic resonance imaging (MRI)^3,4. Specifically, any variations in the anatomical characteristics of the kidney (such as kidney and parenchymal thickness) are associated with clinical disorders. For instance, a small kidney length may indicate subclinical atherosclerotic or irreversible chronic renal disease⁴. In addition, a large kidney length may also be related to higher cardiovascular risks⁵. Moreover, a diminished parenchymal thickness may increase the risk of end-stage renal disease (ESRD) in boys with posterior urethral valves⁶. However, the quality of the ultrasound imaging and its interpretation depend completely on the radiologist’s skills and expertise. On the other hand, investors favor the imaging system since it is a noninvasive diagnostic and screening approach. Many kidney disorders, such as Autosomal dominant polycystic kidney disease (ADPKD), need several follow-ups. Moreover, the kidney length seems to be related to the early assessment of the efficacy of the therapies^7,8. Despite the significance of determining the kidney’s anatomical characteristics in ultrasound images, the procedure is challenging and suffers from inter and intraobserver variability⁴. Thus, several automatic and semi-automatic approaches have been developed to overcome this issue, including the level-set approach⁹, statistical shape model^10,11, graph cut¹², texture analysis¹³, and support vector machine (SVM)¹⁴. Marsousi et al.⁹ used offline training datasets to generate a 3-D kidney shape model using a proposed shape model representation called the complex-valued implicit shape model. Zheng et al.¹² drew a graph of image pixels near the boundaries of the kidney instead of creating one of the entire image and combined image pixels with texture feature maps derived from Gabor filters. Xie et al.¹³ utilized ultrasound images’ shape priors combined with their texture. The image texture was obtained by applying a bank of Gabor filters, and the shape priors were gathered using Leventon and Faugeras’s method¹⁵. Ardon et al.¹⁴ proposed an SVM method to conquer the high variability in kidney appearance in 3D ultrasound images.

In recent years, several aspects of medical image analysis have been influenced by deep learning^{16,17,18,19,20,21,22}, and image segmentation has been affected the most. Various deep approaches have been proposed to distinguish the object of interest from its background in medical images. Some include U-net²³, V-net²⁴, SegNet²⁵, Fully Convolutional Networks (FCN)²⁶, and several promotions of U-net^{19,27,28,29,30}. These promotions are a testament to the performance and popularity of U-net-based medical image segmentation approaches. Several deep learning-based methods have recently been proposed for kidney image segmentation using boundary distance regression and pixel-wise segmentation^31,32, 3D U-net³³, modified FCN³⁴, and regular convolutional neural networks (CNN)³⁵. Yin et al.³² utilized a pre-trained CNN to extract high-level image features from kidney images. They created the boundary distance map of the image using a boundary distance regression network. Finally, the map was fed to a pixel-wise classification network to predict kidney masks. Ravishankar et al. tried to incorporate prior shape information into the FCN through an augmented dataset. They achieved an 83.95% of dice similarity coefficient (DSC) as their best result³⁴. In the Supplementary information file, a table is provided comparing kidney segmentation methods using ultrasound images from the literature.

There have been several attempts for kidney segmentation in computed tomography (CT) images. Sharma et al.³⁵ concentrated on kidney segmentation in Autosomal Dominant Polycystic Kidney Disease (ADPKD) patients, characterized by kidney enlargement. They utilized a specific architecture of CNN to segment kidneys in CT images. Türk et al. developed a hybrid segmentation model based on V-net²⁴ for kidney segmentation on 210 CT images³⁶. They furthered their work by utilizing an improved U-Net3D model for kidney segmentation³⁷ and a two-stage bottleneck block architecture for renal tumor segmentation³⁸ in the same dataset.

The main bottleneck of the proposed methods is the relatively high computational cost of predicting the object mask. This is due to the excessive number of parameters in these models. We have proposed a modification of Fast-Unet to overcome the problem. Fast-Unet is a high-performance novel CNN architecture that aims to segment fetal ultrasound images³⁹. The key point in the network is using 2 × 2 stride in the convolution layers that downsamples the spatial resolution of feature maps, thus making the network needless to the pooling layer. Inspired by U-net++, we have introduced Fast-Unet++ architecture, which utilizes diagonal layers to produce the final feature map, thus creating a more precise prediction.

In summary, the main contributions of this manuscript are:

1-
We have proposed a novel CNN architecture that accurately segments the kidney in ultrasound images at sagittal and axial views.
2-
This is the first contribution introducing a method for automatically predicting all kidney parameters, including its length, width, thickness, volume, and parenchymal thickness.

The latter contribution demonstrates the main clinical value of the proposed method, as the results show. To our knowledge, this is the first attempt to extract five kidney parameters automatically from ultrasound images^31,32,40. Supriyanto et al.⁴¹ proposed a semi-automatic approach based on the level set and tried to measure kidney dimensions (including the length, width, thickness, and volume) in only six samples. Kim et al.⁴² developed an automated method for kidney volume measurement in children using ultrasound. However, they only validated the predicted values with the CT dataset.

The proposed network was evaluated for the segmentation and the predicted kidney image biomarkers. In addition, a publicly-available dataset was used to compare the segmentation performance with other related networks⁴³. Heretofore, there has been one published article⁴⁴ on segmenting the presented data using nnUnet⁴⁵ and a contrastive learning method proposed by Chaitanya et al.⁴⁶. Therefore, we compared our approach with Fast-Unet, nnUnet, and Chaitanya et al.’s method. Fast-Unet architecture, the promotion implemented to propose Fast-Unet++, the results achieved, and a discussion of the results presented are discussed in the sections below.

Materials and methods

Prepared dataset

We trained and validated the proposed architecture for collating kidney ultrasound images in the sagittal and axial views. To ensure sufficient image variability in the training phase, the dataset was collected from several imaging centers with several ultrasound devices from May 2020 to January 2023. The imaging centers include (1) Shahid Hasheminejad, Tehran, Iran; (2) Javadolaemmeh Hospital, Jajarm, Iran; (3) and a private clinic in Bardaskan, Iran. The image acquisition was performed using Luna ultrasound device (SIMUT, Karaj, Iran), Affinity 50 device (Philips, Amsterdam, Netherlands), Voluson 730 Expert ultrasound scanner (General Electric, Austria), and Logiq S7 ultrasound device (General Electric, Austria). In addition, two observers delineated the kidney contours using ImageJ (National Institutes of Health, US). The total dataset consisted of 744 2D ultrasound images in the sagittal and axial views (372 left and 372 right in 372 subjects). The subjects were, on average, 45.2 years old. The male-to-female ratio was 63:37. The dataset was split into 80% for train and 20% for test sets.

The image spacing parameter differs in various datasets, ranging from 0.23 to 0.36 mm/pixel. In addition, the images were captured at different resolutions and resized to 320 × 480 pixels.

Subjects with decreased renal function, abnormal urinalysis findings, renal parenchymal abnormalities, and urinalysis anomalies were excluded. Among the 744 subjects, 13 underwent both kidney ultrasound and CT. The main reason for using CT images was to compare the results of biomarkers’ prediction with the ones achieved from an imaging modality with a higher level of anatomy representation.

Fast-Unet++

Our previous paper proposed Fast-Unet³⁹, which used convolution layers with a 5 × 5 kernel and 2 × 2 stride. Leaky Rectified Linear Units (Leaky ReLU) with 0.2 negative slope coefficient are set as the activation function in the encoder path and the ReLU activation function in the decoder path. The decoder path also used the dropout layer with a probability of 50% and the batch normalization layer.

As described in the paper on Fast-Unet³⁹, the architecture incorporates several modifications that enhance its computational efficiency compared to the classical U-Net architecture. These modifications include decreasing the spatial dimension of inputs by convolutional strides rather than max-pooling layers, employing transposed convolutions with stride 2 × 2 for up-sampling, eliminating two additional 2D convolutional layers in the output, and utilizing batch-normalization layers in both encoder and decoder modules. These optimizations (also present in the Fast-Unet++ architecture) collectively reduce the model’s computational complexity significantly compared to classical UNet.

Inspired by UNet+⁴⁷ and MFP-Unet²⁷ architectures, we have introduced a nested convolution network in this paper, namely Fast-Unet++. The proposed network has two main advantages over the Fast-Unet architecture, using nested blocks and utilizing all of the feature maps in the top-most level of abstraction for producing the output layer. The latter advantage, known as deep supervision, is presented in our previously published paper, MFP-Unet²⁷. Figure 1 depicts the proposed architecture.

As presented in Fig. 1, in the Fast-Unet++ architecture, the skip connection is re-designed. Instead of a direct link between corresponding encoder-decoder blocks, one or more convolution layers are involved in the path. The number of convolution layers and feature maps depends on the level of encoding decoding. These nested convolution blocks are considered as two groups based on their input. For example, group one layers have two inputs consisting of convolution blocks of the same and the next encoding level. On the other hand, group two layers have three inputs comprising (1) a nested layer at the upper level of encoding, (2) a previous nested layer at the same level, and (3) convolution blocks of the same level of encoding. Similar to the last layer of the decoder in the Fast-Unet architecture, the nested layers are composed of up-convolution, ReLU activation function, and batch normalization.

In addition to the transformative nested skip connections and deep supervision, the Fast-Unet++ architecture exhibits several novel features that distinguish it from previous models and contribute to its efficacy in kidney segmentation. First, with feature fusion (three types of inputs listed above in group two), the model can dynamically adjust the importance of feature maps based on local image cues, ensuring that every segmentation decision uses the most relevant data. Moreover, incorporating additional convolution layers in the decoder path enhances feature refinement, leading to smoother and more accurate segmentation boundaries. Moreover, our model implements a progressive refinement strategy in the decoder path, where each nested convolution block refines segmentation details iteratively, culminating in a more granular and contextually informed output. Finally, the model’s robustness to image variations, including noise, artifacts, and variations in kidney morphology, demonstrates its generalizability to a wide range of ultrasound images.

These architectural novelties translate into tangible advancements, delivering not just incremental improvements but a noteworthy leap in performance—evidenced by superior segmentation accuracy in empirical evaluations of both Fast-Unet and UNet++. Our extensive experiments and comparative analyses substantiate the distinctiveness and efficacy of Fast-Unet++ in addressing intricate ultrasound image segmentation challenges.

Measurement of kidney image biomarkers

After kidney segmentation in sagittal and axial views, post-processing algorithms were performed to calculate the renal dimensions and volume. These algorithms, grouped based on the input image and illustrated in Fig. 2, are presented in the following:

Post-processing of sagittal images

The segmented area of the sagittal-view images was used to calculate the kidney length (KL), thickness (KT), and parenchymal thickness. Figure 2a–e represents the post-processing related to the calculation of KL. Figure 2a shows the segmented kidney capsule in the sagittal view image. The green contour represents the ground truth, delineated by the expert radiologist, and the red contour represents the predicted contour by Fast-Unet++. The upper and lower poles of the kidney, which were extracted from the mask, are shown in Fig. 2b. The farthest pairs of points between the two poles were found using a grid search algorithm. The colored lines in Fig. 2b illustrate the procedure, and the determined points are shown in Fig. 2c. The KL is the distance between these two points.

The second row of Fig. 2 is related to the calculation of the kidney thickness. First, the image was cropped using a tailored-crop algorithm based on two points annotating the KL⁴⁸. The resulting square-shaped image enclosed the kidney and some of the background, as shown in Fig. 2d. Next, the cropped image was rotated-padded based on the angle achieved from the KL’s line. In this way, the kidney would be horizontal (Fig. 2e). Finally, to achieve the kidney thickness, the topmost point of the rotated mask was found, and the corresponding downmost point was reached. These points annotate the kidney thickness depicted in Fig. 2f.

The algorithm of the parenchymal thickness calculation is shown in the third row of Fig. 2. The kidney sinus was segmented using Fast-Unet++, and the farthest points between the upper contour of the kidney mask and the sinus mask were found (Fig. 2g, h). The distance between these two points is considered the parenchymal thickness.

Post-processing of axial images

We applied an algorithm to calculate kidney thickness using the axial plane, which some radiologists prefer. Figure 3 shows that the kidney thickness was measured in the axial plane as the maximum length parallel to the hilum. After the kidney segmentation, the major axis of the mask was calculated using the first eigenvector of the principal component analysis (PCA). Then, the mask was rotated horizontally using the angle achieved from eigenvectors (Fig. 3b). When the level of the hilum is on the left, the rightmost point of the mask represents the kidney border terminal and vice versa. This point is shown as point 2 in Fig. 3c. Point 1 was determined by finding the first line that passes through four points in the mask contour and assigning the middle point as the point.

The upper and lower parts of the contour were separated to measure the kidney width. For each point in the upper part, the nearest point in the lower part was found. Therefore, the longest distance between these pairs of points was determined as the kidney thickness (Fig. 3d, e). Finally, the rotation algorithm is applied in the opposite direction to achieve the real places of the determined points, as shown in Fig. 3f.

Kidney volume prediction

According to various publications^3,4,49, the kidney volume extracted from ultrasound images highly correlates with the values extracted with MRI and CT scans. KL and KT are needed for predicting kidney volume, measured from the sagittal view image, and KW from the axial view image. Thus, the kidney volume was calculated using the following formula:

$$Kidney\;Volume = KL \times KW \times KT \times 0.523$$

(1)

Implementation details and evaluation metrics

The proposed model was implemented using Python 3.8.12, Tensorflow 2.3.0, and Keras 2.4.3. The batch size was 32, and the maximum number of iteration steps was 47. The hardware used to train the deep learning model contained a GeForce RTX 2060 Graphics processing unit (GPU), HP 32GB DDR4 RAM, and Intel Core i5-7400 CPU. The Adam stochastic optimization⁵⁰ was used to compile the model with a learning rate of 10e-4. A random normal initializer with a mean of 0 and a standard deviation of 0.02 was used to initialize the filters of the proposed model. The hyperparameter optimization method used to select the best parameters (including the learning rate, the batch size, and the convolution kernel size) was a grid search. The grid search consisted of training the model with a range of values for each parameter and selecting the ones producing the best results. The results were evaluated using the mean dice similarity coefficient (DSC)⁵¹.

Segmentation performance was measured using DSC, Jaccard coefficient (JC)⁵², and mean absolute distance (MAD)⁵³. The predicted measurements were also evaluated using accuracy, the area under the curve (AUC), sensitivity, specificity, precision, and F1 score^54,55.

The convolutional layers dominate the computational complexity of the proposed model. The number of floating-point operations (FLOPs) required to compute the output of a convolutional layer is given by:

$$FLOPs = 2K2 \times C_{in} \times C_{out} \times N$$

(2)

where K is the kernel size of the convolutional filter. C_in is the number of input channels. C_out is the number of output channels. N is the number of pixels in the input image.

For the Fast-Unet++ model, the total number of FLOPs is approximately 1.5e10.

Result

Figure 4 shows the results of the segmentation network on ultrasound images of the kidney in sagittal and axial views. In this Figure, the green contour shows the ground truth that an expert delineates, and the red contour depicts the predicted mask by the proposed network. The promising agreement between the manual and automatic contour proves the robustness of the proposed method.

To enhance the interpretability of the proposed Fast-Unet++ architecture, we applied grad-CAM analysis to visualize the model’s activation patterns and identify the regions that contribute most significantly to the segmentation predictions. The results are visualized in Fig. 5.

We also calculated DSC for 100 random samples of the test image set and presented them in a Swarm scatter plot, a scatter plot with the points offset (jittered) in the x-dimension. In this Figure (introduced in the Supplementary information file), sagittal images are labeled as one, and axial images are labeled as two. As the Figure shows, the network performs better in s sagittal than axial images.

Speckle noise is a common artifact in ultrasound images that can degrade the performance of image segmentation models. Previous studies have shown that denoising algorithms can improve the segmentation accuracy of ultrasound images^56,57. We evaluated the performance of our Fast-Unet++ model on ultrasound images with and without denoising. We used the adaptive bilateral filter⁵⁸ method for denoising. We found that denoising did not significantly improve the segmentation accuracy of our model.

We believe that our Fast-Unet++ model is designed to be robust to noise. The model uses skip connections to allow information from earlier layers of the network to be propagated to later layers. This helps the model to learn to distinguish between relevant features and noise.

Quantitative analysis of the proposed segmentation network using our prepared dataset was performed using DSC, JC, and MAD metrics and compared with the results of two other related networks (Fast-Unet and UNet + +) to evaluate how our proposed modifications influence the results. Table 1 shows the results based on imaging views (sagittal and axial). The achieved metrics in sagittal view images are better than those achieved in axial view images, which was expected according to our previously presented results.

Table 1 Evaluation of the segmentation of kidney images. The values are with the form of (mean ± standard deviation).

Full size table

Beyond that, the network was compared with several networks using a publicly-available dataset⁴³. The compared models contained Fast-Unet³⁹, nnUnet⁴⁵, SwinUNETR⁵⁹, UNet++⁴⁷, DeepLabV3 + ⁶⁰, FCN²⁶, PSPNet⁶¹, traditional U-Net²³, and a method proposed by Chaitanya et al.⁴⁶. The dataset was collected in 5 years from over 500 patients. All images were categorized into three groups according to the acquisition view. The ground truth annotation was delineated for the kidney capsule, cortex, medulla, and sinus. Since our proposed method was trained for segmenting the kidney capsule and sinus, we compared our results in these two regions. Some of the results presented by Singla et al.⁴⁴ and the rest presented by Valente et al.⁶², used different evaluation metrics; therefore, we quoted the results from both studies with their own metrics. The results are presented in Table 2. Due to the dataset in this study providing segmentation from two experts, Table 2 shows the predictions based on both ground truths. As Singla et al.⁴⁴ did not report their results based on two experts’ masks, just one could be presented.

Table 2 Evaluation of the segmentation of kidney images using the public dataset. The HD¹ and SSD² metrics are reported on millimeters.

Full size table

The final evaluation of the proposed network compares the predicted kidney mage biomarkers (length, width, thickness, volume, and parenchymal thickness) with those achieved via CT scan. As mentioned, 13 cases had CT images, along with kidney ultrasounds. We used accuracy, AUC, sensitivity, specificity, precision, and F1 score to compare the results. According to Table 3, the proposed method produces robust results across all metrics. To be more accurate, we used the kidney width estimated from sagittal images in the present comparison.

Table 3 Comparing the predicted kidney dimension in ultrasound images with the results of CT images. The values are in the form of (mean ± standard deviation).

Full size table

Discussion

In this study, we proposed a novel CNN-based model for kidney segmentation from ultrasound images in sagittal and axial views and predicting kidney image biomarkers and volume. As far as we know, this is the first attempt to predict three kidney dimensions in addition to its volume and parenchymal thickness. We developed our previously published model, Fast-Unet³⁹, by adding nested layers inspired by Unet++⁴⁷. Compared with these networks, Fast-Unet++ takes advantage of the low computation cost of Fast-Unet and nested layers of Unet++. Therefore, combining these two structures yields better results, as reported in Table 1. Fast-Unet, however, performed better than Unet++, explainable by the intrinsic features of its architecture. As quantitative and qualitative results show, segmentation of the kidney in the sagittal frame yields more satisfying results due to the clearer borders of the kidney in these images. Thus, this project did not use an axial view for parenchymal thickness, as some radiologists do. In addition, the grad-CAM results demonstrate that the model effectively focuses on the kidney region, with high activation values highlighting the key anatomical features that guide the segmentation process. This analysis provides valuable insights into the model’s decision-making process and reinforces its ability to accurately segment the kidney in ultrasound images.

A comprehensive dataset was acquired from various imaging centers and imaging vendors (GE, Philips, and Simut) for the training and evaluation of the network. Since the ultrasound images of the kidney are affected by the operator’s experience, the imaging system, and the defined preset, collecting such an extensive dataset provides an illustrative vision of the network’s performance.

We compared the network’s performance with the CT images as the gold standard since the final goal of this study was to predict clinically routinely measured dimensions of the kidney. According to the results, our proposed model reliably represents all kidney dimensions, especially in length and width. This was predictable because sagittal images had higher quality, and kidney borders were more clearly defined than sinus borders. In addition, the evaluation of the kidney volume shows acceptable values in all metrics. To our knowledge, Kim et al.′s⁴² study is the only paper comparing kidney volumes predicted using artificial intelligence from ultrasound images with CT images. Their results show a 90% correlation between the predicted and reference values. While they have shown promising results, their work suffers from a low number of training images and evaluation metrics. Moreover, they have used 3D ultrasound, which provides more accurate images.

Another evaluation procedure of the proposed method was comparing the results with two other CNNs using a public dataset⁴³. Since nnUnet⁴⁵ and Chaitanya et al.’s approach⁴⁶ have been tested on this public dataset, comparing the results of our proposed model with them brings a good insight into the model’s performance. As results show, the proposed model achieved better results in terms of DSC and HD. Although the MAD metric was calculated, it was not comparable because the authors did not provide this information in their paper. The main advantage of the proposed model that causes this superiority is that it is less complicated than those two.

One of the most important limitations of this work was that the radiologist’s skill in acquiring the input image directly influences the result of segmentation. Some models have been introduced to overcome this issue. For instance, van den Heuvel et al.⁶³ proposed a model to automatically detect the fetal head and then estimate head circumference (HC) from a sequence of 2D ultrasound frames obtained using the obstetric sweep protocol (OSP). Their paper resulted from a program aimed at helping expand ultrasound in developing countries, which suffer from a lack of trained sonographers, with the help of artificial intelligence. This approach can be implemented for kidney image biomarker prediction, too. In other words, the sonographer sweeps the ultrasound transducer over the patient’s skin using a predefined protocol, which completely contains sagittal and axial views of the kidney. Then, a CNN model analyzes the frames to detect the best sagittal and axial ones. Finally, the image segmentation would be applied to the best frames.

The other limitation is the definition of the best frame for the estimation of kidney length and parenchymal thickness. According to the definition, kidney length is estimated in a frame where the largest view of the kidney is achieved. At the same time, this is not necessarily an excellent option for predicting parenchymal thickness since it needs a proper contrast between the sinus and parenchyma.

In addition, some abnormal structures, such as polycystic kidney (PKD) or tumors, can alter the kidney’s appearance and make it more difficult for the model to identify the correct segmentation boundaries⁶⁴. PKD is a genetic disorder characterized by the development of numerous fluid-filled cysts within the kidneys. These cysts can vary in size and distribution, dramatically altering the kidney’s morphology and complicating the segmentation process. Tumors, on the other hand, can have a variety of appearances, ranging from solid masses to cystic lesions. Their irregular shapes and heterogeneous echogenicity can make it difficult for the model to distinguish them from normal kidney tissue. To address these challenges, future studies may focus on utilizing prior knowledge about the characteristics of abnormal structures to guide the model’s segmentation and employing ensemble methods by combining multiple models, each trained on different aspects of the image, such as texture, shape, and intensity.

This work can be extended by predicting the thickness of the renal medulla and cortex, which are great representatives of kidney functions and can show disorders such as chronic kidney disease (CKD)⁶⁵. This is achievable by a high-quality dataset in which the medulla appears clearly in sagittal images. However, this raises an important issue: differentiating between medulla and kidney cysts because the medulla appears hypoechoic in ultrasound images.

Conclusion

The proposed CNN architecture demonstrated robust kidney segmentation in both sagittal and axial frames. The model achieved promising values for kidney image biomarkers (including length, width, thickness, and parenchymal thickness) and kidney volume, particularly for sagittal frames. These results suggest that the model can be applied to clinical settings for kidney assessment. Future work should focus on improving the accuracy of parenchyma volume estimation and exploring the model’s performance in a larger and more diverse dataset.

Data availability

The public kidney dataset (Open Kidney Dataset) is available on https://rsingla.ca/kidneyUS/. Our prepared dataset is available from the corresponding author upon reasonable request.

References

O’Neill, W. C. Renal relevant radiology: Use of ultrasound in kidney disease and nephrology procedures. Clin. J. Am. Soc. Nephrol. 9, 373–381 (2014).
Article PubMed PubMed Central Google Scholar
Wilson, D. Ultrasonic scanning of the kidneys. Ann. Clin. Lab. Sci. 11, 367–376 (1981).
CAS PubMed Google Scholar
Braconnier, P., Piskunowicz, M. & Zu, E. How reliable is renal ultrasound to measure renal length and volume in patients with chronic kidney disease compared with magnetic resonance imaging ?. Acta radiol. 61, 117–127 (2020).
Article PubMed Google Scholar
Bakker, J. et al. Renal volume measurements: Accuracy and repeatability of US compared with that of MR imaging. Radiology 211, 623–628 (1999).
Article CAS PubMed Google Scholar
van der Sande, N. G. C. et al. Relation between kidney length and cardiovascular and renal risk in high-risk patients. Clin. J. Am. Soc. Nephrol. 12, 921–928 (2017).
Article ADS PubMed PubMed Central Google Scholar
Pulido, J. E., Furth, S. L., Zderic, S. A., Canning, D. A. & Tasian, G. E. Renal parenchymal area and risk of ESRD in boys with posterior urethral valves. Clin. J. Am. Soc. Nephrol. 9, 499 (2014).
Article PubMed Google Scholar
Tokiwa, S., Muto, S., China, T. & Horie, S. The relationship between renal volume and renal function in autosomal dominant polycystic kidney disease. Clin. Exp. Nephrol. 15, 539–545 (2011).
Article CAS PubMed Google Scholar
Fick-Brosnahan, G. M., Belz, M. M., McFann, K. K., Johnson, A. M. & Schrier, R. W. Relationship between renal volume growth and renal function in autosomal dominant polycystic kidney disease: A longitudinal study. Am. J. Kidney Dis. 39, 1127–1134 (2002).
Article PubMed Google Scholar
Marsousi, M., Plataniotis, K. N. & Stergiopoulos, S. An automated approach for kidney segmentation in three-dimensional ultrasound images. IEEE J. Biomed. Heal. Inform. 21, 1079–1094 (2017).
Article Google Scholar
Cerrolaza, J. J. et al. Renal segmentation from 3D ultrasound via fuzzy appearance models and patient-specific alpha shapes. IEEE Trans. Med. Imaging 35, 2393–2402 (2016).
Article PubMed Google Scholar
Martín-Fernández, M. & Alberola-López, C. An approach for contour detection of human kidneys from ultrasound images using Markov random fields and active contours. Med. Image Anal. 9, 1–23 (2005).
Article PubMed Google Scholar
Zheng, Q., Warner, S., Tasian, G. & Fan, Y. A dynamic graph cuts method with integrated multiple feature maps for segmenting kidneys in 2D ultrasound images. Acad. Radiol. 25, 1136–1145 (2018).
Article PubMed PubMed Central Google Scholar
Xie, J., Jiang, Y. & Tsui, H. T. Segmentation of kidney from ultrasound images based on texture and shape priors. IEEE Trans. Med. Imaging 24, 45–57 (2005).
Article PubMed Google Scholar
Ardon, R., Cuingnet, R., Bacchuwar, K. & Auvray, V. Fast kidney detection and segmentation with learned kernel convolution and model deformation in 3D ultrasound images. In Proceedings—International Symposium on Biomedical Imaging vols 2015-July 268–271 (IEEE Computer Society, 2015).
Leventon, M. E., Grimson, W. E. L. & Faugeras, O. Statistical shape influence in geodesic active contours. In 5th IEEE EMBS International Summer School on Biomedical Imaging 316–323 (IEEE, 2002). https://doi.org/10.1109/SSBI.2002.1233989.
Sirjani, N. et al. A novel deep learning model for breast lesion classification using ultrasound Images: A multicenter data evaluation. Phys. Medica 107, 102560 (2023).
Article Google Scholar
Yadav, S. S. & Jadhav, S. M. Deep convolutional neural network based medical image classification for disease diagnosis. J. Big Data 6, 1–18 (2019).
Article Google Scholar
Maier, A., Syben, C., Lasser, T. & Riess, C. A gentle introduction to deep learning in medical image processing. Zeitschrift fur Medizinische Physik. https://doi.org/10.1016/j.zemedi.2018.12.003 (2019).
Article PubMed Google Scholar
Oghli, M. G. et al. Automatic fetal biometry prediction using a novel deep convolutional network architecture. Phys. Medica 88, 127–137 (2021).
Article Google Scholar
Yadav, N., Dass, R. & Virmani, J. Objective assessment of segmentation models for thyroid ultrasound images. J. Ultrasound 26, 673–685 (2023).
Article PubMed Google Scholar
Yadav, N., Dass, R. & Virmani, J. Assessment of encoder-decoder-based segmentation models for thyroid ultrasound images. Med. Biol. Eng. Comput. 61, 2159–2195 (2023).
Article PubMed Google Scholar
Yadav, N., Dass, R. & Virmani, J. Deep learning-based CAD system design for thyroid tumor characterization using ultrasound images. Multimed. Tools Appl. 2023, 1–43. https://doi.org/10.1007/S11042-023-17137-4 (2023).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention 234–241 (Springer, 2015). doi:https://doi.org/10.1007/978-3-319-24574-4_28.
Milletari, F., Navab, N. & Ahmadi, S. A. V-Net: Fully convolutional neural networks for volumetric medical image segmentation. In: Proc. 2016 4th Int. Conf. 3D Vision, 3DV 2016 565–571 (2016). https://doi.org/10.1109/3DV.2016.79.
Badrinarayanan, V., Kendall, A. & Cipolla, R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39, 2481–2495 (2017).
Article PubMed Google Scholar
Long, J., Shelhamer, E. & Darrell, T. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition 3431–3440 (2015).
Moradi, S. et al. MFP-Unet: A novel deep learning based approach for left ventricle segmentation in echocardiography. Phys. Medica 67, 58–69 (2019).
Article Google Scholar
Alom, M. Z., Hasan, M., Yakopcic, C., Taha, T. M. & Asari, V. K. Recurrent residual convolutional neural network based on u-net (r2u-net) for medical image segmentation. arXiv:1802.06955 (2018).
Oktay, O. et al. Attention u-net: Learning where to look for the pancreas. arXiv:1804.03999 (2018).
Huang, H. et al. UNet 3+: A full-scale connected UNet for medical image segmentation. ICASSP, IEEE Int. Conf. Acoust. Speech Signal Process. - Proc. 2020, 1055–1059 (2020).
Yin, S. et al. Fully-automatic segmentation of kidneys in clinical ultrasound images using a boundary distance regression network. Proc. Int. Symp. Biomed. Imaging 2019, 1741–1744 (2019).
Google Scholar
Yin, S. et al. Automatic kidney segmentation in ultrasound images using subsequent boundary distance regression and pixelwise classification networks. Med. Image Anal. 60, 101602 (2020).
Article PubMed Google Scholar
Jackson, P. et al. Deep learning renal segmentation for fully automated radiation dose estimation in unsealed source therapy. Front. Oncol. 8, 215 (2018).
Article PubMed PubMed Central Google Scholar
Ravishankar, H., Venkataramani, R., Thiruvenkadam, S., Sudhakar, P. & Vaidya, V. Learning and incorporating shape models for semantic segmentation. Lect. Notes Comput. Sci. 10433, 203–211 (2017).
Article Google Scholar
Sharma, K. et al. Automatic segmentation of kidneys using deep learning for total kidney volume quantification in autosomal dominant polycystic kidney disease. Sci. Rep. 7, 1–10 (2017).
ADS Google Scholar
Türk, F., Lüy, M. & Barışçı, N. Kidney and renal tumor segmentation using a hybrid V-Net-based model. Mathematics 8, 1772 (2020).
Article Google Scholar
Turk, F., Luy, M. & Barisci, N. Renal segmentation using an improved U-Net3D model. J. Med. Imaging Heal. Informatics 11, 2258–2266 (2021).
Article Google Scholar
Turk, F., Luy, M., Barışçı, N. & Yalçınkaya, F. Kidney tumor segmentation using two-stage bottleneck block architecture. Intell. Autom. Soft Comput. 1, (2022).
Ashkani Chenarlogh, V. et al. Fast and accurate U-Net model for fetal ultrasound image segmentation. Ultrasonic Imaging 44(1), 25–38. https://doi.org/10.1177/0161734621106988244,25-38 (2022).
Article PubMed Google Scholar
Pandey, M. & Gupta, A. A systematic review of the automatic kidney segmentation methods in abdominal images. Biocybern. Biomed. Eng. 41, 1601–1628 (2021).
Article Google Scholar
Supriyanto, E., Hafiza, W. M., Wui, Y. J. & Arooj, A. Automatic non invasive kidney volume measurement based on ultrasound image. Recent Res. Comput. Sci. 387–392 (2011).
Kim, D.-W. et al. Advanced kidney volume measurement method using ultrasonography with artificial intelligence-based hybrid learning in children. Sensors 21, 6846 (2021).
Article ADS PubMed PubMed Central Google Scholar
Singla, R. et al. The open kidney ultrasound data set. arXiv:2206.06657v2 (2022).
Singla, R. et al. Speckle and shadows: ultrasound-specific physics-based data augmentation for kidney segmentation. In International Conference on Medical Imaging with Deep Learning 1139–1148 (2022).
Isensee, F., Jaeger, P. F., Kohl, S. A., Petersen, J. & Maier-Hein, K. H. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18, 203–211 (2021).
Article CAS PubMed Google Scholar
Chaitanya, K., Erdil, E., Karani, N. & Konukoglu, E. Contrastive learning of global and local features for medical image segmentation with limited annotations. Adv. Neural Inf. Process. Syst. 33, 12546–12558 (2020).
Google Scholar
Zhou, Z., Rahman Siddiquee, M. M., Tajbakhsh, N. & Liang, J. UNet++: A Nested U-Net Architecture for Medical Image Segmentation. 3–11 (Springer, Cham, 2018). doi:https://doi.org/10.1007/978-3-030-00889-5_1.
Kuo, C.-C. et al. Automation of the kidney function prediction and classification through ultrasound-based kidney imaging using deep learning. NPJ Digit. Med. 2, (2019).
Hammoud, S. et al. Ultrasonographic renal volume measurements in early autosomal dominant polycystic disease: Comparison with CT-scan renal volume calculations. Diagn. Interv. Imaging 96, 65–71 (2015).
Article CAS PubMed Google Scholar
Boyd, S., Boyd, S. P. & Vandenberghe, L. Convex Optimization. (Cambridge University Press, 2004).
Dice, L. R. Measures of the amount of ecologic association between species. Ecology 26, 297–302 (1945).
Article Google Scholar
Vargas, J. M. The probabilistic basis of jaccard’s index of similarity. Artic. Syst. Biol. 45, 380–385 (1996).
Article Google Scholar
Babalola, K. O. et al. An evaluation of four automatic methods of segmenting the subcortical structures in the brain. Neuroimage 47, 1435–1447 (2009).
Article PubMed Google Scholar
Sokolova, M., Japkowicz, N. & Szpakowicz, S. Beyond accuracy, F-Score and ROC: A family of discriminant measures for performance evaluation. In Australasian Joint Conference on Artificial Intelligence 1015–1021 (2006).
Hajian-Tilaki, K. Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Casp. J. Intern. Med. 4, 627 (2013).
Google Scholar
Yadav, N., Dass, R. & Virmani, J. Despeckling filters applied to thyroid ultrasound images: a comparative analysis. Multimed. Tools Appl. 81, 8905–8937 (2022).
Article Google Scholar
Dass, R. & Yadav, N. Image quality assessment parameters for despeckling filters. Procedia Comput. Sci. 167, 2382–2392 (2020).
Article Google Scholar
Salehi, H. & Vahidi, J. An ultrasound image despeckling method based on weighted adaptive bilateral filter. Int. J. Image Graph. 20(03), 2050020 (2020).
Article Google Scholar
Hatamizadeh, A. et al. Swin UNETR: Swin transformers for semantic segmentation of brain tumors in mri images. In International MICCAI Brainlesion Workshop 272–284 (2021).
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F. & Adam, H. Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European Conference on Computer Vision (ECCV) 801–818 (2018).
Zhao, H., Shi, J., Qi, X., Wang, X. & Jia, J. Pyramid scene parsing network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2881–2890 (2017).
Valente, S. et al. A comparative study of deep learning methods for multi-class semantic segmentation of 2D kidney ultrasound images.
van den Heuvel, T. L. A., Petros, H., Santini, S., de Korte, C. L. & van Ginneken, B. Automated fetal head detection and circumference estimation from free-hand ultrasound sweeps using deep learning in resource-limited countries. Ultrasound Med. Biol. 45, 773–785 (2019).
Article PubMed Google Scholar
Grantham, J. J., Mulamalla, S. & Swenson-Fields, K. I. Why kidneys fail in autosomal dominant polycystic kidney disease. Nat. Rev. Nephrol. https://doi.org/10.1038/nrneph.2011.109 (2011).
Article PubMed Google Scholar
Beland, M. D., Walle, N. L., Machan, J. T. & Cronan, J. J. Renal cortical thickness measured at ultrasound: is it better than renal length as an indicator of renal function in chronic kidney disease?. Am. J. Roentgenol. 195, W146–W149 (2010).
Article Google Scholar

Download references

Acknowledgements

This research has been funded by Med Fanavaran Plus Co.

Author information

Authors and Affiliations

Research and Development Department, Med Fanavaran Plus Co., Karaj, Iran
Mostafa Ghelich Oghli, Ali Shabanzadeh, Ardavan Akhavan, Mostafa Taghipour & Zahra Shabanzadeh
Department of Radiology, Hasheminejad Kidney Center, Iran University of Medical Sciences, Tehran, Iran
Seyed Morteza Bagheri
Section of Body Imaging, Division of Clinical Research, Climax Radiology Education Foundation, Tehran, Iran
Mohammad Zare Mehrjardi
Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, 1211, Geneva 4, Switzerland
Isaac Shiri

Authors

Mostafa Ghelich Oghli
View author publications
You can also search for this author in PubMed Google Scholar
Seyed Morteza Bagheri
View author publications
You can also search for this author in PubMed Google Scholar
Ali Shabanzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Zare Mehrjardi
View author publications
You can also search for this author in PubMed Google Scholar
Ardavan Akhavan
View author publications
You can also search for this author in PubMed Google Scholar
Isaac Shiri
View author publications
You can also search for this author in PubMed Google Scholar
Mostafa Taghipour
View author publications
You can also search for this author in PubMed Google Scholar
Zahra Shabanzadeh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.G.O. contributed to the data analysis and writing of the manuscript. S.M.B contributed to the supervision of data preparation. A.S., A.A., and Z.S. contributed to the data preparation and segmentation. I.S., and M.T. contributed to the supervision of writing the manuscript.

Corresponding author

Correspondence to Mostafa Ghelich Oghli.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Oghli, M.G., Bagheri, S.M., Shabanzadeh, A. et al. Fully automated kidney image biomarker prediction in ultrasound scans using Fast-Unet++. Sci Rep 14, 4782 (2024). https://doi.org/10.1038/s41598-024-55106-5

Download citation

Received: 25 June 2023
Accepted: 20 February 2024
Published: 27 February 2024
DOI: https://doi.org/10.1038/s41598-024-55106-5
Springer Nature Limited

Fully automated kidney image biomarker prediction in ultrasound scans using Fast-Unet++

Abstract

Similar content being viewed by others

Transformative Deep Neural Network Approaches in Kidney Ultrasound Segmentation: Empirical Validation with an Annotated Dataset

URI-CADS: A Fully Automated Computer-Aided Diagnosis System for Ultrasound Renal Imaging

Multi-classification of kidney abnormalities in sonography using the LOA-MFO and long-term recurrent convolutional network

Introduction