BraNet: a mobil application for breast image classification based on deep learning algorithms

Jiménez-Gaona, Yuliana; Álvarez, María José Rodríguez; Castillo-Malla, Darwin; García-Jaen, Santiago; Carrión-Figueroa, Diana; Corral-Domínguez, Patricio; Lakshminarayanan, Vasudevan

doi:10.1007/s11517-024-03084-1

BraNet: a mobil application for breast image classification based on deep learning algorithms

Original Article
Open access
Published: 02 May 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Medical & Biological Engineering & Computing Aims and scope Submit manuscript

BraNet: a mobil application for breast image classification based on deep learning algorithms

Download PDF

Yuliana Jiménez-Gaona^1,2,3,
María José Rodríguez Álvarez²,
Darwin Castillo-Malla^1,2,3,
Santiago García-Jaen¹,
Diana Carrión-Figueroa⁴,
Patricio Corral-Domínguez⁵ &
…
Vasudevan Lakshminarayanan⁶

481 Accesses
Explore all metrics

Abstract

Mobile health apps are widely used for breast cancer detection using artificial intelligence algorithms, providing radiologists with second opinions and reducing false diagnoses. This study aims to develop an open-source mobile app named “BraNet” for 2D breast imaging segmentation and classification using deep learning algorithms. During the phase off-line, an SNGAN model was previously trained for synthetic image generation, and subsequently, these images were used to pre-trained SAM and ResNet18 segmentation and classification models. During phase online, the BraNet app was developed using the react native framework, offering a modular deep-learning pipeline for mammography (DM) and ultrasound (US) breast imaging classification. This application operates on a client–server architecture and was implemented in Python for iOS and Android devices. Then, two diagnostic radiologists were given a reading test of 290 total original RoI images to assign the perceived breast tissue type. The reader’s agreement was assessed using the kappa coefficient. The BraNet App Mobil exhibited the highest accuracy in benign and malignant US images (94.7%/93.6%) classification compared to DM during training I (80.9%/76.9%) and training II (73.7/72.3%). The information contrasts with radiological experts’ accuracy, with DM classification being 29%, concerning US 70% for both readers, because they achieved a higher accuracy in US ROI classification than DM images. The kappa value indicates a fair agreement (0.3) for DM images and moderate agreement (0.4) for US images in both readers. It means that not only the amount of data is essential in training deep learning algorithms. Also, it is vital to consider the variety of abnormalities, especially in the mammography data, where several BI-RADS categories are present (microcalcifications, nodules, mass, asymmetry, and dense breasts) and can affect the API accuracy model.

Graphical abstract

A systematic review of machine and deep learning techniques for the identification and classification of breast cancer through medical image modalities

Article 28 September 2023

A Comprehensive Review on Breast Cancer Detection, Classification and Segmentation Using Deep Learning

Article 07 July 2023

Breast Mammograms Diagnosis Using Deep Learning: State of Art Tutorial Review

Article 09 February 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Today, in the healthcare landscape, artificial intelligence tools hold great promise for clinicians by enhancing breast cancer diagnostics and tailoring treatment strategies to match the disease’s characteristics [1,2,3]. However, in the same line, there are some alternatives, such as command-line tools with shell scripts [4] and manual, semi-automated, and fully automated methods for image processing [5]; these options are not user-friendly for specialists and researchers without a background in computer science. Furthermore, the available graphical interface tools are often task-specific [6, 7], focusing on contour delineation, segmentation, or classification.

In this context, radiomics constitutes an emerging field in medical imaging and offers the potential to extract diagnostic and prognostic information from 2D grayscale images by analyzing lesion features [1, 2]. Hence, graphical and mobile tools are elevating the role of radiomics in biomedical research, potentially serving as a second opinion for radiologists in breast lesion detection. Specifically, computer-aided diagnosis (CAD) systems based on deep/machine learning (DL/ML) play a crucial role in addressing various computer vision challenges, such as medical image pre-processing with super-resolution [8,9,10,11] and denoising, data augmentation [12,13,14,15], medical image segmentation [16,17,18] (e.g., NiftyNet [6], MIScnn [16], and NiftySeg [17]), image classification [19], computer-assisted interventions [5], image recognition [20], and annotation [5].

In the context of detecting cancer, there are several radiomic projects, CAD based on deep/machine learning (DL/ML) systems, and studies that propose different artificial intelligence techniques that help to provide decision support for many applications in the patient care processes, such as lesion detection, characterization, cancer staging, and treatment planning. The major challenge in this field of research is to build a fully automatic CAD system that can analyze large quantities of images to provide an accurate diagnosis and, at the same time, robust enough to handle the biological variations in humans [21].

The most successful DL algorithms used in the processing of medical images are convolutional neural networks (CNNs), generative adversarial networks (GANs), and recurrent neural networks (RNNs), which play a crucial role in improving healthcare outcomes by providing accurate and efficient analysis in processing medical images, each offering unique capabilities in data augmentation, pattern recognition, and feature extraction [22]. In the early detection of breast cancer, CAD systems have several stages: (i) image collection, (ii) annotation and detection of tumors based on the region of interest (ROI), (iii) segmentation, (iv) classification based on the ROI shape using deep learning models, and (v) performance evaluation of the models [23, 24].

Image collection and annotation are the main challenges in performing large-scale medical image analysis using DL algorithms. Some CNNs-based options to consider as mask segmentation for detected tumors in medical images are you only look once (YOLO) [25], region-based convolutional neural network (R-CNN) [26, 27] and their variants (Mask R-CNN [26] and Faster R-CNN [27]), deep neural networks such as natural language process (NLP), which can help us to automatically identify and extract relevant information from radiology clinical reports and images [28].

Although there is a variety of CAD systems developed concerning breast cancer, it is also important to mention that there are systems deployed in mobile applications for use in the smartphone, e.g., in [29], an automated breast cancer diagnosis system on mobile phones for taking photos of ultrasound reports was implemented. The authors include the automatic extraction of intricate image features by convolutional neural networks (CNNs) and the precise classification of breast masses. It eliminates the need for manual feature engineering and reduces human error. These applications streamline the diagnostic process, increase efficiency, and, most importantly, enhance patient outcomes by providing reliable, consistent, and accessible early breast cancer detection and treatment tools.

2 Related work

In this section, we will briefly introduce NLP and CNN modeling as more recent approaches using neural networks and discuss how several authors have used these models in radiomics and biomedical applications.

One of the main fundamentals of NLP is extracting image information using patterns such as the accession number, series number, and image number. Information about the imaging modality, magnetic resonance imaging (MRI), CT (computed tomography), positron emission tomography (PET), ultrasound (US), and mammography imaging can be relevant, too. It can be extracted from the accession number and image number, where the patient identification number (ID) can be appropriate if the patient’s history is of interest.

Linna et al. [30] indicate that NLP tools in radiology and other medical settings have been used for information retrieval and classification. NLP-based algorithms have opened more possibilities for medical image processing, detecting findings, and giving possible diagnoses [31]. Wang et al. [32] suggested that cancers are the most common subject area in NLP-assisted medical research on diseases, with breast cancers (23.30%) and lung cancers (14.56%) with the highest proportion of studies. Also, Luo et al. [33] specified that NLP is useful for creating new automated tools that could improve clinical workflows and unlock unstructured textual information contained in radiology and clinical reports for the development of radiology and clinical artificial intelligence applications.

Prabadevi et al. [34] proposed a machine learning system using WEKA algorithms to detect cancer staging classification. Buckley et al. [35] used NLP to extract clinical information from > 76,000 breast pathology reports, the model of which demonstrated a sensitivity and specificity of 99.1% and 96.5% compared to expert humans. Chen et al. [36] proposed an NLP extraction pipeline system that accepts scanned images of operative and pathology reports. The system achieved 91.9% (operative) and 95.4% (pathology) accuracy. The pipeline accurately extracted outcomes data pertinent to breast cancer tumor characteristics, prognostic factors, and treatment-related variables. Liu et al. [37] implemented an NLP program to extract index lesions and their corresponding imaging features accurately from the text of breast MRI reports.

The NLP system demonstrated 91% recall and 99.6% precision in correctly identifying and extracting image features from the index lesions. The recall and precision for correctly identifying the BI-RADS categories were 96.6% and 94.8%, respectively. Kirillov et al. [38] created the NLP-based segment anything model (SAM) as a mask extraction and promptable segmentation task. Thus, it can transfer zero-shot [39] to new image distributions.

Likewise, in a CAD system, the classification task is an important step after the segmentation process. The most widely used deep learning-based algorithms for image classification are CNN models (ResNet [40], DenseNet [41], NasNet [42, 43], VGG-16 [44], GoogLeNet [45], and Inception-V3 [46]).

Several authors [47,48,49,50,51] have used models for benign and malignant breast mass classification. The CNN used for breast classification is divided into two main categories: (i) novo-trained model (e.g., Scratch) and (ii) transfer learning-based models that exploited previously trained networks (e.g., AlexNet, VGG-Net, GoogLeNet, and ResNet) [47]. In [48], the ResNet model was used as a classification training model using an original and synthetic mammography (DDSM) dataset, obtaining a performance of 67.6 and 72.5%, respectively.

In [49], several CNN models were proposed (GoogLeNet, Visual Geometry Group Network (VGGNet), and ResNet), to classify malignant and benign cells using average pooling classification. The results overcome all the other deep learning architectures in terms of accuracy (97.67%). However, the choice of architecture depends on the specific problem and involves commitments between factors such as model size, computational efficiency, and accuracy.

Despite the extensive availability of medical radiomic tool research and CAD-based deep learning systems [52, 53], this technology has limited support within mobile app infrastructure for 2D breast medical image analysis. Consequently, the BraNet’s workflow has two main phases off-line and on-line, to achieve the following aims: (i) to develop a mobile app based on deep learning models for segmenting and classifying 2D breast images into benign and malignant lesions and (ii) to implement statistical metrics as a prediction performance evaluation tool.

3 Methods

3.1 Data collection

We collected seven open-access breast image databases, including three datasets of breast ultrasound (US) images and four datasets of mammography images.

(i)
Breast Ultrasound Images Dataset (BUSI): this dataset, gathered by [43], comprises 780 images (133 normal, 437 benign, and 210 malignant).
(ii)
Dataset A: collected by Rodrigues et al. [54] available at (https://data.mendeley.com/datasets/wmy84gzngw/1), Dataset A contains 250 breast US images (100 benign and 150 malignant).
(iii)
Dataset B: Comprising 163 US images, these data were acquired from the UDIAT Diagnostic Centre of the Parc Tauli Corporation, Sabadell, Spain [55].
(iv)
CBIS-DDSM: Curated Breast Imaging Subset–Digital Database for Screening Mammography, accessible at (https://n9.cl/qtl48), this database comprises 2620 cases [56].
(v)
mini-MIAS (Mammographic Image Analysis Society): available at http://peipa.essex.ac.uk/info/mias.html), includes 322 (208 normal, 63 benign and 51 malignant images) Medio Lateral Oblique (MLO) mammograms from 161 patients [57, 58].
(vi)
Inbreast: this dataset comprises a total of 115 images and can be found at (https://biokeanos.com/source/INBreast) [59].
(vii)
VinDr-Mammo: introduces a large-scale full-field digital mammography dataset of 5,000 four-view exams (https://physionet.org/content/vindr-mammo/1.0.0/) [60].

3.2 Pretraining models in phase off-line

3.2.1 Data normalization and automatic ROI annotation

An ROI annotation is needed from a large dataset of US and mammography images from the above public database to improve the previously trained GAN and ResNet models and their computational performance. The breast images vary in size, see Table 1.

Table 1 US and mammography ROIs

Full size table

Thus, it is necessary to perform transformations and standardize the images taken at different sizes to a single dimension (128 × 128 × 1 pixel). It was also necessary to transform it to a single channel (grayscale pixel) and normalize it in the range [− 1,1] with a mean of 0.5 and a standard deviation of 0.5. The torch-vision (pytorch) library and Jupyter notebook algorithm (crop_vindr_images.ipynb) were used as the image annotation region processes to identify ROIs that may contain lesions.

Figure 1 details the overall process followed in this study.

3.2.2 User ROI extraction and segmentation

As other studies have pointed out [61], to improve the detection accuracy, smaller patches (i.e., RoIs) where all breast masses and micros (e.g., cysts and calcification) are included inside this extracted area are generated from the original mammogram. In most mammogram images, 32 to 56% are background pixels, which do not contribute to breast cancer diagnosis.

In this research, the segment anything model (SAM) [38] (an encoder-decoder architecture based on NLP prompt-based learning) [35] was trained as automatic ROI segmentation before being implemented in the Module 5 (BraNet application phase on-line). SAM is an open-source software, and the quality of the segmentation masks was rigorously previously evaluated, with automatic masks deemed high quality and effective for training models, leading to the decision to include automatically generated masks.

NLP tasks include sentence boundary detection, tokenization, and problem-specific segmentations, and the SamAutomaticMaskGenerato function was used for automatic mask extraction [28]. SAM model is available under a permissive open license (Apache 2.0) at https://segment-anything.com.

The SAM predefined hyperparameters used are as follows: points_per_side (32), points_per_batch (64), pred_iou_thresh (0.88), stability_score_thresh (0.95), stability_score_offset (1.0), box_nms_thresh (0.7), crop_n_layers (0), crop_nms_thresh (0.7), crop_overlap_ratio (512/1500), crop_n_points_downscale_factor (1), point_grids (Null), min_mask_region_area (0), and output_mode (‘binary_mask’). Also, SAM accuracy ROI segmentation is evaluated by the intersection-over-union (Jaccard index) metric [18] using calculate_stability_score function.

The model was trained on a large and diverse set of masks of mammography and US images. These ROIs were previously extracted from the Mini-MIAS, Inbreast, and VinDr-Mammo databases (corresponding annotated bounding boxes are available in a.csv file). However, it is worth noting that its authors had already classified the ROIs from the CBIS-DDSM dataset; thus, no additional ROI extraction was required. Regarding US images, RoIs were extracted from the BUSI and Dataset B (UDIAT), excluding Dataset A, because it already contained RoIs.

A total of 6592 breast ROI images were used for pre-trained SAM and GAN models, with 4463 mammography images (benign and malignant) and 1041 US images, as shown in Table 1.

3.2.3 Data augmentation

This technique was previously employed in the phase off-line to mitigate the risk of overfitting effectively. To generate new realistic images and improve BraNet’s classification task performance, all ROIs were previously augmented by a GAN using the spectral normalization technique (SNGAN). SNGAN introduces a novel weight normalization technique known as spectral normalization to enhance the training stability of the discriminator network [62, 63], serving as the foundation for synthetic image generation, which use Hinge loss function (see Eq. 1).

$${L}_{D}=\begin{array}{c}-{{\text{E}}}_{\left({\text{x}}, {\text{y}}\right)\sim {{\text{P}}}_{{\text{data}}}}\left[{\text{min}}\left(0,-1+{\text{D}}\left({\text{x}},{\text{y}}\right)\right)\right]-{{\text{E}}}_{{\text{z}}\sim {{\text{P}}}_{{\text{z}},{\text{y}}}{{\text{P}}}_{{\text{data}}}}\left[{\text{min}}\left(0,-1+{\text{D}}\left({\text{G}}\left({\text{z}}\right),{\text{y}}\right)\right)\right] {{\text{L}}}_{{\text{G}}}\\ =-{{\text{E}}}_{{\text{z}}\sim {{\text{P}}}_{{\text{z}},{\text{y}}}{{\text{P}}}_{{\text{data}}}}{\text{D}}\left({\text{G}}\left({\text{z}}\right),{\text{y}}\right)\end{array}$$

(1)

where ${P}_{{\text{data}}}$ is the real data distribution, $P(z)$ is a prior distribution on noise vector z, $D\left(x\right)$ denotes the probability that x comes from the real data rather than generated data, ${E}_{x\sim {P}_{{\text{data}}}}$ represents the expectation of x from real data distribution ${P}_{{\text{data}}}$, and ${E}_{z\sim {P}_{(z)}}$ is the expectation of z sampled from noise.

The clean-fid library was used to obtain the FID value, using the Tensorflow and PyTorch libraries, some original implementations of the metric were taken from Parmar et al. [64]. The GAN model was trained using a cross-validation technique in Google Colab Pro 1 GPU model V100 with CUDA cores execution; with the hyperparameters detailed in Table 2.

Table 2 Hyperparameter tunning of deep learning models

Full size table

3.2.4 Cross-validation analysis

The technique divides the dataset into multiple folds and trains (DM: training I (4463) and training II (6592) and US: training I (1041)) the model on different subsets while validating the remaining fold can provide a more robust estimate of the model's performance, effectively mitigating the risk of overfitting. It helps detect overfitting early and tune the model accordingly. A total of 6933 benign and malignant ROIs were split into 80% training and 20% validation, using the Sklearn library from Pytorch, see Table 3.

Table 3 Training and validation datasets of DM 80% (5273) and 20% (1319) and US 80% (832) and 20% (209) breast images

Full size table

3.2.5 ROI classification process

Before implementing Module 6 (ROI image classification) in the BraNet mobil interface, the ResNet model was pre-trained on a large set of generated mammography and US ROIs using also cross-validation technique.

ResNet18 training model

The ResNet18 CNN-deep learning-based classification model has been widely used in medical image classification, especially in breast lesion diagnosis and detection, and was chosen for its effectiveness in transfer learning, offering reduced training time and the automatic extraction of features [40]. This approach effectively mitigates the issues of vanishing or exploding gradients that can arise from increasing neural network depth, ultimately leading to improved accuracy [65,66,67,68].

Thus, to train the ResNet model and distinguish between malignant and benign breast lesions, the datasets were divided in two categories: (i) dataset A (original + synthetic ROIs) and (ii) dataset B (synthetic ROIs). The model consists of three convolutional layers and two fully connected layers. The kernel size for the first convolutional layer is 5 × 5, and, for the rest, 3 × 3. The size of the first and the second fully connected layers are 128 and 2 (the number of classes), with a dropout of 0.5. After the flattening and the first fully connected layers, the ReLU activation function for all layers except the output layer, where softmax was used. The model was pre-trained with the PyTorch library using a Google Colab Pro 1 GPU model V100 with CUDA cores execution; the training hyperparameters are outlined in Table 2.

3.3 System architecture in phase on-line

The system architecture consists of two primary components facilitating scalability and system maintenance: (i) the mobile application and (ii) the backend server, following a client–server architecture.

The mobile application is a client that communicates with the backend server to request services and image analysis. The backend server processes these requests and returns the results to the client for display (see Fig. 2). The backend server was developed using react native and was implemented in the Python programming language.

The mobile application comprises several interrelated components:

Module 1: Registration, Login Synchronization, and User Profile Information. Data generated by the application, such as images and metadata, is stored in Firebase (a mobile and web application development platform). Firebase is also used for user authentication, mobile application registration, and log in.
Module 2: Image Import allows users to upload breast ultrasound and mammography images in PNG, JPG, and JPEG formats, with a maximum file size of 10 MB.
Module 3: Visualization Area (a history of image analysis results, image analysis capabilities, and user assistance).
Module 4: User ROI extraction.
Module 5: ROI segmentation (see Section 2.2.2).
Module 6: ROI Image Classification (see Section 2.2.4).

3.4 Evaluation metrics in phase off-line and on-line

3.4.1 Quality of synthetic image

The FID and KID quantitative feature-based metrics have been applied to evaluate the quality of real and synthesized ROIs generated by GANs models and to compute the distance between the vector representation of the synthesized and authentic images.

Fréchet inception distance (FID): FID compares the distributions of the original and synthetic images to assess how well the generated images represent the training dataset. Lower FID scores indicate better quality images [69], and it is calculated as shown in Eq. 2:

$${\text{FID}}={\Vert {\mu }_{r}-{\mu }_{g}\Vert }^{2}+Tr({\sum }_{r}+{\sum }_{g}-2{({\sum }_{r}{\sum }_{g})}^{1/2})$$

(2)

Here, µ_r represents the mean of the feature vector calculated from the real images, µ_g is the mean of the feature vector calculated from the fake images, Σ_r is the covariance of the feature vector from the real images, and Σ_g is the covariance of the feature vector from the fake images.

Kernel inception distance (KID)

KID employs the cubic kernel to compare the skewness, mean, and variance [69]. A lower KID value signifies a higher visual similarity between the actual and generated images. The cubic polynomial kernel is defined as shown in Eq. 3):

$$k\left(x,y\right)={\left(\frac{1}{d} {x}^{T}y+1\right)}^{3}$$

(3)

where d represents the dimension of the feature space for vectors x and y.

3.4.2 BraNet’s classification performance evaluation

For assessing the BraNet’s classifier’s performance, we employed a confusion matrix, F1 score (Dice), accuracy (Acc), precision (Prec), sensitivity (Sen), recall, and specificity (Spec) [24] metrics (see Tables 4 and 5). The accuracy of the model was calculated using statistical score libraries such as the classification report and confusion matrix from the Python sci-kit-learn module.

Table 4 Confusion matrix to distinguish between two classes (benign, and malignant)

Full size table

Table 5 Validation assessment metrics

Full size table

3.4.3 Human expert evaluation

Two senior radiologists were asked to assess, annotate, and classify images independently to ensure that BI-RADS categories are correctly assigned. Representative original ROI images for each breast type is available in https://drive.google.com/drive/folders/1HMeqPfI8qL58hAqwVpZupH6uq4W_kHrI.

A comparison between the images tested by human experts and those annotated in public databases was conducted using an independent test set of 212 mammography images (47 malignant and 165 non-malignant) and 78 US images (24 malignant and 54 non-malignant).

Two diagnostic radiologists (reader 1 with 20 years of experience and reader 2 with 13 years of experience) were given a reading test consisting of 290 total original RoI images to assign the perceived breast tissue type. The readers rated each image as (1) benign or (0) malignant.

Kappa coefficient and overall accuracy

Furthermore, the agreement between the two readers’ answers (considering all elements of error matrix) was assessed by determining the kappa coefficient (K), using the ranges between 0 (when there is no agreement) and 1 (when there is substantial agreement), and is calculated using the Eq. 4. The error matrix was calculated by comparing the two readers’ choices from five possibilities and was interpreted as follows: < 0.2 slight; 0.21–0.40 fair; 0.41–0.60 moderate; 0.61–0.80 high; and 0.81–1.0 almost perfect [70].

$$k=\frac{{P}_{0}-{P}_{e}}{1-{P}_{e}}$$

(4)

P_o is the correctly allocated samples (agreement cases), and P_e is the hypothetical random agreement.

The overall accuracy (Eq. 5) allows the description of model performance and is calculated by dividing the total number of correctly classified samples by the total number of samples.

$${\text{Acc}}= \frac{{C}_{s}}{{N}_{s}}$$

(5)

C_s is the number of correct samples classified, and N_s is the total number of samples.

4 Results

The main results are categorized into two phases, off-line and on-line. First, we introduced the preprocessing and pre-training models’ section with data augmentation (GAN), segmentation (SAM), and classification (ResNet) algorithms. Then, we presented the on-line phase with a practical utility of BraNet’s user interface and its modules.

4.1 Preprocessing and pretrained models

4.1.1 Synthetic data to feed the classification network

A significantly number of synthetic RoIs (10,000 (training I) and 2000 (training II) mammography RoIs and 4000 US RoIs (training I)) were generated by SNGAN to feed the classifier. The loss function and accuracy of the generator and discriminator play a crucial role in assessing the training stability and performance of GANs. A stable GAN is characterized by a discriminator loss around 0.5 or higher than 0.7, while the generator loss typically ranges from 1.0 to higher values like 1.5, 2.0, or even more. The accuracy of the discriminator, both on real and synthetic images, is expected to hover around 70 to 80%. Appendix Table 9 presents the accuracy plot to SNGAN.

The average FID and KID values in SNGAN are 58.80/0.052 and 52.34/0.051 for mammography training I and training II, respectively, and 116.85/0.06 for the training I in US (see Appendix Fig. 6). The lowest values indicate that SNGAN-generated synthetic images closely resemble to the original mammography and US images in clinical characteristics, suggesting their potential utility in clinical data augmentation and training, particularly for enhancing diagnostic skills in breast imaging.

4.1.2 ResNet training model

The model shows the highest accuracy in US image classification (see Table 6) concerning the mammography dataset. Although the network received more mammography images (6592) as input (Mini-MIAS, Inbreast, CBIS-DDSM, and VinDr-Mammo) with respect to the small number (1041) of US data (BUSI, UDIAT, and DATASET A). It means that not only the amount of the data is important to train deep learning algorithms. Also, it is important to considerer the variety of abnormalities especially in the mammography data, such as microcalcifications, nodules, mass, asymmetry, and dense breasts, because it can improve the accuracy of the ResNet training model.

Table 6 ResNet statistical performance evaluation in US and DM image classification

Full size table

Therefore, it is essential to monitor the evolution and performance of the models using training and validation datasets, see Fig. 3a–d. Figure 3a, c displays the loss and accuracy values concerning each epoch during the ResNet training and validation model using mammography and US images, while Fig. 3b, d shows the accuracy, F1 score, recall, and precision by each epoch in both image types. Appendix Table 10 shows the details of the network training and validation dataset.

The BraNet App Mobil exhibited the highest accuracy in benign and malignant US images (94.7%/93.6%) classification compared to DM during training I (80.9%/76.9%) and training II (73.7/72.3%). And the ResNet model does not improve the accuracy of benign and malignant ROI lesion classification during training II compared to the previous training I.

4.2 The BraNet and its graphical user interface (GUI)

The mobile app’s user interface was developed using Python v3.11 and React Native as a JavaScript framework for creating native mobile applications compatible with iOS and Android platforms. The interface is composed of several modules, each serving distinct purposes:

Module 1: Registration, Login Synchronization, and User Profile Information: This module handles user registration and login functionalities, synchronizing user data and providing access to user profile information.
Module 2: Image Import: Users can import images in standard picture formats, such as JPG, JPEG, and PNG, with a maximum size limit of 10 MB.
Module 3: Visualization Area: This area displays loaded images. The selected image is displayed in grayscale, preserving the original image's aspect ratio (see Fig. 4a).
Module 4: Manual ROIs Extraction: This module allows users to manually or semi-automatically create masks and define ROIs within the selected image. Masks are represented as binary matrices with the same dimensions as the loaded mammography image, where true values indicate the ROIs. Users can customize the size and sampling method for RoIs.
Module 5: ROI Segmentation: Users can segment a subset or the entire set of features from the segmentation section (as shown in Fig. 4b). Before performing calculations on the image, the user must add at least one ROI in the “Regions and Masks.”
Module 6: ROI Image Classification: This module employs the ResNet18 model to classify ROI images into benign and malignant classes. Example output classes are provided in Fig. 4b.

The BraNet’s graphical user interface enhances the user experience by providing intuitive image analysis and classification tools, making it a valuable resource for medical professionals and researchers. The practical usage of the tool can be accessed via the following link: https://drive.google.com/file/d/1d1vnjQ6LqOd0fdz65eaVg791d7cFRPWO/view

4.2.1 Comparison of the BraNet with human experts’ evaluation

The accuracy percentages of correct rates between benign and malign images classification for readers 1 and 2 were 29% and 42%, respectively. The reader agreement was assessed using the kappa coefficient, which values are 70% and 71% in mammography and US classification, respectively. Table 7 indicates a fair agreement (0.3) for mammography images and moderate agreement (0.4) for ultrasound images in both readers, with a change in prevalence from the lowest in US images to the highest value in mammography images, resulting in a corresponding change in sensitivity (19.2/60) to specificity (51/84.4) percentage points. This effect was statistically significant (p < 0.05) for either sensitivity or specificity in both image types.

Table 7 Interrater reliability, Cohen’s kappa, and statistical values for 2 raters in both classes

Full size table

5 Discussion

The pressing need to transition automatic medical image classification by CAD systems from research laboratories into practical clinical applications is evident. BraNet’s aims to provide an API for setting up a breast image classification pipeline with ROI mask extraction and segmentation capabilities. The tool offers an open-source solution for processing US and mammography images, complete with statistical metrics for evaluating model performance.

There have been many published examples of AI algorithms that demonstrate excellent performance in cancer detection for screening mammography. These include several algorithms trained and evaluated on private and public data sets. Table 8 compares BraNet’s performance against other state-of-the-art medical image classification applications.

Table 8 Comparison of the BraNet’s performance against other deep learning applications

Full size table

However, there is a significant gap in understanding how these AI applications will perform with multimodal images in the real world when radiologists use them in clinical practice [75, 76].

The BraNet Mobile App is an open interface for classifying specific 2D breast image types using deep learning models. It is believed that this is the first system for breast cancer diagnosis deployed on mobile phones to both types of images. The API’s development comprises two main phases: (i) off-line to pre-train the deep algorithms and (ii) on-line to release the app, which includes several modules, including model selection, model extraction (by a human expert), segmentation (SAM model), model classification (ResNet18 model), and model evaluation.

During the off-line phase, the pre-trained GAN algorithm was implemented as synthetic image generation, and the image quality was evaluated by two feature-based metrics FID and KID. It is widely acknowledged that the preprocessing images, quality, and diversity of the training dataset greatly impact the training of GAN and CNN deep learning models [77,78,79]. The lower FID and KID values mean a higher visual similarity between the real and generated images. The results (Appendix Fig. 6) indicate that the SNGAN model is suitable for mammography and US synthetic data generation with average values of FID = 52.4/ KID = 0.051 for mammography and FID = 116.85/ KID = 0.06 for US.

With these datasets Dataset A (original + synthetic ROIs) and Dataset B (real ROIs), the classification model was trained. Table 6 and Fig. 3 show the accuracy results averaged in BraNet ROI classification are as follows: (i) training I in US (94.7 (B)/93.6 (M)) and DM (80.9 (B)/76.9 (M)) and (ii) training II in DM (73.7 (B)/72.3 (M)). The result demonstrated that ResNet model during training II with original + synthetic images (where the VirDrMammo database was added) did not improve the accuracy (73.7 / 72.3%) concerning Training I (80.9/76.9). In comparison, with radiological experts, accuracy in DM was 29% concerning with 70% in DM for both readers. These results show that both API and Readers obtained a better percentage of accuracy in classifying the ROIs of mammography images than US images.

A final comparison between BraNet and radiological experts’ evaluation demonstrates that for the all-breast image types, reader accuracy was higher with US images (75%) than with original ROI images from public databases. The reader agreement was 70% and 71% in mammography and US classification, respectively. The kappa value indicates a fair agreement (0.3) for mammography images and moderate agreement (0.4) for ultrasound images in both readers (Table 7). This can be contrasted with BraNet classification accuracy (Table 8), where the API shows the highest accuracy in US image classification (Table 6) concerning the mammography dataset. Although the network received more mammography images (5892) with respect to US (1041). It means that not only the amount of the data is important to train deep learning algorithms. Also, it is important to considerer the variety of abnormalities especially in the mammography data, where several BI-RADS categories are present (microcalcifications, nodules, mass, asymmetry, and dense breasts), and can be affect the accuracy in the ResNet training model.

According to the previous results, some limitations in implementing BraNet must be addressed in future work. One is the need to classify and characterize images based on different abnormalities, such as architectural distortion, asymmetry, mass, and microcalcification. BraNet no was trained using different breast tissue types and variations in mammography and US imaging techniques; the ROI classification process was performed only using two classes 1 (benignant according to BI-RADS 1–3) and 0 (malignant according to BI-RADS 4–6) categories. Oyelade et al. [80] indicate that is better to focus on previously classified and characterizing abnormalities into architectural distortion, asymmetry, mass, and microcalcification so that training distinctively learns the features of each abnormality. It generates sufficient images for each category before training a GAN model.

Thus, in future work, we plan to annotate the datasets with more fine-grained classes to get more targeted training in GAN and CNN models. Moving forward, we should consider pre-processing with denoising, super-resolution, improving the overall image quality and reducing blur and artifacts. Also, previous breast tissue types of classification are needed to obtain a diverse range of synthetic data, resulting in a more accurate image generation and classification process using GAN and convolutional algorithms. We must also compare our image classification with other TL models, such as Nasnet and DenseNet, to ensure we use the most effective techniques.

An updated version of the BraNet application and prospectively explore the real AI/human interaction could be implemented, which can recognize full 2D images and not only resized images of 128 × 128 pixels. The app could be used for performance and load testing to assess how the application processes many images simultaneously. It simulates an increasing number of users or requests to see how the application performs under progressively higher loads.

We implement the app as a web server and realize scalability testing; incrementally increase resources (like CPU, GPU, and memory) available to the application and measure performance improvement to determine how efficiently the application scales; make full use of available CPU/GPU cores to process images in parallel, enhancing throughput; and utilize image compression techniques to reduce the size of high-resolution breast images without losing critical details necessary for analysis.

Finally, the use of IA in medical diagnosis brings about a range of ethical considerations that must be carefully navigated to ensure that the integration of these technologies benefits patients, healthcare providers, and the broader healthcare system responsibly and equitably. It is essential to highlight ethical considerations regarding using artificial intelligence in developing CAD systems in healthcare.

The patient’s well-being is paramount, necessitating a comprehensive approach to protecting their data privacy and confidentiality [81, 82]. This project ensures patient privacy through the anonymization and coding of training image databases during the application’s first and second modules, which are also publicly available.

Another ethical consideration is the fairness of AI models [83], which requires providing equitable healthcare outcomes across various patient demographics. Thus, the developed application aims to contribute to medical service equity, particularly by facilitating pathology diagnosis in rural groups and sectors typically deprioritized in healthcare, especially in developing countries.

Finally, transparency regarding the capabilities and limitations of CAD systems is fundamental [84],ensuring that medical staff and patients know that decisions and outcomes adhere to ethical standards. In this context, the developed application is merely a test prototype that aspires to achieve the necessary maturity for use in a real healthcare setting, ensuring the requisite medical reliability.

6 Conclusions

In this paper, we have introduced BraNet, a mobile app for breast image classification based on deep learning algorithms. The API enables the rapid construction of breast image classification workflows, encompassing data input/output, ROI mask extraction, segmentation, and evaluation metrics. The client–server architecture, coupled with its open interface, empowers users to customize the pipeline and swiftly establish comprehensive medical image classification setups using Python libraries and the react native framework for creating native mobile applications on iOS and Android. We have demonstrated the functionality of the BraNet app by conducting automatic cross-validation on data augmentation, ROI segmentation, and classification using public ultrasound and mammography datasets, resulting in a preclinical tool. After implementing some improvements and future updates, BraNet will facilitate the migration of medical image segmentation and classification from research laboratories to practical applications. Also, ensuring that the App complies with all regulations and standards governing data privacy and security in healthcare is essential. It is only a preclinical testing phase; thus, there is still work to be done in this area. BraNet currently offers a pipeline for breast image segmentation and classification, and it will continue to receive regular updates and extensions in the future. This data must be rigorously analyzed, reported, and often published in scientific journals to ensure its accuracy and reliability.

Data availability

All codes are available as Mendeley Data: https://data.mendeley.com/preview/jh9trvbjbv?a=57b040ca-ae6d-4ebb-bc04-ac8c27deae59 [85].

References

Lambin P, Leijenaar RT, Deist TM, Peerlings J, De Jong EE, Van Timmeren J, Walsh S (2017) Radiomics: the bridge between medical imaging and person-alized medicine. Nat Rev Clin Oncol 14(12):749–762
Article PubMed Google Scholar
Papadimitroulas P, Brocki L, Chung NC, Marchadour W, Vermet F, Gaubert L, …, Hatt M (2021) Artificial intelligence: deep learning in oncological radiomics and challenges of interpretability and data har-moni-zation. Phys Medica 83:108–121
Castiglioni I, Rundo L, Codari M, Di Leo G, Salvatore C, Interlenghi M, …, Sardanelli F (2021) AI applications to medical images: from machine learning to deep learning. Phys Medica 83:9–24
Wollny G, Kellman P, Ledesma-Carbayo MJ, Skinner MM, Hublin JJ, Hierl T (2013) MIA-A free and open source software for gray scale medical image analysis. Source Code Biol Med 8(1):1–20
Article Google Scholar
Philbrick KA, Weston AD, Akkus Z, Kline TL, Korfiatis P, Sakinis T, …, Erickson BJ (2019) RIL-contour: a medical imaging dataset annotation tool for and with deep learning. J Digital Imaging 32:571–581
Gibson E, Li W, Sudre C, Fidon L, Shakir DI, Wang G, …, Vercauteren T (2018) NiftyNet: a deep-learning platform for medical imaging. Comput Methods Progr Biomed 158:113–122
Papademetris X, Jackowski MP, Rajeevan N, DiStasio M, Okuda H, Constable RT, Staib LH (2006) BioImage Suite: an integrated medical image analysis suite: an update. Insight J 2006:209
PubMed PubMed Central Google Scholar
Zhang L, Dai H, Sang Y (2022) Med-SRNet: GAN-based medical image super-resolution via high-resolution repre-sentation learning. Comput Intell Neurosci 20(2022):1744969. https://doi.org/10.1155/2022/1744969
Article Google Scholar
Zhu Y, Zhou Z, Liao G, Yuan K (2020) Csrgan: medical image super-resolution using a generative adversarial network. 2020 IEEE 17th International Symposium on Biomedical Imaging Workshops (ISBI Workshops), Iowa City, IA, USA, pp. 1-4. https://doi.org/10.1109/ISBIWorkshops50223.2020.9153436
Zhang K, Hu H, Philbrick K, Conte GM, Sobek JD, Rouzrokh P, Erickson BJ (2022) SOUP-GAN: su-per-resolution MRI using generative adversarial networks. Tomography 8:905–919. https://doi.org/10.3390/tomography8020073
Article PubMed PubMed Central Google Scholar
Ahmad W, Ali H, Shah Z et al (2022) A new generative adversarial network for medical images super resolution. Sci Rep 12:9533. https://doi.org/10.1038/s41598-022-13658-4
Article CAS PubMed PubMed Central Google Scholar
Bargsten L, Schlaefer A (2020) SpeckleGAN: a generative adversarial network with an adaptive speckle layer to augment limited training data for ultrasound image processing. Int J CARS 15:1427–1436. https://doi.org/10.1007/s11548-020-02203-1
Article Google Scholar
Haque A (2021) EC-GAN: low-sample classification using semi-supervised algorithms and GANs (student abstract). In Proceedings of the AAAI Conference on Artificial Intelligence 35(18):15797–15798
Sun Y, Yuan P, Sun Y (2020) MM-GAN: 3D MRI data augmentation for medical image segmentation via gen-erative adversarial networks. 2020 IEEE International Conference on Knowledge Graph (ICKG), Nanjing, China, pp. 227–234. https://doi.org/10.1109/ICBK50248.2020.00041
Bashar MA, Nayak R (2020) TAnoGAN: time series anomaly detection with generative adversarial net-works. In: 2020 IEEE symposium series on computational intelligence (SSCI). Canberra, ACT, Australia, pp 1778–1785. https://doi.org/10.1109/SSCI47803.2020.9308512
Müller D, Kramer F (2021) MIScnn: a framework for medical image segmentation with convolutional neural networks and deep learning. BMC Med Imaging 21:12. https://doi.org/10.1186/s12880-020-00543-7
Article PubMed PubMed Central Google Scholar
Cardoso M, Clarkson M, Modat M, Ourselin S (2012) NiftySeg: open-source software for medical image segmentation, label fusion and cortical thickness estimation. In: IEEE international symposium on biomedical imaging, Barcelona, Spain
Jiménez Gaona Y, Castillo Malla D, Vega Crespo B, Vicuña MJ, Neira VA, Dávila S, Verhoeven V (2022) Ra-diomics diagnostic tool based on deep learning for col-poscopy image classification. Diagnostics 12:1694. https://doi.org/10.3390/diagnostics12071694
Article CAS PubMed PubMed Central Google Scholar
Chen Y, Zhang Q, Wu Y, Liu B, Wang M, Lin Y (2018) Fine-tuning ResNet for breast cancer classification from mammography. In The International Conference on Healthcare Science and Engineering (pp. 83–96). Springer, Singapore
Dourado CMJM, da Silva SPP, da Nobrega RVM, Rebouças Filho PP, Muhammad K, de Albuquerque VHC (2021) An open IoHT-based deep learning framework for online medical image recognition. IEEE J Select Areas Commun 39(2):541–548. https://doi.org/10.1109/JSAC.2020.3020598
Article Google Scholar
Lee H, Chen YPP (2015) Image based computer aided diagnosis system for cancer detection. Expert Syst Appl 42(12):5356–5365
Article Google Scholar
Chowdhury D, Das A, Dey A, Sarkar S, Dwivedi AD, Rao Mukkamala R, Murmu L (2022) ABCanDroid: a cloud integrated android app for noninvasive early breast cancer detection using transfer learning. Sensors 22(3):832
Article PubMed PubMed Central Google Scholar
Jiménez-Gaona Y, Rodríguez-Álvarez MJ, Lakshminarayanan V (2020) Deep-learning-based computer-aided systems for breast cancer imaging: a critical review. Appl Sci 10(22):8298
Article Google Scholar
Guo Z, Xie J, Wan Y, Zhang M, Qiao L, Yu J, ..., Yao Y (2022) A review of the current state of the computer-aided diagnosis (CAD) systems for breast cancer diagnosis. Open Life Sci 17(1):1600–1611
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, pp. 779–788
He K, Gkioxari G, Dollár P, Girshick R (2017) Mask r-cnn. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, pp. 2961–2969
Bharati P, Pramanik A (2020) Deep learning techniques—R-CNN to mask R-CNN: a survey. Computational intelligence in pattern recognition: proceedings of CIPR 2019, pp 657–668
Shin HC, Lu L, Summers RM (2017) Natural language processing for large-scale medical image analysis using deep learning. Deep learning for medical image analysis, pp 405–421. https://doi.org/10.1016/B978-0-12-810408-8.00023-7
Qi X, Yi F, Zhang L, Chen Y, Pi Y, Chen Y, ..., Yi Z (2022) Computer-aided diagnosis of breast cancer in ultrasonography images by deep learning. Neurocomputing 472:152–165
Linna N, Kahn CE Jr (2022) Applications of natural language processing in radiology: a systematic review. Int J Med Inform 163:104779
Article PubMed Google Scholar
Yin C, Qian B, Wei J et al (2019) Automatic generation of medical imaging diagnostic report with hierarchical recurrent neural network. In: Wang J, Shim K, Wu X (eds) Proceedings IEEE international conference on data mining ICDM. Vol 2019-November. Institute of electrical and electronics engineers inc, pp 728–737. https://doi.org/10.1109/ICDM.2019.00083
Wang J, Deng H, Liu B, Hu A, Liang J, Fan et al (2020) Systematic evaluation of research progress on natural language processing in medicine over the past 20 years: bibliometric study on PubMed. J Med Int Res 22(1):e16816
Google Scholar
Luo JW, Chong JJ (2020) Review of natural language processing in radiology. Neuroimaging Clinics 30(4):447–458
Article PubMed Google Scholar
Prabadevi B, Deepa N, Krithika LB, Vinod V (2020) Analysis of machine learning algorithms on cancer dataset. In: 2020 international conference on emerging trends in information technology and engineering (ic-ETITE), Vellore, India, pp 1–10. Intelligence algorithms. Cancers 14(14):3442. https://doi.org/10.1109/ic-ETITE47903.2020.36
Buckley JM, Coopey SB, Sharko J, Polubriaginof F, Drohan et al (2012) The feasibility of using natural language processing to extract clinical information from breast pathology reports. J Pathol Inform 3(1):23
Article PubMed PubMed Central Google Scholar
Chen Y, Hao L, Zou VZ, Hollander Z, Ng RT, Isaac KV (2022) Automated medical chart review for breast cancer outcomes research: a novel natural language processing extraction system. BMC Med Res Methodol 22(1):136
Article PubMed PubMed Central Google Scholar
Liu Y, Liu Q, Han C, Zhang X, Wang X (2019) The implementation of natural language processing to extract index lesions from breast magnetic resonance imaging reports. BMC Med Inform Decis Mak 19(1):1–10
Article PubMed PubMed Central Google Scholar
Kirillov A, Mintun E, Ravi N, Mao H, Rolland C, Gustafson L, ..., Girshick R (2023) Segment anything. In: proceedings of the IEEE/CVF international conference on computer vision, pp 4015–4026
Keshari R, Singh R, Vatsa M (2020) Generalized zero-shot learning via over-complete distribution. In: proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 13300–13308
Wu N, Phang J, Park J, Shen Y, Huang Z, Zorin M, ..., Geras KJ (2019) Deep neural networks improve radiologists’ performance in breast cancer screening. IEEE Trans Med Imaging 39(4):1184–1194
Huang Y, Han L, Dou H, Luo H, Yuan Z, Liu Q, ..., Yin G (2019) Two-stage CNNs for computerized BI-RADS categorization in breast ultrasound images. Biomed Eng Online 18(1):1–18
Weng Y, Zhou T, Li Y, Qiu X (2019) Nas-unet: neural architecture search for medical image seg-men-tation. IEEE Access 7:44247–44257
Article Google Scholar
Al-Dhabyani W, Gomaa M, Khaled H, Aly F (2019) Deep learning ap-proaches for data augmentation and classification of breast masses using ultrasound images. Int J Adv Comput Sci Appl 10(5):1–11
Google Scholar
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. 3rd International Conference on Learning Representations, ICLR 2015–Conference Track Proceedings. 2015, 1–14
Mahmoud HAH, Alharbi AH, Khafga DS (2021) Breast cancer classification using deep convolution neural network with transfer learning. Intell Autom Soft Comput 29(3):803–814. https://doi.org/10.32604/iasc.2021.0186
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp 2818–2826
Abhisheka B, Biswas SK, Purkayastha B (2023) A comprehensive review on breast cancer detection, classification and segmentation using deep learning. Arch Comput Methods Eng 30(8):5023–5052
Article Google Scholar
Li Y, Wu J, Wu Q (2019) Classification of breast cancer histology images using multi-size and discriminative patches based on deep learning. Ieee Access 7:21400–21408
Article Google Scholar
Murtaza G, Shuib L, Abdul Wahab AW, Mujtaba G, Mujtaba G, Nweke HF, ..., Azmi NA (2020) Deep learning-based breast cancer classification through medical imaging modalities: state of the art and research challenges. Artif Intell Rev 53:1655–1720
Muramatsu C, Nishio M, Goto T, Oiwa M, Morita T, Yakami M, ..., Fujita H (2020) Improving breast mass classification by shared data with domain transformation using a generative adversarial network. Comput Biol Med 119:103698
Khan S, Islam N, Jan Z, Din IU, Rodrigues JJC (2019) A novel deep learning based framework for the detection and classification of breast cancer using transfer learning. Pattern Recogn Lett 125:1–6
Article Google Scholar
Jiménez-Gaona Y, Rodríguez-Álvarez MJ, Carrión-Figueroa D, Castillo-Malla D, Lakshminarayanan V (2024) Breast mass regions classification from mammograms using convolutional neural networks and transfer learning. J Modern Optics TMOP. https://doi.org/10.1080/09500340.2024.2313724
Article Google Scholar
van Timmeren JE, Cester D, Tanadini-Lang S, Alkadhi H, Baessler B (2020) Radiomics in medical imaging “how-to” guide and critical reflection. Insights Imaging 11(1):1–16
Google Scholar
Rodrigues PS (2017) Breast ultrasound image. Mendeley Data, V1. https://doi.org/10.17632/wmy84gzngw.1
Yap MH, Pons G, Marti J et al (2018) Automated breast ultrasound lesions detection using convolutional neural networks. IEEE J Biomed Heal Informatics 22(4):1218–1226. https://doi.org/10.1109/JBHI.2017.2731873
Article Google Scholar
Heath M, Bowyer K, Kopans D, Kegelmeyer P, Moore R, Chang K et al (1998) Current status of the digital database for screening mammography. En Digital mammography. Springer, Dordrecht
Google Scholar
Suckling J, Parker J, Dance D et al (2015) Mammographic image analysis society (MIAS) database v1.21. [Dataset]. Apollo - University of Cambridge Repository. https://www.repository.cam.ac.uk/handle/1810/250394
Li S, Hatanaka Y, Fujita H, Hara T, Endo T (1999) Automated detection of mammographic masses in MIAS Database. Med Imaging Technol 17:427–428
Google Scholar
Moreira IC, Amaral I, Domingues I, Cardoso A, Cardoso MJ, Cardoso JS (2012) INbreast. Acad Radiol 19(2):236–248. https://doi.org/10.1016/j.acra.2011.09.014
Article PubMed Google Scholar
Pham HH, Nguyen Trung H, Nguyen HQ (2022) VinDr-Mammo: A large-scale benchmark dataset for computer-aided detection and diagnosis in full-field digital mammography (version 1.0.0). PhysioNet. 10.13026/br2v-7517
Ibrokhimov B, Kang JY (2022) Two-stage deep learning method for breast cancer detection using high-resolution mammogram images. Appl Sci 12(9):4616
Article CAS Google Scholar
Shao S, Wang P, Yan R (2019) Generative adversarial networks for data augmentation in machine fault diagnosis. Comput Ind 106:85–93
Article Google Scholar
Woldesellasse H, Tesfamariam S (2023) Data augmentation using conditional generative adversarial network (cGAN): application for prediction of corrosion pit depth and testing using neural network. J Pipeline Sci Eng 3(1):100091
Article Google Scholar
Parmar G, Zhang R, Zhu J-Y (2022) On aliased resizing and surprising subtleties in GAN evaluation. 11400–11410. https://doi.org/10.1109/cvpr52688.2022.01112
Gao M, Song P, Wang F, Liu J, Mandelis A, Qi D (2021) A novel deep convolutional neural network based on ResNet-18 and transfer learning for detection of wood knot defects. Journal of Sensors 2021:1–16
Google Scholar
Aljuaid H, Alturki N, Alsubaie N, Cavallaro L, Liotta A (2022) Computer-aided diagnosis for breast cancer classification using deep neural networks and transfer learning. Comput Methods Programs Biomed 223:106951
Article PubMed Google Scholar
Swarnambiga Ayyachamy, Varghese Alex, Mahendra Khened, and Ganapathy Krishnamurthi (2019) “Medical image retrieval using Resnet-18”. Proc. SPIE 10954, Medical Imaging 2019: Imaging Informatics for Healthcare, Research, and Applications, 1095410. https://doi.org/10.1117/12.2515588
Guo M, Du Y (2019) “Classification of thyroid ultrasound standard plane images using ResNet-18 networks,” 2019 IEEE 13th International Conference on Anti-counterfeiting, Security, and Identification (ASID), Xiamen, China, pp. 324–328. https://doi.org/10.1109/ICASID.2019.8925267
Borji A (2019) Pros and cons of GAN evaluation measures. Comput Vis Image Underst 179:41–65. https://doi.org/10.1016/j.cviu.2018.10.009
Article Google Scholar
Zama S, Fujioka T, Yamaga E, Kubota K, Mori M, Katsuta L, ... ,Tateishi U (2023) Clinical utility of breast ultrasound images synthesized by a generative adversarial network. Medicina, 60(1):14
Pang T, Wong JHD, Ng WL, Chan CS (2021) Semi-supervised GAN-based radiomics model for data augmentation in breast ultrasound mass classification. Comput Methods Programs Biomed 203:106018
Article PubMed Google Scholar
Jiménez Gaona Y, Castillo Malla D, Vega Crespo B, Vicuña MJ, Neira VA, Dávila S, Verhoeven V (2022) Radiomics diagnostic tool based on deep learning for colposcopy image classification. Diagnostics 12:1694. https://doi.org/10.3390/diagnostics12071694
Article CAS PubMed PubMed Central Google Scholar
Dihge L, Bendahl PO, Skarping I, Hjärtström M, Ohlsson M, Rydén L (2023) The implementation of NILS: a web-based artificial neural network decision support tool for noninvasive lymph node staging in breast cancer. Front Oncol 13:1102254
Article PubMed PubMed Central Google Scholar
To T, Lu T, Jorns JM, Patton M, Schmidt TG, Yen T, …, Ye DH (2023) Deep learning classification of deep ultraviolet fluorescence images toward intra-operative margin assessment in breast cancer. Front Oncol 13:1179025
Taylor CR, Monga N, Johnson C, Hawley JR, Patel M (2023) Artificial intelligence applications in breast imaging: current status and future directions. Diagnostics 13(12):2041
Article CAS PubMed PubMed Central Google Scholar
Le EPV, Wang Y, Huang Y, Hickman S, Gilbert FJ (2019) Artificial intelligence in breast imaging. Clin Radiol 74(5):357–366
Article CAS PubMed Google Scholar
Huynh HN, Tran AT, Tran TN (2023) Region-of-interest optimization for deep-learning-based breast cancer detection in mammograms. Appl Sci 13(12):6894
Article CAS Google Scholar
Afrin H, Larson NB, Fatemi M, Alizad A (2023) Deep learning in different ultrasound methods for breast cancer, from diagnosis to prognosis: current trends, challenges, and an analysis. Cancers 15(12):3139
Article PubMed PubMed Central Google Scholar
Prodan M, Paraschiv E, Stanciu A (2023) Applying deep learning methods for mammography analysis and breast cancer detection. Appl Sci 13(7):4272
Article CAS Google Scholar
Oyelade ON, Ezugwu AE, Almutairi MS, Saha AK, Abualigah L, Chiroma H (2022) A generative adversarial network for synthetization of regions of interest based on digital mammograms. Sci Rep 12(1):6166
Article CAS PubMed PubMed Central Google Scholar
Herington J, McCradden MD, Creel K, Boellaard R, Jones EC, Jha AK, …, Saboury B (2023) Ethical considerations for artificial intelligence in medical imaging: data collection, development, and evaluation. Journal of Nuclear Medicine, 64(12), 1848–1854
Boellaard R, Jones EC, Jha AK, …, Saboury B (2023) Ethical considerations for artificial intelligence in medical imaging: deployment and governance. J Nuclear Med 64(10):1509–1515
Ueda D, Kakinuma T, Fujita S, Kamagata K, Fushimi Y, Ito R, …, Naganawa S (2024) Fairness of artificial intelligence in healthcare: review and recommendations. Jpn J Radiol 42(1):3–15
Drabiak K, Kyzer S, Nemov V, El Naqa I (2023) AI and machine learning ethics, law, diversity, and global impact. Br J Radiol 96:20220934
Article PubMed Google Scholar
Jimenez Y, Rodriguez-Alvarez MJ, Castillo-Malla D, Garcia S, Carrión-Figueroa D, Lakshminarayanan V (2024) BraNet: a mobil application for breast image classification based on deep learning algorithms. Mendeley Data, V1. https://doi.org/10.17632/jh9trvbjbv

Download references

Funding

Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. Funding was obtained from the Universidad Técnica Particular de Loja, PROY_INV_QU_2022_3576. CRUE-UNIVERSITAT POLITÈCNICA DE VALÈNCIA.

Author information

Authors and Affiliations

Departamento de Química y Ciencias Exactas, Universidad Técnica Particular de Loja, San Cayetano Alto s/n CP1101608, Loja, Ecuador
Yuliana Jiménez-Gaona, Darwin Castillo-Malla & Santiago García-Jaen
Instituto de Instrumentación para la Imagen Molecular I3M, Universitat Politécnica de Valencia, 46022, Valencia, Spain
Yuliana Jiménez-Gaona, María José Rodríguez Álvarez & Darwin Castillo-Malla
Theoretical and Experimental Epistemology Lab, School of Opto ΩN2L3G1, Waterloo, Canada
Yuliana Jiménez-Gaona & Darwin Castillo-Malla
Hospital-IESS del Sur de Quito, Av. 18 de Septiembre, Quito, Ecuador
Diana Carrión-Figueroa
Corporación Médica Monte Sinaí-CIPAM (Centro Integral de Patología Mamaria) Cuenca-Ecuador, Facultad de Ciencias Médicas, Universidad de Cuenca, Cuenca, 010203, Ecuador
Patricio Corral-Domínguez
Department of Systems Design Engineering, Physics, and Electrical and Computer Engineering, University of Waterloo, Waterloo, ON, N2L3G1, Canada
Vasudevan Lakshminarayanan

Authors

Yuliana Jiménez-Gaona
View author publications
You can also search for this author in PubMed Google Scholar
María José Rodríguez Álvarez
View author publications
You can also search for this author in PubMed Google Scholar
Darwin Castillo-Malla
View author publications
You can also search for this author in PubMed Google Scholar
Santiago García-Jaen
View author publications
You can also search for this author in PubMed Google Scholar
Diana Carrión-Figueroa
View author publications
You can also search for this author in PubMed Google Scholar
Patricio Corral-Domínguez
View author publications
You can also search for this author in PubMed Google Scholar
Vasudevan Lakshminarayanan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, Y.J.G., M.J.R.-Á and V.L.; study conception and design: Y.J.G, M.J.R.-Á, D.C.M, VL; formal analysis, Y.J.G., M.J.R.-Á. and V.L.; investigation, Y.J.G.; Data Collection Y.J.G and S.G.J; analysis and interpretation of results: Y.J.G, S.G.J;D.C.F and P.C.D writing—original draft preparation, Y.J.G, M.J.R.A, D.C.M, VL writing—review and editing, Y.J.G, M.J.R.A, D.C.M, D.C.F, P.C.D and VL visualization, Y.J.G.; supervision, M.J.R.-Á. and V.L.; project administration, M.J.R.-Á.; funding acquisition M.J.R.-Á. All authors reviewed the results and approved the final version of the manuscript.

Corresponding author

Correspondence to Yuliana Jiménez-Gaona.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

(MP4 6.19 MB)

Appendices

Appendix 1

Figure 5

Appendix 2

Table 9

Table 9 The GAN discriminator’s accuracy on real (green) and fake (red) images during training. Figures illustrate the loss and accuracy trends for SNGAN

Full size table

Appendix 3

Figure 6

Appendix 4

Table 10

Table 10 ResNet 18 evaluation model on training and validation datasets

Full size table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jiménez-Gaona, Y., Álvarez, M.J.R., Castillo-Malla, D. et al. BraNet: a mobil application for breast image classification based on deep learning algorithms. Med Biol Eng Comput (2024). https://doi.org/10.1007/s11517-024-03084-1

Download citation

Received: 14 November 2023
Accepted: 26 March 2024
Published: 02 May 2024
DOI: https://doi.org/10.1007/s11517-024-03084-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

BraNet: a mobil application for breast image classification based on deep learning algorithms

Abstract

Graphical abstract

Similar content being viewed by others

A systematic review of machine and deep learning techniques for the identification and classification of breast cancer through medical image modalities

A Comprehensive Review on Breast Cancer Detection, Classification and Segmentation Using Deep Learning

Breast Mammograms Diagnosis Using Deep Learning: State of Art Tutorial Review

1 Introduction

2 Related work

3 Methods

3.1 Data collection

3.2 Pretraining models in phase off-line

3.2.1 Data normalization and automatic ROI annotation

3.2.2 User ROI extraction and segmentation

3.2.3 Data augmentation

3.2.4 Cross-validation analysis

3.2.5 ROI classification process

ResNet18 training model

3.3 System architecture in phase on-line

3.4 Evaluation metrics in phase off-line and on-line

3.4.1 Quality of synthetic image

Kernel inception distance (KID)

3.4.2 BraNet’s classification performance evaluation

3.4.3 Human expert evaluation

Kappa coefficient and overall accuracy

4 Results

4.1 Preprocessing and pretrained models

4.1.1 Synthetic data to feed the classification network

4.1.2 ResNet training model

4.2 The BraNet and its graphical user interface (GUI)

4.2.1 Comparison of the BraNet with human experts’ evaluation

5 Discussion

6 Conclusions

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Appendices

Appendix 1

Appendix 2

Appendix 3

Appendix 4

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation