An aseptic approach towards skin lesion localization and grading using deep learning and harris hawks optimization

Balaha, Hossam Magdy; Hassan, Asmaa El-Sayed; El-Gendy, Eman M.; ZainEldin, Hanaa; Saafan, Mahmoud M.

doi:10.1007/s11042-023-16201-3

An aseptic approach towards skin lesion localization and grading using deep learning and harris hawks optimization

Open access
Published: 28 July 2023

Volume 83, pages 19787–19815, (2024)
Cite this article

Download PDF

You have full access to this open access article

Multimedia Tools and Applications Aims and scope Submit manuscript

An aseptic approach towards skin lesion localization and grading using deep learning and harris hawks optimization

Download PDF

Hossam Magdy Balaha¹,
Asmaa El-Sayed Hassan²,
Eman M. El-Gendy¹,
Hanaa ZainEldin¹ &
…
Mahmoud M. Saafan¹

832 Accesses
3 Citations
Explore all metrics

Abstract

Skin cancer is the most common form of cancer. It is predicted that the total number of cases of cancer will double in the next fifty years. It is an expensive procedure to discover skin cancer types in the early stages. Additionally, the survival rate reduces as cancer progresses. The current study proposes an aseptic approach toward skin lesion detection, classification, and segmentation using deep learning and Harris Hawks Optimization Algorithm (HHO). The current study utilizes the manual and automatic segmentation approaches. The manual segmentation is used when the dataset has no masks to use while the automatic segmentation approach is used, using U-Net models, to build an adaptive segmentation model. Additionally, the meta-heuristic HHO optimizer is utilized to achieve the optimization of the hyperparameters of 5 pre-trained CNN models, namely VGG16, VGG19, DenseNet169, DenseNet201, and MobileNet. Two datasets are used, namely "Melanoma Skin Cancer Dataset of 10000 Images" and "Skin Cancer ISIC" dataset from two publicly available sources for variety purpose. For the segmentation, the best-reported scores are 0.15908, 91.95%, 0.08864, 0.04313, 0.02072, 0.20767 in terms of loss, accuracy, Mean Absolute Error, Mean Squared Error, Mean Squared Logarithmic Error, and Root Mean Squared Error, respectively. For the "Melanoma Skin Cancer Dataset of 10000 Images" dataset, from the applied experiments, the best reported scores are 97.08%, 98.50%, 95.38%, 98.65%, 96.92% in terms of overall accuracy, precision, sensitivity, specificity, and F1-score, respectively by the DenseNet169 pre-trained model. For the "Skin Cancer ISIC" dataset, the best reported scores are 96.06%, 83.05%, 81.05%, 97.93%, 82.03% in terms of overall accuracy, precision, sensitivity, specificity, and F1-score, respectively by the MobileNet pre-trained model. After computing the results, the suggested approach is compared with 9 related studies. The results of comparison proves the efficiency of the proposed framework.

Skin cancer diagnosis based on deep transfer learning and sparrow search algorithm

Article Open access 23 September 2022

Robust optimization of SegNet hyperparameters for skin lesion segmentation

Article 29 May 2021

Detection and classification of dermatoscopic images using segmentation and transfer learning

Article 21 February 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Nowadays, cancer is considered one of the killing diseases worldwide [28]. Over the next fifty years, it is predicted that the total number of cases of cancer will double [60]. However, early detection of cancer helps in minimizing the number of deaths caused by it [58]. Skin cancer is considered the most popular form of cancer. In addition, melanoma is the most aggressive type of Skin Lesion (SL). It is a malignant tumor that develops from the pigment-containing cells called melanocytes [21]. Also, it has the most rapidly increasing mortality rate among skin cancers. According to the Skin Cancer Foundation, over 9,500 people in the US are diagnosed with skin cancer daily. For every hour, skin cancer causes the death of 2 people [58]. Additionally, in the United States, about 197,700 new melanoma cases are estimated to be diagnosed and about 7,650 people are expected to die of it in the year 2022. According to the statistics [58], the risk of getting melanoma for whites is about 2.6%, for blacks about 0.1%, and for Hispanics about 0.4%. Moreover, 90% of the deaths associated with cutaneous tumors are caused by melanomas [30]. Melanomas occur in different shapes, colors, and sizes. This is the reason behind the difficulty to deliver a comprehensive set of warning signs. Since detecting this type of cancer early is so crucial, the common signs, symptoms, and early detection strategies are discussed. Many moles, growths, and brown spots on the skin are harmless but it is not always the case. The ABCDEs and the Ugly Duckling sign can help in detecting melanoma. The first five letters of the alphabet (i.e., ABCDE) [62] can be explained as follows:

A is for Asymmetry: Most melanomas are asymmetrical (i.e., the two halves don’t match).
B is for Border: The borders usually are uneven and may have notched or scalloped edges.
C is for Color: Multiple colors in melanomas are a warning sign. The moles that have different shades of brown, tan, or black tend to be melanomas.
D is for Diameter: If a lesion is the size of an eraser (i.e., about 6 mm in diameter) or larger, it tends to be a melanoma.
E is for Evolving: Any change in shape, size, color, or elevation of a spot on the skin, is considered a warning sign of melanoma.

Another warning sign of melanoma is the Ugly Duckling [59]. The concept behind this strategy is based on the thought that most normal moles resemble one another, whereas melanomas stand out like ugly ducklings in comparison. Several factors increase the risk of developing melanoma. These factors include fair skin, excessive ultraviolet (UV) light exposure, the existence of many moles or unusual moles, a family history of melanoma, and a weakened immune system [22].

As mentioned before, accurate recognition of melanoma in its early stages can increase the survival rate significantly. Nevertheless, the manual detection of melanoma is time-consuming, complicated, and error-prone [3]. Additionally, it suffers from inter-observer variations and creates a huge demand for well-trained specialists. Manual scanning is considered a challenging task for many reasons [2]. Dermoscopy images might contain hair, blood vessels, lubricants, bubbles, as well as other disturbances that make the identification difficult. Low contrast between the outer tissue and the SL region makes the segmentation difficult. Different sizes, forms, and colors of SL might restrict the effectiveness of approaches in obtaining greater levels of accuracy.

Moreover, manual detection of melanoma is difficult for dermatologists, hence, it is worthwhile to develop a reliable automatic system for melanoma detection, advancing the accuracy and efficiency of dermatologists and pathologists. Computer-Aided Diagnostic (CAD) doctors save time and diagnose skin cancer more accurately. Recently, many publications based on Convolution Neural Network (CNN) are proposed to deliver fully automated systems [1, 18, 31]. Generally, the automatic detection can be classified into five steps (i.e., preprocessing, segmentation, feature extraction, classification, and deployment stages) [9]. In preprocessing, various image preprocessing techniques (e.g., color space correction, contrast augmentation, and noise removal) are applied to enhance skin cancer diagnosis [40]. Then, image segmentation is applied to extract regions of interest by segmenting cancerous areas from healthy areas [63]. It is used to provide accurate detection by isolating healthy tissue before extracting characteristics from lesions. Typically, the feature extraction is performed after the segmentation. This step lowers the size of data to make it more manageable. Moreover, preserving relevant data will make processing data faster and easier. Also, the detecting accuracy will significantly rise when the feature extraction is executed properly [49]. The purpose of the classification is to allocate a region of interest (ROI) from a picture of a specific class. Many melanoma detection reports employed feedforward and recurrent NNs. Different machine and deep learning approaches had been proposed to perform the task of skin cancer detection and segmentation (e.g., Fuzzy C-means [56], support vector machines SVM [20], Deep Neural Networks [24], and Recurrent Neural Networks [54]). The traditional machine learning (ML) methods require extracting the features manually [19]. Then, classification and segmentation methods are utilized to classify and segment the tumor using those features. Deep learning (DL) is a popular ML research branch that can effectively capture complex relations [8, 12].

The current study focuses on melanoma skin cancer detection. The classification is accomplished using 5 pre-trained convolution neural network (CNN) models. They are VGG16, VGG19, DenseNet169, DenseNet201, and MobileNet. Additionally, the Harris Hawks Optimization (HHO) algorithm is used to accomplish to gain the performance metrics of the state-of-the-art (SOTA). Segmentation is performed in two ways, namely manual segmentation and automatic segmentation. The manual segmentation is used when the dataset has no masks to use while the automatic segmentation approach is used, using U-Net models, to build an adaptive segmentation model. A novel manual segmentation approach called “HMB-MAS” is proposed in the current study.

This study proposes a hybrid approach using deep learning-based algorithms and Harris Hawks optimization algorithm (HHO) for skin lesion detection, classification, and segmentation. HHO algorithm has been used to optimize the hyperparameters of the applied models to enhance the performance of these models. Deep learning (DL) has been integrated with HHO to make it more robust and adaptive utilizing a proper balance between exploration and exploitation of the search space.

The main contributions of the current study can be recapped in the next points:

Proposing a hybrid approach using Harris hawks optimization algorithm and deep learning approach for melanoma detection.
Proposing a novel manual segmentation approach called “HMB-MAS” for extracting regions of interest.
Applying transfer learning using 5 pre-trained CNN models.
Applying Harris hawks’ optimization algorithm to optimize the hyperparameters and acquire the optimal configurations for each model.
Reporting the state-of-the-art performance metrics and comparing the current performance with the reported related studies.

The rest of this paper is organized as follows: A brief review of some publications introduced for melanoma detection is discussed in Section 2. The proposed methodology is described in detail in Section 3. Section 4 shows the experiments and reports the results. Conclusion, limitation, and future works are presented in Section 5.

2 Related studies

Recently, many pieces of research to detect melanoma automatically based on deep learning techniques are conducted. Several approaches and techniques were proposed in publications to aid in computer- aided diagnosis. Some of these are retrieved in terms of the newness and the highest performance. The discussed studies can be classified into A) Deep Learning-based techniques and B) hybrid techniques.

2.1 Deep learning-based techniques

Recently, a shift from the handcrafted feature vector to extract the features from the input automatically by the computer has become prevalent. This concept is the idea of numerous DL-based algorithms. Vani et. al. [64] presented a new technique to detect melanoma as well as a prediction tool. The suggested technique utilized DL techniques to predict the lesion of the afflicted region. Additionally, multiple performance metrics such as precision, accuracy, recall, and F1-score are used to assess the proposed approach. For improving the quality of images, pre-processing algorithms were applied. The segmenting between normal and diseased skin areas was done using an active contour segmentation procedure. The detection was done using Self-Organizing Map (SOM) and CNN classifiers. The suggested approach showed significant efficiency in melanoma detection with improved accuracy of 90% using a randomly generated collection of 500 pictures, 350 pictures as the training dataset, and 150 pictures as the validation dataset. To address the task of skin lesion segmentation, Wu et. al. [65] developed a Feature Adaptive Transformers Network (FAT-Net). It is an effective and innovative two-stream net with a feature adaptation transformer. Unlike the typical CNN-based encoders, the proposed transformer encoder achieves segmentation using a new sequence-to-sequence predicting technique. Testing is performed on four public datasets (i.e., the International Skin Imaging Collaboration (ISIC) 2016, ISIC 2017, ISIC 2018, and PH2 datasets). Also, extensive experiments were performed to verify the achieved accuracy. Their reported experiments and results showed the superiority of the proposed FAT-Net in terms of accuracy and testing speed.

A modified U-Net approach for segmenting skin lesions in medical imagery to perform melanoma detection is presented by Anand et. al. [5]. The PH2 dataset was used for validating the proposed network. The suggested U-Net was tested by utilizing the Stochastic gradient descent (SGD), Adadelta, and Adam optimization techniques. The suggested approach achieved an accuracy of 96.27%, a Jaccard Score of 96.35%, and a Dice Coefficient of 89.01%. Agrahari et. al. presented a multi-class classification for skin cancer [4] with high efficiency equivalent to a dermatologist. The model is constructed using a pre-trained MobileNet network. The ISIC dataset HAM10000 (Human Against Machine with 10000 training images) was used for training and evaluating the proposed method. A category accuracy of 80.81%, a top-2 accuracy of 91.25%, and a top-3 accuracy of 96.26% were reported in detecting skin lesions.

Nersisson et. al. in [46] proposed a You Only Look Once (YOLO) based-Convolutional Neural Network (CNN) technique to detect skin lesions. The characteristics such as texture and color information collected from the lesion region were combined with the features acquired from the CNN. Then, these characteristics were sent to a Fully Connected Deep Neural Network (FCDNN) that was trained with the international symposium on biomedical imaging (ISBI) Melanoma dataset. The experimental results showed that the suggested approach could enhance the classification accuracy of skin lesions compared to state-of-the-art techniques. The performance improvements included accuracy of 94%, a precision of 85%, recall of 88%, and AUC of 95%.

Kaur et. al. [42] proposed a Deep CNN (DCNN) with an efficient melanoma classification technique to detect benign and malignant melanoma. Dermoscopy images were gathered from the International Skin Imaging Collaboration datastores (e.g., ISIC 2016, ISIC2017, and ISIC 2020). Accuracy, recall, specificity, precision, and F1-score were used to assess the classifier performance. For the ISIC 2020, ISIC 2017, and ISIC 2016 datasets, the suggested classifier achieved an accuracy rate of 90.42%, 88.23%, and 81.41%, respectively. A novel Residual DCNN for melanoma detection is proposed by Hosny and Kassem [34]. Six well-known melanoma datasets (e.g., PH2, ISIC2016, ISIC2018, ISIC2017, MED-NODE (MElanoma Diagnosis from NOn-DErmoscopic images), and DermIS and Quest) were used to train and evaluate the proposed model. Three separate experiments were used to evaluate the proposed neural network accuracy. The first one was conducted using the original dataset pictures with no segmentation or pre-processing. For the second one, the segmented pictures were used to evaluate the suggested model. Finally, the output training model of the second experiment is preserved and utilized as a pre-trained model in the third one. The suggested RDCNN surpassed the current DCNN in terms of overall performance.

Shorfuzzaman et. al. [55] presented a CNN-based stacked ensemble approach for the early detection of melanoma. The transfer learning principle was applied in stacking ensemble learning, where many CNN sub-models were constructed. The classification results were generated by a new model called a meta-learner, which incorporated all the sub-model results. Experimental results demonstrate the performance of the proposed approach with an accuracy of 95.76%, a sensitivity of 96.67%, and an AUC of 95.7%. Even though several publications have focused on identifying melanoma, Kumar et. al. [44] narrows the scope in determining the levels of skin cancer by applying DL techniques. Their proposed algorithm enhanced the accuracy of skin lesion detection and could provide suitable therapy based on the cancer level. State-of-the- art methods such as SVM, Random Forest (RF), and Artificial NN (Neural Networks), as well as the suggested fusion-based DL approach, were tested. Different evaluation criteria such as Accuracy, Mean Square Error (MSE), Precision, Peak Signal to Noise Ratio (PSNR), and Recall were used to evaluate the suggested approach and monitor its validity. The proposed approach achieved an accuracy of 97%. Elansary et. al. [26] presented a CNN-based skin cancer classification method. The proposed model was tested using ISIC 2020. Furthermore, the dataset was extremely imbalanced, with fewer than 2% aggressive instances. For addressing imbalanced data, random oversampling and data augmenting were applied. Furthermore, to construct a classifier that could learn from all classes identically while focusing more on the minority class, the class weight approach was utilized to assign a weight value to each category. EfficientNet-B6 was proposed for melanoma detection. The reported findings revealed that the recommended system accuracy rate was 97.84%.

2.2 Hybrid techniques

Generally, combining two different machine learning techniques resulted in a hybrid one. For example, a hybrid classification model can consist of one unsupervised learner to preprocess the training data and one supervised classifier to learn the clustering result. Additionally, many hybrid techniques integrate CNN with different classifiers to enhance melanoma detection accuracy. A structured scheme for analyzing and assessing the possibilities of melanoma is proposed by Srividhya [61]. The proposed approach performs skin lesion segmentation and feature extraction followed by good performance machine learning techniques to accomplish intensity level adjustment during the pre- processing phase. The correlation parameters were significant in determining the success of recognizing skin tumors and acted as a metric to define the distinct feature set for training CNN to detect melanoma. Sensitivity and Identifying Efficiency (IE) were determined to be 93.3% and 95%, respectively. Gazioglu and Kamasak [31] aimed to study the impact of exterior items (i.e., ruler and hair) and visual quality (i.e., noise, blurring, and brightness) in influencing the result of melanoma detection. Four pre-trained CNN models, namely, AlexNet, ResNet50, DenseNet121, and VGG16, were employed by the authors. DenseNet obtained the best accuracy in blurred and noisy datasets. Accuracies of 89.22% for hair set, 86% for ruler set, and 88.81% for none set were reported.

Cao et. al. [23] proposed a Mixed Skin Lesion MSL image using Mask R-CNN (MSLP-MR) model to perform melanoma detection. Furthermore, they established a Mask-DenseNet melanoma detection system. Their approach merged the concept of ensemble learning with MSLP-CNN to add mask segmentation and integrate many classifiers for weighted predictions. Their proposed approach was validated using the ISIC dataset. The reported accuracy was 90.61%, sensitivity was 78%, specificity was 93.43%, and the Area Under the Curve (AUC) was 95.02%.

In Ilkin et. al. [38], the SVM classifier and a bacterial colony optimization algorithm were integrated to create a hybrid classification to detect melanoma accurately. The proposed technique was tested on two separate datasets, ISIC and PH2. According to ISIC and PH2 results, AUC values of 98%, and 97%, respectively, were achieved. A Fully Transformer Network (FTN) was proposed by He et. al. [32] to learn long-range context data for skin lesion diagnosis. FTN is a hierarchy transformer that uses the Spatial Pyramid Transformer to compute features. Since it integrated a spatial pyramid pooling module with multi-head attention, it requires computing costs. Training and testing were done on ISIC 2018 dataset. Comparing FTN with state-of-the-art techniques, FTN was more efficient and capable of achieving superior results. Patil and Bellary [47] presented two approaches for categorizing the phases of melanoma. In the first approach, melanoma cancer was classified into two stages, namely, stage 1 and stage 2. In the second one, melanoma was classified into three stages, namely, stage 1, stage 2, and stage 3. The suggested framework utilized a CNN method with a loss function of Similarity Measure for Text Processing. Experiments with several loss functions were presented and compared to the suggested loss function. The reported results showed that the suggested technique outperformed many different loss functions.

An approach for classifying skin lesions as either benign or malignant was provided by Sayed et. al. [53]. The ISIC 2020 dataset was used to evaluate the suggested framework. Their research provided a strategy to overcome the extreme category imbalance that existed in the used dataset based on data augmentation and random over-sampling. In addition, a novel hybrid form of CNN architecture with bald eagle search optimization was suggested. The optimization method identified the best settings for the hyperparameters of SqueezeNet. The suggested melanoma cancer predictive algorithm had a 98.37% accuracy rate, 96.47% specificity, 100% sensitivity, 98.40% f-score, and 99% AUC. Gaonkar et. al. [29] proposed a hybrid technique using SVM and Radial Basis Function Network (RBFN) to minimize the complexity and processing time of the classification process. Three main steps (i.e., lesion segmentation, extraction of features, and classification) were applied. SVM and RBFN achieved an accuracy of 87% and 91%, respectively. Specificity and Sensitivity for SVM were 82% and 92%, respectively. While for RBFN, specificity and sensitivity were 90% and 93% respectively.

2.3 Plan of solution

Skin melanoma detection and segmentation is an important and difficult task in medical imaging applications. In the current study, different CNN architectures are utilized to perform the task of skin melanoma classification and segmentation. After evaluating these architectures, transfer learning and Harris Hawks optimization algorithm are used to tone and optimize training parameters and hyperparameters. Finally, different experiments using different performance metrics are performed to report the best architectures.

3 Methodology and suggested approach

The proposed framework for the classification and segmentation phases is shown in Fig. 1. As mentioned in the introduction, this study proposes a hybrid algorithm to perform melanoma detection using different CNN architectures (i.e., VGG16, VGG19, DenseNet169, DenseNet201, and MobileNet) and a metaheuristic optimizer named Harris Hawks optimization algorithm (HHO). In summary, the images are accepted by the input layer. Then, dataset augmentation, scaling, and balancing are employed to preprocess the images. The images can be classified after using the proposed models. The transfer learning and meta- heuristic optimization phase occur. In the following subsections, a discussion about these phases is presented.

3.1 Materials

In the current study, two public datasets are acquired and used from Kaggle (a public dataset repository). For each dataset, a summary of the number of classes, number of images, and the number of images per class are discussed in Table 1. Additionally, Fig. 2 shows samples from the used datasets. The first dataset is named “Melanoma Skin Cancer Dataset of 10000 Images” [39]. It is composed of 10,605 images and partitioned into 2 classes. The second one is named “Skin Cancer ISIC” [41]. It is composed of 2,248 images and partitioned into 9 classes.

Table 1 Summarization of the utilized datasets

An aseptic approach towards skin lesion localization and grading using deep learning and harris hawks optimization

Abstract

Similar content being viewed by others

Skin cancer diagnosis based on deep transfer learning and sparrow search algorithm

Robust optimization of SegNet hyperparameters for skin lesion segmentation

Detection and classification of dermatoscopic images using segmentation and transfer learning

1 Introduction

2 Related studies

2.1 Deep learning-based techniques

2.2 Hybrid techniques

2.3 Plan of solution

3 Methodology and suggested approach

3.1 Materials

3.2 Data scaling

3.3 Segmentation phase using “HMB-MAS”

3.4 Classification phase

3.4.1 VGG model architecture

3.4.2 DenseNet model architecture

3.4.3 MobileNet model architecture

3.5 Learning and optimization

3.5.1 Harris hawks optimization algorithm

3.5.2 Exploration phase

3.5.3 Transition from exploration to exploitation

3.5.4 Exploitation phase

Soft besieg

Hard besiege

Soft besiege with progressive rapid dives

Hard besieges with progressive rapid dives

3.6 Pseudocode of the proposed model

4 Experiments and discussions

4.1 Segmentation experiments

4.2 Learning, classification, and optimization experiments

4.3 Time complexity

4.4 Related studies comparisons

5 Conclusions, limitations and future work

Data Availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interest

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation