DF-dRVFL: A novel deep feature based classifier for breast mass classification

Yu, Xiang; Ren, Zeyu; Guttery, David S.; Zhang, Yu-Dong

doi:10.1007/s11042-023-15864-2

DF-dRVFL: A novel deep feature based classifier for breast mass classification

Open access
Published: 11 July 2023

Volume 83, pages 14393–14422, (2024)
Cite this article

Download PDF

You have full access to this open access article

Multimedia Tools and Applications Aims and scope Submit manuscript

DF-dRVFL: A novel deep feature based classifier for breast mass classification

Download PDF

Xiang Yu¹,
Zeyu Ren¹,
David S. Guttery² &
…
Yu-Dong Zhang ORCID: orcid.org/0000-0002-4870-1493¹

886 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Amongst all types of cancer, breast cancer has become one of the most common cancers in the UK threatening millions of people’s health. Early detection of breast cancer plays a key role in timely treatment for morbidity reduction. Compared to biopsy, which takes tissues from the lesion for further analysis, image-based methods are less time-consuming and pain-free though they are hampered by lower accuracy due to high false positivity rates. Nevertheless, mammography has become a standard screening method due to its high efficiency and low cost with promising performance. Breast mass, as the most palpable symptom of breast cancer, has received wide attention from the community. As a result, the past decades have witnessed the speeding development of computer-aided systems that are aimed at providing radiologists with useful tools for breast mass analysis based on mammograms. However, the main issues of these systems include low accuracy and require enough computational power on a large scale of datasets. To solve these issues, we developed a novel breast mass classification system called DF-dRVFL. On the public dataset DDSM with more than 3500 images, our best model based on deep random vector functional link network showed promising results through five-cross validation with an averaged AUC of 0.93 and an average accuracy of $81.71\%$. Compared to sole deep learning based methods, average accuracy has increased by 0.38. Compared with the state-of-the-art methods, our method showed better performance considering the number of images for evaluation and the overall accuracy.

The Automated Learning of Deep Features for Breast Mass Classification from Mammograms

Fully convolutional network for automated detection and diagnosis of mammographic masses

Article 09 May 2023

Breast mass density categorisation using deep transferred EfficientNet with support vector machines

Article 13 February 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Breast cancer is the most frequent cancer among women in the UK. While most breast cancer diagnoses occur in women over 50, and younger women can also develop breast cancer. About 1 in 8 women are diagnosed with breast cancer during their lifetime, but there is a good chance of recovery if detected early [34]. Therefore, there are lots of researchers who focus on how to early detect breast cancer in the early stage by computer-aided (CAD) systems. CAD systems can be subdivided into computer-aided detection (CADe) systems and computer-aided diagnosis (CADx) systems. The CADe systems are mainly utilized for extracting ROIs from medical images for further analysis tasks. Based on the obtained ROIs, the CADx systems focus on extracting features from the obtained ROIs and make predictions of severity based on the extracted features. There are several challenges to the early detection of breast cancer by CAD systems. Firstly, compared to natural images, mammograms are usually with higher resolutions and larger sizes. The high resolutions are challenging for both the hardware and the performance of the algorithms for diagnosis. Secondly, the anatomical architectures of the organs and the tissues in mammograms are more difficult to recognize and detect than the natural images. Traditional CAD systems cannot easily obtain these features of anatomical architectures.

In recent years, with the development of deep learning, deep learning-based CAD systems achieved great results in solving existing challenges of CAD systems. However, deep learning-based CAD systems also have some limitations for breast mass classification tasks. Firstly, the performance of the deep learning model highly relies on differentiating malignant masses from benign masses. The main differences between benign and malignant masses are the shape and margins of the masses [1]. The shape of a mass can likely be irregular, round, and lobular, while benign masses are more likely to have circumscribed oval and round shapes, and malignant masses tend to have irregular shapes. The margins of the masses can also be subdivided into categories, including microlobulated, obscured, ill-defined, and spiculated. The microlobulated margins describe the scalloped appearance of the mass that is distinct from the breast tissues. The obscured margins indicate the margins of the masses that were partially blocked by adjacent tissue. As a result, the masses in this situation may fail to be differentiated from the breast tissue. The ill-defined margins refer to the margins that are indistinct from the breast tissue. The reason why margins are ill-defined can be the low contrast of the images and the high breast density. The spiculated margins are shown in the form of radiating lines from the breast masses and are usually shown in malignant breast masses. However, malignant masses can also have circumscribed margins with a low possibility that follow-up examinations are required to distinguish those masses. The second limitation is that deep learning models always need sufficient computational power, and the improvements of well-trained models are hard to obtain, although there are powerful computational resources.

Based on the challenges and limitations mentioned above, it is still of great value to design a deep learning-based CADx system for breast mass classification as they can advise radiologists in a relatively short time and facilitate the diagnosis procedures instead of forcing patients to go through painful tissue extraction procedures for biopsy. Aimed at developing a breast mass classification system with promising performance, we proposed to develop a novel deep feature-based model called DF-dRVFL. The main contributions of this study can be concluded as follows:

1.
We developed a high-performance CADx system for breast mass classification based on DF-dRVFL for mammography images. The developed system works on the extracted ROIs using a previously developed breast mass detection system. The developed system consists of three components: model training, feature extraction, and feature classification. In model training, we transferred the state-of-the-art deep models that were pre-trained on ImageNet instead of training the models from scratch. We first removed the top layers of the deep models as they were initially trained for 1,000-class classification and added new fully connected layers for the classification task here. We also added the dropout layers to prevent the trained models from overfitting. After training, we then extracted the features from the trained models as the input of our classifier DF-dRVFL. The experimental results on the public DDSM dataset showed the effectiveness of the developed system and therefore validated the plausibility between the combinations of breast mass feature extraction and classification.
2.
We developed a novel strategy for breast mass classification by introducing a novel hybrid deep learning-based model. In this study, we proposed a VGG19-DF to deploy trained deep learning models as feature extractor instead of relying on hand-crafted features. As mentioned before, the differences between benign masses and malignant masses mainly lie in the shapes and the margins. However, these hand-crafted features are not reliable, and they can mislead the diagnosis results. Instead, we proposed to use deep features that are extracted by a trained deep learning model for mass classification. Compared to hand-crafted features, deep features are more representative and robust, and the models trained with deep features are likely to show higher performance.
3.
We found an efficient method for breast mass classification performance improvement with low computational cost. If well fine-tuned, the performance of deep learning models can be improved step by step. However, the process pf optimizing the settings of hyper-parameters for model optimization can be lengthy. Therefore, there is an unmet need to improve the performance of the classifiers at a minimal cost. Toward this, we proposed to introduce DF-dRVFL for fast performance improvement. Experiments on the public dataset DDSM showed that the novel classifiers could be trained on the dataset consisting of more than 2,000 samples within only a few seconds. More importantly, the performance of the classifiers has also been improved. Considering that breast mass classification is only a special case of classification, we believe the proposed method for efficient performance improvement can also be extended to other scenarios.

The remainder of this paper will be arranged as follows. In Section 2, we will briefly revisit the related works. Then we will present the details of the developed system in Section 3. As mentioned before, we utilized deep learning models as feature extractors in DF-dRVFL. However, some adjustments have to be made to adapt the models for the classification task. The classification task is implemented by a novel classifier called deep random vector functional link network (dRVFL). For comparison, we also take machine learning models, including ELM, RVFLN, and spiking neural network (SNN), as the classifiers. We will present the details of these machine learning models and examine the overall classification performance of these models, where the results will be shown in Section 4. At the beginning of the experiment section, we will introduce the details of the dataset used in this chapter. We then move to parameter settings, followed by the ablation for model refinement. The experiment results will be shown in the last part of this section. We then discuss some related issues in Section 5 and end this paper with the conclusion and future work in Section 6. The abbreviations used in this paper are listed in Table 1.

Table 1 Symbols and meaning

DF-dRVFL: A novel deep feature based classifier for breast mass classification

Abstract

Similar content being viewed by others

The Automated Learning of Deep Features for Breast Mass Classification from Mammograms

Fully convolutional network for automated detection and diagnosis of mammographic masses

Breast mass density categorisation using deep transferred EfficientNet with support vector machines

1 Introduction

2 Related works

3 Methodology

3.1 VGG19-DF: deep feature extractor

3.2 Design of classifiers

3.2.1 ELM classifier

3.2.2 RVFLN classifier

3.2.3 dRVFLN classifier

3.2.4 SNN classifier

4 Experiment

4.1 Dataset

4.2 Settings of the experiment

4.3 Performance of feature extractors

4.4 Model ablation

4.4.1 \(Fea_{24}\)-based mass classification

4.4.2 \(Fea_{64}\)-based mass classification

4.4.3 \(Fea_{24+64}\)-based mass classification

4.5 Method comparison

5 Discussion

6 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation