Abstract
Breast cancer is one of the leading causes of death by cancer among women. The high mortality rates and the occurrence of this cancer worldwide show the importance of the investigation and development of means for the detection and early diagnosis of this disease. Computer-Aided Detection and Diagnosis systems have been developed to improve diagnostic accuracy by radiologists. This work proposes a method for discriminating patterns of malignancy and benignity of masses in digitized mammography images through the analysis of local features. The method comparatively applies the Scale-Invariant Feature Transform (SIFT), Speed Up Robust Feature (SURF), Oriented Fast and Rotated BRIEF (ORB) and Local Binary Pattern (LBP) descriptors for local feature extraction. These features are represented by a Bag of Features (BoF) model, applied to provide new representations of the data and to reduce its dimensionality. Finally, the features are used as input for the Support Vector Machine (SVM), Adaptive Boosting (Adaboost) and Random Forests (RF) classifiers to differentiate malignant and benign masses. The method obtained significant results, reaching 100% sensitivity, 99.65% accuracy and 99.24% specificity for benign and malignant mass classification.
Similar content being viewed by others
References
Abbas Q (2016) Deepcad: A computer-aided diagnosis system for mammographic masses using deep invariant features. Computers 5(4):28. https://doi.org/10.3390/computers5040028
Abdel-Zaher AM, Eldeib AM (2016) Breast cancer classification using deep belief networks. Expert Syst Appl 46:139–144. https://doi.org/10.1016/j.eswa.2015.10.015
Alahi A, Ortiz R, Vandergheynst P (2012) Freak: fast retina keypoint. In: Conference on IEEE computer vision and pattern recognition (CVPR) 2012. IEEE, pp 510–517
Antani S, Kasturi R, Jain R (2002) A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video. Pattern Recogn 35(4):945–965
Arevalo J, Gonzalez FA, Ramos-Pollan R, Oliveira JL, Lopez MAG (2015) Convolutional neural networks for mammography mass lesion classification.. In: Proceedings of the annual international conference of the IEEE engineering in medicine and biology society, EMBS, pp 797–800. https://doi.org/10.1109/EMBC.2015.7318482
Avila S (2013) Extended bag-of-words formalism for image classification. PhD thesis, Université, Pierre et Marie Curie-Paris VI
Ayed NGB, Masmoudi AD, Masmoudi DS (2014) A new automated CAD system for classification of malignant and benign lesions. Asian J Inf Technol 13(9):477–484
Azar AT, Elshazly HI, Hassanien AE, Elkorany AM (2014) A random forest classifier for lymph diseases. Comput Methods Prog Biomed 113(2):465–473
Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (surf). Comput Vis Image Underst 110(3):346–359
Beura S, Majhi B, Dash R (2015) Mammogram classification using two dimensional discrete wavelet transform and gray-level co-occurrence matrix for detection of breast cancer. Neurocomputing 154:1–14
Braz JG, Cardoso dPA, Corrêa SA, Cesar MdOA (2009) Classification of breast tissues using Moran’s index and Geary’s coefficient as texture signatures and SVM. Comput Biol Med 39(12):1063–1072
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and regression trees. CRC Press, Flórida
Brown M, Lowe DG (2002) Invariant features from interest point groups. In: BMVC
Calonder M, Lepetit V, Strecha C, Fua P (2010) Brief: binary robust independent elementary features. Computer Vision–ECCV 2010, vol 6314, pp 778–792
Campos L, Silva A, Barros A (2005) Diagnosis of breast cancer in digital mammograms using independent component analysis and neural networks. Progress in Pattern Recognition, Image Analysis and Applications, vol 3773, pp 460–469
Chatfield K, Lempitsky V, Vedaldi A, Zisserman A (2011) The devil is in the details: an evaluation of recent feature encoding methods. In: British machine vision conference
da Rocha SV, Junior GB, Silva AC, de Paiva AC, Gattass M (2016) Texture analysis of masses malignant in mammograms images using a combined approach of diversity index and local binary patterns distribution. Expert Syst Appl 66:7–19
Dhahbi S, Barhoumi W, Zagrouba E (2015) Breast cancer diagnosis in digitized mammograms using curvelet moments. Comput Biol Med 64:79–90. https://doi.org/10.1016/j.compbiomed.2015.06.012
Don S, Chung D, Revathy K, Choi E, Min D (2012) A new approach for mammogram image classification using fractal properties. Cybern Inf Technol 12 (2):69–83
Everitt B (1998) The Cambridge dictionary of statistics. Cambridge University Press, Cambridge
Fawcett T (2006) An introduction to roc analysis. Pattern Recogn Lett 27 (8):861–874
Freund Y, Schapire RE (1995) A desicion-theoretic generalization of on-line learning and an application to boosting. In: European conference on computational learning theory, Springer, pp 23–37
Freund Y, Schapire RE, et al. (1996) Experiments with a new boosting algorithm. In: Icml, vol 96, pp 148–156
Giger ML (2000) Computer-aided diagnosis of breast lesions in medical images. Comput Sci Eng 2(5):39–45
Görgel P, Sertbas A, Ucan ON (2013) Mammographical mass detection and classification using Local Seed Region Growing-Spherical Wavelet Transform (LSRG-SWT) hybrid scheme. Comput Biol Med 43(6):765–774. https://doi.org/10.1016/j.compbiomed.2013.03.008
Haralick RM, Shanmugam K, et al. (1973) Textural features for image classification. IEEE Trans Syst Man Cybern SMC 3(6):610–621
Harris C, Stephens M (1988) A combined corner and edge detector. In: Alvey vision conference, Citeseer, vol 15, p 50
Harris ZS (1954) Distributional structure. Word 10(2-3):146–162
Hartigan JA, Wong MA (1979) Algorithm as 136: a k-means clustering algorithm. Appl Stat pp 100–108
Haykin S (1994) Neural networks: a comprehensive foundation. Prentice Hall PTR
Heath M, Bowyer K, Kopans D, Kegelmeyer P Jr, Moore R, Chang K, Munishkumaran S (1998) Current status of the digital database for screening mammography. In: Digital mammography. Springer, pp 457–460
Ho TK (1998) The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell 20(8):832–844
Hussain M, Khan S, Muhammad G, Ahmad I, Bebis G (2014) Effective extraction of gabor features for false positive reduction and mass classification in mammography. Appl Math Inf Sci 8(1 L):397–412
Kaur P (2016) Mammogram image nucleus segmentation and classification using convolution neural network classifier. Int J Adv Res Ideas Innov Technol 2(5):1–12
Klein G, Murray D (2007) Parallel tracking and mapping for small ar workspaces. In: 6th IEEE and ACM international symposium on mixed and augmented reality, 2007. ISMAR 2007. IEEE, pp 225–234
Lindeberg T (1998) Feature detection with automatic scale selection. Int J Comput Vis 30(2):79–116
Liu X, Tang J (2014) Mass classification in mammograms using selected geometry and texture features, and a new SVM-based feature selection method. IEEE Systems Journal. https://doi.org/10.1109/JSYST.2013.2286539
Lowe DG (1999) Object recognition from local scale-invariant features. In: The proceedings of the 7th IEEE international conference on computer vision, 1999, vol 2. IEEE, pp 1150–1157
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Lu F, Huang J, Zhan K (2015) Boosting classifiers for scene category recognition. J Inf Hiding Multimedia Signal Process 6(4):708–717
Martins LdO, Santos AMd, Silva AC, Paiva AC (2006) Classification of normal, benign and malignant tissues using co-occurrence matrix and bayesian neural network in mammographic images. In: 9th Brazilian symposium on neural networks, 2006. SBRN’06. IEEE, pp 24–29
Massich J, Meriaudeau F, Sentís M, Ganau S, Pérez E, Puig D, Martí R, Oliver A, Martí J (2014) Sift texture description for understanding breast ultrasound images. In: Breast imaging. Springer, pp 681–688
Mikolajczyk K, Schmid C (2001) Indexing based on scale invariant interest points. In: Proceedings of the 8th IEEE international conference on computer vision, 2001. ICCV 2001, vol 1. IEEE, Florida, pp 525–531
Moayedi F, Azimifar Z, Boostani R, Katebi S (2010) Contourlet-based mammography mass classification using the SVM family. Comput Biol Med. https://doi.org/10.1016/j.compbiomed.2009.12.006
Ojala T, Pietikäinen M, Harwood D (1996) A comparative study of texture measures with classification based on featured distributions. Pattern Recogn 29(1):51–59
Peng X, Wang L, Wang X, Qiao y (2016) Bag of visual words and fusion methods for action recognition: comprehensive study and good practice. Comput Vis Image Underst
Rijsbergen CJV (1979) Information retrieval, 2nd edn. Butterworth-Heinemann, Newton
Rocha SVd, Braz Junior G, Corrêa A, Cardoso A, Paiva D, Gattass M (2016) Texture analysis of masses malignant in mammograms images using a combined approach of diversity index and local binary patterns distribution. Expert Syst Appl 66:7–19
Rosin PL (1999) Measuring corner properties. Comput Vis Image Underst 73 (2):291–307
Rosten E, Drummond T (2006) Machine learning for high-speed corner detection. In: Computer vision–ECCV 2006. Springer, pp 430–443
Rothacker L, Rusinol M, Fink GA (2013) Bag-of-features hmms for segmentation-free word spotting in handwritten documents. In: 12th international conference on document analysis and recognition (ICDAR), 2013. IEEE, pp 1305–1309
Rouhi R, Jafari M, Kasaei S, Keshavarzian P (2015) Benign and malignant breast tumors classification based on region growing and CNN segmentation. Expert Syst Appl 42(3):990–1002. https://doi.org/10.1016/j.eswa.2014.09.020
Rublee E, Rabaud V, Konolige K, Bradski G (2011) Orb: an efficient alternative to sift or surf. In: International conference on computer vision (ICCV), 2011. IEEE, pp 2564–2571
Siegel RL, Miller KD, Jemal A (2016) Cancer statistics, 2016. CA: a cancer journal for clinicians 66(1):7–30
Sun H, Sun X, Wang H, Li Y, Li X (2012) Automatic target detection in high-resolution remote sensing images using spatial sparse coding bag-of-words model. IEEE Geosci Remote Sens Lett 9(1):109–113
Thornton C, Hutter F, Hoos HH, Leyton-Brown K (2013) Auto-weka: combined selection and hyperparameter optimization of classification algorithms. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, pp 847–855
Vapnik VN, Vapnik V (1998) Statistical learning theory, vol 1. Wiley, New York
Wajid SK, Hussain A (2015) Local energy-based shape histogram feature extraction technique for breast cancer diagnosis. Expert Syst Appl 42(20):6990–6999. https://doi.org/10.1016/j.eswa.2015.04.057
Wan J, Ruan Q, Li W, Deng S (2013) One-shot learning gesture recognition from rgb-d data using bag of features. J Mach Learn Res 14(1):2549–2582
Wang J, Li Y, Zhang Y, Xie H, Wang C (2011) Bag-of-features based classification of breast parenchymal tissue in the mammogram via jointly selecting and weighting visual words. In: 6th international conference on image and graphics (ICIG), 2011. IEEE, pp 622–627
WHO (2014) Who position paper on mammography screening
Witkin AP (1987) Scale-space filtering. US Patent 4,658,372
Xie X, Li B, Chai X (2015) Kernel-based nonparametric fisher classifier for hyperspectral remote sensing imagery. Algorithms 19(20):21
Xu S, Fang T, Li D, Wang S (2010) Object classification of aerial images with bag-of-visual words. IEEE Geosci Remote Sens Lett 7(2):366–370
Yang J, Jiang YG, Hauptmann AG, Ngo CW (2007) Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the international workshop on workshop on multimedia information retrieval. ACM, pp 197–206
Yang Y, Newsam S (2010) Bag-of-visual-words and spatial extensions for land-use classification. In: Proceedings of the 18th SIGSPATIAL international conference on advances in geographic information systems. ACM, San Jose, pp 270–279
Zhou L, Zhou Z, Hu D (2013) Scene classification using a multi-resolution bag-of-features model. Pattern Recogn 46(1):424–433
Acknowledgments
The authors thanks CNPq and FAPEMA for the financial support.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Matos, C.E.F., Souza, J.C., Diniz, J.O.B. et al. Diagnosis of breast tissue in mammography images based local feature descriptors. Multimed Tools Appl 78, 12961–12986 (2019). https://doi.org/10.1007/s11042-018-6390-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-6390-x