Abstract
Breast cancer has been recorded one of the most common disease among the females worldwide. It is the most common type of melanoma and responsible for the increasing mortality rate. The accurate and timely diagnosis of this fatal disease is particularly important to increase the chances of survival of the patients. There have been several implementations in this area, and numerous machine learning and soft computing algorithms have been proposed to analyze and detect the cancer and help the doctors to provide right medications on time. In this paper, a comparative analysis of prominent machine learning techniques is done, and their performance is evaluated. The techniques used for detecting and diagnosing the breast cancer are support vector machines (SVM), random forest, and k-nearest neighbor (k-NN). All experiments have been conducted with R programming software (data mining tool). The outcomes indicated that k-NN has achieved the highest accuracy (97.32%) in comparison with SVM, RF.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
World Health Organization. World Health Statistics 2018: Monitoring Health for the SDGs Sustainable Development Goals. World Health Organization (2018)
Mao, N., Yin, P., Wang, Q., et al.: Added value of radiomics on mammography for breast cancer diagnosis: a feasibility study. J. Am. Coll. Radiol. 16(4), 485–491 (2019)
Michie, E.D., Spiegelhalter, D.J., Taylor C.C.: Machine learning, neural and statistical classification. In: Proceeding (1994)
Han, J., Kamber, M.: Data mining: concepts and techniques, 3rd edn, pp. 5–6. Elsevier, USA (2012)
William, H., Wolberg, W., Street, N., Mangasarian, O.L.: Breast cancer Wisconsin (diagnostic) data set. UCI Machine Learning Repository. http://archive.ics.uci.edu/ml/ (1992)
Chaurasia, V., Tiwari, B.B., Pal, S.: Prediction of benign and malignant breast cancer using data mining techniques. J. Algorithms Comput. Technol.
Wang H, Yoon, S.W.: Breast cancer prediction using data mining method. In: Proceedings of the 2015 Industrial and Systems Engineering Research Conference
Sarvestan, A.S., Safavi, A.A., Parandeh, M.N., Salehi, M.: Predicting breast cancer survivability using data mining techniques. In: Software Technology and Engineering (ICSTE), 2nd International Conference, vol. 2, pp. 227–231 (2010)
Karabatak, M., Ince, M.C.: An expert system for detection of breast cancer based on association rules and neural network. Expert Syst. Appl. 36(2), 3465–3469 (2009)
Delen, D., Walker, G., Kadam, A.: Predicting breast cancer survivability: a comparison of three data mining methods. Artif. Intell. Med. 34(2), 113–127 (2005)
Pena-Reyes, C.A., Sipper, M.: A fuzzy-genetic approach to breast cancer diagnosis. Artif. Intell. Med. 17(2), 131–155 (1999)
Khan, M.U., Choi, J.P., Shin, H., Kim, M.: Predicting breast cancer survivability using fuzzy decision trees for personalized healthcare. In: Proceedings of IEEE Engineering in Medicine and Biology Society, pp. 48–51 (2008)
Mourya, A.K.: Performance and evaluation of support vector machine and artificial neural network over heterogeneous data. In: Santosh, K., Hegadi, R. (eds.) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2018. Communications in Computer and Information Science, vol. 1035. Springer, Singapore. https://doi.org/10.1007/978-981-13-9181-1_51 (2019)
Chen, H.L., Yang, B., Liu, J., Liu, D.Y.: A support vector machine classifier with rough setbased feature selection for breast cancer diagnosis. Expert Syst. Appl. 38(7), 9014–9022 (2011)
Lavanya, D.: Ensemble decision tree classifier for breast cancer data. Int. J. Inf. Technol. Convergence Serv. 2(1), 17–24 (2012)
Gandhi Rajiv, K., Marcus, K., Kannan, S.: Classification rule construction using particle swarm optimization algorithm for breast cancer datasets. In: Signal Acquisition and Processing. ICSAP, International Conference, pp. 233–237 (2010)
Masmoudi, Y., Chabchoub, H., Hanafi, S., Rebaï, A.: A mathematical programming based procedure for breast cancer classification. J. Math. Model. Algorithms 9(3), 247–255 (2010)
Pawlovsky, A.P., Nagahashi, M.: A method to select a good setting for the k-NN algorithm when using it for breast cancer prognosis. In: Proceedings of IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), Valencia (2014)
Speiser, J.L., Miller, M.E., Tooze, J., Ip, E.: A comparison of random forest variable selection methods for classification prediction modeling. Expert Syst. Appl. 134, 93–101 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kumar, P., Bhatnagar, A., Jameel, R., Mourya, A.K. (2021). Machine Learning Algorithms for Breast Cancer Detection and Prediction. In: Das, S., Mohanty, M.N. (eds) Advances in Intelligent Computing and Communication. Lecture Notes in Networks and Systems, vol 202. Springer, Singapore. https://doi.org/10.1007/978-981-16-0695-3_14
Download citation
DOI: https://doi.org/10.1007/978-981-16-0695-3_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0694-6
Online ISBN: 978-981-16-0695-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)