Skip to main content

A Machine Learning-Based Traditional and Ensemble Technique for Predicting Breast Cancer

  • Conference paper
  • First Online:
Hybrid Intelligent Systems (HIS 2022)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 647))

Included in the following conference series:

  • 537 Accesses

Abstract

Breast cancer is a physical disease and increasing in recent years. The topic is known widely in the recent world. Most women are suffering from problem of breast cancer. The disease is measured by the differences between normal and affected area ratio and the rate of uncontrolled increase of the tissue. Many studies have been conducted in the past to predict and recognize breast cancer. We have found some good opportunities to improve the technique. We propose predicting the risks and making early awareness using effective algorithm models. Our proposed method can be easily implemented in real life and is suitable for easy breast cancer predictions. The dataset was collected from Kaggle. In our model, we have implemented some different classifiers named Random Forest (RF), Logistic Regression (LR), Gradient Boosting (GB), and K-Nearest Classifier algorithms. Logistic Regression and Random Forest Classifier were performed well with 98.245% testing accuracy. Other algorithms like Gradient Boosting 91.228%, and K-Nearest 92.105% testing accuracy. We also used some different ensemble models to justify the performances. We have used Bagging LRB 94.736%, RFB 94.736%, GBB 95.614%, and KNB 92.105% accuracy, Boosting LRBO 96.491%, RFBO 99.122%, and GBBO 98.218% accuracy, and Voting algorithm LRGK with 95.614% accuracy. We have used hyper-parameter tuning in each classifier to assign the best parameters. The experimental study indicates breast cancer predictions with a higher degree of accuracy and evaluated the findings of other current studies, RFBO with 99.122% accuracy being the best performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Yang, L., Shami, A.: On hyperparameter optimization of machine learning algorithms: theory and practice. Neurocomputing 415, 295–316 (2020)

    Article  Google Scholar 

  2. Khan, F., Kanwal, S., Alamri, S., Mumtaz, B.: Hyper-parameter optimization of classifiers, using an artificial immune network and its application to software bug prediction. IEEE Access 8, 20954–20964 (2020)

    Article  Google Scholar 

  3. Rani, V.M.K., Dhenakaran, S.S.: Classification of ultrasound breast cancer tumor images using neural learning and predicting the tumor growth rate. Multimedia Tools Appl. 79(23–24), 16967–16985 (2019). https://doi.org/10.1007/s11042-019-7487-6

    Article  Google Scholar 

  4. Li, Y., Liu, Y., Zhang, M., Zhang, G., Wang, Z., Luo, J.: Radiomics with attribute bagging for breast tumor classification using multimodal ultrasound images. J. Ultrasound Med. 39(2), 361–371 (2020)

    Article  Google Scholar 

  5. Gómez-Flores, W., Hernández-López, J.: Assessment of the invariance and discriminant power of morphological features under geometric transformations for breast tumor classification. Comput. Meth. Progr. Biomed. 185, article 105173 (2020)

    Google Scholar 

  6. Liu, Y., Ren, L., Cao, X., Tong, Y.: Breast tumors recognition based on edge feature extraction using support vector machine. Biomed. Signal Process. Control 58(101825), 1–8 (2020)

    Google Scholar 

  7. Irfan, R., Almazroi, A.A., Rauf, H.T., Damaševičius, R., Nasr, E.A., Abdelgawad, A.E.: Dilated semantic segmentation for breast ultrasonic lesion detection using parallel feature fusion. Diagnostics 11(7), 1212 (2021)

    Article  Google Scholar 

  8. Lahoura, H., Singh, A., Aggarwal et al.: Cloud computing-based framework for breast cancer diagnosis using extreme learning machine. Diagnostics 11(2), 241 (2021)

    Google Scholar 

  9. Breast Cancer Dataset. https://www.kaggle.com/datasets/yasserh/breast-cancer-dataset

  10. What is Correlation in Machine Learning? https://medium.com/analytics-vidhya/what-is-correlation-4fe0c6fbed47. Accessed: 6 Aug 2020

  11. Mary Gladence, L., Karthi, M., Maria Anu, V.: A statistical comparison of logistic regression and different bayes classification methods for machine learning. ARPN J. Eng. Appl. Sci. 10(14) (2015). ISSN 1819-6608

    Google Scholar 

  12. Logistic Regression for Machine Learning. https://www.capitalone.com/tech/machine-learning/what-is-logistic-regression/. Accessed 6 Aug 2021

  13. Ghosh, P., Karim, A., Atik, S.T., Afrin, S., Saifuzzaman, M.: Expert cancer model using supervised algorithms with a LASSO selection approach. Int. J. Electr. Comput. Eng. (IJECE) 11(3), 2631 (2021)

    Google Scholar 

  14. Nahar, N., Ara, F.: Liver disease prediction by using different decision tree techniques. Int. J. Data Mining Knowl. Manage. Process 8(2), 01–09 (2018)

    Article  Google Scholar 

  15. Aljahdali, S., Hussain, S.N.: Comparative prediction performance with support vector machine and random forest classification techniques. Int. J. Comput. Appl. 69(11) (2013)

    Google Scholar 

  16. Bentéjac, C., Csörgő, A., Martínez-Muñoz, G.: A comparative analysis of gradient boosting algorithms. Artif. Intell. Rev. 54(3), 1937–1967 (2020). https://doi.org/10.1007/s10462-020-09896-5

    Article  Google Scholar 

  17. Drucker, H., Cortes, C., Jackel, L.D., LeCun, Y., Vapnik, V.: Boosting and other ensemble methods. Neural Comput. 6(6), 1289–1301 (1994)

    Article  MATH  Google Scholar 

  18. Pasha, M., Fatima, M.: Comparative analysis of meta learning algorithms for liver disease detection. J. Softw. 12(12), 923–933 (2017)

    Article  Google Scholar 

  19. Wang, Y., Jha, S., Chaudhuri, K.: Analyzing the robustness of nearest neighbors to adversarial examples. In: International Conference on Machine Learning, pp. 5133–5142. PMLR (2018)

    Google Scholar 

  20. Sharma, A., Suryawanshi, A.: A novel method for detecting spam email using KNN classification with spearman correlation as distance measure. Int. J. Comput. Appl. 136(6), 28–35 (2016)

    Google Scholar 

  21. Hou, Z.-H.: Ensemble Methods: Foundations and Algorithms. CRC Press (2012)

    Google Scholar 

  22. Emmens, A., Croux, C.: Bagging and boosting classification trees to predict churn. J. Market. Res. 43(2), 276–286 (2006)

    Google Scholar 

  23. Islam, R., Beeravolu, A.R., Islam, M.A.H., Karim, A., Azam, S., Mukti, S.A.: a performance based study on deep learning algorithms in the efficient prediction of heart disease. In: 2021 2nd International Informatics and Software Engineering Conference (IISEC), pp. 1–6. IEEE (2021)

    Google Scholar 

  24. Tajmen, S., Karim, A., Mridul, A.H., Azam, S., Ghosh, P., Dhaly, A., Hossain, M.N.: A machine learning based proposition for automated and methodical prediction of liver disease. In: The 10th International Conference on Computer and Communications Management in Japan (2022)

    Google Scholar 

  25. Molla, S., et al.: A predictive analysis framework of heart disease using machine learning approaches. Bull. Electr. Eng. Informatics 11(5), 2705–2716 (2022)

    Article  Google Scholar 

  26. Afrin, S., et al.: Supervised machine learning based liver disease prediction approach with LASSO feature selection. Bull. Electr. Eng. Informatics 10(6), 3369–4337 (2021)

    Article  Google Scholar 

  27. Ghosh, P., et al.: Efficient prediction of cardiovascular disease using machine learning algorithms with relief and LASSO feature selection techniques. IEEE Access 9, 19304–19326 (2021)

    Article  Google Scholar 

  28. Jubier Ali, M., Chandra Das, B., Saha, S., Biswas, A.A., Chakraborty, P.: A comparative study of machine learning algorithms to detect cardiovascular disease with feature selection method. In: Skala, V., Singh, T.P., Choudhury, T., Tomar, R., Abul Bashar, M. (Eds.) Machine Intelligence and Data Science Applications. Lecture Notes on Data Engineering and Communications Technologies, vol. 132. Springer, Singapore (2022). https://doi.org/10.1007/978-981-19-2347-0_45

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Md. Jahidul Islam , Asifuzzaman Asif , Mushfiqur Rahman or Mohammad Jahangir Alam .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mridul, A.H., Islam, M.J., Asif, A., Rahman, M., Alam, M.J. (2023). A Machine Learning-Based Traditional and Ensemble Technique for Predicting Breast Cancer. In: Abraham, A., Hong, TP., Kotecha, K., Ma, K., Manghirmalani Mishra, P., Gandhi, N. (eds) Hybrid Intelligent Systems. HIS 2022. Lecture Notes in Networks and Systems, vol 647. Springer, Cham. https://doi.org/10.1007/978-3-031-27409-1_21

Download citation

Publish with us

Policies and ethics