Skip to main content

Using Feature Selection Techniques to Improve the Accuracy of Breast Cancer Classification

  • Conference paper
  • First Online:
Innovations in Smart Cities Applications Edition 2 (SCA 2018)

Abstract

Classification is a data mining process that aims to divide data into classes to facilitate decision-making; it is therefore an important task in medical field. In this paper we will try to improve the accuracy of the classification of six machines learning algorithms: Bayes Network (BN), Support Vector Machine (SVM), k-nearest neighbors algorithm (Knn), Artificial Neural Network (ANN), Decision Tree (C4.5) and Logistic Regression using feature selection techniques, for breast cancer classification and diagnosis. We examined those methods of classification and techniques of feature selection in WEKA Tool (The Waikato Environment for Knowledge Analysis) using two databases, Wisconsin breast cancer datasets original (WBC) and diagnostic (WBCD) available in UCI machine learning repository.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. «Breast cancer statistics»: World Cancer Research Fund, 22 Aug 2018. Available on: https://www.wcrf.org/dietandcancer/cancer-trends/breast-cancer-statistics

  2. Ganesan, K., Acharya, U.R., Chua, C.K., Min, L.C., Abraham, K.T., Ng, K.-H.: Computer-aided breast cancer detection using mammograms: a review. IEEE Rev. Biomed. Eng. 6, 77–98 (2013)

    Article  Google Scholar 

  3. Aalaei, S., Shahraki, H., Rowhanimanesh, A., Eslami, S.: «Feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets». Iran J. Basic Med. Sci. 19(5), 7 (2016)

    Google Scholar 

  4. Saabith, A.L.S., Sundararajan, E., Bakar, A.A.: «Comparative Study on Different Classification Techniques for Breast Cancer Dataset», p. 8 (2014)

    Google Scholar 

  5. Gowri A.S. Ramar, D.K.: «A novel approach of feature selection techniques for image dataset». 3(2), 5

    Google Scholar 

  6. Abd El-Hafeez Ibrahim, A., Hashad, A.I., El-Deen Mohamed Shawky, N. Maher, A., Arab Academy for Science, Technology Maritime Transport, Cairo, Egypt: «Robust breast cancer diagnosis on four different datasets using multi-classifiers fusion». Int. J. Eng. Res., V4(03) (Mars 2015)

    Google Scholar 

  7. Hamsagayathri P., Sampath, P.: «Performance analysis of breast cancer classification using decision tree classifiers». Int. J. Curr. Pharm. Res. 9(2), 19 (Mars 2017)

    Article  Google Scholar 

  8. Lavanya, D., Usha Rani K.: «Analysis of feature selection with classification: breast cancer datasets». Indian J. Comput. Sci. Eng. (IJCSE) 2(5), 9 (2011)

    Google Scholar 

  9. Kaur, R.: «Study and comparison of feature selection approaches for intrusion detection». Int. J. Comput. Appl. 7

    Google Scholar 

  10. Mahmood, A.: «Structure Learning of Causal Bayesian Networks: A Survey», p. 6

    Google Scholar 

  11. Han, J., Kamber, M.: Data mining: concepts and techniques, 2nd ed., [Nachdr.]. Elsevier/Morgan Kaufmann, Amsterdam (2010)

    Google Scholar 

  12. Han, J., Kamber, M.: Data mining: concepts and techniques, 3rd edn. Elsevier, Burlington, MA (2011)

    MATH  Google Scholar 

  13. Yusuff, H., Mohamad, N., Ngah, U., Yahaya, A.: «Breast cancer analysis using logistic regression». Int. J. Res. Appl. Stud. 11 (2012)

    Google Scholar 

  14. Negnevitsky, M.: Artificial intelligence: a guide to intelligent systems. 2nd ed. Addison-Wesley, Harlow, England; New York: (2005)

    Google Scholar 

  15. «UCI Machine Learning Repository: Breast Cancer Wisconsin (Original) Data Set»: Available on: https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+(original)

  16. «UCI Machine Learning Repository: Breast Cancer Wisconsin (Diagnostic) Data Set»: Available on: https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+(Diagnostic)

  17. «Machine Learning Project at the University of Waikato in New Zealand»: Available in: https://www.cs.waikato.ac.nz/ml/index.html

  18. Saoud, H., Ghadi, A., Ghailani, M.: Analysis of evolutionary trends of incidence and mortality by cancers. In: Ben Ahmed M., Boudhir A. (eds.) Innovations in Smart Cities and Applications. SCAMS 2017. Lecture Notes in Networks and Systems, vol 37. Springer, Cham (2018)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hajar Saoud .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Saoud, H., Ghadi, A., Ghailani, M., Abdelhakim, B.A. (2019). Using Feature Selection Techniques to Improve the Accuracy of Breast Cancer Classification. In: Ben Ahmed, M., Boudhir, A., Younes, A. (eds) Innovations in Smart Cities Applications Edition 2. SCA 2018. Lecture Notes in Intelligent Transportation and Infrastructure. Springer, Cham. https://doi.org/10.1007/978-3-030-11196-0_28

Download citation

Publish with us

Policies and ethics