Skip to main content

Prediction of Cancer Diagnosis Patients from Fine-Needle Aspirates Using Machine Learning

  • Conference paper
  • First Online:
International Conference on Intelligent Computing and Smart Communication 2019

Part of the book series: Algorithms for Intelligent Systems ((AIS))

Abstract

An experimental study is conducted to predict the malignant diagnosis of patients from 1989 to 1995 donated by Olvi Mangasarian, Computer Science Department, University of Wisconsin, WI. A secondary dataset from the UCI machine library is analyzed using Support Vector Machine (SVM) and Artificial Neural Network (ANN) classifier with feature extraction applied by Principal Component Analysis (PCA). The dataset is trained and tested using hold out and K-fold cross-validation methods. PCA method is used to grab the essential features (7 out of 31) from the huge multivariate dataset and used for training and testing the SVM and ANN models. Model performance is measured on various metrics like accuracy, error, sensitivity, and specificity. All experiments are conducted using the R studio version 1.0.143. During the study, on the preliminary phase, many redundant features of datasets are removed with the data cleaning process and PCA is used to extract the 90% significant features in the prediction task. The finding of the paper shows that PCA played significant role in the enhancement of the prediction accuracy of the ANN and SVM. Also, the ANN classifier outperformed SVM in binary classification. We recommend this study to be used as a real-time prediction on Hospital official website in the future.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. UCI. https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+%28Diagnostic%29 (2019)

  2. D. Kabakchieva, Student performance prediction by using data mining classification algorithms. Int. J. Comput. Sci. Manag. Res. 1(4), 686–690 (2012)

    Google Scholar 

  3. C. Maria Teresa, Noel R. Maria, Prediction of university student’s academic achievement by linear and logistic models. Spanish J. Psychol. 2(1), 275–288 (2015)

    Google Scholar 

  4. D. Kolo, A decision tree approach for predicting students’ academic performance. Int. J. Educ. Manag. Eng. 5, 12–19 (2015)

    Article  Google Scholar 

  5. C. Verma, Educational data mining to examine mindset of educators towards ICT knowledge. Int. J. Data Min. Emerg. Technol. 7, 53–60 (2017)

    Article  Google Scholar 

  6. B. Deshmukh, A. Patil, B. Pawar, Comparison of classification algorithms using weka on various datasets. Int. J. Comput. Sci. Inf. Technol. 4(2), 85–90 (2011)

    Google Scholar 

  7. R.L. Cheu, D. Srinivasan, E. Tian, Support vector machine models for freeway incident detection. In: Proceedings of the Intelligent Transportation Systems, vol. 1 (IEEE, 2003), pp. 238–243

    Google Scholar 

  8. C. Verma, S. Ahmad, V. Stoffová, Z. Illés, Forecasting residence state of indian student based on responses towards information and communication technology awareness: a primarily outcomes using machine learning. In: International Conference on Innovations in Engineering, Technology and Sciences (IEEE, India, 2018), In Press

    Google Scholar 

  9. C. Verma, V. Stoffová, Z. Illés, S. Dahiya, Binary logistic regression classifying the gender of student towards Computer Learning in European schools. In: The 11th Conference of Ph. D Students in Computer Science (Szeged University, Hungary, 2018), p. 45

    Google Scholar 

  10. C. Verma, V. Stoffová, Z. Illés, An ensemble approach to identifying the student gender towards information and communication technology awareness in European schools using machine learning, Int. J. Eng. Technol. 7, 3392–3396 (2018)

    Google Scholar 

  11. C. Verma, S. Dahiya, Gender difference towards information and communication technology awareness in indian universities. SpringerPlus 5, 1–7 (2016)

    Article  Google Scholar 

  12. C. Verma, Z. Illés, V. Stoffová, Attitude prediction towards ICT and mobile technology for the real-time: an experimental study using machine learning. In: The 15th International Scientific Conference eLearning and Software for Education (University Politehnica of Bucharest, Romania, 2019), In Press

    Google Scholar 

  13. C. Verma, Z. Illés, V. Stoffová, Real-time prediction of development and availability of ICT and mobile technology in Indian and Hungarian University. In: 2nd International Conference on Recent Innovations in Computing (J & K University, India, 2019), In Press

    Google Scholar 

  14. C. Verma, S. Ahmad, V. Stoffová, Z. Illés, M. Singh, National identity predictive models for the real time prediction of european schools students: preliminary results. In: International Conference on Automation, Computational and Technology Management (IEEE, London, 2019), In Press

    Google Scholar 

  15. C. Verma, S. Dahiya, D. Mehta, An analytical approach to investigate state diversity towards ICT: a study of six universities of Punjab and Haryana: Indian. J. Sci. Technol. 9, 1–5 (2016)

    Google Scholar 

  16. S.R. Kalmegh, Comparative analysis of weka data mining algorithm random forest, random tree and lad tree for classification of indigenous news data, Int. J. Emerg. Technol. Adv. Eng. 5(1), 507–517 (2015)

    Google Scholar 

  17. M. Minsky, S. Papert, Perceptrons: An Introduction to Computational Geometry, (MIT Press, 2017)

    Google Scholar 

  18. C. Verma, S. Ahmad, V. Stoffová, Z. Illés, S. Dahiya, Gender prediction of the european school’s teachers using machine learning: preliminary results. In: International Advance Computing Conference (IEEE, India, 2018), pp. 213–220

    Google Scholar 

  19. C. Verma, V. Stoffová, Z. Illés, Age group predictive models for the real-time prediction of the university students using machine learning: preliminary results. In: International Conference on Electrical, Computer and Communication (IEEE, India, 2019), In Press

    Google Scholar 

  20. C. Verma, V. Stoffová, Z. Illés, Rate-Monotonic vs early deadline first scheduling: a review. In: International Conference on Education Technology and Computer Science in Building Better Future (University of Technology and Humanities, Poland, 2018), pp. 188–193

    Google Scholar 

Download references

Acknowledgements

The corresponding author thank the UCI website to provide significant datasets to pursue this research. The authors’ institutions, Bharat Group of College and Eötvös Loránd, did not require ethical committee approval to be granted for this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Deepak Mehta .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mehta, D., Verma, C. (2020). Prediction of Cancer Diagnosis Patients from Fine-Needle Aspirates Using Machine Learning. In: Singh Tomar, G., Chaudhari, N.S., Barbosa, J.L.V., Aghwariya, M.K. (eds) International Conference on Intelligent Computing and Smart Communication 2019. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-15-0633-8_33

Download citation

Publish with us

Policies and ethics