Abstract
An experimental study is conducted to predict the malignant diagnosis of patients from 1989 to 1995 donated by Olvi Mangasarian, Computer Science Department, University of Wisconsin, WI. A secondary dataset from the UCI machine library is analyzed using Support Vector Machine (SVM) and Artificial Neural Network (ANN) classifier with feature extraction applied by Principal Component Analysis (PCA). The dataset is trained and tested using hold out and K-fold cross-validation methods. PCA method is used to grab the essential features (7 out of 31) from the huge multivariate dataset and used for training and testing the SVM and ANN models. Model performance is measured on various metrics like accuracy, error, sensitivity, and specificity. All experiments are conducted using the R studio version 1.0.143. During the study, on the preliminary phase, many redundant features of datasets are removed with the data cleaning process and PCA is used to extract the 90% significant features in the prediction task. The finding of the paper shows that PCA played significant role in the enhancement of the prediction accuracy of the ANN and SVM. Also, the ANN classifier outperformed SVM in binary classification. We recommend this study to be used as a real-time prediction on Hospital official website in the future.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
UCI. https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+%28Diagnostic%29 (2019)
D. Kabakchieva, Student performance prediction by using data mining classification algorithms. Int. J. Comput. Sci. Manag. Res. 1(4), 686–690 (2012)
C. Maria Teresa, Noel R. Maria, Prediction of university student’s academic achievement by linear and logistic models. Spanish J. Psychol. 2(1), 275–288 (2015)
D. Kolo, A decision tree approach for predicting students’ academic performance. Int. J. Educ. Manag. Eng. 5, 12–19 (2015)
C. Verma, Educational data mining to examine mindset of educators towards ICT knowledge. Int. J. Data Min. Emerg. Technol. 7, 53–60 (2017)
B. Deshmukh, A. Patil, B. Pawar, Comparison of classification algorithms using weka on various datasets. Int. J. Comput. Sci. Inf. Technol. 4(2), 85–90 (2011)
R.L. Cheu, D. Srinivasan, E. Tian, Support vector machine models for freeway incident detection. In: Proceedings of the Intelligent Transportation Systems, vol. 1 (IEEE, 2003), pp. 238–243
C. Verma, S. Ahmad, V. Stoffová, Z. Illés, Forecasting residence state of indian student based on responses towards information and communication technology awareness: a primarily outcomes using machine learning. In: International Conference on Innovations in Engineering, Technology and Sciences (IEEE, India, 2018), In Press
C. Verma, V. Stoffová, Z. Illés, S. Dahiya, Binary logistic regression classifying the gender of student towards Computer Learning in European schools. In: The 11th Conference of Ph. D Students in Computer Science (Szeged University, Hungary, 2018), p. 45
C. Verma, V. Stoffová, Z. Illés, An ensemble approach to identifying the student gender towards information and communication technology awareness in European schools using machine learning, Int. J. Eng. Technol. 7, 3392–3396 (2018)
C. Verma, S. Dahiya, Gender difference towards information and communication technology awareness in indian universities. SpringerPlus 5, 1–7 (2016)
C. Verma, Z. Illés, V. Stoffová, Attitude prediction towards ICT and mobile technology for the real-time: an experimental study using machine learning. In: The 15th International Scientific Conference eLearning and Software for Education (University Politehnica of Bucharest, Romania, 2019), In Press
C. Verma, Z. Illés, V. Stoffová, Real-time prediction of development and availability of ICT and mobile technology in Indian and Hungarian University. In: 2nd International Conference on Recent Innovations in Computing (J & K University, India, 2019), In Press
C. Verma, S. Ahmad, V. Stoffová, Z. Illés, M. Singh, National identity predictive models for the real time prediction of european schools students: preliminary results. In: International Conference on Automation, Computational and Technology Management (IEEE, London, 2019), In Press
C. Verma, S. Dahiya, D. Mehta, An analytical approach to investigate state diversity towards ICT: a study of six universities of Punjab and Haryana: Indian. J. Sci. Technol. 9, 1–5 (2016)
S.R. Kalmegh, Comparative analysis of weka data mining algorithm random forest, random tree and lad tree for classification of indigenous news data, Int. J. Emerg. Technol. Adv. Eng. 5(1), 507–517 (2015)
M. Minsky, S. Papert, Perceptrons: An Introduction to Computational Geometry, (MIT Press, 2017)
C. Verma, S. Ahmad, V. Stoffová, Z. Illés, S. Dahiya, Gender prediction of the european school’s teachers using machine learning: preliminary results. In: International Advance Computing Conference (IEEE, India, 2018), pp. 213–220
C. Verma, V. Stoffová, Z. Illés, Age group predictive models for the real-time prediction of the university students using machine learning: preliminary results. In: International Conference on Electrical, Computer and Communication (IEEE, India, 2019), In Press
C. Verma, V. Stoffová, Z. Illés, Rate-Monotonic vs early deadline first scheduling: a review. In: International Conference on Education Technology and Computer Science in Building Better Future (University of Technology and Humanities, Poland, 2018), pp. 188–193
Acknowledgements
The corresponding author thank the UCI website to provide significant datasets to pursue this research. The authors’ institutions, Bharat Group of College and Eötvös Loránd, did not require ethical committee approval to be granted for this study.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mehta, D., Verma, C. (2020). Prediction of Cancer Diagnosis Patients from Fine-Needle Aspirates Using Machine Learning. In: Singh Tomar, G., Chaudhari, N.S., Barbosa, J.L.V., Aghwariya, M.K. (eds) International Conference on Intelligent Computing and Smart Communication 2019. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-15-0633-8_33
Download citation
DOI: https://doi.org/10.1007/978-981-15-0633-8_33
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-0632-1
Online ISBN: 978-981-15-0633-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)