Abstract
Cervical cancer is the fourth most common and one of the deadliest types of cancer in the female population. However, it does not show any symptoms in the first stage. Therefore, early diagnosis is very difficult. On the other hand, there are some risk factors directly or indirectly associated with it. The effect of these risk factors on predicting the diagnosis of cervical cancer was investigated with machine learning-based algorithms and promising results were obtained. However, studies on which risk factors are more effective than others are scarce. This study focuses on investigating how the selection of promising factors associated with cervical cancer will affect the predictive ability of machine learning-based classification algorithms such as Gaussian Naive Bayes (GNB), k Nearest Neighbor and Decision Tree. ANOVA F-test is applied to evaluate each risk factor independently according to the desired class. In order to ensure the reliability of the results, K-fold cross validation technique is used at different K values. In the selected cervical cancer behavioral risk dataset, the GNB algorithm showed the highest performance (%94) with eight risk factors for all K values.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alam, T.M., Khan, M.M.A., Iqbal, M.A., Abdul, W., Mushtaq, M.: Cervical cancer prediction through different screening methods using data mining. (IJACSA) Int. J. Adv. Comput. Sci. Appl. 10(2), 388–396 (2019)
Deng, X., Luo, Y., Wang, C.: Analysis of risk factors for cervical cancer based on machine learning methods. In: 5th IEEE International Conference on Cloud Computing and Intelligence Systems (CCIS), pp. 631–635. IEEE (2018)
Shetty, A., Shah, V.: Survey of cervical cancer prediction using machine learning: a comparative approach. In: 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT), pp. 1–6. IEEE (2018)
Ozaslan, I.N., Degirmenci, A., Karal, O.: Tourism demand forecasting for Turkey by using Adaboost algorithm. In: Innovations in Intelligent Systems and Applications Conference (ASYU), pp. 1–5. IEEE (2022)
Apaydin, M., Yumus, M., Degirmenci, A., Kesikburun, S., Karal, O.: Deep convolutional neural networks using U-net for automatic intervertebral disc segmentation in axial MRI. In: Innovations in Intelligent Systems and Applications Conference (ASYU), pp. 1–6. IEEE (2022)
Muttaqi, M., Degirmenci, A., Karal, O.: US accent recognition using machine learning methods. In: Innovations in Intelligent Systems and Applications Conference (ASYU), pp. 1–6. IEEE (2022)
Apaydin, M., Yumuş, M., Değirmenci, A., Karal, Ö.: Evaluation of air temperature with machine learning regression methods using Seoul City meteorological data. Pamukkale Üniversitesi Mühendislik Bilimleri Dergisi 28(5), 737–747 (2022)
Degirmenci, A., Karal, O.: Robust incremental outlier detection approach based on a new metric in data streams. IEEE Access 9, 160347–160360 (2021)
Degirmenci, A., Karal, O.: Efficient density and cluster based incremental outlier detection in data streams. Inf. Sci. 607, 901–920 (2022)
Degirmenci, A., Karal, O.: iMCOD: Incremental multi-class outlier detection model in data streams. Knowl. Based Syst. 258, 109950 (2022)
Machmud, R., Wijaya, A.: Behavior determinant based cervical cancer early detection with machine learning algorithm. Adv. Sci. Lett. 22(10), 3120–3123 (2016)
Oyelakin, A.M., Muhammed-Thani, S., Salau-Ibrahim, T.T., Rilwan, D.M.: Performance analysis of selected machine learning algorithms for the detection of cervical cancer based on behavioral risk dataset. Int. J. Inf. Secur. Priv. Digit. Forensics 5(1), 15–21 (2021)
Midyanti, D.M., Bahri, S., Midyanti, H.I.: ADALINE neural network for early detection of cervical cancer based on behaviour determinant. Sci. J. Inform. 8(2), 283–288 (2021)
Gamara, R.P.C., Neyra, R.Q., Recto, K.H.A.: Behavior-based early cervical cancer risk detection using artificial neural networks. In: 13th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM), pp. 1–6. IEEE (2021)
Nilnumpetch, C., Amornsamankul, S., Kraipeerapun, P.: Cancer prediction using cascade generalization and duo output neural network. In: Proceedings of the Sixth International Conference on Research in Intelligent and Computing, pp. 65–70 (2021)
Curia, F.: Cervical cancer risk prediction with robust ensemble and explainable black boxes method. Heal. Technol. 11(4), 875–885 (2021)
Alpan, K.: Performance evaluation of classification algorithms for early detection of behavior determinant based cervical cancer. In: 5th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT), pp. 706–710. IEEE (2021)
Ratul, I.J., Al-Monsur, A., Tabassum, B., Ar-Rafi, A.M., Nishat, M.M., Faisal, F.: Early risk prediction of cervical cancer: A machine learning approach. In: 19th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), pp. 1–4. IEEE (2022)
Cicek, İB., İlhami, S.E.L., Yağin, F.H., Colak, C.: Development of a Python-based classification web interface for independent datasets. Balkan J. Electr. Comput. Eng. 10(1), 91–96 (2022)
Akter, L., Islam, M., Al-Rakhami, M.S., Haque, M.: Prediction of cervical cancer from behavior risk using machine learning techniques. SN Comput. Sci. 2(3), 1–10 (2021)
UCI Machine Learning Repository: Cervical Cancer Behavior Risk Data Set. https://archive.ics.uci.edu/ml/datasets/Cervical+Cancer+Behavior+Risk. Accessed 21 Jan 2022
Aggarwal, C. C., Hinneburg, A., Keim, D. A.: On the surprising behavior of distance metrics in high dimensional space. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, pp. 420–434. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-44503-X_27
Onder, Z., Degirmenci, A., Karal, O.: Estimating breakpoints in piecewise linear regression using machine learning methods. In: Innovations in Intelligent Systems and Applications Conference (ASYU), pp. 1–6. IEEE (2022)
Saritas, M.M., Yasar, A.: Performance analysis of ANN and Naive Bayes classification algorithm for data classification. Int. J. Intell. Syst. Appl. Eng. 7(2), 88–91 (2019)
Pintas, J.T., Fernandes, L.A.F., Garcia, A.C.B.: Feature selection methods for text classification: a systematic literature review. Artif. Intell. Rev. 54(8), 6149–6200 (2021)
St, L., Wold, S.: Analysis of variance (ANOVA). Chemom. Intell. Lab. Syst. 6(4), 259–272 (1989)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Çakır, M., Degirmenci, A., Karal, O. (2023). Exploring the Behavioural Factors of Cervical Cancer Using ANOVA and Machine Learning Techniques. In: Mirzazadeh, A., Erdebilli, B., Babaee Tirkolaee, E., Weber, GW., Kar, A.K. (eds) Science, Engineering Management and Information Technology. SEMIT 2022. Communications in Computer and Information Science, vol 1808. Springer, Cham. https://doi.org/10.1007/978-3-031-40395-8_18
Download citation
DOI: https://doi.org/10.1007/978-3-031-40395-8_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-40394-1
Online ISBN: 978-3-031-40395-8
eBook Packages: Computer ScienceComputer Science (R0)