Abstract
One of the major causes of infertility across the world is Polycystic Ovary Syndrome (PCOS) which has affected a large proportion of women. Its symptoms include hormone imbalance, irregular menstrual cycle, excess acne, weight gain and many more. If overlooked, the disease can lead to serious health problems like obstructive sleep apnea, type-2 diabetes, obesity, heart disease, mood disorders, etc. Thus, early detection of PCOS is important. This paper proposes a system for early prediction of PCOS using optimized machine learning classifiers. Extensive EDA was done on the dataset and out of 45 available features, 6 were selected using the Pearson statistical correlation coefficient. In addition to the EDA performed, oversampling was also done in order to handle the class imbalance problem by increasing the number of minority class samples. Optimized machine learning classifiers are classifiers with the best set of available hyperparameters that helps us achieve the highest prediction accuracy. Among the set of 11 optimized classifiers, Optimized KNN gave the best prediction accuracy of 0.995.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Umland EM, Weinstein LC, Buchanan EM (2011) Pharmacotherapy: A pathophysiologic approach
Mayo Clinic PCOS webpage. https://www.mayoclinic.org/diseases-conditions/pcos/symptoms-causes/syc-20353439. Last Accessed on 16 Feb 22
NICHD PCOS Factsheet. https://www.nichd.nih.gov/health/topics/factsheets/pcos. Last Accessed on 16 Feb 22
Lin LH, Baracat MC, Maciel GA, Soares JM Jr, Baracat EC (2013) Androgen receptor gene polymorphism and polycystic ovary syndrome. Int J Gynecol Obstet 120(2):115–118
Williams T, Mortada R, Porter S (2016) Diagnosis and treatment of polycystic ovary syndrome. Am Fam Physician 94(2):106–113
Chauhan P, Patil P, Rane N, Raundale P, Kanakia H (2021) Comparative analysis of machine learning algorithms for prediction of PCOS. In: 2021 International conference on communication information and computing technology (ICCICT). pp 1–7. https://doi.org/10.1109/ICCICT50803.2021.9510101
Prapty AS, Shitu TT (2020) An efficient decision tree establishment and performance analysis with different machine learning approaches on polycystic ovary syndrome. In: 2020 23rd International conference on computer and information technology (ICCIT). pp 1–5. https://doi.org/10.1109/ICCIT51783.2020.9392666
Abu Adla YA, Raydan DG, Charaf MZJ, Saad RA, Nasreddine J, Diab MO (2021) Automated detection of polycystic ovary syndrome using machine learning techniques. In: 2021 Sixth international conference on advances in biomedical engineering (ICABME). pp 208–212. https://doi.org/10.1109/ICABME53305.2021.9604905
Nabi N, Islam S, Khushbu SA, Masum AKM (2021) Machine learning approach: detecting polycystic ovary syndrome & its impact on Bangladeshi women. In: 2021 12th International conference on computing communication and networking technologies (ICCCNT). pp 1–7. https://doi.org/10.1109/ICCCNT51525.2021.9580143
Khan Inan MS, Ulfath RE, Alam FI, Bappee FK, Hasan R (2021) Improved sampling and feature selection to support extreme gradient boosting for pcos diagnosis. In: 2021 IEEE 11th annual computing and communication workshop and conference (CCWC). pp 1046–1050. https://doi.org/10.1109/CCWC51732.2021.9375994
Bharati S, Podder P, Hossain Mondal MR (2020) Diagnosis of polycystic ovary syndrome using machine learning algorithms. In: 2020 IEEE region 10 symposium (TENSYMP). pp 1486–1489. https://doi.org/10.1109/TENSYMP50017.2020.9230932
Denny A, Raj A, Ashok A, Ram CM, George R (2019) i-hOPE: Detection and prediction system for polycystic ovary syndrome (pcos) using machine learning techniques. In: TENCON 2019–2019 IEEE region 10 conference (TENCON). pp 673–678. https://doi.org/10.1109/TENCON.2019.8929674
Sajwan V, Ranjan R (2022) A novel feature descriptor: color texture description with diagonal local binary patterns using new distance metric for image retrieval. In: Congress on intelligent systems. Springer, Singapore, pp 17–26
Altaf I, Butt MA, Zaman M (2022) Machine learning techniques on disease detection and prediction using the hepatic and lipid profile panel data. In: Congress on intelligent systems. Springer, Singapore, pp 189–203
Oza A, Bokhare A (2022) Diabetes prediction using logistic regression and k-nearest neighbor. In: Congress on intelligent systems. Springer, Singapore, pp 407–418
Kaggle Dataset. https://www.kaggle.com/prasoonkottarathil/polycystic-ovary-syndrome-pcos. Last Accessed on 22 Oct 21
He H, Ma Y (2013) Imbalanced learning: Foundations, algorithms, and applications, 1st edn. Wiley-IEEE Press
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Vora, K., Shah, A., Shah, N., Verma, P. (2023). Prediction of Polycystic Ovary Syndrome (PCOS) Using Optimized Machine Learning Classifiers. In: Mathur, G., Bundele, M., Tripathi, A., Paprzycki, M. (eds) Proceedings of 3rd International Conference on Artificial Intelligence: Advances and Applications. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-7041-2_1
Download citation
DOI: https://doi.org/10.1007/978-981-19-7041-2_1
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-7040-5
Online ISBN: 978-981-19-7041-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)