Abstract
The behavior of a machine learning algorithm very much depends upon the structure of entities. There a pool of big data, which have lots of ambiguity, noises, granularity, unlabeled behavior that make it tough to find pattern and visualize an outcome. To solve this issue machine learning proposes an algorithm like clustering, which is mainly part of unsupervised learning. In the proposed model we used the unsupervised learning specifically the clustering algorithms like Expectation Maximization (EM), Make Density Based Analysis (MDBC) wrapped with the clustering algorithm like Kmeans, EM, GenClust++. The dataset has been chosen is a survey data on healthcare issues like chronic diseases, regular treatment etc. across various states and district of India. We have used Weka tool for formulation and training our dataset. This dataset is obtained from the Open Data Government Platform India Portal under the Government of Open Data License India, from the Department of Health and Family Welfare. The dataset contains ordinal type of values. These kinds of dataset are excellent examples of exploratory data analysis. The particle swarm optimization is applied on dataset for optimizing the attributes later trained with clustering algorithms which gave better log likelihood value comparable to the results on simple clustering algorithms. We even compare the performance speed among the algorithms to check how quickly they are able to perform on the variety of attributes.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
A. Samuel, Some studies in machine learning using the game of checkers. IBM J. Res. Dev. 3(3), 210–229 (1959). https://doi.org/10.1147/rd.33.0210
A. Turing, I—Computing machinery and intelligence. Mind LIX(236), 433–460 (1950). https://doi.org/10.1093/mind/lix.236.433
T.M. Mitchell, Machine Learning, vol. 45(37), pp. 870–877 (McGraw Hill, Burr Ridge, IL, 1997)
G. Hinton, T. Sejnowski, Unsupervised Learning: Foundations of Neural Computation (Computational neuroscience). (MIT Press, 1999)
T. Koch, K. Denike, Crediting his critics’ concerns: remaking John Snow’s map of Broad Street cholera, 1854. Soc. Sci. Med. 69(8), 1246–1251 (2009). https://doi.org/10.1016/j.socscimed.2009.07.046
Practical Guide to Clustering Algorithms & Evaluation in R Tutorials & Notes | Machine Learning | HackerEarth. (2019). Retrieved 10 October 2019, from https://www.hackerearth.com/practice/machine-learning/machine-learning-algorithms/clustering-algorithms-evaluation-r/tutorial/
M. Islam, V. Estivill-Castro, M. Rahman, T. Bossomaier, Combining K-Means and a genetic algorithm through a novel arrangement of genetic operators for high quality clustering. Expert Syst. Appl. 91, 402–417 (2018). https://doi.org/10.1016/j.eswa.2017.09.005
J. MacQueen, Some methods for classification and analysis of multivariate observations, in Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1 m No, 14 (1967, June), pp. 281–297
A. Dempster, N. Laird, D. Rubin, Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B (Methodological) 39(1), 1–22 (1977). https://doi.org/10.1111/j.2517-6161.1977.tb01600.x
C. Stover, Log-likelihood function—from Wolfram MathWorld (2019). Retrieved 17 October 2019, from http://mathworld.wolfram.com/Log-LikelihoodFunction.html
M. Rahman, M. Islam, A hybrid clustering technique combining a novel genetic algorithm with K-Means. Knowl.-Based Syst. 71, 345–365 (2014). https://doi.org/10.1016/j.knosys.2014.08.011
Error Sum of Squares (2019). Retrieved 17 October 2019, from https://hlab.stanford.edu/brian/error_sum_of_squares.html
D. van der Merwe, A. Engelbrecht, Data clustering using particle swarm optimization. In: The 2003 Congress on Evolutionary Computation, CEC ‘03 (2003). https://doi.org/10.1109/cec.2003.1299577
M. Alswaitti, M. Albughdadi, N. Isa, Density-based particle swarm optimization algorithm for data clustering. Expert Syst. Appl. 91, 170–186 (2018). https://doi.org/10.1016/j.eswa.2017.08.050
S. Singh, A. Singh, Web-spam features selection using CFS-PSO. Procedia Comput. Sci. 125, 568–575 (2018). https://doi.org/10.1016/j.procs.2017.12.07
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Firdaus, H., Hassan, S.I. (2020). Unsupervised Learning on Healthcare Survey Data with Particle Swarm Optimization. In: Jain, V., Chatterjee, J. (eds) Machine Learning with Health Care Perspective. Learning and Analytics in Intelligent Systems, vol 13. Springer, Cham. https://doi.org/10.1007/978-3-030-40850-3_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-40850-3_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-40849-7
Online ISBN: 978-3-030-40850-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)