Missing Value Imputation with Unsupervised Kohonen Self Organizing Map

Singh, Ninni; Javeed, Anum; Chhabra, Sheenu; Kumar, Pardeep

doi:10.1007/978-81-322-2550-8_7

Ninni Singh⁴,
Anum Javeed⁴,
Sheenu Chhabra⁴ &
…
Pardeep Kumar⁴

1072 Accesses
8 Citations

Abstract

Many data mining and data analysis techniques function with large datasets. These large data sets have missing values which result in biased estimates, imprecise statistical results or unacceptable conclusions. Data mining and data analysis techniques cannot be directly applied to datasets with missing values. For this purpose, different imputation techniques are proposed by different authors for both categorical and continuous variables. The existing imputation techniques have many limitations such as (a) methods like conditional mean imputation results in biased parameter estimation. (b) Too much variation is discovered in the inference of any single value or distance between particular samples in the case of random draw imputation. (c) In case of multiple imputations it is not easy to determine the posterior distribution of samples to draw from. In this paper, we present an unsupervised learning technique based on a Kohonen self-organizing map used for both categorical and numerical data values. In this paper, our aim is to achieve the highest accuracy. To achieve this, we trained our model by using the splitting approach to make the learning model and use this model to predict the accuracy. The proposed algorithm can map the missing values closed to original by adjusting the weights by improving accuracy when compared to classification without missing values and with missing values.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Influence of Data Distribution in Missing Data Imputation

Imputation of Multivariate Attribute Values in Big Data

kDMI: A Novel Method for Missing Values Imputation Using Two Levels of Horizontal Partitioning in a Data set

References

Farhangfar, A., Kurgan, L.A., Pedrycz, W.: A novel framework for imputation of missing values in databases. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 37(5), 692–709 (2007)
Article Google Scholar
Amit, G., Monica, L.: The weight decay back-propagation for generalizations with missing values. Springer Ann. Oper. Res. 78, 165–187 (1998)
Article MATH Google Scholar
Mohammad-sale, J., Shola, Brain: Improved neural network performance using principle component analysis on Mat lab. J. comput. internet manage. 6(2), 1–8 (2008)
Google Scholar
Ennett, C.M., Frize, M., Robin C., Walker, Influence of Missing Values on Artificial Neural Network Performance, In: Proceedings of Med info, pp. 449–453 (2001)
Google Scholar
Josse, J., Husson, F.: Handling missing values in exploratory multivariate data analysis methods. Int. Joint Conf. Neural Networks (IJCNN) 153(2), 1–10 (2012)
MathSciNet Google Scholar
Farhangfara, A.: Impact of imputation of missing values on classification error for discrete data mining. Elsevier J. Pattern Recognit. 41(12), 3692–3705 (2008)
Article Google Scholar
Shivnandam, S.N., Deepak, S.N.: Computing missing values using different clustering techniques. Int. J. Adv. Res. Artif. Intell. (IJARAI). 2(2), (2012)
Google Scholar
Sathya, R., Abraham, A.: Comparison of Supervised and Unsupervised Learning Algorithms for Pattern Classification. Int. J. Adv. Res. Artif. Int. (IJARAI) 2(2), 34–38 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Jaypee University of Information Technology, Waknaghat, India
Ninni Singh, Anum Javeed, Sheenu Chhabra & Pardeep Kumar

Authors

Ninni Singh
View author publications
You can also search for this author in PubMed Google Scholar
Anum Javeed
View author publications
You can also search for this author in PubMed Google Scholar
Sheenu Chhabra
View author publications
You can also search for this author in PubMed Google Scholar
Pardeep Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ninni Singh .

Editor information

Editors and Affiliations

Nitte Meenakshi Institute of Technology, Bangalore, Karnataka, India
N. R. Shetty
Dept of Master of Computer Application, Nitte Meenakshi Institute of Technology, Bangalore, Karnataka, India
N.H. Prasad
Nitte Meenakshi Institute of Technology, Bangalore, Karnataka, India
N. Nalini

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Singh, N., Javeed, A., Chhabra, S., Kumar, P. (2015). Missing Value Imputation with Unsupervised Kohonen Self Organizing Map. In: Shetty, N., Prasad, N., Nalini, N. (eds) Emerging Research in Computing, Information, Communication and Applications. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2550-8_7

Download citation

DOI: https://doi.org/10.1007/978-81-322-2550-8_7
Published: 18 July 2015
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2549-2
Online ISBN: 978-81-322-2550-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Missing Value Imputation with Unsupervised Kohonen Self Organizing Map

Abstract

Access this chapter

Similar content being viewed by others

Influence of Data Distribution in Missing Data Imputation

Imputation of Multivariate Attribute Values in Big Data

kDMI: A Novel Method for Missing Values Imputation Using Two Levels of Horizontal Partitioning in a Data set

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Missing Value Imputation with Unsupervised Kohonen Self Organizing Map

Abstract

Access this chapter

Similar content being viewed by others

Influence of Data Distribution in Missing Data Imputation

Imputation of Multivariate Attribute Values in Big Data

kDMI: A Novel Method for Missing Values Imputation Using Two Levels of Horizontal Partitioning in a Data set

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation