Abstract
Most machine learning algorithms are based on the assumption that available data are completely known, nevertheless, real world data sets are often incomplete. For this reason, the ability of handling missing values has become a fundamental requirement for statistical pattern recognition. In this article, a new proposal to impute missing values with deep networks is analyzed. Besides the real missing values, the method introduces a percentage of artificial missing (‘deleted values’) using the true values as targets. Empirical results over several UCI repository datasets show that this method is able to improve the final imputed values obtained by other procedures used as pre-imputation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
García-Laencina, P.J., Sancho-Gómez, J.L., Figueiras-Vidal, A.: Pattern classification with missing data: a review. Neural Comput. Appl. 9(1), 1–12 (2009)
Batista, G., Monard, M.C.: An analysis of four missing data treatment methods for supervised learning. Appl. Artif. Intell. 17, 519–533 (2003)
Luengo, J., García, S., Herrera, F.: On the choice of the best imputation methods for missing values considering three groups of classification methods. Knowl. Inf. Syst. 32(1), 77–108 (2012)
Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., Manzagol, P.-A.: Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res. 11(11), 3371–3408 (2010)
Bengio, Y.: Learning deep architectures for AI. Technical report, Dept. IRO, Universite de Montreal (2009)
Deng, L., Yu, D.: Deep learning: methods and applications. Found. Trends Sig. Process. 7(3–4), 197–387 (2014)
Bengio, Y., Lamblin, P., Popovici, D., Larochelle, H.: Greedy layer-wise training of deep networks. Adv. Neural Inf. Process. Syst. 19, 153 (2007). (NIPS06), (B. Scholkopf, J. Platt, and T. Hoffman, eds.)
Lichman, M.: UCI machine learning repository (2013). http://archive.ics.uci.edu/ml
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Sánchez-Morales, A., Sancho-Gómez, JL., Figueiras-Vidal, A.R. (2017). Values Deletion to Improve Deep Imputation Processes. In: Ferrández Vicente, J., Álvarez-Sánchez, J., de la Paz López, F., Toledo Moreo, J., Adeli, H. (eds) Biomedical Applications Based on Natural and Artificial Computing. IWINAC 2017. Lecture Notes in Computer Science(), vol 10338. Springer, Cham. https://doi.org/10.1007/978-3-319-59773-7_25
Download citation
DOI: https://doi.org/10.1007/978-3-319-59773-7_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59772-0
Online ISBN: 978-3-319-59773-7
eBook Packages: Computer ScienceComputer Science (R0)