Abstract
In some research situations, we often have to classify data with incomplete values which affect the learning performance of classifiers. Although various classification algorithms have been proposed, most of them are short of the ability to deal with incomplete data. This paper proposes a novel approach based on selective ensemble for classifying incomplete data. The method finds the local complete patterns for which the feature values are complete and trains multiple component learners for each local complete subset. Then, it combines the outputs of the classifiers. The method needs no assumption about the incomplete mechanism that is necessary for previous methods. The proposed method is evaluated by three datasets from the UCI Machine Learning Repository. The experiments results show that classification accuracy of the proposed method is superior to those of widely used imputations and deletion method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chen, J., Huang, H., Tian, F., Tian, S.: A Selective Bayes Classifier for Classifying Incomplete Data Based on Gain Ratio. Knowledge-Based Systems 21, 530–534 (2008)
Quinlan, J.R.: C4.5 Programs for Machine Learning. Morgan Kaufman, San Francisco (1993)
Qu, H., Mao, L., Wang, J.: Method for Self-Extracting Diagnostic Rules of Blood Stasis Syndrome Based on Decision Tree. Chinese Journal of Biomedical Engineering 24, 709–711, 727 (2005)
Boujelben, M.A., Smet, Y.D., Frikha, A., Chabchoub, H.: Building A Binary Outranking Relation in Uncertain, Imprecise and Multi-experts Contexts: The application of Evidence Theory. International Journal of Approximate Reasoning 50, 1259–1278 (2009)
Salama, A.S.: Topological Solution of Missing Attribute Values Problem in Incomplete Information Tables. Information Sciences 180, 631–639 (2010)
Huang, Z., Li, J., Su, H., Watts, G.S., Chen, H.: Large-scale Regulatory Network Analysis from Microarray Data: Modified Bayesian Network Learning and Association Rule Mining. Decision Support Systems 43, 1207–1225 (2007)
Williams, D., Liao, X., Xue, Y., Carin, L., Krishnapuram, B.: On Classification with Incomplete Data. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 427–436 (2007)
Ramoni, M., Sebastiani, P.: Robust Bayes Classifiers. Artificial Intelligence 125, 209–226 (2001)
Partalas, I., Tsoumakas, P.G., Vlahavas, I.: Pruning an Ensemble of Classifiers via Reinforcement Learning. Neurocomputing 72, 1900–1909 (2009)
Breiman, L.: Bagging Predictors. Machine Learning 24, 123–140 (1996)
Freund, Y., Schapire, R.E.: A Decision-Theoretic Generalize of On-Line Learning and An Application to Boosting. Journal of Computer and System Sciences 55, 119–139 (1997)
Witten, I.H., Frank, E.: Data Mining. Morgan Kaufmann Publishers, Elsevier (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, Y., Gao, Y., Shen, R., Yang, F. (2011). Selective Ensemble Approach for Classification of Datasets with Incomplete Values. In: Wang, Y., Li, T. (eds) Foundations of Intelligent Systems. Advances in Intelligent and Soft Computing, vol 122. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25664-6_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-25664-6_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25663-9
Online ISBN: 978-3-642-25664-6
eBook Packages: EngineeringEngineering (R0)