Journal of Intelligent Information Systems

, Volume 36, Issue 1, pp 73–98

A review and comparison of strategies for handling missing values in separate-and-conquer rule learning

Article

Abstract

In this paper, we review possible strategies for handling missing values in separate-and-conquer rule learning algorithms, and compare them experimentally on a large number of datasets. In particular through a careful study with data with controlled levels of missing values we get additional insights on the strategies’ different biases w.r.t. attributes with missing values. Somewhat surprisingly, a strategy that implements a strong bias against the use of attributes with missing values, exhibits the best average performance on 24 datasets from the UCI repository.

Keywords

Machine learning Inductive rule learning Missing values 

Copyright information

© Springer Science+Business Media, LLC 2010

Authors and Affiliations

  1. 1.Knowledge Engineering GroupTechnische Universität DarmstadtDarmstadtGermany

Personalised recommendations