Abstract
In many applications, an expert classifier system has access to statistical information about the prior probabilities of the different classes. Such information should in principle reduce the amount of additional information that the system must collect, e.g., from answers to questions, before it can make a correct classification. This paper examines how to make best use of such prior statistical information, sequentially updated by collection of additional costly information, to optimally reduce uncertainty about the class to which a case belongs, thus minimizing the cost or effort required to correctly classify new cases. Two approaches are introduced, one motivated by information theory and the other based on the idea of trying to prove class membership as efficiently as possible. It is shown that, while the general problem of cost-effective classification is NP-hard, both heuristics provide useful approximations on small to moderate sized problems. Moreover, a hybrid heuristic that chooses which approach to apply based on the characteristics of the classification problem (entropy of the class probability distribution and coefficient of variation of information collection costs) appears to give excellent results. The results of initial computational experiments are summarized in support of these conclusions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1994 Springer-Verlag New York, Inc.
About this paper
Cite this paper
Cox, L.A., Qiu, Y. (1994). Minimizing the expected costs of classifying patterns by sequential costly inspections. In: Cheeseman, P., Oldford, R.W. (eds) Selecting Models from Data. Lecture Notes in Statistics, vol 89. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2660-4_35
Download citation
DOI: https://doi.org/10.1007/978-1-4612-2660-4_35
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-94281-0
Online ISBN: 978-1-4612-2660-4
eBook Packages: Springer Book Archive