Abstract
In this chapter we give an introduction to ROC (‘receiver operating characteristics’) analysis and its applications to data mining. We argue that ROC analysis provides decision support for data mining in several ways. For model selection, ROC analysis establishes a method to determine the optimal model once the operating characteristics for the model deployment context are known. We also show how ROC analysis can aid in constructing and refining models in the modeling stage.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blocked, H. and Struyf, J. (2002). Deriving biased classifiers for improved ROC performance, Informatica, Vol. 26, No. 1, 77–84.
Fawcett, T. (2003). ROC Graphs: Notes and Practical Considerations for Data Mining Researchers, Hewlett-Packard Laboratories.
Ferri, C., Flach, P. and Hernändez-Orallo, J. (2002). Decision tree learning using the area under the ROC curve. Proc. 19th International Conference on Machine Learning (ICML’02). (eds. Sammut, C. and Hoffman, A.), Morgan Kaufmann, 139–146.
Flach, P. and Gamberger, D. (2001). Subgroup evaluation and decision support for a direct mailing marketing problem. Proc. ECML/PKDD-2001 Workshop Integrating Aspects of Data Mining, Decision Support and Meta-Learning (IDDM-2001). (eds. Giraud-Carrier, C., Lavrač, N., Moyle, S. A. and Kavšek, B.), Freiburg, Germany, 45–56.
Hand, D. J. and Till, R. J. (2001). A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems, Machine Learning, Vol. 45, No. 2, 171–186.
Lavrač, N., Flach, P., Kavšek, B. and Todorovski, L. (2002). Adapting classification rule induction to subgroup discovery. Proc. 2002 IEEE International Conference on Data Mining. IEEE Press, 266–273.
Provost, F. and Fawcett, T. (2001). Robust classification for imprecise environments, Machine Learning, Vol. 42, No. 3, 203–231.
Srinivasan, A. (1999). Note on the location of optimal classifiers in n-dimensional ROC space, Oxford University Computing Laboratory.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer Science+Business Media New York
About this chapter
Cite this chapter
Flach, P., Blockeel, H., Ferri, C., Hernández-Orallo, J., Struyf, J. (2003). Decision Support for Data Mining. In: Mladenić, D., Lavrač, N., Bohanec, M., Moyle, S. (eds) Data Mining and Decision Support. The Springer International Series in Engineering and Computer Science, vol 745. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-0286-9_7
Download citation
DOI: https://doi.org/10.1007/978-1-4615-0286-9_7
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5004-0
Online ISBN: 978-1-4615-0286-9
eBook Packages: Springer Book Archive