Evaluating the Correlation Between Objective Rule Interestingness Measures and Real Human Interest
In the last few years, the data mining community has proposed a number of objective rule interestingness measures to select the most interesting rules, out of a large set of discovered rules. However, it should be recalled that objective measures are just an estimate of the true degree of interestingness of a rule to the user, the so-called real human interest. The latter is inherently subjective. Hence, it is not clear how effective, in practice, objective measures are. More precisely, the central question investigated in this paper is: “how effective objective rule interestingness measures are, in the sense of being a good estimate of the true, subjective degree of interestingness of a rule to the user?” This question is investigated by extensive experiments with 11 objective rule interestingness measures across eight real-world data sets.
KeywordsRank Number True Degree Exception Rule Rule Interestingness Minimum Generalization
- 2.Carvalho, D.R., Freitas, A.A., Ebecken, N.F.: A Critical Review of Rule Surprisingness Measures. In: Proc. 2003 Int. Conf. on Data Mining, pp. 545–556. WIT Press (2003)Google Scholar
- 5.Ohsaki, M., Kitaguchi, S., Okamoto, K., Yokoi, H., Yamaguchi, T.: Evaluation of rule interestingness measures with a clinical dataset on hepatitis. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 362–373. Springer, Heidelberg (2004)CrossRefGoogle Scholar
- 6.Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann, San Francisco (1993)Google Scholar
- 7.Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: Proc. ACM SIGKDD KDD 2002. ACM Press, New York (2002)Google Scholar