Abstract
It is a common issue that KDD processes may generate a large number of patterns depending on the algorithm used, and its parameters. It is hence impossible for an expert to sustain these patterns. This may be the case with the well-known Apriori algorithm. One of the methods used to cope with such an amount of output depends on the use of interestingness measures. Stating that selecting interesting rules also means using an adapted measure, we present an experimental study of the behaviour of 20 measures on 10 datasets. This study is compared to a previous analysis of formal and meaningful properties of the measures, by means of two clusterings. One of the goals of this study is to enhance our previous approach. Both approaches seem to be complementary and could be profitable for the problem of a user’s choice of a measure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fayyad, U., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.): Advances in KDD. AAAI/MIT Press (1996)
Lenca, P., Meyer, P., Vaillant, B., Picouet, P.: Aide multicritère à la décision pour évaluer les indices de qualité des connaissances – modélisation des préférences de l’utilisateur. In: EGC 2003, vol. 1, pp. 271–282 (2003)
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: ACM SIGKDD Int. Conf. on KDD, pp. 32–41 (2002)
Lallich, S., Teytaud, O.: Évaluation et validation de l’intérêt des règles d’association. RNTI-E-1, 193–217 (2004)
Lenca, P., Meyer, P., Picouet, P., Vaillant, B., Lallich, S.: Critères d’évaluation des mesures de qualité en ecd. In: RNTI, pp. 123–134 (2003)
Lenca, P., Meyer, P., Vaillant, B., Lallich, S.: A multicriteria decision aid for interestingness measure selection. Technical Report LUSSI-TR-2004-01-EN, Dpt. LUSSI, ENST Bretagne (2004)
Azé, J., Kodratoff, Y.: A study of the effect of noisy data in rule extraction systems. In: EMCSR, pp. 781–786 (2002)
Vaillant, B., Picouet, P., Lenca, P.: An extensible platform for rule quality measure benchmarking. In: Bisdorff, R. (ed.) HCP 2003, pp. 187–191 (2003)
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Buneman, P., Jajodia, S. (eds.) ACM SIGMOD Int. Conf. on Management of Data, pp. 207–216 (1993)
Hajek, P., Havel, I., Chytil, M.: The guha method of automatic hypotheses determination. Computing 1, 293–308 (1966)
Rauch, J., Simunek, M.: Mining for 4ft association rules by 4ft-miner. In: Int. Conf. on Applications of Prolog, pp. 285–294 (2001)
Giakoumakis, V., Monjardet, B.: Coefficients d’accord entre deux préordres totaux. Statistique et Analyse des Données 12, 46–99 (1987)
Freitas, A.: On rule interestingness measures. KBSJ, 309–315 (1999)
Le Saux, E., Lenca, P., Picouet, P.: Dynamic adaptation of rules bases under cognitive constraints. EJOR 136, 299–309 (2002)
Pearson, K.: Mathematical contributions to the theory of evolution. regression, heredity and panmixia. Philosophical Trans. of the Royal Society A (1896)
Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: Peckham, J. (ed.) ACM SIGMOD 1997 Int. Conf. on Management of Data, pp. 255–264 (1997)
Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In: Piatetsky-Shapiro, G., Frawley, W. (eds.) KDD, pp. 229–248. AAAI/MIT Press (1991)
Loevinger, J.: A systemic approach to the construction and evaluation of tests of ability. Psychological monographs 61 (1947)
Church, K.W., Hanks, P.: Word association norms, mutual information an lexicography. Computational Linguistics 16, 22–29 (1990)
Sebag, M., Schoenauer, M.: Generation of rules with certainty and confidence factors from incomplete and incoherent learning bases. In: Boose, J., Gaines, B., Linster, M. (eds.) EKAW 1988, pp. 28–1–28–20 (1988)
Brin, S., Motwani, R., Silverstein, C.: Beyond market baskets: generalizing association rules to correlations. In: ACM SIGMOD/PODS 1997, pp. 265–276 (1997)
Good, I.J.: The estimation of probabilities: An essay on modern bayesian methods. The MIT Press, Cambridge (1965)
Azé, J., Kodratoff, Y.: Evaluation de la résistance au bruit de quelques mesures d’extraction de règles d’assocation. EGC 2002 1, 143–154 (2002)
Cohen, J.: A coefficient of agreement for nominal scale. Educational and Psychological Measurement 20, 37–46 (1960)
Terano, T., Liu, H., Chen, A.L.P.: Association Rules. In: Terano, T., Chen, A.L.P. (eds.) PAKDD 2000. LNCS, vol. 1805, Springer, Heidelberg (2000)
Lerman, I., Gras, R., Rostam, H.: Elaboration d’un indice d’implication pour les données binaires, i et ii. Mathématiques et Sciences Humaines, 5–35, 5–47 (1981)
Gras, R., Ag. Almouloud, S., Bailleuil, M., Larher, A., Polo, M., Ratsimba-Rajohn, H., Totohasina, A.: L’implication Statistique, Nouvelle Méthode Exploratoire de Données. Application à la Didactique, Travaux et Thèses. La Pensée Sauvage (1996)
Gras, R., Kuntz, P., Couturier, R., Guillet, F.: Une version entropique de l’intensité d’implication pour les corpus volumineux. EGC 2001 1, 69–80 (2001)
Lerman, I., Azé, J.: Une mesure probabiliste contextuelle discriminante de qualité des règles d’association. EGC 2003 1, 247–262 (2003)
Borgelt, C., Kruse, R.: Induction of association rules: Apriori implementation. In: 15th Conf. on Computational Statistics (2002)
Vaillant, B., Lenca, P., Lallich, S.: Association rule interestingness measures: an experimental study. Technical Report LUSSI-TR-2004-02-EN, Dpt. LUSSI, ENST Bretagne (2004)
Lallich, S.: Mesure et validation en extraction des connaissances à partir des données. Habilitation à Diriger des Recherches – Université Lyon 2 (2002)
Chauchat, J.H., Risson, A.: 3. In: Blasius, J., Greenacre, M. (eds.) Visualization of Categorical Data, pp. 37–45. Academic Press, New York (1998)
Gras, R., Couturier, R., Blanchard, J., Briand, H., Kuntz, P., Peter, P.: Quelques critères pour une mesure de qualité de règles d’association. RNTI-E-1, 3–31 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vaillant, B., Lenca, P., Lallich, S. (2004). A Clustering of Interestingness Measures. In: Suzuki, E., Arikawa, S. (eds) Discovery Science. DS 2004. Lecture Notes in Computer Science(), vol 3245. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30214-8_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-30214-8_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23357-2
Online ISBN: 978-3-540-30214-8
eBook Packages: Springer Book Archive