Skip to main content

A Clustering of Interestingness Measures

  • Conference paper
Discovery Science (DS 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3245))

Included in the following conference series:

Abstract

It is a common issue that KDD processes may generate a large number of patterns depending on the algorithm used, and its parameters. It is hence impossible for an expert to sustain these patterns. This may be the case with the well-known Apriori algorithm. One of the methods used to cope with such an amount of output depends on the use of interestingness measures. Stating that selecting interesting rules also means using an adapted measure, we present an experimental study of the behaviour of 20 measures on 10 datasets. This study is compared to a previous analysis of formal and meaningful properties of the measures, by means of two clusterings. One of the goals of this study is to enhance our previous approach. Both approaches seem to be complementary and could be profitable for the problem of a user’s choice of a measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fayyad, U., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.): Advances in KDD. AAAI/MIT Press (1996)

    Google Scholar 

  2. Lenca, P., Meyer, P., Vaillant, B., Picouet, P.: Aide multicritère à la décision pour évaluer les indices de qualité des connaissances – modélisation des préférences de l’utilisateur. In: EGC 2003, vol. 1, pp. 271–282 (2003)

    Google Scholar 

  3. Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: ACM SIGKDD Int. Conf. on KDD, pp. 32–41 (2002)

    Google Scholar 

  4. Lallich, S., Teytaud, O.: Évaluation et validation de l’intérêt des règles d’association. RNTI-E-1, 193–217 (2004)

    Google Scholar 

  5. Lenca, P., Meyer, P., Picouet, P., Vaillant, B., Lallich, S.: Critères d’évaluation des mesures de qualité en ecd. In: RNTI, pp. 123–134 (2003)

    Google Scholar 

  6. Lenca, P., Meyer, P., Vaillant, B., Lallich, S.: A multicriteria decision aid for interestingness measure selection. Technical Report LUSSI-TR-2004-01-EN, Dpt. LUSSI, ENST Bretagne (2004)

    Google Scholar 

  7. Azé, J., Kodratoff, Y.: A study of the effect of noisy data in rule extraction systems. In: EMCSR, pp. 781–786 (2002)

    Google Scholar 

  8. Vaillant, B., Picouet, P., Lenca, P.: An extensible platform for rule quality measure benchmarking. In: Bisdorff, R. (ed.) HCP 2003, pp. 187–191 (2003)

    Google Scholar 

  9. Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Buneman, P., Jajodia, S. (eds.) ACM SIGMOD Int. Conf. on Management of Data, pp. 207–216 (1993)

    Google Scholar 

  10. Hajek, P., Havel, I., Chytil, M.: The guha method of automatic hypotheses determination. Computing 1, 293–308 (1966)

    Article  MATH  Google Scholar 

  11. Rauch, J., Simunek, M.: Mining for 4ft association rules by 4ft-miner. In: Int. Conf. on Applications of Prolog, pp. 285–294 (2001)

    Google Scholar 

  12. Giakoumakis, V., Monjardet, B.: Coefficients d’accord entre deux préordres totaux. Statistique et Analyse des Données 12, 46–99 (1987)

    MathSciNet  Google Scholar 

  13. Freitas, A.: On rule interestingness measures. KBSJ, 309–315 (1999)

    Google Scholar 

  14. Le Saux, E., Lenca, P., Picouet, P.: Dynamic adaptation of rules bases under cognitive constraints. EJOR 136, 299–309 (2002)

    Article  MATH  Google Scholar 

  15. Pearson, K.: Mathematical contributions to the theory of evolution. regression, heredity and panmixia. Philosophical Trans. of the Royal Society A (1896)

    Google Scholar 

  16. Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: Peckham, J. (ed.) ACM SIGMOD 1997 Int. Conf. on Management of Data, pp. 255–264 (1997)

    Google Scholar 

  17. Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In: Piatetsky-Shapiro, G., Frawley, W. (eds.) KDD, pp. 229–248. AAAI/MIT Press (1991)

    Google Scholar 

  18. Loevinger, J.: A systemic approach to the construction and evaluation of tests of ability. Psychological monographs 61 (1947)

    Google Scholar 

  19. Church, K.W., Hanks, P.: Word association norms, mutual information an lexicography. Computational Linguistics 16, 22–29 (1990)

    Google Scholar 

  20. Sebag, M., Schoenauer, M.: Generation of rules with certainty and confidence factors from incomplete and incoherent learning bases. In: Boose, J., Gaines, B., Linster, M. (eds.) EKAW 1988, pp. 28–1–28–20 (1988)

    Google Scholar 

  21. Brin, S., Motwani, R., Silverstein, C.: Beyond market baskets: generalizing association rules to correlations. In: ACM SIGMOD/PODS 1997, pp. 265–276 (1997)

    Google Scholar 

  22. Good, I.J.: The estimation of probabilities: An essay on modern bayesian methods. The MIT Press, Cambridge (1965)

    MATH  Google Scholar 

  23. Azé, J., Kodratoff, Y.: Evaluation de la résistance au bruit de quelques mesures d’extraction de règles d’assocation. EGC 2002 1, 143–154 (2002)

    Google Scholar 

  24. Cohen, J.: A coefficient of agreement for nominal scale. Educational and Psychological Measurement 20, 37–46 (1960)

    Article  Google Scholar 

  25. Terano, T., Liu, H., Chen, A.L.P.: Association Rules. In: Terano, T., Chen, A.L.P. (eds.) PAKDD 2000. LNCS, vol. 1805, Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  26. Lerman, I., Gras, R., Rostam, H.: Elaboration d’un indice d’implication pour les données binaires, i et ii. Mathématiques et Sciences Humaines, 5–35, 5–47 (1981)

    Google Scholar 

  27. Gras, R., Ag. Almouloud, S., Bailleuil, M., Larher, A., Polo, M., Ratsimba-Rajohn, H., Totohasina, A.: L’implication Statistique, Nouvelle Méthode Exploratoire de Données. Application à la Didactique, Travaux et Thèses. La Pensée Sauvage (1996)

    Google Scholar 

  28. Gras, R., Kuntz, P., Couturier, R., Guillet, F.: Une version entropique de l’intensité d’implication pour les corpus volumineux. EGC 2001 1, 69–80 (2001)

    Google Scholar 

  29. Lerman, I., Azé, J.: Une mesure probabiliste contextuelle discriminante de qualité des règles d’association. EGC 2003 1, 247–262 (2003)

    Google Scholar 

  30. Borgelt, C., Kruse, R.: Induction of association rules: Apriori implementation. In: 15th Conf. on Computational Statistics (2002)

    Google Scholar 

  31. Vaillant, B., Lenca, P., Lallich, S.: Association rule interestingness measures: an experimental study. Technical Report LUSSI-TR-2004-02-EN, Dpt. LUSSI, ENST Bretagne (2004)

    Google Scholar 

  32. Lallich, S.: Mesure et validation en extraction des connaissances à partir des données. Habilitation à Diriger des Recherches – Université Lyon 2 (2002)

    Google Scholar 

  33. Chauchat, J.H., Risson, A.: 3. In: Blasius, J., Greenacre, M. (eds.) Visualization of Categorical Data, pp. 37–45. Academic Press, New York (1998)

    Chapter  Google Scholar 

  34. Gras, R., Couturier, R., Blanchard, J., Briand, H., Kuntz, P., Peter, P.: Quelques critères pour une mesure de qualité de règles d’association. RNTI-E-1, 3–31 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Vaillant, B., Lenca, P., Lallich, S. (2004). A Clustering of Interestingness Measures. In: Suzuki, E., Arikawa, S. (eds) Discovery Science. DS 2004. Lecture Notes in Computer Science(), vol 3245. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30214-8_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30214-8_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23357-2

  • Online ISBN: 978-3-540-30214-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics