A Clustering of Interestingness Measures

Vaillant, Benoît; Lenca, Philippe; Lallich, Stéphane

doi:10.1007/978-3-540-30214-8_23

Benoît Vaillant²⁰,
Philippe Lenca²⁰ &
Stéphane Lallich²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3245))

Included in the following conference series:

International Conference on Discovery Science

958 Accesses
36 Citations

Abstract

It is a common issue that KDD processes may generate a large number of patterns depending on the algorithm used, and its parameters. It is hence impossible for an expert to sustain these patterns. This may be the case with the well-known Apriori algorithm. One of the methods used to cope with such an amount of output depends on the use of interestingness measures. Stating that selecting interesting rules also means using an adapted measure, we present an experimental study of the behaviour of 20 measures on 10 datasets. This study is compared to a previous analysis of formal and meaningful properties of the measures, by means of two clusterings. One of the goals of this study is to enhance our previous approach. Both approaches seem to be complementary and could be profitable for the problem of a user’s choice of a measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fayyad, U., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.): Advances in KDD. AAAI/MIT Press (1996)
Google Scholar
Lenca, P., Meyer, P., Vaillant, B., Picouet, P.: Aide multicritère à la décision pour évaluer les indices de qualité des connaissances – modélisation des préférences de l’utilisateur. In: EGC 2003, vol. 1, pp. 271–282 (2003)
Google Scholar
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: ACM SIGKDD Int. Conf. on KDD, pp. 32–41 (2002)
Google Scholar
Lallich, S., Teytaud, O.: Évaluation et validation de l’intérêt des règles d’association. RNTI-E-1, 193–217 (2004)
Google Scholar
Lenca, P., Meyer, P., Picouet, P., Vaillant, B., Lallich, S.: Critères d’évaluation des mesures de qualité en ecd. In: RNTI, pp. 123–134 (2003)
Google Scholar
Lenca, P., Meyer, P., Vaillant, B., Lallich, S.: A multicriteria decision aid for interestingness measure selection. Technical Report LUSSI-TR-2004-01-EN, Dpt. LUSSI, ENST Bretagne (2004)
Google Scholar
Azé, J., Kodratoff, Y.: A study of the effect of noisy data in rule extraction systems. In: EMCSR, pp. 781–786 (2002)
Google Scholar
Vaillant, B., Picouet, P., Lenca, P.: An extensible platform for rule quality measure benchmarking. In: Bisdorff, R. (ed.) HCP 2003, pp. 187–191 (2003)
Google Scholar
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Buneman, P., Jajodia, S. (eds.) ACM SIGMOD Int. Conf. on Management of Data, pp. 207–216 (1993)
Google Scholar
Hajek, P., Havel, I., Chytil, M.: The guha method of automatic hypotheses determination. Computing 1, 293–308 (1966)
Article MATH Google Scholar
Rauch, J., Simunek, M.: Mining for 4ft association rules by 4ft-miner. In: Int. Conf. on Applications of Prolog, pp. 285–294 (2001)
Google Scholar
Giakoumakis, V., Monjardet, B.: Coefficients d’accord entre deux préordres totaux. Statistique et Analyse des Données 12, 46–99 (1987)
MathSciNet Google Scholar
Freitas, A.: On rule interestingness measures. KBSJ, 309–315 (1999)
Google Scholar
Le Saux, E., Lenca, P., Picouet, P.: Dynamic adaptation of rules bases under cognitive constraints. EJOR 136, 299–309 (2002)
Article MATH Google Scholar
Pearson, K.: Mathematical contributions to the theory of evolution. regression, heredity and panmixia. Philosophical Trans. of the Royal Society A (1896)
Google Scholar
Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: Peckham, J. (ed.) ACM SIGMOD 1997 Int. Conf. on Management of Data, pp. 255–264 (1997)
Google Scholar
Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In: Piatetsky-Shapiro, G., Frawley, W. (eds.) KDD, pp. 229–248. AAAI/MIT Press (1991)
Google Scholar
Loevinger, J.: A systemic approach to the construction and evaluation of tests of ability. Psychological monographs 61 (1947)
Google Scholar
Church, K.W., Hanks, P.: Word association norms, mutual information an lexicography. Computational Linguistics 16, 22–29 (1990)
Google Scholar
Sebag, M., Schoenauer, M.: Generation of rules with certainty and confidence factors from incomplete and incoherent learning bases. In: Boose, J., Gaines, B., Linster, M. (eds.) EKAW 1988, pp. 28–1–28–20 (1988)
Google Scholar
Brin, S., Motwani, R., Silverstein, C.: Beyond market baskets: generalizing association rules to correlations. In: ACM SIGMOD/PODS 1997, pp. 265–276 (1997)
Google Scholar
Good, I.J.: The estimation of probabilities: An essay on modern bayesian methods. The MIT Press, Cambridge (1965)
MATH Google Scholar
Azé, J., Kodratoff, Y.: Evaluation de la résistance au bruit de quelques mesures d’extraction de règles d’assocation. EGC 2002 1, 143–154 (2002)
Google Scholar
Cohen, J.: A coefficient of agreement for nominal scale. Educational and Psychological Measurement 20, 37–46 (1960)
Article Google Scholar
Terano, T., Liu, H., Chen, A.L.P.: Association Rules. In: Terano, T., Chen, A.L.P. (eds.) PAKDD 2000. LNCS, vol. 1805, Springer, Heidelberg (2000)
Chapter Google Scholar
Lerman, I., Gras, R., Rostam, H.: Elaboration d’un indice d’implication pour les données binaires, i et ii. Mathématiques et Sciences Humaines, 5–35, 5–47 (1981)
Google Scholar
Gras, R., Ag. Almouloud, S., Bailleuil, M., Larher, A., Polo, M., Ratsimba-Rajohn, H., Totohasina, A.: L’implication Statistique, Nouvelle Méthode Exploratoire de Données. Application à la Didactique, Travaux et Thèses. La Pensée Sauvage (1996)
Google Scholar
Gras, R., Kuntz, P., Couturier, R., Guillet, F.: Une version entropique de l’intensité d’implication pour les corpus volumineux. EGC 2001 1, 69–80 (2001)
Google Scholar
Lerman, I., Azé, J.: Une mesure probabiliste contextuelle discriminante de qualité des règles d’association. EGC 2003 1, 247–262 (2003)
Google Scholar
Borgelt, C., Kruse, R.: Induction of association rules: Apriori implementation. In: 15th Conf. on Computational Statistics (2002)
Google Scholar
Vaillant, B., Lenca, P., Lallich, S.: Association rule interestingness measures: an experimental study. Technical Report LUSSI-TR-2004-02-EN, Dpt. LUSSI, ENST Bretagne (2004)
Google Scholar
Lallich, S.: Mesure et validation en extraction des connaissances à partir des données. Habilitation à Diriger des Recherches – Université Lyon 2 (2002)
Google Scholar
Chauchat, J.H., Risson, A.: 3. In: Blasius, J., Greenacre, M. (eds.) Visualization of Categorical Data, pp. 37–45. Academic Press, New York (1998)
Chapter Google Scholar
Gras, R., Couturier, R., Blanchard, J., Briand, H., Kuntz, P., Peter, P.: Quelques critères pour une mesure de qualité de règles d’association. RNTI-E-1, 3–31 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

GET ENST Bretagne / Dpt. LUSSI, CNRS TAMCIC, France
Benoît Vaillant & Philippe Lenca
Laboratoire ERIC, Univ. Lumière, Lyon 2, France
Stéphane Lallich

Authors

Benoît Vaillant
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Lenca
View author publications
You can also search for this author in PubMed Google Scholar
Stéphane Lallich
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, Graduate School of Information Science and Electrical Engineering, Kyushu University, 744 Motooka, Nishi, 819-0395, Fukuoka, Japan
Einoshin Suzuki
Kyushu University, 6–10–1 Hakozaki Higashi-ku, 812–8581, Fukuoka, Japan
Setsuo Arikawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vaillant, B., Lenca, P., Lallich, S. (2004). A Clustering of Interestingness Measures. In: Suzuki, E., Arikawa, S. (eds) Discovery Science. DS 2004. Lecture Notes in Computer Science(), vol 3245. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30214-8_23

Download citation

DOI: https://doi.org/10.1007/978-3-540-30214-8_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23357-2
Online ISBN: 978-3-540-30214-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics