Pattern discovery algorithms typically produce many interesting patterns. In most cases, patterns are reported based on their individual merits, and little attention is given to the interestingness of a pattern in the context of other patterns reported. In this paper, we propose filtering the returned set of patterns based on a number of quality measures for pattern sets. We refer to a small subset of patterns that optimises such a measure as a pattern team. A number of quality measures, both supervised and unsupervised, is proposed. We analyse to what extent each of the measures captures a number of ‘intuitions’ users may have concerning effective and informative pattern teams. Such intuitions involve qualities such as independence of patterns, low overlap, and combined predictiveness.


Quality Measure Association Rule Mining Decision Table Pattern Discovery Interestingness Measure 


  1. 1.
    Fürnkranz, J., Flach, P.: ROC ‘n’ Rule Learning – Towards a Better Understanding of Covering Algorithms. Machine Learning 58, 39–77 (2005)MATHCrossRefGoogle Scholar
  2. 2.
    Guyon, I., Elisseeff, A.: An Introduction to Variable and Feature Selection. Journal of Machine Learning Research 3, 1157–1182 (2003)MATHCrossRefGoogle Scholar
  3. 3.
    Knobbe, A.J.: Multi-Relational Data Mining, Ph.D. dissertation (2004), http://www.kiminkii.com/thesis.pdf
  4. 4.
    Knobbe, A.J., Ho, E.K.Y.: Maximally Informative k-Itemsets and their Efficient Discovery. In: Proceedings of KDD 2006 (2006)Google Scholar
  5. 5.
    Knobbe, A.J., Ho, E.K.Y.: Pattern Teams, long version (2006), http://www.kiminkii.com/publications.html
  6. 6.
    Kohavi, R.: The Power of Decision Tables. In: Proceedings of ECML 1995 (1995)Google Scholar
  7. 7.
    Mielikäinen, T., Mannila, H.: The Pattern Ordering Problem. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) PKDD 2003. LNCS (LNAI), vol. 2838, pp. 327–338. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  8. 8.
    Pfahringer, B.: Compression-Based Feature Subset Selection. In: Proceedings of IJCAI 1995 (1995)Google Scholar
  9. 9.
    Safarii Multi-Relational Data Mining Environment (2006), http://www.kiminkii.com/safarii.html
  10. 10.
    Scheffer, T., Wrobel, S.: Finding the Most Interesting Patterns in a Database Quickly by Using Sequential Sampling. Machine Learning Research 3 (2002)Google Scholar
  11. 11.
    Yan, X., Cheng, H., Han, J., Xin, D.: Summarizing Itemset Patterns: A Profile-Based Approach. In: Proceedings KDD 2005 (2005)Google Scholar
  12. 12.
    Zimmermann, A., De Raedt, L.: CorClass: Correlated Association Rule Mining for Classification. In: Suzuki, E., Arikawa, S. (eds.) DS 2004. LNCS (LNAI), vol. 3245, pp. 60–72. Springer, Heidelberg (2004)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Arno J. Knobbe
    • 1
    • 2
  • Eric K. Y. Ho
    • 1
  1. 1.KiminkiiHoutenThe Netherlands
  2. 2.Utrecht UniversityUtrechtThe Netherlands

Personalised recommendations