Abstract

Association rules are rules of the kind “70% of the customers who buy vine and cheese also buy grapes”. While the traditional field of application is market basket analysis, association rule mining has been applied to various fields since then, which has led to a number of important modifications and extensions. We discuss the most frequently applied approach that is central to many extensions, the Apriori algorithm, and briefly review some applications to other data types, well-known problems of rule evaluation via support and confidence, and extensions of or alternatives to the standard framework.

Keywords

Association Rules Apriori 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal R. and Srikant R. Fast Algorithms for mining association rules. Proc. Int. Conf. on Very Large Databases, 487–499, 1994Google Scholar
  2. Agrawal R. and Srikant R. Mining Sequential Patterns. Proc. Int. Conf. on Data Engineering, 3–14, 1995.Google Scholar
  3. Agrawal R., Mannila H., Srikant R., Toivonen H., Verkamo A. I. Fast Discovery of Association Rules. In: Advances in Knowledge Discovery and Data Mining, Fayyad U.M., Piatetsky-Shapiro G., Smyth P., Uthurusamy R. (eds)., AAAI Press / The MIT Press, 307–328, 1996Google Scholar
  4. Aumann Y., Lindell, Y. A Statistical Theory for Quantitative Association Rules, Journal of Intelligent Information Systems, 20(3):255–283, 2003CrossRefGoogle Scholar
  5. Bayardo R.J., Agrawal R. Mining the Most Interesting Rules. Proc. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 145–154, 1999Google Scholar
  6. Bayardo R.J., Agrawal R., Gunopulos D. Constrained-Based Rule Mining in Large, Dense Databases. Proc. 15th Int. Conf. on Data Engineering, 188–197, 1999Google Scholar
  7. Berzal F., Blanco L., Sanchez D., Vila M.A. A New Framework to Assess Association Rules. Proc. Symp. on Intelligent Data Analysis, LNCS 2189, 95–104, Springer, 2001Google Scholar
  8. Bolton R., Hand D.J., Adams, N.M. Determining Hit Rate in Pattern Search, Proc. Pattern Detection and Discovery, LNAI 2447, 36–48, Springer, 2002Google Scholar
  9. Borgelt C, Berthold M. Mining Molecular Fragments: Finding Relevant Substructures of Molecules, Proc. Int. Conf. on Data Mining, 51–58, 2002Google Scholar
  10. Breiman L., Friedman J., Olshen R., Stone C. Classification and Regression Trees. Chapman & Hall, New York, 1984.Google Scholar
  11. Brijs T., Swinnen G., Vanhoof K. and Wets G. Using Association Rules for Product Assortment Decisions: A Case Study. Proc. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 254–260, 1999Google Scholar
  12. Brin S., Motwani R., and Silverstein C. Beyond market baskets: Generalizing association rules to correlations. Proc. ACM SIGMOD Int. Conf. Management of Data, 265–276, 1997Google Scholar
  13. Brin S., Motwani R., Ullman, J.D., Tsur, S. Dynamic itemset counting and implication rules for market basket data. SIGMOD Record 26(2):255–264, 1997CrossRefGoogle Scholar
  14. Castelo R., Feelders A., Siebes A. Mambo: Discovering Association Rules. Proc. Symp. on Intelligent Data Analysis. LNCS 2189, 289–298, Springer, 2001Google Scholar
  15. Clark P., Boswell R. Rule Induction with CN2: Some recent improvements. In Proc. European Working Session on Learning EWSL-91, 151–163, 1991Google Scholar
  16. Cohen E., Datar M., Fujiwara S., Gionis A., Indyk P., Motwani R., Ullman J., Yang C. Finding Interesting Associations without Support Pruning, IEEE Transaction on Knowledge Discovery 13(1):64–78, 2001CrossRefGoogle Scholar
  17. Dehaspe L., Toivonen H. Discovery of Frequent Datalog Patterns. Data Mining and Knowledge Discovery, 3(1):7–38, 1999CrossRefGoogle Scholar
  18. Delgado M., Martin-Bautista M.J., Sanchez D., Vila M.A. Mining Text Data: Features and Patterns. Proc. Pattern Detection and Discovery, LNAI 2447, 140–153, Springer, 2002Google Scholar
  19. Dong G., Li J. Interestingness of Discovered Association Rules in Terms of Neighbourhood-Based Unexpectedness. Proc. Pacific Asia Conf. on Knowledge Discovery in Databases, LNAI 1394, 72–86, 1998Google Scholar
  20. Friedman J.H., Fisher N.I. Bump Hunting in High-Dimensional Data. Statistics and Computing 9(2), 123–143, 1999CrossRefGoogle Scholar
  21. Gamberger D., Lavrac N., Jovanoski, V. High Confidence Association Rules for Medical Diagnosis, Proc. of Intelligent Data Analysis in Medical Applications (IDAMAP), 42–51, 1999Google Scholar
  22. Goethals B., Van den Bussche, J. On Supporting Interactive Association Rule Mining, Proc. Int. Conf. Data Warehousing and Knowledge Discovery, LNCS 1874, 307–316, Springer, 2000Google Scholar
  23. Han J., Fu Y. Discovery of Multiple-Level Association Rules from Large Databases Proc. Int. Conf. on Very Large Databases, 420–431, 1995Google Scholar
  24. Hilderman R. J. and Hamilton H. J. Knowledge Discovery and Measures of Interest. Kluwer Academic Publishers, 2001Google Scholar
  25. Höppner F, Klawonn F. Finding Informative Rules in Interval Sequences. Intelligent Data Analysis, 6(3):237–256, 2002Google Scholar
  26. Höppner F. Discovery of Core Episodes from Sequences. Proc. Pattern Detection and Discovery, LNAI2447, 199–213, Springer, 2002Google Scholar
  27. Klemettinen M., Mannila H., Ronkainen P., Toivonen, H. and Verkamo A. I. Finding Interesting Rules from Large Sets of Discovered Association Rules. Proc. Int. Conf. on Information and Knowledge Managament, 401–407, 1994.Google Scholar
  28. Koperski K., Han J. Discovery of Spatial Association Rules in Geographic Information Databases. Proc. Int. Symp Advances in Spatial Databases, LNCS 951, 47–66, 1995Google Scholar
  29. Kryszkiewicz M. Concise Representation of Frequent Patterns based on Disjunction-free Generators. Proc. Int. Conf. on Data Mining, 305–312, 2001.Google Scholar
  30. Lee C. H., Lin C. R., Chen M. S. On Mining General Temporal Association Rules in a Publication Database. Proc. Int. Conf. on Data Mining, 337–344, 2001Google Scholar
  31. Li J., Zhang X., Dong G., Ramamohanarao K., Sun Q. Efficient Mining of High Confidence Association Rules without Support Thresholds. Proc. Principles of Data Mining and Knowledge Discovery, 406–411, 1999Google Scholar
  32. Liu B., Hsu W., Ma Y. Pruning and Summarizing the Discovered Associations. Proc. ACM SIGKDD Conf. Knowledge Discovery and Data Mining, 125–134, 1999Google Scholar
  33. Mannila H., Toivonen H. Multiple uses of frequent sets and condensed representations. Proc. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 189–194, 1996Google Scholar
  34. Mannila H., Toivonen H., Verkamo A. I. Discovery of Frequent Episodes in Event Sequences. Data Mining and Knowledge Discovery, 1(3):259–289, 1997CrossRefGoogle Scholar
  35. Miller R. J., Yang Y. Association Rules over Interval Data. Proc. Int. Conf. on Management of Data, 452–461, 1997Google Scholar
  36. Mitchell T, Machine Learning, McGraw-Hill, 1997Google Scholar
  37. Mobasher B., Dai H., Luo T, Nakagawa, M. Discovery and Evaluation of Aggregate Usage Profile for Web Personalization, Data Mining and Knowledge Discovery 6:61–82, 2002MathSciNetCrossRefGoogle Scholar
  38. Padmanabhan B., Tuzhilin A. A Belief-Driven Method for Discovering Unexpected Patterns. Int. Conf. on Knowledge Discovery and Data Mining, 94–100, 1998Google Scholar
  39. Pasquier N., Bastide Y., Taouil R., Lakhal L. Efficient Mining of association rules using closed itemset lattices. Information Systems, 24(1):25–46, 1999MathSciNetCrossRefGoogle Scholar
  40. Piatetsky-Shapiro, G. Discovery, Analysis, and Presentation of Strong Rules. Proc. Knowledge Discovery in Databases, 229–248, 1991Google Scholar
  41. Smyth P. and Goodman, R.M. An Information Theoretic Approach to Rule Induction from Databases. IEEE Trans. Knowledge Discovery and Data Engineering, 4(4):301–316, 1992CrossRefGoogle Scholar
  42. Srikant R., Agrawal R. Mining Generalized Association Rules. Proc. Int. Conf. on Very Large Databases, 407–419, 1995Google Scholar
  43. Srikant R., Agrawal R. Mining Quantitative Association Rules in Large Relational Tables. Proc. ACM SIGMOD Conf. on Management of Data, 1–12, 1996Google Scholar
  44. Srikant R., Vu Q., Agrawal R. Mining Association Rules with Constraints. Proc. Int. Conf. Knowledge Discovery and Data Mining, 66–73, 1997Google Scholar
  45. Tan P. N., Kumar V. Selecting the Right Interestingness Measure for Association Patterns, Proc. ACM SIGKDD Conf. Knowledge Discovery and Data Mining, 32–41, 2002Google Scholar
  46. Tsoukatos, I. and Gunopulos, D. Efficient Mining of Spatiotemporal Patterns. Proc. Int. Symp. Spatial and Temporal Databases, LNCS 2121, pages 425–442, 2001Google Scholar
  47. Wang J., Han J., Pei J. CLOSET+: Searching for the Best Strategies for mining Frequent Closed Itemsets, Proc. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 236–245, 2003Google Scholar
  48. Webb G.I. Efficient Search for Association Rules. Proc. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 99–107, 2000Google Scholar
  49. Webb G.I. Discovering Associations with numeric variables. Proc. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 383–388, 2001Google Scholar
  50. Zaki M. J. Generating non-redundant Association Rules. In Proc. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 34–43, 2000Google Scholar
  51. Zaki M. J. SPADE: An Efficient Algorithm for Mining Frequent Sequences. Machine Learning 42(1):31–60, 2001MATHCrossRefGoogle Scholar
  52. Zelenko D. Optimizing Disjunctive Association Rules. Proc. of Int. Conf. on Principles of Data Mining and Knowledge Discovery, LNAI1704, 204–213, 1999Google Scholar

Copyright information

© Springer Science+Business Media, Inc. 2005

Authors and Affiliations

  • Frank Höppner
    • 1
  1. 1.Department of Information SystemsUniversity of Applied Sciences Braunschweig/WolfenbiittelGermany

Personalised recommendations