Axiomatization of Frequent Sets
In data mining association rules are very popular. Most of the algorithms in the literature for finding association rules start by searching for frequent itemsets. The itemset mining algorithms typically interleave brute force counting of frequencies with a meta-phase for pruning parts of the search space. The knowledge acquired in the counting phases can be represented by frequent set expressions. A frequent set expression is a pair containing an itemset and a frequency indicating that the frequency of that itemset is greater than or equal to the given fre-quency. A system of frequent sets is a collection of such expressions. We give an axiomatization for these systems. This axiomatization characterizes complete systems. A system is complete when it explicitly contains all information that it logically implies. Every system of frequent sets has a unique completion. The completion of a system actually represents the knowledge that maximally can be derived in the meta-phase.
Unable to display preview. Download preview PDF.
- 1.R. Agrawal, T. Imilienski, and A. Swami. Mining association rules between sets of items in large databases. In Proc. ACM SIGMOD, 1993Google Scholar
- 2.R. Agrawal, R. Srikant. Fast Algorithms for Mining Association Rules. In Proc. VLDB, 1994Google Scholar
- 3.T. Calders, and J. Paredaens. A Theoretical Framework for Reasoning about Frequent Itemsets. Technical Report 006, Universiteit Antwerpen, Belgium, http://win-www.uia.ac.be/u/calders/download/axiom.ps, June 2000.Google Scholar
- 8.J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In Proc. ACM SIGMOD, 2000Google Scholar
- 9.P. Hansen, B. Jaumard, G.-B. D. Nguetsé, M. P. de Aragäo. Models and Algorithms for Probabilistic and Bayesian Logic. In Proc. IJCAI, 1995Google Scholar
- 10.P. Hansen, B. Jaumard. Probabilistic Satisfiability. Les Cahiers du GERAD G-96-31, 1996Google Scholar
- 11.L. V.S. Laksmanan, R.T. Ng, J. Han, and A. Pang. Optimization of Constrained Frequent Set Queries with 2-variable Constraints. Proc. ACM SIGMOD, 1999Google Scholar