Abstract
Frequent Itemset Mining, or just pattern mining, plays an important role in data mining, aiming for the discovery of frequent co-occurrences in data. However, existing techniques still suffer from two bottlenecks that difficult the analysis and actual application of their results: they usually return a large number of patterns, and these patterns usually do not reflect user expectations. The most accepted and common approach to minimize these drawbacks is to define the user needs through constraints, and use them to filter and return less but more interesting patterns. Several types of constraints have been proposed in the literature, along with some algorithms that are able to incorporate them. However, there is no unified algorithm able to push any type of constraint. In this work we propose to push constraints into pattern mining through the use of a pattern-tree structure to efficiently store, check and prune the patterns. We define in detail a set of strategies to push each type of constraint, and a generic algorithm that is able to combine these strategies and incorporate any constraint into a pattern-tree.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: VLDB 1994: Proc. of the 20th Intern. Conf. on Very Large Data Bases, pp. 487–499. Morgan Kaufmann, San Francisco (1994)
Bayardo, R.J.: The hows, whys, and whens of constraints in itemset and rule discovery. In: Boulicaut, J.-F., De Raedt, L., Mannila, H. (eds.) Constraint-Based Mining. LNCS (LNAI), vol. 3848, pp. 1–13. Springer, Heidelberg (2006)
Boulicaut, J.-F., Jeudy, B.: Constraint-based data mining. In: The Data Mining and Knowledge Discovery Handbook, pp. 399–416. Springer (2005)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proc. of the 2000 ACM SIGMOD, pp. 1–12. ACM, New York (2000)
Leung, C.K.-S., Lakshmanan, L.V.S., Ng, R.T.: Exploiting succinct constraints using fp-trees. SIGKDD Explor. Newsl. 4(1), 40–49 (2002)
Leung, C.K.-S., Sun, L.: A new class of constraints for constrained frequent pattern mining. In: Proc. of the 27th Annual ACM Symposium on Applied Computing (SAC 2012), pp. 199–204. ACM (2012)
Ng, R., Lakshmanan, L., Han, J., Pang, A.: Exploratory mining and pruning optimizations of constrained associations rules. In: Proc. of the 1998 ACM SIGMOD Int. Conf. on Management of Data, pp. 13–24. ACM (1998)
Pei, J., Han, J.: Constrained frequent pattern mining: a pattern-growth view. SIGKDD Explor. Newsl. 4(1), 31–39 (2002)
Pei, J., Han, J., Lakshmanan, L.V.S.: Mining frequent itemsets with convertible constraints. In: Proc. of the 17th Int. Conf. on Data Engineering (ICDE 2001), pp. 433–442. IEEE, Washington (2001)
Srikant, R., Agrawal, R.: Mining generalized association rules. In: Proc. of the 21th Int. Conf. on Very Large Data Bases (VLDB 1995), pp. 407–419. Morgan Kaufmann Publishers Inc., San Francisco (1995)
Srikant, R., Vu, Q., Agrawal, R.: Mining association rules with item constraints. In: Proc. of the 3rd ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD 1997), pp. 67–73 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Silva, A., Antunes, C. (2013). Pushing Constraints into a Pattern-Tree. In: Torra, V., Narukawa, Y., Navarro-Arribas, G., Megías, D. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2013. Lecture Notes in Computer Science(), vol 8234. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41550-0_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-41550-0_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41549-4
Online ISBN: 978-3-642-41550-0
eBook Packages: Computer ScienceComputer Science (R0)