IntelliSys 2016: Proceedings of SAI Intelligent Systems Conference (IntelliSys) 2016 pp 974-983 | Cite as
A Review of Frequent Pattern Mining Algorithms for Uncertain Data
Abstract
Frequent pattern mining plays an essential role in data mining (e.g. finding association rules, correlation, sequences, and episodes). Uncertainty in data is generally caused by factors like data randomness and incompleteness, limitations of measuring equipment, delayed data updates, etc. Some of the important algorithms in finding all the frequent patterns from probabilistic datasets of uncertain data are U-Apriori, UF-growth, CUF-growth, PUF-growth. When users are only interested in some of the frequent patterns rather than all, these interests are expressed in terms of constraints and are pushed into mining process there by reducing search space. Finally big data era has brought challenges as well as tools for the problem of frequent pattern mining of uncertain data. In this paper all the above issues are briefly discussed and summarized.
Keywords
Frequent itemsets Uncertain data Constrained mining Big dataReferences
- 1.Chui, C.-K., Kao, B., Hung, E.: Mining frequent itemsets from uncertain data. In: Proceedings of the PAKDD 2007, pp. 47–58. Springer (2007)Google Scholar
- 2.Leung, C.K.-S., Tanbeer, S.K.: Fast tree-based mining of frequent itemsets from uncertain data. In: Proceedings of the DASFAA 2012, Part I, pp. 272–287. Springer (2012)Google Scholar
- 3.Leung, C.K.-S., Tanbeer, S.K.: PUF-tree: a compact tree structure for frequent pattern mining of uncertain data. In: Proceedings of the PAKDD 2013, Part I, pp. 13–25. Springer (2013)Google Scholar
- 4.Chi, Y., Muntz, R.R., Nijssen, S., Kok, J.N.: Frequent subtree mining- an overview. In: Proceedings of the PAKDD 2013 66(1–2), 161–198 (2004)Google Scholar
- 5.Cuzzocrea, A., Leung, C.K.-S., MacKinnon, R.K.: Mining constrained frequent itemsets from distributed uncertain data. In: Future Generation Computer Systems. Elsevier (2014) Google Scholar
- 6.Lakshmanan, L.V.S., Leung, C.K.-S., Ng, R.T.: Efficient dynamic mining of constrained frequent sets. ACM Trans. Database Syst. (TODS) 28(4), 337–389 (2003)CrossRefGoogle Scholar
- 7.Leung, C.K.-S.: Frequent itemset mining with constraints. In: Encyclopedia of Database Systems, pp. 1179–1183. Springer (2009)Google Scholar
- 8.Leung, C.K.-S., Brajczuk, D.A.: Mining uncertain data for constrained frequent sets. In: Proceedings of the IDEAS 2009, pp. 109–120. ACM (2009)Google Scholar
- 9.Leung, C.K.-S., Brajczuk, D.A.: uCFS2: an enhanced system that mines uncertain data for constrained frequent sets. In: Proceedings of the IDEAS 2010, pp. 32–37. ACM (2010)Google Scholar
- 10.Leung, C.K.-S., Brajczuk, D.A.: Efficient algorithms for the mining of constrained frequent patterns from uncertain data. ACM SIGKDD Explor. 11(2), 123–130 (2009)CrossRefGoogle Scholar
- 11.Leung, C.K.-S., Hao, B., Brajczuk, D.A.: Mining uncertain data for frequent itemsets that satisfy aggregate constraints. In: Proceedings of the ACM SAC 2010, pp. 1034–1038 (2010)Google Scholar
- 12.Madden, S.: From databases to big data. IEEE Internet Comput. 16(3), 4–6 (2012)CrossRefGoogle Scholar
- 13.Leung, C.K.-S., Hayduk, Y.: Mining frequent patterns from uncertain data with MapReduce for Big Data analytics. In: Proceedings of the DASFAA 2013, Part I, pp. 440–455. Springer (2013)Google Scholar
- 14.
- 15.Leung, C.K.-S., Mateo, M.A.F., Brajczuk, D.A.: A tree-based approach for frequent pattern mining from uncertain data. In: Washio, T., et al. (eds.) PAKDD 2008. LNAI, vol. 5012, pp. 653–661 (2008)Google Scholar
- 16.Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. J. Comput. Sci. Technol. 15, 487–499 (1994)Google Scholar
- 17.Lukoianova, T., Rubin, V.: Veracity Roadmap: is big data objective, truthful and credible? Adv. Classif. Res. Online 24(1) (2014). doi: 10.7152/acro.v24i1.14671
- 18.Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of the ACM SIGMOD 1993, pp. 207–216 (1993)Google Scholar
- 19.Green, T., Tannen, V.: Models for incomplete and probabilistic information. Bull. Tech. Committee Data Eng. 29(1), 17–24 (2006)Google Scholar
- 20.
- 21.Wienhofen, L.W.M., Mathisen, B.M., Roman, D.: Empirical Big Data Research: A Systematic Literature Mapping. http://arxiv.org/pdf/1509.03045.pdf