A Review of Frequent Pattern Mining Algorithms for Uncertain Data

Conference paper
Part of the Lecture Notes in Networks and Systems book series (LNNS, volume 16)

Abstract

Frequent pattern mining plays an essential role in data mining (e.g. finding association rules, correlation, sequences, and episodes). Uncertainty in data is generally caused by factors like data randomness and incompleteness, limitations of measuring equipment, delayed data updates, etc. Some of the important algorithms in finding all the frequent patterns from probabilistic datasets of uncertain data are U-Apriori, UF-growth, CUF-growth, PUF-growth. When users are only interested in some of the frequent patterns rather than all, these interests are expressed in terms of constraints and are pushed into mining process there by reducing search space. Finally big data era has brought challenges as well as tools for the problem of frequent pattern mining of uncertain data. In this paper all the above issues are briefly discussed and summarized.

Keywords

Frequent itemsets Uncertain data Constrained mining Big data 

References

  1. 1.
    Chui, C.-K., Kao, B., Hung, E.: Mining frequent itemsets from uncertain data. In: Proceedings of the PAKDD 2007, pp. 47–58. Springer (2007)Google Scholar
  2. 2.
    Leung, C.K.-S., Tanbeer, S.K.: Fast tree-based mining of frequent itemsets from uncertain data. In: Proceedings of the DASFAA 2012, Part I, pp. 272–287. Springer (2012)Google Scholar
  3. 3.
    Leung, C.K.-S., Tanbeer, S.K.: PUF-tree: a compact tree structure for frequent pattern mining of uncertain data. In: Proceedings of the PAKDD 2013, Part I, pp. 13–25. Springer (2013)Google Scholar
  4. 4.
    Chi, Y., Muntz, R.R., Nijssen, S., Kok, J.N.: Frequent subtree mining- an overview. In: Proceedings of the PAKDD 2013 66(1–2), 161–198 (2004)Google Scholar
  5. 5.
    Cuzzocrea, A., Leung, C.K.-S., MacKinnon, R.K.: Mining constrained frequent itemsets from distributed uncertain data. In: Future Generation Computer Systems. Elsevier (2014) Google Scholar
  6. 6.
    Lakshmanan, L.V.S., Leung, C.K.-S., Ng, R.T.: Efficient dynamic mining of constrained frequent sets. ACM Trans. Database Syst. (TODS) 28(4), 337–389 (2003)CrossRefGoogle Scholar
  7. 7.
    Leung, C.K.-S.: Frequent itemset mining with constraints. In: Encyclopedia of Database Systems, pp. 1179–1183. Springer (2009)Google Scholar
  8. 8.
    Leung, C.K.-S., Brajczuk, D.A.: Mining uncertain data for constrained frequent sets. In: Proceedings of the IDEAS 2009, pp. 109–120. ACM (2009)Google Scholar
  9. 9.
    Leung, C.K.-S., Brajczuk, D.A.: uCFS2: an enhanced system that mines uncertain data for constrained frequent sets. In: Proceedings of the IDEAS 2010, pp. 32–37. ACM (2010)Google Scholar
  10. 10.
    Leung, C.K.-S., Brajczuk, D.A.: Efficient algorithms for the mining of constrained frequent patterns from uncertain data. ACM SIGKDD Explor. 11(2), 123–130 (2009)CrossRefGoogle Scholar
  11. 11.
    Leung, C.K.-S., Hao, B., Brajczuk, D.A.: Mining uncertain data for frequent itemsets that satisfy aggregate constraints. In: Proceedings of the ACM SAC 2010, pp. 1034–1038 (2010)Google Scholar
  12. 12.
    Madden, S.: From databases to big data. IEEE Internet Comput. 16(3), 4–6 (2012)CrossRefGoogle Scholar
  13. 13.
    Leung, C.K.-S., Hayduk, Y.: Mining frequent patterns from uncertain data with MapReduce for Big Data analytics. In: Proceedings of the DASFAA 2013, Part I, pp. 440–455. Springer (2013)Google Scholar
  14. 14.
  15. 15.
    Leung, C.K.-S., Mateo, M.A.F., Brajczuk, D.A.: A tree-based approach for frequent pattern mining from uncertain data. In: Washio, T., et al. (eds.) PAKDD 2008. LNAI, vol. 5012, pp. 653–661 (2008)Google Scholar
  16. 16.
    Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. J. Comput. Sci. Technol. 15, 487–499 (1994)Google Scholar
  17. 17.
    Lukoianova, T., Rubin, V.: Veracity Roadmap: is big data objective, truthful and credible? Adv. Classif. Res. Online 24(1) (2014). doi: 10.7152/acro.v24i1.14671
  18. 18.
    Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of the ACM SIGMOD 1993, pp. 207–216 (1993)Google Scholar
  19. 19.
    Green, T., Tannen, V.: Models for incomplete and probabilistic information. Bull. Tech. Committee Data Eng. 29(1), 17–24 (2006)Google Scholar
  20. 20.
  21. 21.
    Wienhofen, L.W.M., Mathisen, B.M., Roman, D.: Empirical Big Data Research: A Systematic Literature Mapping. http://arxiv.org/pdf/1509.03045.pdf

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.Computer Science and EngineeringRamdeobaba College of Engineering and ManagementNagpurIndia

Personalised recommendations