Skip to main content

Advertisement

Log in

Analysis of tree-based uncertain frequent pattern mining techniques without pattern losses

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Various large-scale data have been generated in a variety of application fields, since the Internet began to be widely used. Accordingly, researchers have developed various data mining methods for pervasive human-centric computing to deal with the data and discover interesting knowledge. Frequent pattern mining is one of the main issues in data mining, which finds meaningful pattern information from databases. In this area, not only precise data but also uncertain data can be generated depending on environments of data generation. Since the concept of uncertain frequent pattern mining was proposed to overcome the limitations of traditional approaches that cannot deal with uncertain data with existential probabilities of items, several relevant methods have been developed. In this paper, we introduce and analyze state-of-the-art methods based on tree structures, and propose a new uncertain frequent pattern mining approach. We also compare algorithm performance and discuss characteristics of them.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  1. Aggarwal CC, Li Y, Wang J, Wang J (2009) Frequent pattern mining with uncertain data. In: 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp 29–37

  2. Agrawal R, Srikant R (1994) Fast Algorithms for mining association rules. In: 20th International Conference on Very Large Data Bases, pp 487–499

  3. Aykroyd RG, Barber S, Miller LR (2016) Classification of multiple time signals using localized frequency characteristics applied to industrial process monitoring. Comput Stat Data Anal 94:351–362

    Article  MathSciNet  Google Scholar 

  4. Cai J, Zhao X, Xun Y (2013) Association rule mining method based on weighted frequent pattern tree in mobile computing environment. Int J Wirel Mob Comput 6(2):193–199

    Article  Google Scholar 

  5. Cheng R, Kalashnikov DV, Prabhakar S (2004) Querying imprecise data in moving object environments. IEEE Trans Knowl Data Eng 16(9):1112–1127

    Article  Google Scholar 

  6. Chui C, Kao B, Hung E (2007) Mining frequent itemsets from uncertain data. In: 11th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, pp 47–58

  7. Duong T, Tran D (2015) A fusion of data mining techniques for predicting movement of mobile users. J Commun Netw 17(6):568–581

    Article  Google Scholar 

  8. Fang G, Deng Z, Ma H (2009) Network traffic monitoring based on mining frequent patterns. Fuzzy Syst Knowl Discov 7:571–575

    Google Scholar 

  9. Han J, Pei J, Yin T, Mao R (2004) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min Knowl Discov 8(1):53–87

    Article  MathSciNet  Google Scholar 

  10. Lee G, Yun U, Ryang H (2015) An uncertainty-based approach: frequent itemset mining from uncertain data with different item importance. Knowl Based Syst 90:239–256

    Article  Google Scholar 

  11. Lee G, Yun U, Ryang H, Kim D (2015) Multiple minimum support-based rare graph pattern mining considering symmetry feature-based growth technique and the differing importance of graph elements. Symmetry 7(3):1151–1163

    Article  MathSciNet  Google Scholar 

  12. Lee G, Yun U, Ryang H (2015) Mining weighted erasable patterns by using underestimated constraint-based pruning technique. J Intell Fuzzy Syst 28(3):1145–1157

    MathSciNet  Google Scholar 

  13. Lee G, Yun U, Ryu K (2016) Mining frequent weighted itemsets without storing transaction ids and generating candidates. Int J Uncertain Fuzziness Knowl Based Syst (In press)

  14. Lee G, Yun U, Ryang H, Kim D (2016) Approximate maximal frequent pattern mining with weight conditions and error tolerance. Int J Pattern Recogn Artif Intell (In press)

  15. Leung CK, Mateo MAF, Brajczuk DA (2008) A tree-based approach for frequent pattern mining from uncertain data. In: 12th Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp 653–661

  16. Leung CK, Carmichael CL, Hao B (2007) Efficient mining of frequent patterns from uncertain data. In: International Conference on Data Mining Workshops, pp 489–494

  17. Lin C, Hong T (2012) A new mining approach for uncertain databases using CUFP trees. Expert Syst Appl 39(4):4084–4093

    Article  MathSciNet  Google Scholar 

  18. Luo C, Chung SM (2012) Parallel mining of maximal sequential patterns using multiple samples. J Supercomput 59(2):852–881

    Article  Google Scholar 

  19. Ryang H, Yun U, Ryu K (2016) Fast algorithm for high utility pattern mining with the sum of item quantities. Intell Data Anal 20(2):395–415

    Article  Google Scholar 

  20. Ryang H, Yun U (2015) Top-k high utility pattern mining with effective threshold raising strategies. Knowl Based Syst 76:109–126

    Article  Google Scholar 

  21. Sallaberry A, Pecheur N, Bringay S, Roche M, Teisseire M (2011) Sequential patterns mining and gene sequence visualization to discover novelty from microarray data. J Biomed Inf 44:760–774

    Article  Google Scholar 

  22. Su MY, Yu GJ, Lin CY (2009) A real-time network intrusion detection system for large-scale attacks based on an incremental mining approach. Comput Secur 28(5):301–309

    Article  Google Scholar 

  23. Sun X, Lim L, Wang S (2012) An approximation algorithm of mining frequent itemsets from uncertain dataset. Int J Adv Comput Technol 4(3):42–49

    Google Scholar 

  24. Yun U, Pyun G, Yoon E (2015) Efficient mining of robust closed weighted sequential patterns without information loss. Int J Artif Intell Tools 24(1):1550007:1–1550007:28

    Article  Google Scholar 

  25. Yun U, Kim J (2015) A fast perturbation algorithm using tree structure for privacy preserving utility mining. Expert Syst Appl 42(3):1149–1165

    Article  MathSciNet  Google Scholar 

  26. Yun U, Ryang H (2015) Incremental high utility pattern mining with static and dynamic databases. Appl Intell 42(2):323–352

    Article  Google Scholar 

  27. Yun U, Lee G (2016) Incremental mining of weighted maximal frequent itemsets from dynamic databases. Expert Syst Appl (In press)

  28. Wang L, Feng L, Wu M (2013) AT-mine: an efficient algorithm of frequent itemset mining on uncertain dataset. J Comput 8(6):1417–1426

    Article  Google Scholar 

  29. Wang L, Cheung DW, Cheng R, Lee S, Yang X (2012) Efficient mining of frequent itemsets on large uncertain databases. IEEE Trans Knowl Data Eng 24(12):2170–2183

    Article  Google Scholar 

  30. Xu K, Zou K, Huang Y, Yu X, Zhang X (2016) Mining community and inferring friendship in mobile social networks. Neurocomputing 174:605–616

    Article  Google Scholar 

  31. Yiu M, Mamoulis N, Dai X, Tao Y, Vaitis M (2009) Efficient evaluation of probabilistic advanced spatial queries on existentially uncertain data. IEEE Trans Knowl Data Eng 21(1):108–122

    Article  Google Scholar 

  32. Zhang Y, Cheng R, Chen J (2010) Evaluating continuous probabilistic queries over imprecise sensor data. In: 15th International Conference on Database Systems for Advanced Applications, pp 535–549

  33. Zhao W, Chellappa R, Phillips PJ, Rosenfeld A (2003) Face recognition: a literature survey. ACM Comput Surv 35:399–458

    Article  Google Scholar 

  34. Zhao Z, Yan D, Ng W (2014) Mining probabilistically frequent sequential patterns in large uncertain databases. IEEE Trans Knowl Data Eng 26(5):1171–1184

    Article  Google Scholar 

  35. Zhang F, Zhang Y, Bakos JD (2013) Accelerating frequent itemset mining on graphics processing units. J Supercomput 66(1):94–117

    Article  Google Scholar 

Download references

Acknowledgments

This research was supported by the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (NRF No. 20152062051 and NRF No. 20155054624).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Unil Yun.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lee, G., Yun, U. & Lee, KM. Analysis of tree-based uncertain frequent pattern mining techniques without pattern losses. J Supercomput 72, 4296–4318 (2016). https://doi.org/10.1007/s11227-016-1847-z

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-016-1847-z

Keywords

Navigation