Abstract
Various large-scale data have been generated in a variety of application fields, since the Internet began to be widely used. Accordingly, researchers have developed various data mining methods for pervasive human-centric computing to deal with the data and discover interesting knowledge. Frequent pattern mining is one of the main issues in data mining, which finds meaningful pattern information from databases. In this area, not only precise data but also uncertain data can be generated depending on environments of data generation. Since the concept of uncertain frequent pattern mining was proposed to overcome the limitations of traditional approaches that cannot deal with uncertain data with existential probabilities of items, several relevant methods have been developed. In this paper, we introduce and analyze state-of-the-art methods based on tree structures, and propose a new uncertain frequent pattern mining approach. We also compare algorithm performance and discuss characteristics of them.
Similar content being viewed by others
References
Aggarwal CC, Li Y, Wang J, Wang J (2009) Frequent pattern mining with uncertain data. In: 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp 29–37
Agrawal R, Srikant R (1994) Fast Algorithms for mining association rules. In: 20th International Conference on Very Large Data Bases, pp 487–499
Aykroyd RG, Barber S, Miller LR (2016) Classification of multiple time signals using localized frequency characteristics applied to industrial process monitoring. Comput Stat Data Anal 94:351–362
Cai J, Zhao X, Xun Y (2013) Association rule mining method based on weighted frequent pattern tree in mobile computing environment. Int J Wirel Mob Comput 6(2):193–199
Cheng R, Kalashnikov DV, Prabhakar S (2004) Querying imprecise data in moving object environments. IEEE Trans Knowl Data Eng 16(9):1112–1127
Chui C, Kao B, Hung E (2007) Mining frequent itemsets from uncertain data. In: 11th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, pp 47–58
Duong T, Tran D (2015) A fusion of data mining techniques for predicting movement of mobile users. J Commun Netw 17(6):568–581
Fang G, Deng Z, Ma H (2009) Network traffic monitoring based on mining frequent patterns. Fuzzy Syst Knowl Discov 7:571–575
Han J, Pei J, Yin T, Mao R (2004) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min Knowl Discov 8(1):53–87
Lee G, Yun U, Ryang H (2015) An uncertainty-based approach: frequent itemset mining from uncertain data with different item importance. Knowl Based Syst 90:239–256
Lee G, Yun U, Ryang H, Kim D (2015) Multiple minimum support-based rare graph pattern mining considering symmetry feature-based growth technique and the differing importance of graph elements. Symmetry 7(3):1151–1163
Lee G, Yun U, Ryang H (2015) Mining weighted erasable patterns by using underestimated constraint-based pruning technique. J Intell Fuzzy Syst 28(3):1145–1157
Lee G, Yun U, Ryu K (2016) Mining frequent weighted itemsets without storing transaction ids and generating candidates. Int J Uncertain Fuzziness Knowl Based Syst (In press)
Lee G, Yun U, Ryang H, Kim D (2016) Approximate maximal frequent pattern mining with weight conditions and error tolerance. Int J Pattern Recogn Artif Intell (In press)
Leung CK, Mateo MAF, Brajczuk DA (2008) A tree-based approach for frequent pattern mining from uncertain data. In: 12th Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp 653–661
Leung CK, Carmichael CL, Hao B (2007) Efficient mining of frequent patterns from uncertain data. In: International Conference on Data Mining Workshops, pp 489–494
Lin C, Hong T (2012) A new mining approach for uncertain databases using CUFP trees. Expert Syst Appl 39(4):4084–4093
Luo C, Chung SM (2012) Parallel mining of maximal sequential patterns using multiple samples. J Supercomput 59(2):852–881
Ryang H, Yun U, Ryu K (2016) Fast algorithm for high utility pattern mining with the sum of item quantities. Intell Data Anal 20(2):395–415
Ryang H, Yun U (2015) Top-k high utility pattern mining with effective threshold raising strategies. Knowl Based Syst 76:109–126
Sallaberry A, Pecheur N, Bringay S, Roche M, Teisseire M (2011) Sequential patterns mining and gene sequence visualization to discover novelty from microarray data. J Biomed Inf 44:760–774
Su MY, Yu GJ, Lin CY (2009) A real-time network intrusion detection system for large-scale attacks based on an incremental mining approach. Comput Secur 28(5):301–309
Sun X, Lim L, Wang S (2012) An approximation algorithm of mining frequent itemsets from uncertain dataset. Int J Adv Comput Technol 4(3):42–49
Yun U, Pyun G, Yoon E (2015) Efficient mining of robust closed weighted sequential patterns without information loss. Int J Artif Intell Tools 24(1):1550007:1–1550007:28
Yun U, Kim J (2015) A fast perturbation algorithm using tree structure for privacy preserving utility mining. Expert Syst Appl 42(3):1149–1165
Yun U, Ryang H (2015) Incremental high utility pattern mining with static and dynamic databases. Appl Intell 42(2):323–352
Yun U, Lee G (2016) Incremental mining of weighted maximal frequent itemsets from dynamic databases. Expert Syst Appl (In press)
Wang L, Feng L, Wu M (2013) AT-mine: an efficient algorithm of frequent itemset mining on uncertain dataset. J Comput 8(6):1417–1426
Wang L, Cheung DW, Cheng R, Lee S, Yang X (2012) Efficient mining of frequent itemsets on large uncertain databases. IEEE Trans Knowl Data Eng 24(12):2170–2183
Xu K, Zou K, Huang Y, Yu X, Zhang X (2016) Mining community and inferring friendship in mobile social networks. Neurocomputing 174:605–616
Yiu M, Mamoulis N, Dai X, Tao Y, Vaitis M (2009) Efficient evaluation of probabilistic advanced spatial queries on existentially uncertain data. IEEE Trans Knowl Data Eng 21(1):108–122
Zhang Y, Cheng R, Chen J (2010) Evaluating continuous probabilistic queries over imprecise sensor data. In: 15th International Conference on Database Systems for Advanced Applications, pp 535–549
Zhao W, Chellappa R, Phillips PJ, Rosenfeld A (2003) Face recognition: a literature survey. ACM Comput Surv 35:399–458
Zhao Z, Yan D, Ng W (2014) Mining probabilistically frequent sequential patterns in large uncertain databases. IEEE Trans Knowl Data Eng 26(5):1171–1184
Zhang F, Zhang Y, Bakos JD (2013) Accelerating frequent itemset mining on graphics processing units. J Supercomput 66(1):94–117
Acknowledgments
This research was supported by the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (NRF No. 20152062051 and NRF No. 20155054624).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lee, G., Yun, U. & Lee, KM. Analysis of tree-based uncertain frequent pattern mining techniques without pattern losses. J Supercomput 72, 4296–4318 (2016). https://doi.org/10.1007/s11227-016-1847-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-016-1847-z