Abstract
Finding all frequent itemsets (patterns) in a given database is a challenging process that in general consumes time and space. Time is measured in terms of the number of database scans required to produce all frequent itemsets. Space is consumed by the number of potential frequent itemsets which will end up classified as not frequent. To overcome both limitations, namely space and time, we propose a novel approach for generating all possible frequent itemsets by introducing a new representation of items into groups of four items and within each group, items are assigned one of four prime numbers, namely 2, 3, 5, and 7. The reported results demonstrate the applicability and effectiveness of the proposed approach. Our approach satisfies scalability in terms of number of transactions and number of items.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agarwal, R., Aggarwal, C., Prasad, V.: A Tree Projection Algorithm for Generation of Frequent Item Sets. Journal of Parallel and Distributed Computing 61(3), 350–371 (2001)
Agrawal, R., Imieliski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proc. of ACM SIGMOD, pp. 207–216 (1993)
Amir, A., Feldman, R., Kashi, R.: A new and versatile method for association generation. Information Systems 22(6-7), 333–347 (1997)
Bialecki, A., Cafarella, M., Cutting, D., Malley, O.: Hadoop: a framework for running applications on large clusters built of commodity hardware, http://hadoop.apache.org/
Boyd, D., Crawford, K.: Critical Questions for Big Data. Information, Communication and Society 15(5), 662–679 (2012)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proc. of ACM SIGMOD, pp. 1–12 (2000)
Kang, U., Chau, D.H., Faloutsos, C.: Big graph mining: Algorithms and discoveries. SIGKDD Explorations 14(2) (2012)
Leung, C.K.-S., Hayduk, Y.: Mining Frequent Patterns from Uncertain Data with MapReduce for Big Data Analytics. In: Meng, W., Feng, L., Bressan, S., Winiwarter, W., Song, W. (eds.) DASFAA 2013, Part I. LNCS, vol. 7825, pp. 440–455. Springer, Heidelberg (2013)
Lin, J., Ryaboy, D.: Scaling big data mining infrastructure: The twitter experience. SIGKDD Explorations 14(2) (2012)
Marz, N., Warren, J.: Big Data: Principles and best practices of scalable realtime data systems. Manning Publications (2013)
Shenoy, P., Haritsa, J.R., Sudarshan, S., Bhalotia, G., Bawa, M., Shah, D.: Turbo-charging vertical mining of large databases. In: ACM SIGMOD (2000)
Zaki, M.J.: Parallel and distributed association mining: a survey. IEEE Concurrency 7(4) (2002)
Tohidi, H., Hamidah, I.: Using Unique-Prime-Factorization Theorem to Mine Frequent Patterns without Generating Tree. American Journal of Economics and Business Administration 3(1), 58–65 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Xylogiannopoulos, K.F., Addam, O., Karampelas, P., Alhajj, R. (2014). Fast Frequent Pattern Detection Using Prime Numbers. In: Corchado, E., Lozano, J.A., Quintián, H., Yin, H. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2014. IDEAL 2014. Lecture Notes in Computer Science, vol 8669. Springer, Cham. https://doi.org/10.1007/978-3-319-10840-7_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-10840-7_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10839-1
Online ISBN: 978-3-319-10840-7
eBook Packages: Computer ScienceComputer Science (R0)