Fast Frequent Pattern Detection Using Prime Numbers

Xylogiannopoulos, Konstantinos F.; Addam, Omar; Karampelas, Panagiotis; Alhajj, Reda

doi:10.1007/978-3-319-10840-7_12

Konstantinos F. Xylogiannopoulos¹⁸,
Omar Addam¹⁸,
Panagiotis Karampelas^19,21 &
…
Reda Alhajj^18,20

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8669))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1559 Accesses

Abstract

Finding all frequent itemsets (patterns) in a given database is a challenging process that in general consumes time and space. Time is measured in terms of the number of database scans required to produce all frequent itemsets. Space is consumed by the number of potential frequent itemsets which will end up classified as not frequent. To overcome both limitations, namely space and time, we propose a novel approach for generating all possible frequent itemsets by introducing a new representation of items into groups of four items and within each group, items are assigned one of four prime numbers, namely 2, 3, 5, and 7. The reported results demonstrate the applicability and effectiveness of the proposed approach. Our approach satisfies scalability in terms of number of transactions and number of items.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agarwal, R., Aggarwal, C., Prasad, V.: A Tree Projection Algorithm for Generation of Frequent Item Sets. Journal of Parallel and Distributed Computing 61(3), 350–371 (2001)
Article MATH Google Scholar
Agrawal, R., Imieliski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proc. of ACM SIGMOD, pp. 207–216 (1993)
Google Scholar
Amir, A., Feldman, R., Kashi, R.: A new and versatile method for association generation. Information Systems 22(6-7), 333–347 (1997)
Article MATH Google Scholar
Bialecki, A., Cafarella, M., Cutting, D., Malley, O.: Hadoop: a framework for running applications on large clusters built of commodity hardware, http://hadoop.apache.org/
Boyd, D., Crawford, K.: Critical Questions for Big Data. Information, Communication and Society 15(5), 662–679 (2012)
Article Google Scholar
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proc. of ACM SIGMOD, pp. 1–12 (2000)
Google Scholar
Kang, U., Chau, D.H., Faloutsos, C.: Big graph mining: Algorithms and discoveries. SIGKDD Explorations 14(2) (2012)
Google Scholar
Leung, C.K.-S., Hayduk, Y.: Mining Frequent Patterns from Uncertain Data with MapReduce for Big Data Analytics. In: Meng, W., Feng, L., Bressan, S., Winiwarter, W., Song, W. (eds.) DASFAA 2013, Part I. LNCS, vol. 7825, pp. 440–455. Springer, Heidelberg (2013)
Chapter Google Scholar
Lin, J., Ryaboy, D.: Scaling big data mining infrastructure: The twitter experience. SIGKDD Explorations 14(2) (2012)
Google Scholar
Marz, N., Warren, J.: Big Data: Principles and best practices of scalable realtime data systems. Manning Publications (2013)
Google Scholar
Shenoy, P., Haritsa, J.R., Sudarshan, S., Bhalotia, G., Bawa, M., Shah, D.: Turbo-charging vertical mining of large databases. In: ACM SIGMOD (2000)
Google Scholar
Zaki, M.J.: Parallel and distributed association mining: a survey. IEEE Concurrency 7(4) (2002)
Google Scholar
Tohidi, H., Hamidah, I.: Using Unique-Prime-Factorization Theorem to Mine Frequent Patterns without Generating Tree. American Journal of Economics and Business Administration 3(1), 58–65 (2011)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, University of Calgary, Calgary, Alberta, Canada
Konstantinos F. Xylogiannopoulos, Omar Addam & Reda Alhajj
Dept. of Information Technology, Hellenic American University, Manchester, NH, USA
Panagiotis Karampelas
Dept. of Computer Science, Global University, Beirut, Lebanon
Reda Alhajj
Dept. of Informatics & Computers, Hellenic Air Force Academy, Attica, Greece
Panagiotis Karampelas

Authors

Konstantinos F. Xylogiannopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Omar Addam
View author publications
You can also search for this author in PubMed Google Scholar
Panagiotis Karampelas
View author publications
You can also search for this author in PubMed Google Scholar
Reda Alhajj
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Salamanca, Plaza de la Merced S/N, 37008, Salamanca, Spain
Emilio Corchado & Héctor Quintián &
University of the Basque Country, Pasco Manuel de Lardizábal 1, 20018, San Sebastián, Spain
José A. Lozano
The University of Manchester, Sackville Street, M13 9PL, Manchester, UK
Hujun Yin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xylogiannopoulos, K.F., Addam, O., Karampelas, P., Alhajj, R. (2014). Fast Frequent Pattern Detection Using Prime Numbers. In: Corchado, E., Lozano, J.A., Quintián, H., Yin, H. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2014. IDEAL 2014. Lecture Notes in Computer Science, vol 8669. Springer, Cham. https://doi.org/10.1007/978-3-319-10840-7_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-10840-7_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10839-1
Online ISBN: 978-3-319-10840-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics