The Journal of Supercomputing

, Volume 71, Issue 4, pp 1318–1344 | Cite as

GPU-based bees swarm optimization for association rules mining

  • Youcef Djenouri
  • Ahcene Bendjoudi
  • Malika Mehdi
  • Nadia Nouali-Taboudjemat
  • Zineb Habbas
Article

Abstract

Association rules mining (ARM) is a well-known combinatorial optimization problem aiming at extracting relevant rules from given large-scale datasets. According to the state of the art, the bio-inspired methods proved their efficiency by generating acceptable solutions in a reasonable time when dealing with small and medium size instances. Unfortunately, to cope with large instances such as the webdocs benchmark, these methods require more and more powerful processors and are time expensive. Nowadays, computing power is no longer a real issue. It can be provided by the power of emerging technologies such as graphics processing units (GPUs) that are massively multi-threaded processors. In this paper, we investigate the use of GPUs to speed up the computation. We propose two GPU-based bees swarm algorithms for association rules mining (single evaluation in GPU, SE-GPU and multiple evaluation in GPU, ME-GPU). SE-GPU aims at evaluating one rule at a time where each thread is associated with one transaction, whereas ME-GPU evaluates multiple rules in parallel on GPU where each thread is associated with several transactions. To validate our approaches, the two algorithms have been executed to solve well-known large ARM instances. Real experiments have been carried out on an Intel Xeon 64 bit quad-core processor E5520 coupled to an Nvidia Tesla C2075 GPU device. The results show that our approaches improve the execution time up to 100\(\times \) over the sequential mono-core bees swarm optimization-ARM algorithm. Moreover, the proposed approaches have been compared with CPU multi-core ones (1–8 cores). The results show that they are faster than the multi-core versions whatever is the number of used cores.

Keywords

Bees swarm optimization (BSO) Association rule mining (ARM) Massively parallel algorithms GPU computing 

References

  1. 1.
    Adil SH, Qamar S (2009) Implementation of association rule mining using CUDA. In: International conference onemerging technologies (ICET 2009). IEEE, New YorkGoogle Scholar
  2. 2.
    Agrawal R, Shafer JC (1996) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6):962–969CrossRefGoogle Scholar
  3. 3.
    Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. ACM SIGMOD Rec 22(2) (ACM)Google Scholar
  4. 4.
    Arabnia HR, Oliver MA (1989) A transputer network for fast operations on digitised images. Int J Eurograph Assoc (Comput Graph Forum) 8(1):3–12CrossRefGoogle Scholar
  5. 5.
    Bendjoudi A, Chekini M, Gharbi M, Mehdi M, Benatchba K, Sitayeb-Benbouzid F, Melab N (2013) Parallel B&B algorithm on hybrid multicore/GPU architecture. In: IEEE international conference on high performance computing and communications, vol 15. IEEE, New YorkGoogle Scholar
  6. 6.
    Boley D et al (1999) Partitioning-based clustering for web document categorization. Decis Support Syst 27(3):329–341CrossRefGoogle Scholar
  7. 7.
    Brin S et al (1997) Dynamic itemset counting and implication rules for market basket data. ACM SIGMOD Rec 26(2) (ACM)Google Scholar
  8. 8.
    Cano A, Luna JM, Ventura S (2013) High performance evaluation of evolutionary-mined association rules on GPUs. J Supercomput 1–24Google Scholar
  9. 9.
    Cecilia JM, García JM, Nisbet A, Amos M, Ujaldón M (2013) Enhancing data parallelism for ant colony optimization on gpus. J Parallel Distrib Comput 73(1):42–51CrossRefGoogle Scholar
  10. 10.
    Cui Q, Xiaobo G (2013) Research on parallel association rules mining on GPU. In: Proceedings of the 2nd international conference on green communications and networks 2012 (GCN 2012), vol 2. Springer, BerlinGoogle Scholar
  11. 11.
    Djenouri Y, Drias H, Habbas Z (2014) Bees swarm optimisation using multiple strategies for association rule mining. Int J Bio-Inspir Comput 6(4):239–249CrossRefGoogle Scholar
  12. 12.
    Fang W et al (2009) Frequent itemset mining on graphics processors. In: Proceedings of the fifth international workshop on data management on new hardware. ACM, New YorkGoogle Scholar
  13. 13.
    Gada FV, Yadav KR, Shah RD (2013) Data mining in medical application for better detection of disease. IJCER 2(2):173–176Google Scholar
  14. 14.
    Garg R, Mishra PK (2011) Exploiting parallelism in association rule mining algorithms. Int J Adv Technol 2.2:222–232Google Scholar
  15. 15.
    Goethals B, Zaki MJ (2003) Frequent itemset mining implementations repository. This site contains a wide-variety of algorithms for mining frequent, closed, and maximal itemsets. http://fimi.cs.helsinki.fi
  16. 16.
    Guvenir HA, Uysal I (2000) Bilkent university function approximation repository. http://funapp.cs.bilkent.edu.tr/DataSets. Accessed 12 March 2012
  17. 17.
    Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. ACM SIGMOD Rec 29(2) (ACM)Google Scholar
  18. 18.
    Han J, Kamber M, Pei J (2006) Data mining: concepts and techniques. Morgan Kaufmann, BurlingtonGoogle Scholar
  19. 19.
    Kacprzyk J, Zadrozny S (2013) Derivation of linguistic summaries is inherently difficult: can association rule mining help? In: Towards advanced data analysis by combining soft computing and statistics, pp 291–303. Springer, BerlinGoogle Scholar
  20. 20.
    Khademolghorani F, Baraani A, Zamanifar K (2014) Efficient mining of association rules based on gravitational search algorithm. Int J Comput Sci 8Google Scholar
  21. 21.
    Khabzaoui M, Dhaenens C, Talbi E-G (2005) Parallel genetic algorithms for multi-objective rule mining. In: The 6th MIC 2005Google Scholar
  22. 22.
    Kozawa Y, Amagasa T, Kitagawa H (2014) Probabilistic frequent itemset mining on a GPU cluster. IEICE Trans Inf Syst 97(4):779–789CrossRefGoogle Scholar
  23. 23.
    Kuo RJ, Chao CM, Chiu YT (2011) Application of particle swarm optimization to association rule mining. Appl Soft Comput 11(1):326–336CrossRefGoogle Scholar
  24. 24.
    Li H, Zhang N (2010) Mining maximal frequent itemsets on graphics processors. In: 2010 seventh international conference on fuzzy systems and knowledge discovery (FSKD), vol 3. IEEE, New YorkGoogle Scholar
  25. 25.
    Lin KW, Deng DJ (2010) A novel parallel algorithm for frequent pattern mining with privacy preserved in cloud computing environments. Int J Ad Hoc Ubiquitous Comput 6(4):205–215CrossRefGoogle Scholar
  26. 26.
    Liu K, Hogan WR, Crowley RS (2011) Natural language processing methods and systems for biomedical ontology learning. J Biomed Inform 44(1):163–179CrossRefGoogle Scholar
  27. 27.
    Liu Z, Chen S, Cai J, Zhang E, Lan L, Zheng J, Du J (2013) Traditional Chinese medicine syndrome-related herbal prescriptions in treatment of malignant tumors. J Tradit Chin Med 33(1):19–26CrossRefGoogle Scholar
  28. 28.
    Luke S (2011) Essentials of meta-heuristics, pp 11–52Google Scholar
  29. 29.
    Luper D, Cameron D, Miller JA, Arabnia HR (2007) Spatial and temporal target association through semantic analysis and GPS data mining. In: International conference on information and knowledge engineering (IKE’07), USA, pp 251–257 (ISBN 1-60132-050-7)Google Scholar
  30. 30.
    Martínez-Ballesteros M, Nepomuceno-Chamorro IA, Riquelme JC (2013) Discovering gene association networks by multi-objective evolutionary quantitative association rules. J Comput Syst SciGoogle Scholar
  31. 31.
    Mata J, José-Luis A, Riquelme J-C (2002) An evolutionary algorithm to discover numeric association rules. In: Proceedings of the 2002 ACM symposium on applied computing. ACM, New YorkGoogle Scholar
  32. 32.
    Melab N, Talbi E-G (2001) A parallel genetic algorithm for rule mining. In: Proceedings of the 15th international parallel and distributed processing symposium. IEEE Computer Society, LondonGoogle Scholar
  33. 33.
    Melab N, Chakroun I, Bendjoudi A (2013) Graphics processing unit-accelerated bounding for branch-and-bound applied to a permutation problem using data access optimization. Concurr Comput Pract ExpGoogle Scholar
  34. 34.
    Moslehi P et al (2014) Multi-objective numeric association rules mining via ant colony optimization for continuous domains without specifying minimum support and minimum confidence. Int J Comput Sci 8Google Scholar
  35. 35.
    Mishra BSP et al (2011) Parallel multi-objective genetic algorithms for associative classification rule mining. In: Proceedings of the 2011 international conference on communication, computing and security. ACM, New YorkGoogle Scholar
  36. 36.
    Olafsson S, Li X, Wu S (2008) Operations research and data mining. Eur J Oper Res 187(3):1429–1448CrossRefMATHMathSciNetGoogle Scholar
  37. 37.
    Orlando S et al (2002) Adaptive and resource-aware mining of frequent sets. In: Proceedings of the 2002 IEEE international conference on data mining, ICDM 2003. IEEE, New YorkGoogle Scholar
  38. 38.
    Park JS, Chen M-S, Yu PS (1995) An effective hash-based algorithm for mining association rules, vol 24(2). ACM, New YorkGoogle Scholar
  39. 39.
    Park JS, Chen M-S, Yu PS (1995) Efficient parallel data mining for association rules. In: Proceedings of the fourth international conference on information and knowledge management. ACM, New YorkGoogle Scholar
  40. 40.
    Parthasarathy S et al (2001) Parallel data mining for association rules on shared-memory systems. Knowl Inf Syst 3.1:1–29Google Scholar
  41. 41.
    Rahbarinia B, Pedram MM, Arabnia HR, Alavi Z (2010) A multi-objective scheme to hide sequential patterns. In: The proceedings of the 2010 international conference on computer and automation engineering, ICCAE, Singapore, 26–28 February 2010, pp 153–158 (E-ISBN 978-1-4244-5586-7, doi:10.1109/ICCAE.2010.5451977)
  42. 42.
    Romero C et al (2012) Association rule mining using genetic programming to provide feedback to instructors from multiple-choice quiz data. Expert SystGoogle Scholar
  43. 43.
    Silvestri C, Orlando S (2012) gpuDCI: exploiting GPUs in frequent itemset mining. In: 20th euromicro international conference on parallel, distributed and network-based processing (PDP). IEEE, New YorkGoogle Scholar
  44. 44.
    Sun J, Pan W (2009) Parallel association rule mining for image data. In: International conference on photonics and image in agriculture engineering (PIAGENG 2009). International Society for Optics and Photonics, LondonGoogle Scholar
  45. 45.
    Talia D, Trunfio P, Verta O (2005) Weka4ws: a wsrf-enabled weka toolkit for distributed data mining on grids. In: Knowledge discovery in databases: PKDD 2005, pp 309–320. Springer, BerlinGoogle Scholar
  46. 46.
    Wang ZG, Wang CS (2012) A parallel association-rule mining algorithm. In: Web information systems and mining, pp 125–129. Springer, BerlinGoogle Scholar
  47. 47.
    Weiss RM (2010) GPU-accelerated data mining with swarm intelligence. Doctoral dissertation, Honors thesis. Department of Computer Science, Macalester College. http://metislogic.net/thesis.pdf
  48. 48.
    Yan X, Zhang C, Zhang S (2009) Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support. Expert Syst Appl 36(2):3066–3076CrossRefGoogle Scholar
  49. 49.
    Yang J, Yang Y (2010) A parallel algorithm for mining association rules. In: 2nd international conference on networking and digital society (ICNDS), vol 1. IEEE, New YorkGoogle Scholar
  50. 50.
    Zaïane OR, El-Hajj M, Lu P (2001) Fast parallel association rule mining without candidacy generation. In: Proceedings of the IEEE international conference on data mining, ICDM 2001. IEEE, New YorkGoogle Scholar
  51. 51.
    Zhang F, Zhang Y, Bakos J (2011) GPApriori: GPU-accelerated frequent itemset mining. In: IEEE international conference on cluster computing (CLUSTER). IEEE, New YorkGoogle Scholar
  52. 52.
    Zhou J, Yu K-M, Wu B-C (2010) Parallel frequent patterns mining algorithm on GPU. In: IEEE international conference on systems man and cybernetics (SMC). IEEE, New YorkGoogle Scholar
  53. 53.
    Zhou Y, Tan Y (2009) GPU-based parallel particle swarm optimization. In: IEEE congress on evolutionary computation, 2009, CEC’09, pp 1493–1500. IEEE, New YorkGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2015

Authors and Affiliations

  • Youcef Djenouri
    • 1
  • Ahcene Bendjoudi
    • 1
  • Malika Mehdi
    • 2
  • Nadia Nouali-Taboudjemat
    • 1
  • Zineb Habbas
    • 3
  1. 1.DTISICERIST Research CenterAlgiersAlgeria
  2. 2.LSIUSTHBAlgiersAlgeria
  3. 3.LCOMSUniversity of Lorraine Ile du SaulcyMetz CedexFrance

Personalised recommendations