Advertisement

Metaheuristics for Frequent and High-Utility Itemset Mining

  • Youcef DjenouriEmail author
  • Philippe Fournier-Viger
  • Asma Belhadi
  • Jerry Chun-Wei Lin
Chapter
Part of the Studies in Big Data book series (SBD, volume 51)

Abstract

Metaheuristics are often used to solve combinatorial problems. They can be viewed as general purpose problem-solving approaches based on stochastic methods, which explore very large search spaces to find near-optimal solutions in a reasonable time. Some metaheuristics are inspired by biological and physical phenomenons. During the last two decades, two population-based methods named evolutionary algorithms and swarm intelligence have shown high efficiency compared to many other metaheuristics. Frequent Itemset Mining (FIM) and High Utility Itemset Mining (HUIM) are the process of extracting useful frequent and high utility itemsets from a given transactional database. Solving FIM and HUIM problems can be very time consuming, especially when dealing with large-scale data. To deal with this issue, different metaheuristic-based methods were developed. In this chapter, we study the application of metaheuristics to FIM and HUIM. Several metaheuristics have been presented, based on evolutionary or swarm intelligence algorithms, such as genetic algorithms, particle swarm optimization, ant colony optimization and bee swarm optimization.

References

  1. 1.
    Djenouri, Y., Belhadi, A., Belkebir, R.: Bees swarm optimization guided by data mining techniques for document information retrieval. Expert. Syst. Appl. 94, 126–136 (2018)Google Scholar
  2. 2.
    Djenouri, Y., Belhadi, A., Fournier-Viger, P.: Extracting useful knowledge from event logs: a frequent itemset mining approach. Knowl.-Based Syst. 139, 132–148 (2018)Google Scholar
  3. 3.
    Djenouri, Y., Habbas, Z., Djenouri, D., Fournier-Viger, P.: Bee swarm optimization for solving the MAXSAT problem using prior knowledge. Soft Comput. 1–18 (2017)Google Scholar
  4. 4.
    Djenouri, Y., Habbas, Z., Djenouri, D.: Data mining-based decomposition for solving the MAXSAT problem: toward a new approach. IEEE Intell. Syst. 32(4), 48–58 (2017)Google Scholar
  5. 5.
    Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: ACM SIGMOD Record, vol. 22, No. 2, pp. 207–216. ACM (1993)Google Scholar
  6. 6.
    Djenouri, Y., Comuzzi, M., Djenouri, D.: SS-FIM: single scan for frequent itemsets mining in transactional databases. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 644–654. Springer, Cham (2017)Google Scholar
  7. 7.
    Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM SIGMOD Record, vol. 29, No. 2, pp. 1–12. ACM (2000)Google Scholar
  8. 8.
    Djenouri, Y., Belhadi, A., Fournier-Viger, P., Lin, J. C. W.: An hybrid multi-core/gpu-based mimetic algorithm for big association rule mining. In: International Conference on Genetic and Evolutionary Computing, pp. 59–65. Springer, Singapore (2017)Google Scholar
  9. 9.
    Djenouri, Y., Habbas, Z., Djenouri, D., Comuzzi, M.: Diversification heuristics in bees swarm optimization for association rules mining. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 68–78. Springer, Cham (2017)Google Scholar
  10. 10.
    Djenouri, Y., Comuzzi, M.: Combining Apriori euristic and bio-inspired algorithms for solving the frequent itemsets mining problem. Inf. Sci. 420, 1–15 (2017)Google Scholar
  11. 11.
    Gheraibia, Y., Moussaoui, A., Djenouri, Y., Kabir, S., Yin, P.Y.: Penguins search optimisation algorithm for association rules mining. J. Comput. Inf. Technol. 24(2), 165–179 (2016)Google Scholar
  12. 12.
    Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: ACM SIGMOD Record, vol. 26, No. 2, pp. 255–264. ACM (1997)Google Scholar
  13. 13.
    Mueller, A.: Fast sequential and parallel algorithms for association rule mining: A comparison (1998)Google Scholar
  14. 14.
    Zaki, M.J., Parthasarathy, S., Ogihara, M., Li, W.: New algorithms for fast discovery of association rules. In: International Conference on Knowledge Discovery and Data Mining, vol. 97, pp. 283–286. ACM (1997)Google Scholar
  15. 15.
    Amphawan, K., Lenca, P., Surarerks, A.: Efficient mining top-k regular-frequent itemset using compressed tidsets. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 124–135. Springer, Heidelberg (2011)Google Scholar
  16. 16.
    Cerf, L., Besson, J., Robardet, C., Boulicaut, J. F.: Closed patterns meet n-ary relations. ACM Trans. Knowl. Discov. Data 3(1) (2009). Article 3Google Scholar
  17. 17.
    Leung, C.K.S., Mateo, M.A.F., Brajczuk, D.A.: A tree-based approach for frequent pattern mining from uncertain data. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 653–661. Springer, Heidelberg (2008)Google Scholar
  18. 18.
    Grahne, G., Zhu, J.: Fast algorithms for frequent itemset mining using fp-trees. IEEE Trans. Knowl. Data Eng. 17(10), 1347–1362 (2005)Google Scholar
  19. 19.
    Fournier-Viger, P., Lin, J. C.-W., Vo, B, Chi, T.T., Zhang, J., Le, H.B.: A survey of itemset mining. WIREs Data Min. Knowl. Discov. e1207 (2017).  https://doi.org/10.1002/widm.1207.. WileyGoogle Scholar
  20. 20.
    Hong, T.P., Lin, C.W., Wu, Y.L.: Incrementally fast updated frequent pattern trees. Expert. Syst. Appl. 34(4), 2424–2435 (2008)Google Scholar
  21. 21.
    Hong, T.P., Lin, C.W., Wu, Y.L.: Maintenance of fast updated frequent pattern trees for record deletion. Comput. Stat. Data Anal. 53(7), 2485–2499 (2009)MathSciNetzbMATHGoogle Scholar
  22. 22.
    Hong, T.P., Lin, C.W., Wu, Y.L.: An efficient FUFP-tree maintenance algorithm for record modification. Int. J. Innov. Comput., Inf. Control. 4(11), 2875–2887 (2008)Google Scholar
  23. 23.
    Lin, C.W., Hong, T.P., Lu, W.H.: The Pre-FUFP algorithm for incremental mining. Expert. Syst. Appl. 36(5), 9498–9505 (2009)Google Scholar
  24. 24.
    Lin, C.W., Hong, T.P., Lu, W.H.: Using the structure of prelarge trees to incrementally mine frequent itemsets. New Gener. Comput. 28(1), 5–20 (2010)zbMATHGoogle Scholar
  25. 25.
    Lin, C.W., Hong, T. P., Lu, W.H.: Maintenance of fast updated frequent trees for record deletion based on prelarge concepts. In: The International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, pp. 675–684. Springer, Heidelberg (2007)Google Scholar
  26. 26.
    Lin, C.W., Hong, T.P., Lu, W.H.: Efficient modification of fast updated FP-trees based on pre-large concepts. Int. J. Innov. Comput., Inf. Control. 6(12), 5163–5177 (2010)Google Scholar
  27. 27.
    Lin, C.W., Gan, W.S., Hong, T.P.: Efficiently maintaining the fast updated sequential pattern trees with sequence deletion. IEEE Access 2, 1374–1383 (2014)Google Scholar
  28. 28.
    Lin, C.W., Gan, W.S., Hong, T.P., Zhang, J.: Updating the built prelarge fast updated sequential pattern trees with sequence modification. Int. J. Data Warehous. Min. 1(1), 1–21 (2015)Google Scholar
  29. 29.
    Zhang, B., Lin, C.W., Gan, W.S., Hong, T.P.: Maintaining the discovered sequential patterns for sequence insertion in dynamic databases. Eng. Appl. Artif. Intell. 35, 131–142 (2014)Google Scholar
  30. 30.
    Djenouri, Y., Drias, H., Habbas, Z.: Bees swarm optimisation using multiple strategies for association rule mining. Int. J. Bio-Inspired Comput. 6(4), 239–249 (2014)Google Scholar
  31. 31.
    Mata J., Alvarez J., Riquelme J.: An evolutionary algorithm to discover numeric association rules. In: Proceedings of the ACM symposium on Applied computing SAC, pp. 590–594 (2002)Google Scholar
  32. 32.
    Romero, C., Zafra, A., Luna, J.M., Ventura, S.: Association rule mining using genetic programming to provide feedback to instructors from multiple-choice quiz data. Expert. Syst. 30(2), 162–172 (2013)Google Scholar
  33. 33.
    Djenouri, Y., Bendjoudi, A., Nouali-Taboudjemat, N.: Association rules mining using evolutionary algorithms. In: The 9th International Conference on Bio-inspired Computing: Theories and Applications (BIC-TA 2014). LNCS (2014)Google Scholar
  34. 34.
    Martinez-Ballesteros, M., Bacardit, J., Troncoso, A., Riquelme, J.C.: Enhancing the scalability of a genetic algorithm to discover quantitative association rules in large-scale datasets. Integr. Comput.-Aided Eng. 22(1), 21–39 (2015)Google Scholar
  35. 35.
    Martin, D., AlcaliFdez, J., Rosete, A., Herrera, F.: NICGAR: a niching genetic algorithm to mine a diverse set of interesting quantitative association rules. Inf. Sci. 355, 208–228 (2016)Google Scholar
  36. 36.
    Wang, B., Merrick, K.E., Abbass, H.A.: Co-operative coevolutionary neural networks for mining functional association rules. IEEE Trans. Neural Netw. Learn. Syst. 28(6), 1331–1344 (2017)Google Scholar
  37. 37.
    Ting, C.K., Liaw, R.T., Wang, T.C., Hong, T.P.: Mining fuzzy association rules using a mimetic algorithm based on structure representation. Memetic Comput. 1–14 (2017)Google Scholar
  38. 38.
    Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S., Coello, C.A.C.: A survey of multiobjective evolutionary algorithms for data mining: Part I. IEEE Trans. Evol. Comput. 18(1), 4–19 (2014)Google Scholar
  39. 39.
    Kuo, R.J., Chao, C.M., Chiu, Y.T.: Application of particle swarm optimization to association rule mining. Appl. Soft Comput. 11(1), 326–336 (2011)Google Scholar
  40. 40.
    Sarath, K.N.V.D., Ravi, V.: Association rule mining using binary particle swarm optimization. Eng. Appl. Artif. Intell. 26(8), 1832–1840 (2013)Google Scholar
  41. 41.
    Beiranvand, V., Mobasher-Kashani, M., Bakar, A.A.: Multi-objective PSO algorithm for mining numerical association rules without a priori discretization. Expert. Syst. Appl. 41(9), 4259–4273 (2014)Google Scholar
  42. 42.
    Agrawal, J., Agrawal, S., Singhai, A., Sharma, S.: SET-PSO-based approach for mining positive and negative association rules. Knowl. Inf. Syst. 45(2), 453–471 (2015)Google Scholar
  43. 43.
    Djenouri, Y., Drias, H., Habbas, Z., Mosteghanemi, H.: Bees swarm optimization for web association rule mining. In: IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol. 3, pp. 142–146. IEEE (2012)Google Scholar
  44. 44.
    Djenouri, Y., Drias, H., Chemchem, A.: A hybrid bees swarm optimization and tabu search algorithm for association rule mining. In: World Congress on Nature and Biologically Inspired Computing, pp. 120–125. IEEE (2013)Google Scholar
  45. 45.
    Djenouri, Y., Drias, H., Habbas, Z.: Hybrid intelligent method for association rules mining using multiple strategies. Int. J. Appl. Metaheuristic Comput. 5(1), 46–64 (2014)Google Scholar
  46. 46.
    Heraguemi, K.E., Kamel, N., Drias, H.: Multi-swarm bat algorithm for association rule mining using multiple cooperative strategies. Appl. Intell. 45(4), 1021–1033 (2016)Google Scholar
  47. 47.
    Song, A., Ding, X., Chen, J., Li, M., Cao, W., Pu, K.: Multi-objective association rule mining with binary bat algorithm. Intell. Data Anal. 20(1), 105–128 (2016)Google Scholar
  48. 48.
    Sheikhan, M., Rad, M.S.: Gravitational search algorithm optimized neural misuse detector with selected features by fuzzy grids based association rules mining. Neural Comput. Appl. 23(7–8), 2451–2463 (2013)Google Scholar
  49. 49.
    Mlakar, U., Zorman, M., Fister Jr., I., Fister, I.: Modified binary cuckoo search for association rule mining. J. Intell. Fuzzy Syst. 32(6), 4319–4330 (2017)Google Scholar
  50. 50.
    Ahmed, C.F., Tanbeer, S.K., Jeong, B.S., Lee, Y.K.: Efficient tree structures for high utility pattern mining in incremental databases. IEEE Trans. Knowl. Data Eng. 21(12), 1708–1721 (2009)Google Scholar
  51. 51.
    Liu, Y., Liao, W. K., Choudhary, A. N.: A two-phase algorithm for fast discovery of high utility itemsets. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, vol. 3518, pp. 689–695 (2005)Google Scholar
  52. 52.
    Lin, C.W., Hong, T.P., Lu, W.H.: An effective tree structure for mining high utility itemsets. Expert. Syst. Appl. 38(6), 7419–7424 (2011)Google Scholar
  53. 53.
    Tseng, V.S., Wu, C.W., Shie, B.E., Yu, P.S.: UP-Growth: an efficient algorithm for high utility itemset mining. In: International Conference on Knowledge Discovery and Data Mining, pp. 253–262. ACM (2010)Google Scholar
  54. 54.
    Yun, U., Ryang, H., Ryu, K.H.: High utility itemset mining with techniques for reducing overestimated utilities and pruning candidates. Expert. Syst. Appl. 41(8), 3861–3878 (2014)Google Scholar
  55. 55.
    Yun, U., Ryang, H.: Incremental high utility pattern mining with static and dynamic databases. Appl. Intell. 42(2), 323–352 (2015)Google Scholar
  56. 56.
    Liu, M., Qu, J.: Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 55–64. ACM (2012)Google Scholar
  57. 57.
    Liu, J., Wang, K., Fung, B.C.: Direct discovery of high utility itemsets without candidate generation. In: IEEE 12th International Conference on Data Mining, pp. 984–989. IEEE (2012)Google Scholar
  58. 58.
    Fournier-Viger, P., Wu, C.W., Zida, S., Tseng, V.S.: FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. In: International Symposium on Methodologies for Intelligent Systems, pp. 83–92. Springer, Cham (2014)Google Scholar
  59. 59.
    Zida, S., Fournier-Viger, P., Lin, J.C.W., Wu, C.W., Tseng, V.S.: EFIM: a fast and memory efficient algorithm for high-utility itemset mining. Knowl. Inf. Syst. 51(2), 595–625 (2017)Google Scholar
  60. 60.
    Lin, C.W., Hong, T.P., Lan, G.C., Wong, J.W., Lin, W.Y.: Incrementally mining high utility patterns based on pre-large concept. Appl. Intell. 40(2), 343–357 (2014)Google Scholar
  61. 61.
    Lin, J. C.W., Gan, W.S., Hong, T.P.: A fast maintenance algorithm of the discovered high-utility itemsets with transaction deletion. Intell. Data Anal. 20(4), 891–913 (2016)Google Scholar
  62. 62.
    Lin, J.C.W., Gan, W., Hong, T.P.: A fast updated algorithm to maintain the discovered high-utility itemsets for transaction modification. Adv. Eng. Inf. 29(3), 562–574 (2015)Google Scholar
  63. 63.
    Lin, C.W., Gan, W., Hong, T.P.: Maintaining the discovered high-utility itemsets with transaction modification. Appl. Intell. 44(1), 166–178 (2016)Google Scholar
  64. 64.
    Zihayat, M., Hut, Z.Z., An, A., Hut, Y.: Distributed and parallel high utility sequential pattern mining. In: 2016 IEEE International Conference on Big Data (Big Data), pp. 853–862. IEEE (2016)Google Scholar
  65. 65.
    Lin, Y.C., Wu, C.W., Tseng, V.S.: Mining high utility itemsets in big data. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 649–661. Springer, Berlin (2015)Google Scholar
  66. 66.
    Chen, Y., An, A.: Approximate parallel high utility itemset mining. Big Data Res. 6, 26–42 (2016)Google Scholar
  67. 67.
    Zhang, L., Fu, G., Cheng, F., Qiu, J., Su, Y.: A multi-objective evolutionary approach for mining frequent and high utility itemsets. Appl. Soft Comput. 62, 974–986 (2017)Google Scholar
  68. 68.
    Zhang, Q., Li, H.: MOEA/D: a multiobjective evolutionary algorithm based on decomposition. IEEE Trans. Evol. Comput. 11(6), 712–731 (2007)Google Scholar
  69. 69.
    Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.A.M.T.: A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6(2), 182–197 (2002)Google Scholar
  70. 70.
    Zitzler, E.: SPEA2: improving the strength pareto evolutionary algorithm for multiobjective optimization. In: EUROGEN 2001, Evolutionary Methods for Design, Optimization and Control with Applications to Industrial Problems, Athens, Greece (2001)Google Scholar
  71. 71.
    Cai, X., Li, Y., Fan, Z., Zhang, Q.: An external archive guided multiobjective evolutionary algorithm based on decomposition for combinatorial optimization. IEEE Trans. Evol. Comput. 19(4), 508–523 (2015)Google Scholar
  72. 72.
    Miettinen, K.: Nonlinear Multiobjective Optimization, vol. 12. Springer Science & Business Media (2012)Google Scholar
  73. 73.
    Zhang, X., Tian, Y., Cheng, R., Jin, Y.: An efficient approach to nondominated sorting for evolutionary multiobjective optimization. IEEE Trans. Evol. Comput. 19(2), 201–213 (2015)Google Scholar
  74. 74.
    Lin, J.C.W., Yang, L., Fournier-Viger, P., Wu, M.T., Hong, T.P., Wang, S.L., Zhan, J.: Mining high-utility itemsets based on particle swarm optimization. Eng. Appl. Artif. Intell. 55, 320–330 (2016)Google Scholar
  75. 75.
    Lin, J.C.W., Yang, L., Fournier-Viger, P., Hong, T.P., Voznak, M.: A binary PSO approach to mine high-utility itemsets. Soft Comput. 21(17), 5103–5121 (2017)Google Scholar
  76. 76.
    Wu, J.M.T., Zhan, J., Lin, J.C.W.: An ACO-based approach to mine high-utility itemsets. Knowl.-Based Syst. 116, 102–113 (2017)Google Scholar
  77. 77.
    Fournier-Viger, P., Gomariz, A., Gueniche, T., Soltani, A., Wu, C.W., Tseng, V.S.: SPMF: a java open-source pattern mining library. J. Mach. Learn. Res. 15, 3389–3393 (2014)zbMATHGoogle Scholar
  78. 78.
    Fournier-Viger, P, Lin, J.C.W., Dinh, T, Le, H.B.: Mining correlated high-utility itemsets using the bond measure. In: Proceedings of International Conference Hybrid Artificial Intelligence Systems, pp. 53–65. Seville, Spain, 18–20 April 2016Google Scholar
  79. 79.
    Fournier-Viger, P, Lin, C.W, Duong, Q.H., Dam, T.L.: PHM: mining periodic high-utility itemsets. In: Proceedings of 16th Industrial Conference on Data Mining, pp. 64–79. New York, USA, 13–17 July 2016Google Scholar
  80. 80.
    Lin, C.-W., Ren, S., Fournier-Viger, P., Hong, T.-P.: EHAUPM: efficient high average-utility pattern mining with tighter upper-bounds . IEEE Access 14(8), 13 (2016). IEEEGoogle Scholar
  81. 81.
    Truong, T., Duong, H., Le, B., Fournier-Viger, P.: Efficient vertical mining of high average-utility itemsets based on novel upper-bounds. In: IEEE Transactions on Knowledge and Data Engineering (TKDE) (2018).  https://doi.org/10.1109/TKDE.2018.2833478.Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Youcef Djenouri
    • 1
    Email author
  • Philippe Fournier-Viger
    • 2
  • Asma Belhadi
    • 3
  • Jerry Chun-Wei Lin
    • 4
  1. 1.IMADA, Southern Denmark UniversityOdenseDenmark
  2. 2.School of Humanities and Social SciencesHarbin Institute of Technology (Shenzhen)ShenzhenChina
  3. 3.RIMA, University of Science and Technology Houari Boumediene (USTHB)AlgiersAlgeria
  4. 4.Department of Computing Mathematics and PhysicsWestern Norway University of Applied Sciences (HVL)BergenNorway

Personalised recommendations