Abstract
Data mining is one of the noticeable researches. Data mining is one of the most outstanding areas of research, striving to extract meaningful information and useful models from databases and to help managers to make better decisions. Mining association rules is a persistent and popular data mining process. It aims to find frequent models, association correlations, or causal structures between a set of objects in large transactional or relational databases and other data repositories. This paper provides an improvement of the Apriori algorithm, a classic rule extraction algorithm by finding appropriate minimum threshold values for the support automatically using different statistical measures depending on each dataset. The experiments on baseline datasets show a comparative analysis between different proposed methods of generation association rules using the automatic minimum support threshold.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhou S, Zhang S, Karypis G (eds) (2012) Advanced data mining and applications. In: 8th International conference, ADMA 2012, December 15–18, 2012, Proceedings vol. 7713. Springer Science & Business Media, Nanjing, China
Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceeding VLDB ‘94 proceedings of 20th international conference very large data bases, vol. 1215, pp 487–499
Liu B, Hsu W, Ma Y (1999) Mining association rules with multiple minimum supports. In: Knowledge discovery and databases, pp 337–241
Lee YC, Hong TP, Lin WY (2005) Mining association rules with multiple minimum supports using maximum constraints. Int J Approximate Reasoning 40(1):44–54
Salam A, Khayal MSH (2012) Mining top-k frequent patterns without minimum support threshold. Knowl Inf Syst 30(1):57–86
Fournier-Viger P, Wu C-W, Tseng VS (2012) Mining top-k association rules. In: Proceedings of the 25th Canadian conference on artificial intelligence (AI 2012). Springer, Canada, pp 61–73
Fournier-Viger P, Tseng VS (2012) Mining top-k non-redundant association rules. In: Proceedings of 20th international symposium, ISMIS 2012, LNCS, vol. 7661. Springer, Macau, China, pp 31–40
Kuo RJ, Chao CM, Chiu YT (2011) Application of particle swarm optimization to association rule mining. In: Proceeding of applied soft computing. Elsevier, pp 326–336
Bouker S, Saidi R, Ben Yahia S, Mephu Nguifo E (2014) Mining undominated association rules through interestingness measures. Int J Artif Intell Tools 23(04):1460011
Dahbi A, Jabri S, Ballouki Y, Gadi T (2017) A new method to select the interesting association rules with multiple criteria. Int J Intell Eng Syst 10(5):191–200
Dahbi A, Balouki Y, Gadi T (2017) Using multiple minimum support to auto-adjust the threshold of support in apriori algorithm. In: Abraham A, Haqiq A, Muda A, Gandhi N (eds) Proceedings of the ninth international conference on soft computing and pattern recognition (SoCPaR 2017). SoCPaR 2017. Advances in intelligent systems and computing, vol 737. Springer, Cham
Pei J, Han J, Mao R (2000) CLOSET: an efficient algorithm for mining frequent closed itemsets. In: Proceeding of the 2000 ACM-SIGMOD international workshop data mining and knowledge discovery (DMKD’00), 2000, TX. ACM, Dallas, pp 11–20
Zaki MJ, Hsiao CJ (2002) CHARM: An efficient algorithm for closed itemset mining. In: SDM’02. Arlington, VA, pp 457–473
Tamhane AC, Dunlop DD (2000) Statistics and data analysis. Prentice Hall
UCI machine learning repository, https://archive.ics.uci.edu/ml/index.php. Last Accessed 31 March 2020
Frequent Itemset Mining Implementations Repository, http://fimi.ua.ac.be/data/. Last Accessed 31 March 2020
An Open-Source Data Mining Library, http://www.philippe-fournier-viger.com/index.?li-nk=datasets.php. Last Accessed 31 March 2020
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dahbi, A., Jabri, S., Balouki, Y., Gadi, T. (2021). Finding Suitable Threshold for Support in Apriori Algorithm Using Statistical Measures. In: Hassanien, A.E., Darwish, A., Abd El-Kader, S.M., Alboaneen, D.A. (eds) Enabling Machine Learning Applications in Data Science. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-33-6129-4_7
Download citation
DOI: https://doi.org/10.1007/978-981-33-6129-4_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-6128-7
Online ISBN: 978-981-33-6129-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)