Skip to main content

Finding Suitable Threshold for Support in Apriori Algorithm Using Statistical Measures

  • Conference paper
  • First Online:
Enabling Machine Learning Applications in Data Science

Abstract

Data mining is one of the noticeable researches. Data mining is one of the most outstanding areas of research, striving to extract meaningful information and useful models from databases and to help managers to make better decisions. Mining association rules is a persistent and popular data mining process. It aims to find frequent models, association correlations, or causal structures between a set of objects in large transactional or relational databases and other data repositories. This paper provides an improvement of the Apriori algorithm, a classic rule extraction algorithm by finding appropriate minimum threshold values for the support automatically using different statistical measures depending on each dataset. The experiments on baseline datasets show a comparative analysis between different proposed methods of generation association rules using the automatic minimum support threshold.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Zhou S, Zhang S, Karypis G (eds) (2012) Advanced data mining and applications. In: 8th International conference, ADMA 2012, December 15–18, 2012, Proceedings vol. 7713. Springer Science & Business Media, Nanjing, China

    Google Scholar 

  2. Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceeding VLDB ‘94 proceedings of 20th international conference very large data bases, vol. 1215, pp 487–499

    Google Scholar 

  3. Liu B, Hsu W, Ma Y (1999) Mining association rules with multiple minimum supports. In: Knowledge discovery and databases, pp 337–241

    Google Scholar 

  4. Lee YC, Hong TP, Lin WY (2005) Mining association rules with multiple minimum supports using maximum constraints. Int J Approximate Reasoning 40(1):44–54

    Article  Google Scholar 

  5. Salam A, Khayal MSH (2012) Mining top-k frequent patterns without minimum support threshold. Knowl Inf Syst 30(1):57–86

    Article  Google Scholar 

  6. Fournier-Viger P, Wu C-W, Tseng VS (2012) Mining top-k association rules. In: Proceedings of the 25th Canadian conference on artificial intelligence (AI 2012). Springer, Canada, pp 61–73

    Google Scholar 

  7. Fournier-Viger P, Tseng VS (2012) Mining top-k non-redundant association rules. In: Proceedings of 20th international symposium, ISMIS 2012, LNCS, vol. 7661. Springer, Macau, China, pp 31–40

    Google Scholar 

  8. Kuo RJ, Chao CM, Chiu YT (2011) Application of particle swarm optimization to association rule mining. In: Proceeding of applied soft computing. Elsevier, pp 326–336

    Google Scholar 

  9. Bouker S, Saidi R, Ben Yahia S, Mephu Nguifo E (2014) Mining undominated association rules through interestingness measures. Int J Artif Intell Tools 23(04):1460011

    Google Scholar 

  10. Dahbi A, Jabri S, Ballouki Y, Gadi T (2017) A new method to select the interesting association rules with multiple criteria. Int J Intell Eng Syst 10(5):191–200

    Google Scholar 

  11. Dahbi A, Balouki Y, Gadi T (2017) Using multiple minimum support to auto-adjust the threshold of support in apriori algorithm. In: Abraham A, Haqiq A, Muda A, Gandhi N (eds) Proceedings of the ninth international conference on soft computing and pattern recognition (SoCPaR 2017). SoCPaR 2017. Advances in intelligent systems and computing, vol 737. Springer, Cham

    Google Scholar 

  12. Pei J, Han J, Mao R (2000) CLOSET: an efficient algorithm for mining frequent closed itemsets. In: Proceeding of the 2000 ACM-SIGMOD international workshop data mining and knowledge discovery (DMKD’00), 2000, TX. ACM, Dallas, pp 11–20

    Google Scholar 

  13. Zaki MJ, Hsiao CJ (2002) CHARM: An efficient algorithm for closed itemset mining. In: SDM’02. Arlington, VA, pp 457–473

    Google Scholar 

  14. Tamhane AC, Dunlop DD (2000) Statistics and data analysis. Prentice Hall

    Google Scholar 

  15. UCI machine learning repository, https://archive.ics.uci.edu/ml/index.php. Last Accessed 31 March 2020

  16. Frequent Itemset Mining Implementations Repository, http://fimi.ua.ac.be/data/. Last Accessed 31 March 2020

  17. An Open-Source Data Mining Library, http://www.philippe-fournier-viger.com/index.?li-nk=datasets.php. Last Accessed 31 March 2020

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dahbi, A., Jabri, S., Balouki, Y., Gadi, T. (2021). Finding Suitable Threshold for Support in Apriori Algorithm Using Statistical Measures. In: Hassanien, A.E., Darwish, A., Abd El-Kader, S.M., Alboaneen, D.A. (eds) Enabling Machine Learning Applications in Data Science. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-33-6129-4_7

Download citation

Publish with us

Policies and ethics