Multimedia Tools and Applications

, Volume 75, Issue 9, pp 4887–4912 | Cite as

Updating high-utility pattern trees with transaction modification

  • Chun-Wei Lin
  • Binbin Zhang
  • Wensheng Gan
  • Bo-Wei Chen
  • Seungmin Rho
  • Tzung-Pei Hong
Article

Abstract

Traditional association-rule mining only concerns the occurrence frequencies of the items in a binary database. In real-world applications, customers may buy several copies of the purchased items. Other factors such as profit, quantity, or price should be concerned to measure the utilities of the purchased items. High-utility itemsets mining was thus proposed to consider the factors of quantity and profit. Two-phase model was the most commonly way to keep the transaction-weighted utilization downward closure property, thus reducing the numerous candidates in utility mining. Most methods for finding high-utility itemsets are used to handle a static database. In practical applications, transactions are changed whether insertion, deletion, or modification. Some itemsets may arise as the new high-utility itemsets or become invalid knowledge in the updated database. In this paper, a maintenance Fast Updated High Utility Pattern tree for transaction MODification (FUP-HUP-tree-MOD) algorithm is thus proposed to effective maintain and update the built HUP tree for mining high-utility itemsets in dynamic databases without candidate generation. Experiments are conducted to show better performance of the proposed algorithm compared to the two-phase algorithm and the HUP tree algorithm in batch mode.

Keywords

Modification Utility mining Two-phase algorithm Dynamic databases 

References

  1. 1.
    Abdullah Z, Herawan T, Deris M (2010) Mining significant least association rules using fast slp-growth algorithm. Lect Notes Comput Sci 6059:324–336CrossRefGoogle Scholar
  2. 2.
    Agrawal R, Imielinski T, Swami A (1993) Database mining: a performance perspective. IEEE Trans Knowl Data Eng 5:914–925CrossRefGoogle Scholar
  3. 3.
    Agrawal R, Srikant R (1994) “Fast algorithms for mining association rules in large databases.” The Int Conf Very Large Data Bases: 487–499Google Scholar
  4. 4.
    Chan R, Yang Q, Shen YD (2003) “Mining high utility itemsets,” IEEE Int Conf Data Min: 19–26Google Scholar
  5. 5.
    Chen MS, Han J, Yu PS (1996) Data mining: an overview from a database perspective. IEEE Trans Knowl Data Eng 8:866–883CrossRefGoogle Scholar
  6. 6.
    Cheung DWL, Han J, Ng V, Wong CY (1996) “Maintenance of discovered association rules in large databases: An incremental updating technique.” Int Conf Data Eng:106–114Google Scholar
  7. 7.
    Cheung DWL, Lee SD, Kao B (1997) “A general incremental technique for maintaining discovered association rules.” The Int Conf Database Syst Adv Appl: 185–194Google Scholar
  8. 8.
    Frequent itemset mining dataset repository. Available: http://fimi.ua.ac.be/data/ (2012)
  9. 9.
    Gharib TF, Nassar H, Taha M, Abrahamd A (2010) An efficient algorithm for incremental mining of temporal association rules. Data Knowl Eng 69:800–815CrossRefGoogle Scholar
  10. 10.
    Han J, Pei J, Yin Y, Mao R (2004) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Min Knowl Disc 8:53–87MathSciNetCrossRefGoogle Scholar
  11. 11.
    Hong TP, Lin CW, Wu YL (2008) Incrementally fast updated frequent pattern trees. Expert Syst Appl 34:2424–2435CrossRefGoogle Scholar
  12. 12.
    Hong TP, Lin CW, Wu YL (2008) “An efficient fufp-tree mainteance algorithm for record modification,”. Int J Innov Comput Inf Control 4:2875–2887Google Scholar
  13. 13.
    Li YC, Yeh JS, Chang CC (2005) Direct candidates generation: a novel algorithm for discovering complete share-frequent itemsets. Fuzzy Syst Knowl Discov 3614:551–560CrossRefGoogle Scholar
  14. 14.
    Li YC, Yeh JS, Chang CC (2005) “Direct candidates generation: A novel algorithm for discovering complete share-frequent itemsets.” Lect Notes Comput Sci: 551–560Google Scholar
  15. 15.
    Li YC, Yeh JS, Chang CC (2005) “A fast algorithm for mining share-frequent itemsets.” Lect Notes Comput Sci: 417–428Google Scholar
  16. 16.
    Lin CW, Hong TP, Lu WH (2009) The pre-fufp algorithm for incremental mining. Expert Syst Appl 36:9498–9505CrossRefGoogle Scholar
  17. 17.
    Lin CW, Hong TP, Lu WH (2010) “Maintaining high utility pattern trees in dynamic databases.” Int Conf Comput Eng Appl: 304–308Google Scholar
  18. 18.
    Lin CW, Hong TP, Lu WH (2011) An effective tree structure for mining high utility itemsets. Expert Syst Appl 38:7419–7424CrossRefGoogle Scholar
  19. 19.
    Lin CW, Lan GC, Hong TP, Kong L (2014) Mining high utility itemsets based on transaction deletion. Lect Notes Electr Eng 260:983–990CrossRefGoogle Scholar
  20. 20.
    Lin CW, Lan GC, Hong TP (2012) An incremental mining algorithm for high utility itemsets. Expert Syst Appl 39:7173–7180CrossRefGoogle Scholar
  21. 21.
    Liu Y, Liao WK, Choudhary A (2005) “A two-phase algorithm for fast discovery of high utility itemsets.” Adv Knowl Discov Data Min: 689–695Google Scholar
  22. 22.
    Liu M, Qu J (2012) “Mining high utility itemsets without candidate generation>” ACM Int Conf Inf Knowl Manag: 55–64Google Scholar
  23. 23.
    Liu J, Wang K, Fung BCM (2012) “Direct discovery of high utility itemsets without candidate generation.” IEEE Int Conf Data Min: 984–989Google Scholar
  24. 24.
    Microsoft. Example database foodmart of microsoft analysis services. Available: http://msdn.microsoft.com/en-us/library/aa217032(SQL.80).aspx
  25. 25.
    Nath B, Bhattacharyya DK, Ghosh A (2013) “Incremental association rule mining: A survey,” WIREs Data Mining Knowledge Discovery, vol. 3Google Scholar
  26. 26.
    Song W, Liu Y, Li J (2013) “Mining high utility itemsets by dynamically pruning the tree structure.” Appl Intell: 1–15Google Scholar
  27. 27.
    Tseng VS, Bai-En S, Cheng-Wei W, Yu PS (2013) Efficient algorithms for mining high utility itemsets from transactional databases. IEEE Trans Knowl Data Eng 25:1772–1786CrossRefGoogle Scholar
  28. 28.
    Wu CW, Lin YF, Yu PS, Tseng VS (2013) “Mining high utility episodes in complex event sequences.” ACM Int Conf Knowl Discov Data Min: 536–544Google Scholar
  29. 29.
    Yao H, Hamilton HJ (2006) Mining itemset utilities from transaction databases. Data Knowl Eng 59:603–626CrossRefGoogle Scholar
  30. 30.
    Yao H, Hamilton HJ, Butz CJ (2004) “A foundational approach to mining itemset utilities from databases.” SIAM Int Conf Data Min: 211–225Google Scholar
  31. 31.
    Yuna U, Ryanga H, Ryub KH (2014) High utility itemset mining with techniques for reducing overestimated utilities and pruning candidates. Expert Syst Appl 41:3861–3878CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2014

Authors and Affiliations

  • Chun-Wei Lin
    • 1
    • 2
  • Binbin Zhang
    • 3
  • Wensheng Gan
    • 1
  • Bo-Wei Chen
    • 4
  • Seungmin Rho
    • 5
  • Tzung-Pei Hong
    • 6
    • 7
  1. 1.Innovative Information Industry Research Center (IIIRC), School of Computer Science and TechnologyHarbin Institute of Technology Shenzhen Graduate SchoolShenzhenChina
  2. 2.Shenzhen Key Laboratory of Internet Information Collaboration, School of Computer Science and TechnologyHarbin Institute of Technology Shenzhen Graduate SchoolShenzhenChina
  3. 3.Medical SchoolShenzhen UniversityShenzhenChina
  4. 4.Department of Electrical EngineeringNational Cheng Kung UniversityTainanTaiwan
  5. 5.Department of MultimediaSungkyul UniversityAnyang-siSouth Korea
  6. 6.Department of Computer Science and Information EngineeringNational University of KaohsiungKaohsiungRepublic of China
  7. 7.Department of Computer Science and EngineeringNational Sun Yat-sen UniversityKaohsiungRepublic of China

Personalised recommendations