Skip to main content

Generalizing Itemset Mining in a Constraint Programming Setting

  • Chapter
  • First Online:
Inductive Databases and Constraint-Based Data Mining

Abstract

In recent years, a large number of algorithms have been proposed for finding set patterns in boolean data. This includes popular mining tasks based on, for instance, frequent (closed) itemsets. In this chapter, we develop a common framework in which these algorithms can be studied thanks to the principles of constraint programming. We show how such principles can be applied both in specialized and general solvers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Agrawal, H. Mannila, R. Srikant, H. Toivonen, and A. Verkamo. Fast discovery of association rules. In Advances in Knowledge Discovery and Data Mining, pages 307–328. AAAI Press, 1996.

    Google Scholar 

  2. S. Basu, I. Davidson, and K.Wagstaff. Constrained Clustering: Advances in Algorithms, Theory and Applications. Chapman & Hall/CRC Press, Data Mining and Knowledge Discovery Series, 2008.

    Google Scholar 

  3. J Besson, C. Robardet, and J-F. Boulicaut. Mining a new fault-tolerant pattern type as an alternative to formal concept discovery. In ICCS’06: Proc. Int. Conf. on Conceptual Structures, volume 4068 of LNCS. Springer, 2006.

    Google Scholar 

  4. S. Bistarelli and F. Bonchi. Interestingness is not a dichotomy: Introducing softness in constrained pattern mining. In PKDD’05: Proc. 9th European Conf. on Principles and Practice of Knowledge Discovery in Databases, volume 3721 of LNCS, pages 22–33. Springer, 2005.

    Google Scholar 

  5. Francesco Bonchi, Fosca Giannotti, Claudio Lucchese, Salvatore Orlando, Raffaele Perego, and Roberto Trasarti. A constraint-based querying system for exploratory pattern discovery. Information Systems, 34(1):3–27, 2009.

    Article  Google Scholar 

  6. Francesco Bonchi, Fosca Giannotti, Alessio Mazzanti, and Dino Pedreschi. Adaptive constraint pushing in frequent pattern mining. In PKDD’03: Proc. 7th European Conf. on Principles and Practice of Knowledge Discovery in Databases, volume 2838 of LNCS, pages 47–58. Springer, 2003.

    Google Scholar 

  7. Francesco Bonchi, Fosca Giannotti, Alessio Mazzanti, and Dino Pedreschi. Examiner: Optimized level-wise frequent pattern mining with monotone constraint. In ICDM 2003: Proc. 3rd International Conf. on Data Mining, pages 11–18. IEEE Computer Society, 2003.

    Google Scholar 

  8. Jean-François Boulicaut, Luc De Raedt, and Heikki Mannila, editors. Constraint-Based Mining and Inductive Databases, volume 3848 of LNCS. Springer, 2005.

    Google Scholar 

  9. C. Bucila, J. E. Gehrke, D. Kifer, and W. White. Dualminer: A dual-pruning algorithm for itemsets with constraints. Data Mining and Knowledge Discovery Journal, 7(4):241–272, Oct. 2003.

    Article  MathSciNet  Google Scholar 

  10. Toon Calders, Christophe Rigotti, and Jean-François Boulicaut. A survey on condensed representations for frequent sets. In Constraint-based Mining and Inductive Databases, volume 3848 of LNCS, pages 64–80. Springer, 2005.

    Google Scholar 

  11. L. Cerf, J. Besson, C. Robardet, and J.-F. Boulicaut. Closed patterns meet n-ary relations. ACM Trans. on Knowledge Discovery from Data, 3(1), March 2009.

    Google Scholar 

  12. Hong Cheng, Philip S. Yu, and Jiawei Han. Ac-close: Efficiently mining approximate closed itemsets by core pattern recovery. In ICDM, pages 839–844, 2006.

    Google Scholar 

  13. Luc De Raedt, Tias Guns, and Siegfried Nijssen. Constraint programming for itemset mining. In KDD’08: Proc. 14th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pages 204–212, 2008.

    Google Scholar 

  14. B. Ganter. Two basic algorithms in concept analysis. Technical report, Germany Darmstadt : Technisch Hochschule Darmstadt, Preprint 831, 1984.

    Google Scholar 

  15. Bernhard Ganter, Gerd Stumme, and Rudolph Wille. Formal Concept Analysis, Foundations and Applications, volume 3626 of LNCS. Springer, 2005.

    Google Scholar 

  16. B. Goethals and M. J. Zaki, editors. Frequent Itemset Mining Implementations, volume 90. CEUR-WS.org, Melbourne, Florida, USA, December 2003.

    Google Scholar 

  17. Jiawei Han, Jian Pei, Yiwen Yin, and Runying Mao. Mining frequent patterns without candidate generation: A frequent-pattern tree approach. Data Mining and Knowledge Discovery, 8(1):53–87, 2004.

    Article  MathSciNet  Google Scholar 

  18. David J. Hand, Niall M. Adams, and Richard J. Bolton, editors. Pattern Detection and Discovery, ESF Exploratory Workshop Proceedings, volume 2447 of LNCS. Springer, 2002.

    Google Scholar 

  19. Daniel Kifer, Johannes E. Gehrke, Cristian Bucila, andWalker M. White. How to quickly find a witness. In Constraint-Based Mining and Inductive Databases, pages 216–242, 2004.

    Google Scholar 

  20. Jinze Liu, Susan Paulsen, Xing Sun, Wei Wang, Andrew B. Nobel, and Jan Prins. Mining approximate frequent itemsets in the presence of noise: Algorithm and analysis. In SDM, 2006.

    Google Scholar 

  21. H. Mannila and H. Toivonen. Levelwise search and borders of theories in knowledge discovery. In Data Mining and Knowledge Discovery journal, volume 1(3), pages 241–258. Kluwer Academic Publishers, 1997.

    Google Scholar 

  22. Katharina Morik, Jean-François Boulicaut, and Arno Siebes, editors. Local Pattern Detection, International Dagstuhl Seminar Revised Selected Papers, volume 3539 of LNCS. Springer, 2005.

    Google Scholar 

  23. Nicolas Pasquier, Yves Bastide, Rafik Taouil, and Lotfi Lakhal. Efficient mining of association rules using closed itemset lattices. Information Systems, 24(1):25–46, 1999.

    Article  Google Scholar 

  24. Jian Pei, Anthony K. H. Tung, and Jiawei Han. Fault-tolerant frequent pattern mining: Problems and challenges. In DMKD. Workshop, 2001.

    Google Scholar 

  25. A. Soulet and B. Crémilleux. An efficient framework for mining flexible constraints. In PaKDD’05: Pacific-Asia Conf. on Knowledge Discovery and Data Mining, volume 3518 of LNCS, pages 661–671. Springer, 2005.

    Google Scholar 

  26. T. Uno, M. Kiyomi, and H. Arimura. Lcm ver. 2: Efficient mining algorithms for frequent/closed/maximal itemsets. In FIMI’04, Proceedings of the IEEE ICDM Workshop on Frequent Itemset Mining Implementations, volume 126 of CEUR Workshop Proceedings. CEURWS. org, 2004.

    Google Scholar 

  27. T. Uno, M. Kiyomi, and H. Arimura. LCM ver.3: collaboration of array, bitmap and prefix tree for frequent itemset mining. In OSDM’05: Proc. 1st Int. Workshop on Open Source Data Mining, pages 77–86. ACM Press, 2005.

    Google Scholar 

  28. C. Yang, U. Fayyad, and P. S. Bradley. Efficient discovery of error-tolerant frequent itemsets in high dimensions. In SIGKDD, pages 194–203, San Francisco, California, USA, August 2001. ACM Press.

    Google Scholar 

  29. Mohammed J. Zaki. Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng., 12(3):372–390, 2000.

    Article  MathSciNet  Google Scholar 

  30. Mengsheng Zhang, Wei Wang, and Jinze Liu. Mining approximate order preserving clusters in the presence of noise. In ICDE, pages 160–168, 2008.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jérémy Besson .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Besson, J., Boulicaut, JF., Guns, T., Nijssen, S. (2010). Generalizing Itemset Mining in a Constraint Programming Setting. In: Džeroski, S., Goethals, B., Panov, P. (eds) Inductive Databases and Constraint-Based Data Mining. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-7738-0_5

Download citation

  • DOI: https://doi.org/10.1007/978-1-4419-7738-0_5

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4419-7737-3

  • Online ISBN: 978-1-4419-7738-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics