Compression-Based Measures for Mining Interesting Rules

Suzuki, Einoshin

doi:10.1007/978-3-642-02568-6_75

Einoshin Suzuki²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5579))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

1603 Accesses
3 Citations

Abstract

An interestingness measure estimates the degree of interestingness of a discovered pattern and has been actively studied in the past two decades. Several pitfalls should be avoided in the study such as a use of many parameters and a lack of systematic evaluation in the presence of noise. Compression-based measures have advantages in this respect as they are typically parameter-free and robust to noise. In this paper, we present J-measure and a measure based on an extension of the Minimum Description Length Principle (MDLP) as compression-based measures for mining interesting rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Mining and Using Sets of Patterns through Compression

The minimum description length principle for pattern mining: a survey

Article Open access 04 July 2022

Interesting Patterns

References

Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I., et al.: Fast discovery of association rules. In: Fayyad, U.M., et al. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 307–328. AAAI/MIT Press, Menlo Park (1996)
Google Scholar
Barthélemy, J. P., Legrain, A., Lenca, P., Vaillant, B.: Aggregation of Valued Relations Applied to Association Rule Interestingness Measures. In: Torra, V., Narukawa, Y., Valls, A., Domingo-Ferrer, J. (eds.) MDAI 2006. LNCS, vol. 3885, pp. 203–214. Springer, Heidelberg (2006)
Chapter Google Scholar
Bayardo, R.J., Agrawal, R.: Mining the Most Interesting Rules. In: Proc. Fifth ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining, pp. 145–154 (1999)
Google Scholar
Brin, S., Motwani, R., Silverstein, C.: Beyond Market Baskets: Generalizing Association Rules to Correlations. In: SIGMOD 1997, Proc. ACM SIGMOD Int’l Conf. on Management of Data, pp. 265–276 (1997)
Google Scholar
Carvalho, D.R., Freitas, A.A., Ebecken, N.F.F.: Evaluating the Correlation Between Objective Rule Interestingness Measures and Real Human Interest. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS, vol. 3721, pp. 453–461. Springer, Heidelberg (2005)
Chapter Google Scholar
Gras, R.: L’ Implication Statistique. La Pensée Sauvage (1996) (in French)
Google Scholar
Jaroszewicz, S., Simovici, D.A.: Interestingness of Frequent Itemsets Using Bayesian Networks as Background Knowledge. In: Proc. Tenth ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining, pp. 178–186 (2004)
Google Scholar
Lenca, P., Meyer, P., Vaillant, B., Lallich, S.: On Selecting Interestingness Measures for Association Rules: User Oriented Description and Multiple Criteria Decision Aid. European Journal of Operational Research 184(2), 610–626 (2008)
Article MATH Google Scholar
Liu, B., Hsu, W., Chen, S.: Using General Impressions to Analyze Discovered Classification Rules. In: Proc. Third Int’l Conf. on Knowledge Discovery and Data Mining (KDD), pp. 31–36 (1997)
Google Scholar
Liu, B., Hsu, W., Mun, L.-F., Lee, H.-Y.: Finding Interesting Patterns Using User Expectations. IEEE Trans. Knowledge and Data Eng. 11(6), 817–832 (1999)
Article Google Scholar
Padmanabhan, B., Tuzhilin, A.: A Belief-Driven Method for Discovering Unexpected Patterns. In: Proc. Fourth Int’l Conf. Knowledge Discovery and Data Mining (KDD), pp. 94–100 (1998)
Google Scholar
Piatetsky-Shapiro, G.: Discovery, Analysis, and Presentation of Strong Rules. In: Knowledge Discovery in Databases, pp. 229–248. AAAI/MIT Press, Menlo Park (1991)
Google Scholar
Silberschatz, A., Tuzhilin, A.: On Subjective Measures of Interestingness in Knowledge Discovery. In: Proc. First Int’l Conf. Knowledge Discovery and Data Mining (KDD), pp. 275–281 (1995)
Google Scholar
Silberschatz, A., Tuzhilin, A.: What Makes Patterns Interesting in Knowledge Discovery Systems. IEEE Trans. Knowledge and Data Eng. 8(6), 970–974 (1996)
Article Google Scholar
Smyth, P., Goodman, R.M.: An Information Theoretic Approach to Rule Induction from Databases. IEEE Trans. Knowledge and Data Engineering 4(4), 301–316 (1992)
Article Google Scholar
Suzuki, E.: Autonomous Discovery of Reliable Exception Rules. In: Proc. Third Int’l Conf. on Knowledge Discovery and Data Mining (KDD), pp. 259–262 (1997)
Google Scholar
Suzuki, E.: Undirected Discovery of Interesting Exception Rules. Int’l Journal of Pattern Recognition and Artificial Intelligence 16(8), 1065–1086 (2002)
Article Google Scholar
Suzuki, E.: Pitfalls for Categorizations of Objective Interestingness Measures for Rule Discovery. In: Gras, R., Suzuki, E., Guillet, F., Spagnolo, F. (eds.) Statistical Implicative Analysis: Theory and Applications, pp. 383–395. Springer, Heidelberg (2008)
Chapter Google Scholar
Suzuki, E.: Negative Encoding Length as a Subjective Interestingness Measure for Groups of Rules. In: Proc. Thirteenth Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD (2009) (accepted for publication)
Google Scholar
Suzuki, E., Shimura, M.: Exceptional Knowledge Discovery in Databases Based on Information Theory. In: Proc. Second Int’l Conf. Knowledge Discovery and Data Mining (KDD), pp. 275–278 (1996)
Google Scholar
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the Right Interestingness Measure for Association Patterns. In: Proc. Eighth ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining (KDD), pp. 32–41 (2002)
Google Scholar
Vaillant, B., Lallich, S., Lenca, P.: On the Behavior of the Generalizations of the Intensity of Implication: A Data-driven Comparative Study. In: Gras, R., Suzuki, E., Guillet, F., Spagnolo, F. (eds.) Statistical Implicative Analysis: Theory and Applications, pp. 421–447. Springer, Heidelberg (2008)
Chapter Google Scholar
Suzuki, E.: Interestingness Measures - Limits, Desiderata, and Recent Results. In: Proc. Quality Issues, Measures of Interestingness and Evaluation of Data Mining Models, QIMIE (2009) (keynote talk, accepted for publication)
Google Scholar
Suzuki, E.: Evaluation Scheme for Exception Rule/Group Discovery. In: Intelligent Technologies for Information Analysis, pp. 89–108. Springer, Heidelberg (2004)
Chapter Google Scholar
Keogh, E.J., Lonardi, S., Ratanamahatana, C.A.: Towards Parameter-free Data Mining. In: Proc. Tenth ACM SIGKDD Int’l Conf. on Knowledge Discovery and Data Mining (KDD), pp. 206–215 (2004)
Google Scholar
Feldman, R., Dagan, I.: Knowledge Discovery in Textual Databases (KDT). In: Proc. First International Conference on Knowledge Discovery and Data Mining (KDD), pp. 112–117 (1995)
Google Scholar
Keogh, E.J., Pazzani, M.J.: Scaling up Dynamic Time Warping to Massive Dataset. In: Żytkow, J.M., Rauch, J. (eds.) PKDD 1999. LNCS (LNAI), vol. 1704, pp. 1–11. Springer, Heidelberg (1999)
Chapter Google Scholar
Blachman, N.M.: The Amount of Information That y Gives About X. IEEE Transactions on Information Theory IT-14(1), 27–31 (1968)
Article MathSciNet MATH Google Scholar
Hájek, P., Havel, C.M.: The GUHA Method of Automatic Hypotheses Determination. Computing 1, 293–308 (1966)
Article MATH Google Scholar
Quinlan, J.R., Rivest, R.L.: Inferring Decision Trees Using the Minimum Description Length Principle. Information and Computation 80(3), 227–248 (1989)
Article MathSciNet MATH Google Scholar
Wallace, C.S., Patrick, J.D.: Coding Decision Trees. Machine Learning 11(1), 7–22 (1993)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Kyushu University, Fukuoka, 819-0395, Japan
Einoshin Suzuki

Authors

Einoshin Suzuki
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Information Engineering, National University of Tainan, 700, Tainan, Taiwan
Been-Chian Chien
Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan
Tzung-Pei Hong
Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan
Shyi-Ming Chen
Department of Computer Science, Texas State University-San Marcos, 601 University Drive, 78666-4616, San Marcos, TX, USA
Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Suzuki, E. (2009). Compression-Based Measures for Mining Interesting Rules. In: Chien, BC., Hong, TP., Chen, SM., Ali, M. (eds) Next-Generation Applied Intelligence. IEA/AIE 2009. Lecture Notes in Computer Science(), vol 5579. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02568-6_75

Download citation

DOI: https://doi.org/10.1007/978-3-642-02568-6_75
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02567-9
Online ISBN: 978-3-642-02568-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Compression-Based Measures for Mining Interesting Rules

Abstract

Access this chapter

Preview

Similar content being viewed by others

Mining and Using Sets of Patterns through Compression

The minimum description length principle for pattern mining: a survey

Interesting Patterns

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Compression-Based Measures for Mining Interesting Rules

Abstract

Access this chapter

Preview

Similar content being viewed by others

Mining and Using Sets of Patterns through Compression

The minimum description length principle for pattern mining: a survey

Interesting Patterns

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation