Abstract
In this paper, we extend monotone monomials as large itemsets in association rule mining to monotone DNF formulas. First, we introduce not only the minimum support but also the maximum overlap, which is a new measure how much all pairs of two monomials in a monotone DNF formula commonly cover data. Next, we design the algorithm dnf_cover to extract coverings as monotone DNF formulas satisfying both the minimum support and the maximum overlap. In the algorithm dnf_cover, first we collect the monomials of which support value is not only more than the minimum support but also less than the minimum support as seeds. Secondly we construct the coverings as monotone DNF formulas, by combining monomials in seeds under the minimum support and the maximum overlap. Finally, we apply the algorithm dnf_cover to bacterial culture data.
This work is partially supported by Japan Society for the Promotion of Science, Grants-in-Aid for Encouragement of Young Scientists (B) 15700137 and for Scientific Research (B) 13558036.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adamo, J.-M.: Data mining for association rules and sequential patterns: Sequential and parallel algorithms. Springer, Heidelberg (2001)
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. In: [7], pp. 307–328
Agrawal, R., Srikant, R.: algorithms for mining association rules in large databases. In: Proc. of 20th VLDB, pp. 487–499 (1994)
Angluin, D.: Queries and concept learning. Machine Learning 2, 319–342 (1988)
Arimura, H., Shinohara, T., Otsuki, S.: Finding minimal generalizations for unions of pattern languages and its application to inductive inference from positive data. In: Enjalbert, P., Mayr, E.W., Wagner, K.W. (eds.) STACS 1994. LNCS, vol. 775, pp. 649–660. Springer, Heidelberg (1994)
Džeroski, S., Lavrač, N. (eds.): Relational data mining. Springer, Heidelberg (2001)
Fayyed, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.): Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press (1996)
Matsuoka, K., Fukunami, M., Yokoyama, S., Ichiyama, S., Harao, M., Yamakawa, T., Tsumoto, S., Sugawara, K.: Study on the relationship of patients’ diseases and the occurrence of Anaerobes by using data mining techniques. In: Proc. International Congress of the Confederation of Anaerobes Societies, vol. 186, pp. (1Xa-P2) (2000)
Suzuki, E.: Mining bacterial test data with scheduled discovery of exception rules. In: [10], pp. 34–40
Suzuki, E. (ed.): Proc. International Workshop of KDD Challenge on Real-World Data (KDD Challenge 2000) (2000)
Tsumoto, S.: Guide to the bacteriological examination data set. In: [10], pp. 8–12, Also available at http://www.slab.dnj.ynu.ac.jp/challenge2000
Zhang, C., Zhang, S.: Association Rule Mining. LNCS (LNAI), vol. 2307, p. 25. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hirata, K., Nagazumi, R., Harao, M. (2003). Extraction of Coverings as Monotone DNF Formulas. In: Grieser, G., Tanaka, Y., Yamamoto, A. (eds) Discovery Science. DS 2003. Lecture Notes in Computer Science(), vol 2843. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39644-4_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-39644-4_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20293-6
Online ISBN: 978-3-540-39644-4
eBook Packages: Springer Book Archive