Abstract
In this paper we describe a way to discover attribute associations and a way to present them to users using Bayesian networks. We describe a three-dimensional visualization to present them effectively to users. Furthermore we discuss two applications of attribute associations to the KDD process. One application involves using them to support feature selection. The result of our experiment shows that feature selection using visualized attribute associations works well in 17 data sets out of the 24 that were used. The other application uses them to support the selection of data mining methods. We discuss the possibility of using attribute associations to help in deciding if a given data set is suited to learning decision trees. We found 3 types of structural characteristics in Bayesian networks obtained from the data. The characteristics have strong relevance to the results of learning decision trees.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Cheng, J., Bell, D.A., Liu, W.: Learning belief networks from data: an information theory based approach. In: Proceedings of the Sixth ACM International Conference on Information and Knowledge Management (1997)
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian Network Classifiers. Machine Learning 29, 131–163 (1997)
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, H.: Advances in Knowledge discovery and data mining. AAAI/MIT Press, Cambridge (1996)
Heckerman, D., Geiger, D., Chickering, D.: Learning Bayesian networks: The Combination of Knowledge and Statistical Data, Technical Report MSR-TR-94-09, Microsoft Research (1994)
John, G., Kohave, R., Pfleger, K.: Irrelevant Features and the Subset Selection Problem. In: Machine Learning: Proceedings of the Eleventh International Conference, pp. 121–129. Morgan Kaufmann Publisher, San Francisco (1994)
Liu, H., Setiono, R.: Feature Selection and Classification-A Probabilistic Wrapper Approach. In: Proc. 9th Int. Conf. on IEA/AIE, pp. 419–424 (1996)
Masuda, G., Sakamoto, N., Ushijima, K.: A Practical Object-Oriented Concept Learning System in Clinical Medicine. In: Proc. 9th Int. Conf. on IEA/AIE, pp. 449–454 (1996)
Masuda, G., Sakamoto, N., Ushijima, K.: Applying Design Patterns to Decision Tree Learning System. In: Proc. of ACM SIGSOFT Sixth International Symposium on the Foundations of Software Engineering, pp. 111–120 (1998)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1993)
Blake, C., Keogh, E., Merz, C.J.: UCI Repository of machine learning databases, Department of Information and Computer Science. University of California, Irvine (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Masuda, G., Yano, R., Sakamoto, N., Ushijima, K. (1999). Discovering and Visualizing Attribute Associations Using Bayesian Networks and Their Use in KDD. In: Żytkow, J.M., Rauch, J. (eds) Principles of Data Mining and Knowledge Discovery. PKDD 1999. Lecture Notes in Computer Science(), vol 1704. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-48247-5_7
Download citation
DOI: https://doi.org/10.1007/978-3-540-48247-5_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66490-1
Online ISBN: 978-3-540-48247-5
eBook Packages: Springer Book Archive