Classification for Multi-Relational Data Mining Using Bayesian Belief Network
Multi-Relational Data Mining is an active area of research from last decade. Relational database is an important source of structured data, hence richest source of knowledge. Most of the commercial and application oriented data uses a relational database scheme in which multiple relations are linked through primary key, foreign key relationship. Multi-Relational Data Mining (MRDM) deals with extraction of information from a relational database containing multiple tables related to each other. In order to extract important information or knowledge, it is required to apply Data Mining algorithms on this relational database but most of these algorithms work only on a single table. Generating a single table from multiple tables may result in loss of important information, like the relation between tuples, also it is a not efficient in terms of time and space. In this paper, we proposed a Probabilistic Graphical Model, Bayesian Belief Network (BBN), based approach that considers not only attributes of the table but also the relation between tables. The conditional dependencies between tables are derived from Semantic Relationship Graph (SRG) of the relational database, whereas Tuple Id propagation helps to derive the conditional probability of tables. Our model not only predicts class label of unknown samples, but also gives the value of sample if class label is known.
KeywordsMulti-Relational Data Mining Relational database Bayesian Belief Network Data Mining Probabilistic graphical model Semantic Relationship Graph Tuple Id propagation
Unable to display preview. Download preview PDF.
- 1.Sašo, D., Lavrač, N.: An introduction to inductive logic programming. In: Relational Data Mining, pp. 48–73 (2001)Google Scholar
- 2.Lavrac, N., Dzeroski, S.: Inductive Logic Programming: Techniques and Applications. Ellis Horwood (1994)Google Scholar
- 6.Muggleton, S., Feng, C.: Efficient Induction of Logic Programs. In: Proceedings of Conference on Algorithmic Learning Theory (1990)Google Scholar
- 7.Pompe, U., Kononenko, I.: Naive Bayesian classifier within ILP-R. In: Proceedings of the 5th International Workshop on Inductive Logic Programming, pp. 417–436 (1995)Google Scholar
- 11.Neville, J., Jensen, D., Gallagher, B., Fairgrieve, R.: Simple Estimators for Relational Bayesian Classifiers. In: International Conference on Data Mining (2003)Google Scholar
- 13.Quinlan, J.R., Cameron-Jones, R.M.: FOIL: A Midterm Report. In: Proceedings of 1993 European Conference on Machine Learning (1993)Google Scholar
- 14.Yin, X., Han, J., Yang, J.: Efficient Multi-relational Classification by Tuple ID Propagation. In: Proceedings of the KDD-2003 Workshop on Multi-Relational Data Mining (2003)Google Scholar