Combining Different Data Mining Techniques to Improve Data Analysis
In this paper we propose the combined use of different methods to improve the data analysis process. This is obtained by combining inductive and deductive techniques. Inductive techniques are used for generating hypotheses from data whereas deductive techniques are used to derive knowledge and to verify hypotheses. In order to guide users in the the analysis process, we have developed a system which integrates deductive tools, data mining tools (such as classification algorithms and features selection algorithms), visualization tools and tools for the easy manipulation of data sets. The system developed is currently used in a large project whose aim is the integration of information sources containing data concerning the socio-economic aspects of Calabria and the analysis of the integrated data. Several experiments on socio-economic indicators of Calabrian cities have shown that the combined use of different techniques improves both the comprehensibility and the accuracy of models.
KeywordsData Mining Technique Logical Rule Feature Selection Algorithm Data Mining Tool Decision Tree Induction Algorithm
Unable to display preview. Download preview PDF.
- 2.Cheeseman P., Stutz J. (1996) Bayesian Classification (Autoclass): Theory and Results. In: , 153–180Google Scholar
- 3.Dougherty J., Kohavi R., Sahami M. (1997) Supervised and unsupervised discretization of continuous features. In Proc. 12th Int. Conf. Mach. Learn., 194–202Google Scholar
- 5.Fayyad U.M., Piatesky-Shapiro G., Smyth P. (1996) LFrom Data Mining to Knowledge Discovery: An overview. In: , 1–36Google Scholar
- 6.Fayyad U.M., Piatesky-Shapiro G., Smyth P., Uthurusamy R., (Eds.) (1996) Advances in Knoweldge Discovery and Data Mining. The MIT Press.Google Scholar
- 7.Freund Y., Shapire R.E. (1997) A Decision-Theoretic Generalization of On-line Learning and an Application to Boosting. In Journal of Computer System Sciences, 55 (1): 119–139.Google Scholar
- 8.Quinlan J.R. (1986) Induction of Decision Trees. Machine Learning 1 (1): 81–106Google Scholar
- 9.Hanson R., Stutz J., Cheeseman P (1991) Bayesian classification with correlation and inheritance. Proc. 12th IJCAI Conf., 1991. 692–698Google Scholar
- 10.Mardia K.V., Kent J.T., Bibby J.M. (1979) Multivariant Analysis. Academic Press, New YorkGoogle Scholar
- 12.Simoudis E., Livezey B., Kerber R. (1996) Integrating Inductive and Deductive Reasoning for Data Mining. In: , 353–373Google Scholar
- 13.Scheffer T., Herbrich H. (1997) Unbiased assessment of learning algorithm. In: Proc. 15th IJCAI Conf., 1997, 798–803Google Scholar
- 14.Waikato Environment for Knowledge Analysis (WEKA). Available at http://www.cs.waikato.ac.nz/ml/weka“.