Advertisement

Rough-set inspired approach to knowledge discovery in business databases

  • W. Kowalczyk
  • Z. Piasta
Papers
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1394)

Abstract

We present an approach to knowledge discovery in databases that is based on the idea of the attribute space partition. An inspiration for this approach was the methodology of the rough set theory. Two systems, ProbRough and TRANCE, which are representative of this approach are capable of inducing decision rules from databases with practically unlimited number of objects. The beam search strategy of ProbRough is guided by the global cost criterion and leads to inducing rough classifiers which are mainly intended for making decisions concerning new unseen objects. In case of TRANCE, the exhaustive search strategy in a space of user pre-specified models is guided by the criterion expressed in terms of local properties of the model. The relationships between values of attributes and decisions, detected by both systems, are presented in the form of rules that are easily understood by humans. The presented approach to knowledge discovery is illustrated on two real-world examples from database marketing. The rules induced by ProbRough and TRANCE provided a lot of useful information on customer behavior patterns and about the phenomenon of customer retention.

Key words

induction of rules rough set theory discretization of continuous data noise handling KDD applications in business 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Deogun, J. S., Raghavan, V. V., Sarkar, A. and Sever, H. (1997). Data mining: trends in research and developments. In: Lin, T. Y. and Cercone, N. (eds), Rough Sets and Data Mining, Kluwer Academic Publishers, Boston/ London/ Dordrecht, pp. 9–45.CrossRefGoogle Scholar
  2. 2.
    Fayyad, U. M., Piatetsky-Shapiro, G., Smyth, P. and Uthurusamy, R. (1996). Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, London.Google Scholar
  3. 3.
    Kowalczyk, W. (1998). Rough Data Modelling: a new technique for analyzing data. In: L. Polkowski and A. Skowron (eds.) Rough Sets in Knowledge Discovery, Physica-Verlag (to appear).Google Scholar
  4. 4.
    Kowalczyk, W. and Slisser, F., (1997). Analyzing customer retention with rough data models. In Proceedings of the 1st European Symposium on Principles of Data Mining and Knowledge Discovery, PKDD'97, Trondheim, Norway, Lecture Notes in AI 1263, Springer, pp. 4–13.Google Scholar
  5. 5.
    Kowalczyk, W. (1996). TRANCE: A tool for rough data analysis, classification and clustering. In S. Tsumoto, S. Kobayashi, T. Yokomori, H. Tanaka and A. Nakamura (eds.), Proceedings of the Fourth International Workshop on Rough Sets, Fuzzy Sets, and Machine Discovery, RSFD'96, Tokyo University, pp. 269–275.Google Scholar
  6. 6.
    Kowalczyk, W. (1996). Analyzing temporal patterns with rough sets. In Proceedings of the Fourth European Congress on Intelligent Techniques and Soft Computing, EUFIT'96, Volume I, Aachen, Germany, pp. 139–143.Google Scholar
  7. 7.
    Lenarcik, A. and Piasta, Z. (1994). Rough classifiers. In: Ziarko, W. (ed.), Rough Sets, Fuzzy Sets and Knowledge Discovery, Springer-Verlag, London, pp. 298–316.CrossRefGoogle Scholar
  8. 8.
    Lenarcik, A. and Piasta, Z. (1994). Deterministic rough classifiers. In Conference Proceedings of the Third International Workshop on Rough Sets and Soft Computing, San Jose State University, CA, pp. 434–441.Google Scholar
  9. 9.
    Lenarcik, A. and Piasta, Z. (1997). Probabilistic rough classifiers with mixtures of discrete and continuous condition attributes. In: Lin, T. Y. and Cercone, N. (eds), Rough Sets and Data Mining, Kluwer Academic Publishers, Boston/ London/ Dordrecht, pp. 373–383.CrossRefGoogle Scholar
  10. 10.
    Lin, T. Y. and Wildberger, A. M., (eds) (1995). Soft Computing: Rough Sets, Fuzzy Logic, Neural Networks, Uncertainty Management and Knowledge Discovery, Simulation Councils, Inc., San Diego CA.Google Scholar
  11. 11.
    Michalski, R. S., Bratko, I. and Kubat, M. (eds) (1997), Machine Learning and Data Mining: Methods and Applications, Wiley, New York.Google Scholar
  12. 12.
    Nguyen, H.S. and Nguyen, S.H. (1998). Discretization Methods in Data Mining. In: L. Polkowski and A. Skowron (eds.) Rough Sets in Knowledge Discovery, Physica-Verlag (to appear).Google Scholar
  13. 13.
    Pawlak, Z. (1991). Rough Sets: Theoretical Aspects of Reasoning About Data, Kluwer Academic Publishers, Dordrecht.CrossRefMATHGoogle Scholar
  14. 14.
    Piasta, Z. and Lenarcik, A. (1996). Rule induction with probabilistic rough classifiers, ICS Research Report 24/96, Warsaw University of Technology, to appear in Machine Learning.Google Scholar
  15. 15.
    Piasta, Z. and Lenarcik, A. (1998). Learning rough classifiers from large databases with missing values. In: L. Polkowski and A. Skowron (eds.) Rough Sets in Knowledge Discovery, Physica-Verlag (to appear).Google Scholar
  16. 16.
    Piatetsky-Shapiro, G. (1996). Data mining and knowledge discovery in business databases, In: Ras, Z. W. and Michalewicz M.(eds), Foundations of Intelligent Systems, Springer-Verlag, Berlin Heidelberg, pp. 56–67.CrossRefGoogle Scholar
  17. 17.
    Provost, F. and Fawcett, T. (1997). Analysis and visualization of classifier performance: comparison under imprecise class and cost distributions. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, AAAI Press.Google Scholar
  18. 18.
    Quinlan, R. (1986). Induction of decision trees, Machine Learning 1, 81–106.Google Scholar
  19. 19.
    Slowinski, R. (ed.) (1992). Intelligent Decision Support: Handbook of Applications and Advances of the Rough Sets Theory, Kluwer Academic Publishers, Dordrecht.MATHGoogle Scholar
  20. 20.
    Tsumoto, S., Kobayashi, S, Yokomori, T, Tanaka, H and Nakamura, A., (1996) Proceedings of the Fourth International Workshop on Rough Sets, Fuzzy Sets, and Machine Discovery, University of Tokyo.Google Scholar
  21. 21.
    Ziarko, W., (ed.) (1994). Rough Sets, Fuzzy Sets and Knowledge Discovery, Springer-Verlag, London.MATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1998

Authors and Affiliations

  • W. Kowalczyk
    • 1
  • Z. Piasta
    • 2
  1. 1.Department of Mathematics and Computer ScienceVrije Universiteit AmsterdamAmsterdamThe Netherlands
  2. 2.Mathematics DepartmentKielce University of TechnologyKielcePoland

Personalised recommendations