Mining Direct Marketing Data by Ensembles of Weak Learners and Rough Set Methods

  • Jerzy Błaszczyński
  • Krzysztof Dembczyński
  • Wojciech Kotłowski
  • Mariusz Pawłowski
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4081)


This paper describes problem of prediction that is based on direct marketing data coming from Nationwide Products and Services Questionnaire (NPSQ) prepared by Polish division of Acxiom Corporation. The problem that we analyze is stated as prediction of accessibility to Internet. Unit of the analysis corresponds to a group of individuals in certain age category living in a certain building located in Poland. We used several machine learning methods to build our prediction models. Particularly, we applied ensembles of weak learners and ModLEM algorithm that is based on rough set approach. Comparison of results generated by these methods is included in the paper. We also report some of problems that we encountered during the analysis.


Support Vector Machine Weak Learner Rule Induction Decision Class Direct Marketing 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Booker, L.B., Goldberg, D.E., Holland, J.F.: Classifier systems and genetic algorithms. In: Carbonell, J.G. (ed.) Machine Learning. Paradigms and Methods, pp. 235–282. The MIT Press, Cambridge, MA (1990)Google Scholar
  2. 2.
    Breiman, L.: Bagging Predictors. Machine Learning 24(2), 123–140 (1996)zbMATHMathSciNetGoogle Scholar
  3. 3.
    Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth (1984)Google Scholar
  4. 4.
    Friedman, J.H., Popescu, B.E.: Predictive Learning via Rule Ensembles. Research Report, Stanford University (February 2005) (last access 1.06.2006),
  5. 5.
    Grzymala-Busse, J.W., Stefanowski, J.: Three discretization methods for rule induction. International Journal of Intelligent Systems 16(1), 29–38 (2001)zbMATHCrossRefGoogle Scholar
  6. 6.
    Pawlak, Z.: Rough Sets. Theoretical Aspects of Reasoning about Data. Kluwer Academic Publishers, Dordrecht (1991)zbMATHGoogle Scholar
  7. 7.
    Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)Google Scholar
  8. 8.
    Rough Sets Data Explorer (ROSE2) (last access 1.06.2006),
  9. 9.
    Schapire, R.E., Freund, Y., Bartlett, P., Lee, W.E.: Boosting the margin: A new explanation for the effectiveness of voting methods. The Annals of Statistics. 26(5), 1651–1686 (1998)zbMATHCrossRefMathSciNetGoogle Scholar
  10. 10.
    Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)zbMATHGoogle Scholar
  11. 11.
    Witten, I., Frank, H., Data Mining, E.: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)zbMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Jerzy Błaszczyński
    • 1
  • Krzysztof Dembczyński
    • 1
  • Wojciech Kotłowski
    • 1
  • Mariusz Pawłowski
    • 2
  1. 1.Institute of Computing SciencePoznań University of TechnologyPoznańPoland
  2. 2.Acxiom PolskaWarszawaPoland

Personalised recommendations