Skip to main content

Some Probabilistic Modelling Ideas for Boolean Classification in Genetic Programming

  • Conference paper
  • 948 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1802))

Abstract

We discuss the problem of boolean classification via Genetic Programming. When predictors are numeric, the standard approach proceeds by classifying according to the sign of the value provided by the evaluated function. We consider an alternative approach whereby the magnitude of such a quantity also plays a role in prediction and evaluation. Specifically, the original, unconstrained value is transformed into a probability value which is then used to elicit the classification. This idea stems from the well-known logistic regression paradigm and can be seen as an attempt to squeeze all the information in each individual function. We investigate the empirical behaviour of these variants and discuss a third evaluation measure equally based on probabilistic ideas. To put these ideas in perspective, we present comparative results obtained by alternative methods, namely recursive splitting and logistic regression.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bojarczuk, C.C., Lopes, H.S., Freitas, A.A.: Discovering Comprehensible Classification Rules Using Genetic Programming: A Case Study in a Medical Domain. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 1999), vol. 2 (1999)

    Google Scholar 

  2. Koza, J.R.: Genetic Programming. MIT Press, Cambridge (1992)

    MATH  Google Scholar 

  3. Christensen, R.: Log-Linear Models and Logistic Regression, 2nd edn. Springer, Heidelberg (1997)

    MATH  Google Scholar 

  4. McCullagh, P., Nelder, J.A.: Generalized Linear Models. Chapman & Hall, Boca Raton (1983)

    MATH  Google Scholar 

  5. Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford University Press, Oxford (1995)

    Google Scholar 

  6. Jordan, M.I.: Why the Logistic Function? A Tutorial Discussion on Probabilities and Neural Networks. Computational Cognitive Science Technical Report 9503. MIT (1995)

    Google Scholar 

  7. Venables, W.N., Ripley, B.D.: Modern Applied Statistics with S-PLUS, 2nd edn. Springer, Heidelberg (1997)

    MATH  Google Scholar 

  8. See, http://GARAGe.cps.msu.edu/software/lil-gp/lilgp-index.html

  9. Eggermont, J., Eiben, A.E., van Hemert, J.I.: A Comparison of Genetic Programming Variants for Data Classification. In: Eggermont, J., Eiben, A.E., van Hemert, J.I. (eds.) IDA 1999. LNCS, vol. 1642, p. 281. Springer, Heidelberg (1999)

    Google Scholar 

  10. Freitas, A.A.: A Genetic Programming Framework for Two Data Mining Tasks: Classification and Generalized Rule Induction. In: Proceedings of the Second Genetic Programming Conference, GP 1997 (1997)

    Google Scholar 

  11. Cavaretta, M.J., Chellapilla, K.: Data Mining Using Genetic Programming: the Implications of Parsimony on Generalization Error. In: Proceedings of the 1999 Conference on Evolutionary Computation, CEC 1999 (1999)

    Google Scholar 

  12. Michie, D., Spiegelhalter, D.J., Taylor, C.C.: Machine Learning, Neural and Statistical Classification. Ellis Horwood (1994)

    Google Scholar 

  13. Iba, H.: Bagging, Boosting and Bloating in Genetic Programming. In: Proceedings of GECCO 1999, vol. 2 (1999)

    Google Scholar 

  14. Hillis, W.D.: Co-Evolving Parasites Improve Simulated Evolution as an Optimization Procedure. In: Langton, C.G., Taylor, C., Farmer, J.D., Rasmussen, S. (eds.) Artificial Life II, SFI Studies in the Science of Complexity, Addison-Wesley, Reading (1991)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Muruzábal, J., Cotta-Porras, C., Fernández, A. (2000). Some Probabilistic Modelling Ideas for Boolean Classification in Genetic Programming. In: Poli, R., Banzhaf, W., Langdon, W.B., Miller, J., Nordin, P., Fogarty, T.C. (eds) Genetic Programming. EuroGP 2000. Lecture Notes in Computer Science, vol 1802. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-46239-2_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-46239-2_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67339-2

  • Online ISBN: 978-3-540-46239-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics