Bloat Free Genetic Programming versus Classification Trees for Identification of Burned Areas in Satellite Imagery

  • Sara Silva
  • Maria J. Vasconcelos
  • Joana B. Melo
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6024)

Abstract

This paper compares Genetic Programming and Classification Trees on a problem of identification of burned areas in satellite imagery. Additionally, it studies how the most recently recognized bloat control technique, Operator Equalisation, affects the quality of the solutions provided by Genetic Programming. The merit of each approach is assessed not only by its classification accuracy, but also by the ability to predict the correctness of its own classifications, and the ability to provide solutions that are human readable and robust to data inaccuracies. The results reveal that both approaches achieve high accuracy with no overfitting, and that Genetic Programming can reveal some surprises and offer interesting advantages even on a simple problem so easily tackled by the popular Classification Trees. Operator Equalisation proved to be crucial.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and regression trees, Wadsworth (1984)Google Scholar
  2. 2.
    Dignum, S., Poli, R.: Generalisation of the limiting distribution of program sizes in tree-based genetic programming and analysis of its effects on bloat. In: Thierens, D., et al. (eds.) Proceedings of GECCO 2007, pp. 1588–1595. ACM Press, New York (2007)CrossRefGoogle Scholar
  3. 3.
    Dignum, S., Poli, R.: Operator equalisation and bloat free GP. In: O’Neill, M., Vanneschi, L., Gustafson, S., Esparcia Alcázar, A.I., De Falco, I., Della Cioppa, A., Tarantino, E. (eds.) EuroGP 2008. LNCS, vol. 4971, pp. 110–121. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  4. 4.
    Dignum, S., Poli, R.: Crossover, sampling, bloat and the harmful effects of size limits. In: O’Neill, M., Vanneschi, L., Gustafson, S., Esparcia Alcázar, A.I., De Falco, I., Della Cioppa, A., Tarantino, E. (eds.) EuroGP 2008. LNCS, vol. 4971, pp. 158–169. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  5. 5.
    Kohavi, R., Quinlan, J.R.: Decision-tree discovery. In: Klosgen, W., Zytkow, J.M. (eds.) Handbook of Data Mining and Knowledge Discovery, ch. 16.1.3, pp. 267–276. Oxford University Press, Oxford (2002)Google Scholar
  6. 6.
    Koza, J.R.: Genetic programming – on the programming of computers by means of natural selection. MIT Press, Cambridge (1992)MATHGoogle Scholar
  7. 7.
    Langdon, W.B., Poli, R.: Foundations of Genetic Programming. Springer, Heidelberg (2002)MATHGoogle Scholar
  8. 8.
    Pereira, J.M.C., Sá, A.C.L., Sousa, A.M.O., Silva, J.M.N., Santos, T.N., Carreiras, J.M.B.: Spectral characterisation and discrimination of burnt areas. In: Chuvieco, E. (ed.) Remote Sensing of Large Wildfires in the European Mediterranean Basin, pp. 123–138. Springer, Heidelberg (1999)Google Scholar
  9. 9.
    Poli, R., Langdon, W.B., Dignum, S.: On the limiting distribution of program sizes in tree-based genetic programming. In: Ebner, M., O’Neill, M., Ekárt, A., Vanneschi, L., Esparcia-Alcázar, A.I. (eds.) EuroGP 2007. LNCS, vol. 4445, pp. 193–204. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  10. 10.
    Poli, R., Langdon, W.B., McPhee, N.F.: A field guide to genetic programming, http://www.gp-field-guide.org.uk (2008) (With contributions by J. R. Koza), http://lulu.com
  11. 11.
    Poli, R., McPhee, N.F., Vanneschi, L.: The impact of population size on code growth in GP: analysis and empirical validation. In: Keijzer, M., et al. (eds.) Proceedings of GECCO 2008, pp. 1275–1282. ACM Press, New York (2008)CrossRefGoogle Scholar
  12. 12.
    Silva, S.: Controlling bloat: individual and population based approaches in genetic programming. PhD thesis, Dep. Informatics Engineering, Univ. Coimbra (2008)Google Scholar
  13. 13.
    Silva, S., Costa, E.: Dynamic limits for bloat control in genetic programming and a review of past and current bloat theories. Genet. Program. Evolvable Mach. 10(2), 141–179 (2009)CrossRefMathSciNetGoogle Scholar
  14. 14.
    Silva, S., Dignum, S.: Extending operator equalisation: Fitness based self adaptive length distribution for bloat free GP. In: Vanneschi, L., Gustafson, S., Moraglio, A., De Falco, I., Ebner, M. (eds.) EuroGP 2009. LNCS, vol. 5481, pp. 159–170. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  15. 15.
    Silva, S., Vanneschi, L.: Operator equalisation, bloat and overfitting - a study on human oral bioavailability prediction. In: Rothlauf, F., et al. (eds.) Proceedings of GECCO 2009, pp. 1115–1122. ACM Press, New York (2009)CrossRefGoogle Scholar
  16. 16.
    Vanneschi, L., Silva, S.: Using operator equalisation for prediction of drug toxicity with genetic programming. In: Lopes, L.S., Lau, N., Mariano, P., Rocha, L.M., et al. (eds.) EPIA 2009. LNCS, vol. 5816, pp. 65–76. Springer, Heidelberg (2009)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Sara Silva
    • 1
    • 2
  • Maria J. Vasconcelos
    • 3
  • Joana B. Melo
    • 3
    • 4
  1. 1.INESC-ID LisboaPortugal
  2. 2.Center for Informatics and Systems of the University of CoimbraPortugal
  3. 3.Tropical Research InstituteLisbonPortugal
  4. 4.Instituto Superior de AgronomiaUTLPortugal

Personalised recommendations