Evolving Simple Symbolic Regression Models by Multi-Objective Genetic Programming
In this chapter we examine how multi-objective genetic programming can be used to perform symbolic regression and compare its performance to single-objective genetic programming. Multi-objective optimization is implemented by using a slightly adapted version of NSGA-II, where the optimization objectives are the model’s prediction accuracy and its complexity. As the model complexity is explicitly defined as an objective, the evolved symbolic regression models are simpler and more parsimonious when compared to models generated by a single-objective algorithm. Furthermore, we define a new complexity measure that includes syntactical and semantic information about the model, while still being efficiently computed, and demonstrate its performance on several benchmark problems. As a result of the multi-objective approach the appropriate model length and the functions included in the models are automatically determined without the necessity to specify them a-priori.
KeywordsSymbolic regression Complexity measures Multi-objective optimization Genetic programming NSGA-II
The work described in this paper was done within the COMET Project Heuristic Optimization in Production and Logistics (HOPL), #843532 funded by the Austrian Research Promotion Agency (FFG).
- Affenzeller M, Winkler S, Kronberger G, Kommenda M, Burlacu B, Wagner S (2014) Gaining deeper insights in symbolic regression. In: Riolo R, Moore JH, Kotanchek M (eds) Genetic programming theory and practice XI. Genetic and evolutionary computation. Springer, New YorkGoogle Scholar
- Luke S, Panait L (2002) Lexicographic Parsimony Pressure. In: Langdon WB, Cantu ′-Paz E, Mathias K, Roy R, Davis D, Poli R, Balakrishnan K, Honavar V, Rudolph G, Wegener J, Bull L, Potter MA, Schultz AC, Miller JF, Burke E, Jonoska N (eds) Proceedings of the genetic and evolutionary computation conference (GECCO’2002). Morgan Kaufmann Publishers, San Francisco, CA, pp 829–836Google Scholar
- Vladislavleva EJ, Smits GF, Den Hertog D (2009) Order of nonlinearity as a complexity measure for models generated by symbolic regression via Pareto genetic programming. IEEE Trans Evol Comput 13(2):333–349Google Scholar
- Wagner S (2009) Heuristic optimization software systems - modeling of heuristic optimization algorithms in the heuristiclab software environment. Ph.D. thesis, Institute for Formal Models and Verification, Johannes Kepler University, LinzGoogle Scholar