Abstract
This chapter asserts that, in current state-of-the-art symbolic regression engines, accuracy is poor. That is to say that state-of-the-art symbolic regression engines return a champion with good fitness; however, obtaining a champion with the correct formula is not forthcoming even in cases of only one basis function with minimally complex grammar depth. Ideally, users expect that for test problems created with no noise, using only functions in the specified grammar, with only one basis function and some minimal grammar depth, that state-of-the-art symbolic regression systems should return the exact formula (or at least an isomorph) used to create the test data. Unfortunately, this expectation cannot currently be achieved using published state-of-the-art symbolic regression techniques. Several classes of test formulas, which prove intractable, are examined and an understanding of why they are intractable is developed. Techniques in Abstract Expression Grammars are employed to render these problems tractable, including manipulation of the epigenome during the evolutionary process, together with breeding of multiple targeted epigenomes in separate population islands. Aselected set of currently intractable problems are shown to be solvable, using these techniques, and a proposal is put forward for a discipline-wide program of improving accuracy in state-of-the-art symbolic regression systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Castillo, Flor, Kordon, Arthur, and Villa, Carlos (2010). Genetic programming transforms in linear regression situations. In Riolo, Rick,McConaghy, Trent, and Vladislavleva, Ekaterina, editors, Genetic Programming Theory and Practice VIII, volume 8 of Genetic and Evolutionary Computation, chapter 11, pages 175–194. Springer, Ann Arbor, USA.
Eberhart, Russell, Shi, Yuhui, and Kennedy, James (2001). Hidden Order: How Adaptation Builds Complexity. Springer, New York.
Hornby, Gregory S. (2006). ALPS: the age-layered population structure for reducing the problem of premature convergence. In Keijzer, Maarten et al., editors, GECCO 2006: Proceedings of the 8th annual conference on Genetic and evolutionary computation, volume 1, pages 815–822, Seattle, Washington, USA. ACM Press.
Korns, Michael F. (2009). Symbolic regression of conditional target expressions. In Riolo, Rick L., O’Reilly, Una-May, and McConaghy, Trent, editors, Genetic Programming Theory and Practice VII, Genetic and Evolutionary Computation, chapter 13, pages 211–228. Springer, Ann Arbor.
Korns, Michael F. (2010). Abstract expression grammar symbolic regression. In Riolo, Rick, McConaghy, Trent, and Vladislavleva, Ekaterina, editors, Genetic Programming Theory and Practice VIII, volume 8 of Genetic and Evolutionary Computation, chapter 7, pages 109–128. Springer, Ann Arbor, USA.
Koza, John R. (1992). Genetic Programming: On the Programming of Computers
by Means of Natural Selection. MIT Press, Cambridge, MA, USA.
Man, KimFung, Tang, KitSang, and Kwong, Sam (1999). Genetic Algorithms. Springer, New York.
McConaghy, Trent, Palmers, Pieter, Gao, Peng, Steyaert, Michiel, and Gielen, Georges (2009). Variation-Aware Analog Structural Synthesis - A Computational Intelligence Approach. Analog Circuits and Signal Processing. Springer, Netherlands.
McKay, Robert I., Hoai, Nguyen Xuan,Whigham, Peter Alexander, Shan, Yin, and O’Neill, Michael (2010). Grammar-based genetic programming: a survey. GeneticProgramming andEvolvableMachines, 11(3/4):365–396. Tenth
Anniversary Issue: Progress in Genetic Programming and Evolvable Machines. O´Neill, Michael and Ryan, Conor (2003). Grammatical Evolution: Evolutionary Automatic Programming in an Arbitrary Language. Kluwer Academic Publishers, Dordrecht Netherlands.
Price, Kenneth, Storn, Rainer, and Lampinen, Jouni (2005). Differential Evolution:
A Practical Approach to Global Optimization. Springer, New York.
Ryan, Conor,Keijzer,Maarten, andCattolico,Mike (2005). Favourable Biasing of Function Sets using Run Transferable Libraries in Genetic Programming Theory and Practice II. Springer, New York.
Schmidt, Michael and Lipson, Hod (2010). Age-fitness pareto optimization. In Riolo, Rick, McConaghy, Trent, and Vladislavleva, Ekaterina, editors, Genetic Programming Theory and Practice VIII, volume 8 of Genetic and Evolutionary Computation, chapter 8, pages 129–146. Springer, Ann Arbor, USA.
Smits, Guido F., Vladislavleva, Ekaterina, and Kotanchek, Mark E. (2010). Scalable symbolic regression by continuous evolution with very small populations. In Riolo, Rick, McConaghy, Trent, and Vladislavleva, Ekaterina, editors, Genetic Programming Theory and Practice VIII, volume 8 of Genetic and Evolutionary Computation, chapter 9, pages 147–160. Springer, Ann Arbor, USA.
Spector, Lee (2010). Towards practical autoconstructive evolution: Selfevolution of problem-solving genetic programming systems. In Riolo, Rick, McConaghy, Trent, and Vladislavleva, Ekaterina, editors, Genetic Programming Theory and Practice VIII, volume 8 of Genetic and Evolutionary Computation, chapter 2, pages 17–33. Springer, Ann Arbor, USA.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Korns, M.F. (2011). Accuracy in Symbolic Regression. In: Riolo, R., Vladislavleva, E., Moore, J. (eds) Genetic Programming Theory and Practice IX. Genetic and Evolutionary Computation. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1770-5_8
Download citation
DOI: https://doi.org/10.1007/978-1-4614-1770-5_8
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-1769-9
Online ISBN: 978-1-4614-1770-5
eBook Packages: Computer ScienceComputer Science (R0)