Abstract
In this chapter, we explore how Genetic Programming can assist and augment the expert-driven process of developing data-driven models. In our use case, modelers must develop hundreds of models that represent individual properties of parts, components, assets, systems and meta-systems like power plants. Each of these models is developed with an objective in mind, like estimating the useful remaining life or detecting anomalies. As such, the modeler uses their expert judgment, as well as available data to select the most appropriate method. In this initial paper, we examine the most basic example of when the experts select a kind of regression modeling approach and develop models from data. We then use that captured domain knowledge from their processes, as well as end models to determine if Genetic Programming can augment, assist and improve their final results. We show that while Genetic Programming can indeed find improved solutions according to an error metric, it is much harder for Genetic Programming to find models that do not increase complexity. Also, we find that one approach in particular shows promise as a way to incorporate domain knowledge.
All authors were employed at GE Global Research, Niskayuna, NY, during the preparation of this chapter.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Akbarzadeh-T., M.R., Jamshidi, M.: Incorporating a-priori expert knowledge in genetic algorithms. In: 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation, 1997. CIRA’97. Proceedings, pp. 300–305 (1997)
Bravo, A., Li, T., Su, A.I., Good, B.M., Furlong, L.: Combining machine learning, crowdsourcing and expert knowledge to detect chemical-induced diseases in text. In: Proceedings of the Fifth BioCreative Challenge Evaluation Workshop, pp. 266–273 (2015)
Breazeal, C., Thomaz, A.L.: Learning from human teachers with socially guided exploration. In: IEEE International Conference on Robotics and Automation, 2008. ICRA 2008, pp. 3539–3544 (2008)
Crapo, A., Gustafson, S.: Semantics: revolutionary breakthrough or just another way of doing things? In: Semantic Web: Implications for Technologies and Business Practices, pp. 85–118. Springer International Publishing, Cham (2016)
Crapo, A., Moitra, A.: Toward a unified english-like representation of semantic models, data, and graph patterns for subject matter experts. Int. J. Semantic Comput. 07(03), 215–236 (2013). https://doi.org/10.1142/S1793351X13500025
Fortin, F.A., De Rainville, F.M., Gardner, M.A., Parizeau, M., Gagné, C.: DEAP: evolutionary algorithms made easy. J. Mach. Learn. Res. 13, 2171–2175 (2012)
GE Global Research: Semantic Application Design Language (SADL): Open Source Project on Source Forge (2011). http://sadl.sourceforge.net/sadl.html
Kommenda, M., Kronberger, G., Winkler, S., Affenzeller, M., Wagner, S.: Effects of constant optimization by nonlinear least squares minimization in symbolic regression. In: Proceedings of the 15th Annual Conference Companion on Genetic and Evolutionary Computation, GECCO ’13 Companion, pp. 1121–1128. ACM, New York (2013)
La Cava, W.G., Danai, K.: Gradient-based adaptation of continuous dynamic model structures. Int. J. Syst. Sci. 47(1), 249–263 (2016)
Lu, Q., Ren, J., Wang, Z.: Using genetic programming with prior formula knowledge to solve symbolic regression problem. Comput. Intell. Neurosci. 2016, 17 (2016)
Moore, J.H., White, B.C.: Exploiting expert knowledge in genetic programming for genome-wide genetic analysis. In: Runarsson, T.P., et al. (eds.) Parallel Problem Solving from Nature - PPSN IX: 9th International Conference, Reykjavik, September 9–13, 2006, Proceedings, pp. 969–977. Springer, Berlin (2006)
Sathyanarayanan, S., Joseph, K.S., Jayakumar, S.K.V.: A hybrid population seeding technique based genetic algorithm for stochastic multiple depot vehicle routing problem. In: 2015 International Conference on Computing and Communications Technologies (ICCCT), pp. 119–127 (2015)
Schmidt, M.D., Lipson, H.: Incorporating expert knowledge in evolutionary search: a study of seeding methods. In: Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation, GECCO ’09, pp. 1091–1098. ACM, New York (2009)
Settles, B.: Active learning literature survey. Tech. Rep. Report 1648, University of Wisconsin, Madison (2010)
W3C OWL Working Group: OWL Web Ontology Language Reference. W3C Recommendation (2004). http://www.w3.org/TR/owl-ref/
Williams, J.W., Cuddihy, P., McHugh, J., Aggour, K.S., Menon, A., Gustafson, S., Healy, T.: Semantics for big data access integration: improving industrial equipment design through increased data usability. In: 2015 IEEE International Conference on Big Data (Big Data), pp. 1103–1112 (2015)
Wu, H., Sun, H., Fang, Y., Hu, K., Xie, Y., Song, Y., Liu, X.: Combining machine learning and crowdsourcing for better understanding commodity reviews. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, AAAI’15, pp. 4220–4221. AAAI Press, San Francisco (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Gustafson, S., Subramaniyan, A., Yousuf, A. (2018). Assisting Asset Model Development with Evolutionary Augmentation. In: Riolo, R., Worzel, B., Goldman, B., Tozier, B. (eds) Genetic Programming Theory and Practice XIV. Genetic and Evolutionary Computation. Springer, Cham. https://doi.org/10.1007/978-3-319-97088-2_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-97088-2_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-97087-5
Online ISBN: 978-3-319-97088-2
eBook Packages: Computer ScienceComputer Science (R0)