Abstract
Discovering knowledge from data is a quantum jump from quantity to quality, which is the characteristic and the spirit of the development of science. Symbolic regression (SR) is playing a greater role in the discovery of knowledge from data, specifically in this era of exponential data growth, because SRs are able to discover mathematical formulas from data. These formulas may provide scientifically meaningful models, especially when combined with domain knowledge. This article provides an overview of SR applications in the field of materials science and engineering. Integrating domain knowledge with SR is the key and a crucial approach, which allows gaining knowledge from data quickly, accurately, and scientifically. In the data-driven paradigm, SR allows for uncovering the underlying mechanisms of materials behavior, properties, and functions, in a wide range of areas from basic academic research to industrial applications, including experiments and computations, by providing explicit interpretable models from data, in comparison with other machine-learning “black-box” models. SR will be a powerful tool for rational and automatic materials development.
Similar content being viewed by others
References
E.B. Goldstein, G. Coco, Front. Environ. Sci. 3, 1 (2015).
P. Langley, “Rediscovering Physics with BACON.3,” Proc. 6th Int. Jt. Conf. Artif. Intell.—Vol. 1 (Morgan Kaufmann Publishers, 1979), pp. 505–507, http:// dl.acm.org/citation.cfm?id=1624861.1624976.
M. Schmidt, H. Lipson, Science 324, 81 (2009).
H. Schaeffer, S.G. McCalla, Phys. Rev. E 96, 023302 (2017).
S.H. Rudy, S.L. Brunton, J.L. Proctor, J.N. Kutz, Sci. Adv. 3, e1602614 (2017).
L.M. Ghiringhelli, J. Vybiral, E. Ahmetcik, R. Ouyang, S.V. Levchenko, C. Draxl, M. Scheffler, New J. Phys. 19, 023017 (2017).
R. Ouyang, S. Curtarolo, E. Ahmetcik, M. Scheffler, L.M. Ghiringhelli, Phys. Rev. Mater. 2, 083802 (2018).
S. Ramakrishna, T.-Y. Zhang, W.-C. Lu, Q. Qian, J.S.C. Low, J.H.R. Yune, D.Z.L. Tan, S. Bressan, S. Sanvito, S.R. Kalidindi, J. Intell. Manuf. (2018).
A. Agrawal, A. Choudhary, APL Mater. 4, 053208 (2016).
Y. Wang, N. Wagner, J.M. Rondinelli, “Symbolic Regression in Materials Science,” submitted arXiv:1901.04136 (2019), http://arxiv.org/abs/1901.04136 (accessed March 26, 2019).
P. Praks, D. Brkić, Water 10, 1175 (2018).
E.B. Goldstein, G. Coco, A.B. Murray, M.O. Green, Earth Surf. Dyn. 2, 67 (2014).
M.P. Hinchliffe, M.J. Willis, Comput. Chem. Eng. 27, 1841 (2003).
D.L. Ly, H. Lipson, J. Mach. Learn. Res. 13, 3585 (2012).
T.W. Cornforth, H. Lipson, Genet. Program. Evolvable Mach. 14, 155 (2013).
J. Gout, M. Quade, K. Shafi, R.K. Niven, M. Abel, Nonlinear Dyn. 91, 1001 (2018).
M. Quade, M. Abel, K. Shafi, R.K. Niven, B.R. Noack, Phys. Rev. E 94, 012214 (2016).
K.A. De Jong, Evolutionary Computation: A Unified Approach (MIT Press, Cambridge, MA, 2006).
J.R. Koza, Genetic Programming: On the Programming of Computers by Means of Natural Selection (MIT Press, Cambridge, MA, 1992).
R.I. McKay, N.X. Hoai, P.A. Whigham, Y. Shan, M. O’Neill, Genet. Program. Evolvable Mach. 11, 365 (2010).
C. Ryan, J.J. Collins, M.O. Neill, “Grammatical Evolution: Evolving Programs for an Arbitrary Language,” Eur. Conf. Genet. Program. (Springer, 1998), pp. 83–96.
J. Miller, Ed., Cartesian Genetic Programming (Springer, Heidelberg, Germany, 2011).
R. Poli, W.B. Langdon, N.F. McPhee, J.R. Koza, A Field Guide to Genetic Programming (Lulu Press, Morrisville, NC, 2008).
S. Sette, L. Boullart, Eng. Appl. Artif. Intell. 14, 727 (2001).
D.J. Gunaratnam, T. Degroff, J.S. Gero, Appl. Soft Comput. 2, 283 (2003).
I.M. Jamadar, D.P. Vakharia, Measurement 94, 177 (2016).
P.J. Angeline, K.E. Kinnear, “On Using Syntactic Constraints with Genetic Programming,” in Advances in Genetic Programming (MIT Press, Cambridge, MA, 1996), https://ieeexplore.ieee.org/document/6277529 (accessed March 23, 2019).
A. Ratle, M. Sebag, Appl. Soft Comput. 1, 105 (2001).
C. Ryan, M. O’Neill, J. Collins, Eds., Handbook of Grammatical Evolution (Springer International Publishing, Cham, Switzerland, 2018).
A.H. Gandomi, A.H. Alavi, C. Ryan, Eds., Handbook of Genetic Programming Applications (Springer International Publishing, Cham, 2015).
K. Sastry, D.D. Johnson, D.E. Goldberg, P. Bellon, Phys. Rev. B 72, 085438 (2005).
K. Sastry, D.D. Johnson, D.E. Goldberg, P. Bellon, Int. J. Multiscale Comput. Eng. 2, 239 (2004).
H.A. Padilla, S.F. Harnish, B.E. Gore, A.J. Beaudoin, J.A. Dantzig, I.M. Robertson, H. Weiland, “High Temperature Deformation and Hot Rolling of AA7055,” Metallurgical Modeling for Aluminum Alloys, Proc. Mater. Solutions Conf. 2001: 1st Int. Symp. Metall. Model. Alum. Alloys, M. Tiryakioglu, L.A. Lalli, Eds. (ASM International, Materials Park, OH, 2003), pp. 1–8.
J. Behler, J. Chem. Phys. 145, 170901 (2016).
W. Li, Y. Ando, E. Minamitani, S. Watanabe, J. Chem. Phys. 147, 214106 (2017).
S.K. Natarajan, J. Behler, Phys. Chem. Chem. Phys. 18, 28704 (2016).
P. Wang, Y. Shao, H. Wang, W. Yang, Extreme Mech. Lett. 24, 1 (2018).
J. Behler, Int. J. Quantum Chem. 115, 1032 (2015).
P.E. Dolgirev, I.A. Kruglov, A.R. Oganov, AIP Adv. 6, 085318 (2016).
F. Fracchia, G. Del Frate, G. Mancini, W. Rocchia, V. Barone, J. Chem. Theory Comput. 14, 255 (2017).
V.L. Deringer, G. Csányi, Phys. Rev. B 95, 094203 (2017).
A. Glielmo, P. Sollich, A. De Vita, Phys. Rev. B 95, 214302 (2017).
Z. Li, J.R. Kermode, A. De Vita, Phys. Rev. Lett. 114, 096405 (2015).
W.M. Brown, A.P. Thompson, P.A. Schultz, J. Chem. Phys. 132, 024108 (2010).
A. Kenoufi, K.T. Kholmurodov, Biol. Chem. Res. 2, 1 (2015).
D.E. Makarov, H. Metiu, J. Chem. Phys. 108, 590 (1998).
A. Slepoy, M.D. Peters, A.P. Thompson, J. Comput. Chem. 28, 2465 (2007).
A. Hernandez, A. Balasubramanian, F. Yuan, S. Mason, T. Mueller, “Fast, Accurate, and Transferable Many-Body Interatomic Potentials by Genetic Programming,” submitted arXiv:1904.01095 (2019), http://arxiv.org/abs/1904.01095 (accessed April 10, 2019).
A.A. Javadi, M. Rezania, Adv. Eng. Inform. 23, 442 (2009).
A. Faramarzi, A.M. Alani, A.A. Javadi, Comput. Struct. 137, 63 (2014).
A.H. Gandomi, S. Sajedi, B. Kiani, Q. Huang, Autom. Constr. 70, 89 (2016).
A.H. Gandomi, A.H. Alavi, Neural Comput. Appl. 21, 171 (2012).
D. Versino, A. Tonda, C.A. Bronkhorst, Comput. Methods Appl. Mech. Eng. 318, 981 (2017).
D.L. Preston, D.L. Tonks, D.C. Wallace, J. Appl. Phys. 93, 211 (2003).
P.S. Follansbee, U.F. Kocks, Acta Metall. 36, 81 (1988).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sun, S., Ouyang, R., Zhang, B. et al. Data-driven discovery of formulas by symbolic regression. MRS Bulletin 44, 559–564 (2019). https://doi.org/10.1557/mrs.2019.156
Published:
Issue Date:
DOI: https://doi.org/10.1557/mrs.2019.156