Abstract
Most machine learning (ML) platforms used in materials science provide prediction models built using a computational database. However, to provide more practical and accurate material property predictions, it is advantageous to build a prediction model with user’s own data. Here, we present a web-based ML platform, SimPL-ML (https://www.simpl-ml.org) that enables the user to build an ML prediction model through a simple process using their own data. Our platform, SimPL-ML, comprises four main parts: a dataset editor for dataset preprocessing, a model generator to perform actual model training, a predictor to provide a predicted target value for an arbitrary input, and a band-gap predictor (as an example case study) to predict the band gap of inorganic materials through several optimized band gap prediction models. In addition to its core functions, SimPL-ML provides additional functions such as atomic feature generation and hyper-parameter optimization for efficient ML research. We expect our platform to facilitate more accurate and efficient materials research through ML.
Similar content being viewed by others
References
Butler KT, Davies DW, Cartwright H, Isayev O, Walsh A (2018) Machine learning for molecular and materials science. Nature 559:547–555
Xie T, Grossman JC (2018) Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys Rev Lett 120:145301
Ward L, Agrawal A, Choudhary A, Wolverton C (2016) A general-purpose machine learning framework for predicting properties of inorganic materials. NPJ Comput Mater 2:201628
Cai J, Chu X, Xu K, Li H, Wei J (2020) Machine learning-driven new material discovery. Nanoscale Adv 2:3115–3130
Jha D, Choudhary K, Tavazza F, Liao W, Choudhary A, Campbell C, Agrawal A (2019) Enhancing materials property prediction by leveraging computational and experimental data using deep transfer learning. Nat Commun 10:5316
Clement CL, Kauwe SK, Sparks TD (2020) Benchmark AFLOW data sets for machine learning. Integr Mater Manuf Innov 9:153–156
Hansen K, Biegler F, Ramakrishnan R, Pronobis W, Lilienfeld OA, Müller KR, Tkatchenko A (2015) Machine learning predictions of molecular properties: accurate many-body potentials and nonlocality in chemical space. J Phys Chem Lett 6:2326–2331
Binder A, Bockmayr M, Hägele M, Wienert S, Heim D, Hellweg K, Ishii M, Stenzinger A, Hocke A, Denkert C, Müller KR, Klauschen F (2021) Morphological and molecular breast cancer profiling through explainable machine learning. Nat Mach Intell 3:355–366
Heinen S, Rudorff GF, Lilienfeld OA (2021) Toward the design of chemical reactions: Machine learning barriers of competing mechanisms in reactant space. J Chem Phys 155:064105
Tkatchenko A (2020) Machine learning for chemical discovery. Nat Commun 11:4125
Deringer VL, Caro MA, Csányi G (2019) Machine learning interatomic potentials as emerging tools for materials science. Adv Mater 31:1902765
Na GS, Jang S, Chang H (2021) Predicting thermoelectric properties from chemical formula with explicitly identifying dopant effects. NPJ Comput Mater 7:106
Na GS, Jang S, Lee YL, Chang H (2020) Tuplewise material representation based machine learning for accurate band gap prediction. J Phys Chem A 124:10616–10623
Artrith N, Urban A, Ceder G (2017) Efficient and accurate machine-learning interpolation of atomic energies in compositions with many species. Phys Rev B 96:014112
Bartel CJ, Trewartha A, Wang Q, Dunn A, Jain A, Ceder G (2020) A critical examination of compound stability predictions from machine-learned formation energies. NPJ Comput Mater 6:97
Duan C, Liu F, Nandy A, Kulik HJ (2021) Putting density functional theory to the test in machine-learning-accelerated materials discovery. J Phys Chem Lett 12:4628–4637
Kim HW, Lee SW, Na GS, Han SJ, Kim SK, Shin JH, Chang H, Kim YT (2021) Reaction condition optimization for non-oxidative conversion of methane using artificial intelligence. React Chem Eng 6:235–243
AFLOW-ML http://aflow.org/aflow-ml. Accessed Oct 2021
JARVIS ML https://jarvis.nist.gov/jarvisml. Accessed Oct 2021
Kresse G, Furthmuller J (1996) Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys Rev B 54:11169–11186
Olson RS, Sipper M, Cava WL, Tartarone S, Vitale S, Fu W, Orzechowski P, Urbanowicz RJ, Holmes JH, Moore JH (2017) A system for accessible artificial intelligence. arXiv 1705:00594v2
Amazon SageMaker (2021) https://docs.aws.amazon.com/sagemaker/latest/dg/whatis.html. Accessed Dec 2021
Rapidminer Go (2021) https://rapidminer.com/why-rapidminer/. Accessed Dec 2021
Li Z, Chard R, Ward L, Chard K, Skluzacek TJ, Babuji Y, Woodard A, Tuecke S, Blaiszik B, Franklin MJ, Foster I (2021) DLHub: simplifying publication, discovery, and use of machine learning models in science. J Parallel Distrib Comput 147:64–76
Algorithmia (2021) https://algorithmia.com/. Accessed Dec 2021
Crankshaw D, Wang X, Zhou G, Franklin MJ, Gonzalez JE, Stoica I (2017) Clipper: a low-latency online prediction serving system. In: 14th USENIX symposium on networked systems design and implementation (NSDI), pp 613–627
Jang S SimPL-ML. https://simpl-ml.org. Accessed Oct 2021
Chen T, Guestrin C (2016) XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 785–794
Gu T, Lu W, Bao X, Chen N (2006) Using support vector regression for the prediction of the band gap and melting point of binary and ternary compound semiconductors. Solid State Sci 8:129–136
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29:1189–1232
Perdew JP, Burke K, Ernzerhof M (1996) Generalized gradient approximation made simple. Phys Rev Lett 77:3865
Morales-García Á, Valero R, Illas F (2017) An empirical, yet practical way to predict the band gap in solids by using density functional band structure calculations. J Phys Chem C 121:18862–18866
Zhuo Y, Tehrani AM, Brgoch J (2018) Predicting the band gaps of inorganic solids by machine learning. J Phys Chem Lett 9:1668–1673
Kaggle https://www.kaggle.com/datasets. Accessed Oct 2021
Acknowledgements
This research was supported by the core KRICT project from the Korea Research Institute of Chemical Technology (SI2151-10) and the Nano-Material Technology Development Program through the National Research Foundation of Korea (NRF-2016M3A7B4025408).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Jang, S., Na, G.S., Lee, J. et al. An Easy, Simple, and Accessible Web-based Machine Learning Platform, SimPL-ML. Integr Mater Manuf Innov 11, 85–94 (2022). https://doi.org/10.1007/s40192-022-00250-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40192-022-00250-x