Abstract
Neural networks (NNs), as one of the most robust and efficient machine learning methods, have been commonly used in solving several problems. However, choosing proper hyperparameters (e.g. the numbers of layers and neurons in each layer) has a significant influence on the accuracy of these methods. Therefore, a considerable number of studies have been carried out to optimize the NN hyperparameters. In this study, the genetic algorithm is applied to NN to find the optimal hyperparameters. Thus, the deep energy method, which contains a deep neural network, is applied first on a Timoshenko beam and a plate with a hole. Subsequently, the numbers of hidden layers, integration points, and neurons in each layer are optimized to reach the highest accuracy to predict the stress distribution through these structures. Thus, applying the proper optimization method on NN leads to significant increase in the NN prediction accuracy after conducting the optimization in various examples.
Abstract
目的
证明超参数优化对深度能量方法 (DEM) 精度的影响以及DEM在预测不同荷载作用下梁和板等结构的应力分布方面的能力。
创新点
1. 为了提高DEM的准确性, 各种超参数组合被输入遗传算法 (GA) 并找到最佳组合。2. 为了防止重复计算以及提高这种元启发式算法的效率, GA过程中还考虑了超参数组合的禁忌列表。
方法
1. 实施非均匀有理样条 (NURBS) 以生成穿过结构体和边界的积分点。2. 采用DEM计算位移和应力分布。3. 利用遗传算法优化DEM的超参数, 以对模型在预测结构内应力和位移传播的准确性方面具有显着影响。
结论
1. 在不同的优化器和激活函数中, Adam和L-BFGS-B方法以及ReLU2函数的组合使得DEM模型的准确率最高。2. 其他对模型预测准确性有影响的超参数包括隐藏层的数量, 每层神经元的数量以及通过上述结构集成的点数。3. 优化DEM的超参数可以使相对应变能误差降低近50%, 提高了DEM模型对应力和位移分布的预测能力。
Similar content being viewed by others
References
Alhichri H, Alajlan N, Bazi Y, et al., 2018. Multi-scale convolutional neural network for remote sensing scene classification. IEEE International Conference on Electro/Information Technology, p.1–5. https://doi.org/10.1109/EIT.2018.8500107
Anitescu C, Hossain MN, Rabczuk T, 2018. Recovery-based error estimation and adaptivity using high-order splines over hierarchical T-meshes. Computer Methods in Applied Mechanics and Engineering, 328:638–662. https://doi.org/10.1016/j.cma.2017.08.032
Augarde CE, Deeks AJ, 2008. The use of Timoshenko’s exact solution for a cantilever beam in adaptive analysis. Finite Elements in Analysis and Design, 44(9–10):595–601. https://doi.org/10.1016/j.finel.2008.01.010
Bacanin N, Bezdan T, Tuba E, et al., 2020. Optimizing convolutional neural network hyperparameters by enhanced swarm intelligence metaheuristics. Algorithms, 13(3):67. https://doi.org/10.3390/a13030067
Bani-Hani D, Khan N, Alsultan F, et al., 2018. Classification of leucocytes using convolutional neural network optimized through genetic algorithm. Proceedings of the 7th Annual World Conference of the Society for Industrial and Systems Engineering.
Bergstra J, Bengio Y, 2012. Random search for hyperparameter optimization. The Journal of Machine Learning Research, 13(1):281–305.
Dalto M, Matuško J, Vašak M, 2015. Deep neural networks for ultra-short-term wind forecasting. IEEE International Conference on Industrial Technology, p.1657–1663. https://doi.org/10.1109/ICIT.2015.7125335
Goswami S, Anitescu C, Chakraborty S, et al., 2020. Transfer learning enhanced physics informed neural network for phase-field modeling of fracture. Theoretical and Applied Fracture Mechanics, 106:102447. https://doi.org/10.1016/j.tafmec.2019.102447
Guo BS, Hu JW, Wu WW, et al., 2019. The Tabu_genetic algorithm: a novel method for hyper-parameter optimization of learning algorithms. Electronics, 8(5):579. https://doi.org/10.3390/electronics8050579
Jo Y, Min K, Jung D, et al., 2019. Comparative study of the artificial neural network with three hyper-parameter optimization methods for the precise LP-EGR estimation using in-cylinder pressure in a turbocharged GDI engine. Applied Thermal Engineering, 149:1324–1334. https://doi.org/10.1016/j.applthermaleng.2018.12.139
Junior FEF, Yen GG, 2019. Particle swarm optimization of deep neural networks architectures for image classification. Swarm and Evolutionary Computation, 49:62–74. https://doi.org/10.1016/j.swevo.2019.05.010
Kanada Y, 2016. Optimizing neural-network learning rate by using a genetic algorithm with per-epoch mutations. International Joint Conference on Neural Networks, p.1472–1479. https://doi.org/10.1109/IJCNN.2016.7727372
Kaur S, Aggarwal H, Rani R, 2020. Hyper-parameter optimization of deep learning model for prediction of Parkinson’s disease. Machine Vision and Applications, 31(5):32. https://doi.org/10.1007/s00138-020-01078-1
Liashchynskyi P, Liashchynskyi P, 2019. Grid search, random search, genetic algorithm: a big comparison for NAS. https://arxiv.org/abs/1912.06059
Loussaief S, Abdelkrim A, 2018. Convolutional neural network hyper-parameters optimization based on genetic algorithms. International Journal of Advanced Computer Science and Applications, 9(10):252–266. https://doi.org/10.14569/IJACSA.2018.091031
Motta D, Santos AÁB, Machado BAS, et al., 2020. Optimization of convolutional neural network hyperparameters for automatic classification of adult mosquitoes. PloS One, 15(7):e0234959. https://doi.org/10.1371/journal.pone.0234959
Najafi B, Faizollahzadeh Ardabili S, Mosavi A, et al., 2018. An intelligent artificial neural network-response surface methodology method for accessing the optimum biodiesel and diesel fuel blending conditions in a diesel engine from the viewpoint of exergy and energy analysis. Energies, 11(4):860. https://doi.org/10.3390/en11040860
Nassif AB, Shahin I, Attili I, et al., 2019. Speech recognition using deep neural networks: a systematic review. IEEE Access, 7:19143–19165. https://doi.org/10.1109/ACCESS.2019.2896880
Nguyen-Thanh VM, Nguyen LTK, Rabczuk T, et al., 2019. A surrogate model for computational homogenization of elastostatics at finite strain using the HDMR-based neural network approximator. https://arxiv.org/abs/1906.02005
Samaniego E, Anitescu C, Goswami S, et al., 2020. An energy approach to the solution of partial differential equations in computational mechanics via machine learning: concepts, implementation and applications. Computer Methods in Applied Mechanics and Engineering, 362:112790. https://doi.org/10.1016/j.cma.2019.112790
Shamshirband S, Mosavi A, Rabczuk T, et al., 2020. Prediction of significant wave height; comparison between nested grid numerical model, and machine learning models of artificial neural networks, extreme learning and support vector machines. Engineering Applications of Computational Fluid Mechanics, 14(1):805–817. https://doi.org/10.1080/19942060.2020.1773932
Torres JF, Gutiérrez-Avilés D, Troncoso A, et al., 2019. Random hyper-parameter search-based deep neural network for power consumption forecasting. International Work-Conference on Artificial Neural Networks, p.259–269. https://doi.org/10.1007/978-3-030-20521-8_22
ul Hassan M, Sabar NR, Song A, 2018. Optimising deep learning by hyper-heuristic approach for classifying good quality images. International Conference on Computational Science, p.528–539. https://doi.org/10.1007/978-3-319-93701-4_41
Wei X, You ZN, 2019. Neural network hyperparameter tuning based on improved genetic algorithm. Proceedings of the 8th International Conference on Computing and Pattern Recognition, p.17–24. https://doi.org/10.1145/3373509.3373554
Wicaksono AS, Supianto AA, 2018. Hyper parameter optimization using genetic algorithm on machine learning methods for online news popularity prediction. International Journal of Advanced Computer Science and Applications, 9(12):263–267. https://doi.org/10.14569/IJACSA.2018.091238
Yu T, Zhu H, 2020. Hyper-parameter optimization: a review of algorithms and applications. https://arxiv.org/abs/2003.05689
Author information
Authors and Affiliations
Corresponding author
Additional information
Contributors
Timon RABCZUK devised the project, verified the computational process, and contributed to the final version of the manuscript. Mohammad SALAVATI contributed to the simulation process. Arvin MOJAHEDIN worked out the technical details, performed the computational calculations, and wrote the manuscript.
Conflict of interest
Arvin MOJAHEDIN, Mohammad SALAVATI, and Timon RABCZUK declare that they have no conflict of interest.
Rights and permissions
About this article
Cite this article
Nikbakht, S., Anitescu, C. & Rabczuk, T. Optimizing the neural network hyperparameters utilizing genetic algorithm. J. Zhejiang Univ. Sci. A 22, 407–426 (2021). https://doi.org/10.1631/jzus.A2000384
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1631/jzus.A2000384