Optimizing the neural network hyperparameters utilizing genetic algorithm

Nikbakht, Saeid; Anitescu, Cosmin; Rabczuk, Timon

doi:10.1631/jzus.A2000384

Optimizing the neural network hyperparameters utilizing genetic algorithm

利用遗传算法优化神经网络超参数

Article
Published: 30 June 2021

Volume 22, pages 407–426, (2021)
Cite this article

Journal of Zhejiang University-SCIENCE A Aims and scope Submit manuscript

868 Accesses
38 Citations
Explore all metrics

Abstract

Neural networks (NNs), as one of the most robust and efficient machine learning methods, have been commonly used in solving several problems. However, choosing proper hyperparameters (e.g. the numbers of layers and neurons in each layer) has a significant influence on the accuracy of these methods. Therefore, a considerable number of studies have been carried out to optimize the NN hyperparameters. In this study, the genetic algorithm is applied to NN to find the optimal hyperparameters. Thus, the deep energy method, which contains a deep neural network, is applied first on a Timoshenko beam and a plate with a hole. Subsequently, the numbers of hidden layers, integration points, and neurons in each layer are optimized to reach the highest accuracy to predict the stress distribution through these structures. Thus, applying the proper optimization method on NN leads to significant increase in the NN prediction accuracy after conducting the optimization in various examples.

Abstract

目的

证明超参数优化对深度能量方法 (DEM) 精度的影响以及DEM在预测不同荷载作用下梁和板等结构的应力分布方面的能力。

创新点

1. 为了提高DEM的准确性, 各种超参数组合被输入遗传算法 (GA) 并找到最佳组合。2. 为了防止重复计算以及提高这种元启发式算法的效率, GA过程中还考虑了超参数组合的禁忌列表。

方法

1. 实施非均匀有理样条 (NURBS) 以生成穿过结构体和边界的积分点。2. 采用DEM计算位移和应力分布。3. 利用遗传算法优化DEM的超参数, 以对模型在预测结构内应力和位移传播的准确性方面具有显着影响。

结论

1. 在不同的优化器和激活函数中, Adam和L-BFGS-B方法以及ReLU2函数的组合使得DEM模型的准确率最高。2. 其他对模型预测准确性有影响的超参数包括隐藏层的数量, 每层神经元的数量以及通过上述结构集成的点数。3. 优化DEM的超参数可以使相对应变能误差降低近50%, 提高了DEM模型对应力和位移分布的预测能力。

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Scientific Machine Learning Through Physics–Informed Neural Networks: Where we are and What’s Next

Article Open access 26 July 2022

A physics-informed neural network technique based on a modified loss function for computational 2D and 3D solid mechanics

Article 28 November 2022

A Review of Physics Informed Neural Networks for Multiscale Analysis and Inverse Problems

Article 13 February 2024

References

Alhichri H, Alajlan N, Bazi Y, et al., 2018. Multi-scale convolutional neural network for remote sensing scene classification. IEEE International Conference on Electro/Information Technology, p.1–5. https://doi.org/10.1109/EIT.2018.8500107
Anitescu C, Hossain MN, Rabczuk T, 2018. Recovery-based error estimation and adaptivity using high-order splines over hierarchical T-meshes. Computer Methods in Applied Mechanics and Engineering, 328:638–662. https://doi.org/10.1016/j.cma.2017.08.032
Article MathSciNet Google Scholar
Augarde CE, Deeks AJ, 2008. The use of Timoshenko’s exact solution for a cantilever beam in adaptive analysis. Finite Elements in Analysis and Design, 44(9–10):595–601. https://doi.org/10.1016/j.finel.2008.01.010
Article Google Scholar
Bacanin N, Bezdan T, Tuba E, et al., 2020. Optimizing convolutional neural network hyperparameters by enhanced swarm intelligence metaheuristics. Algorithms, 13(3):67. https://doi.org/10.3390/a13030067
Article MathSciNet Google Scholar
Bani-Hani D, Khan N, Alsultan F, et al., 2018. Classification of leucocytes using convolutional neural network optimized through genetic algorithm. Proceedings of the 7th Annual World Conference of the Society for Industrial and Systems Engineering.
Bergstra J, Bengio Y, 2012. Random search for hyperparameter optimization. The Journal of Machine Learning Research, 13(1):281–305.
MATH Google Scholar
Dalto M, Matuško J, Vašak M, 2015. Deep neural networks for ultra-short-term wind forecasting. IEEE International Conference on Industrial Technology, p.1657–1663. https://doi.org/10.1109/ICIT.2015.7125335
Goswami S, Anitescu C, Chakraborty S, et al., 2020. Transfer learning enhanced physics informed neural network for phase-field modeling of fracture. Theoretical and Applied Fracture Mechanics, 106:102447. https://doi.org/10.1016/j.tafmec.2019.102447
Article Google Scholar
Guo BS, Hu JW, Wu WW, et al., 2019. The Tabu_genetic algorithm: a novel method for hyper-parameter optimization of learning algorithms. Electronics, 8(5):579. https://doi.org/10.3390/electronics8050579
Article Google Scholar
Jo Y, Min K, Jung D, et al., 2019. Comparative study of the artificial neural network with three hyper-parameter optimization methods for the precise LP-EGR estimation using in-cylinder pressure in a turbocharged GDI engine. Applied Thermal Engineering, 149:1324–1334. https://doi.org/10.1016/j.applthermaleng.2018.12.139
Article Google Scholar
Junior FEF, Yen GG, 2019. Particle swarm optimization of deep neural networks architectures for image classification. Swarm and Evolutionary Computation, 49:62–74. https://doi.org/10.1016/j.swevo.2019.05.010
Article Google Scholar
Kanada Y, 2016. Optimizing neural-network learning rate by using a genetic algorithm with per-epoch mutations. International Joint Conference on Neural Networks, p.1472–1479. https://doi.org/10.1109/IJCNN.2016.7727372
Kaur S, Aggarwal H, Rani R, 2020. Hyper-parameter optimization of deep learning model for prediction of Parkinson’s disease. Machine Vision and Applications, 31(5):32. https://doi.org/10.1007/s00138-020-01078-1
Article Google Scholar
Liashchynskyi P, Liashchynskyi P, 2019. Grid search, random search, genetic algorithm: a big comparison for NAS. https://arxiv.org/abs/1912.06059
Loussaief S, Abdelkrim A, 2018. Convolutional neural network hyper-parameters optimization based on genetic algorithms. International Journal of Advanced Computer Science and Applications, 9(10):252–266. https://doi.org/10.14569/IJACSA.2018.091031
Article Google Scholar
Motta D, Santos AÁB, Machado BAS, et al., 2020. Optimization of convolutional neural network hyperparameters for automatic classification of adult mosquitoes. PloS One, 15(7):e0234959. https://doi.org/10.1371/journal.pone.0234959
Article Google Scholar
Najafi B, Faizollahzadeh Ardabili S, Mosavi A, et al., 2018. An intelligent artificial neural network-response surface methodology method for accessing the optimum biodiesel and diesel fuel blending conditions in a diesel engine from the viewpoint of exergy and energy analysis. Energies, 11(4):860. https://doi.org/10.3390/en11040860
Article Google Scholar
Nassif AB, Shahin I, Attili I, et al., 2019. Speech recognition using deep neural networks: a systematic review. IEEE Access, 7:19143–19165. https://doi.org/10.1109/ACCESS.2019.2896880
Article Google Scholar
Nguyen-Thanh VM, Nguyen LTK, Rabczuk T, et al., 2019. A surrogate model for computational homogenization of elastostatics at finite strain using the HDMR-based neural network approximator. https://arxiv.org/abs/1906.02005
Samaniego E, Anitescu C, Goswami S, et al., 2020. An energy approach to the solution of partial differential equations in computational mechanics via machine learning: concepts, implementation and applications. Computer Methods in Applied Mechanics and Engineering, 362:112790. https://doi.org/10.1016/j.cma.2019.112790
Article MathSciNet Google Scholar
Shamshirband S, Mosavi A, Rabczuk T, et al., 2020. Prediction of significant wave height; comparison between nested grid numerical model, and machine learning models of artificial neural networks, extreme learning and support vector machines. Engineering Applications of Computational Fluid Mechanics, 14(1):805–817. https://doi.org/10.1080/19942060.2020.1773932
Article Google Scholar
Torres JF, Gutiérrez-Avilés D, Troncoso A, et al., 2019. Random hyper-parameter search-based deep neural network for power consumption forecasting. International Work-Conference on Artificial Neural Networks, p.259–269. https://doi.org/10.1007/978-3-030-20521-8_22
ul Hassan M, Sabar NR, Song A, 2018. Optimising deep learning by hyper-heuristic approach for classifying good quality images. International Conference on Computational Science, p.528–539. https://doi.org/10.1007/978-3-319-93701-4_41
Wei X, You ZN, 2019. Neural network hyperparameter tuning based on improved genetic algorithm. Proceedings of the 8th International Conference on Computing and Pattern Recognition, p.17–24. https://doi.org/10.1145/3373509.3373554
Wicaksono AS, Supianto AA, 2018. Hyper parameter optimization using genetic algorithm on machine learning methods for online news popularity prediction. International Journal of Advanced Computer Science and Applications, 9(12):263–267. https://doi.org/10.14569/IJACSA.2018.091238
Article Google Scholar
Yu T, Zhu H, 2020. Hyper-parameter optimization: a review of algorithms and applications. https://arxiv.org/abs/2003.05689

Download references

Author information

Authors and Affiliations

Division of Computational Mechanics, Ton Duc Thang University, Ho Chi Minh City, Vietnam
Timon Rabczuk
Faculty of Civil Engineering, Ton Duc Thang University, Ho Chi Minh City, Vietnam
Timon Rabczuk
Institute of Structural Mechanics, Bauhaus-Universität Weimar, Weimar, 99423, Germany
Saeid Nikbakht & Cosmin Anitescu

Authors

Saeid Nikbakht
View author publications
You can also search for this author in PubMed Google Scholar
Cosmin Anitescu
View author publications
You can also search for this author in PubMed Google Scholar
Timon Rabczuk
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Timon Rabczuk.

Additional information

Contributors

Timon RABCZUK devised the project, verified the computational process, and contributed to the final version of the manuscript. Mohammad SALAVATI contributed to the simulation process. Arvin MOJAHEDIN worked out the technical details, performed the computational calculations, and wrote the manuscript.

Conflict of interest

Arvin MOJAHEDIN, Mohammad SALAVATI, and Timon RABCZUK declare that they have no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nikbakht, S., Anitescu, C. & Rabczuk, T. Optimizing the neural network hyperparameters utilizing genetic algorithm. J. Zhejiang Univ. Sci. A 22, 407–426 (2021). https://doi.org/10.1631/jzus.A2000384

Download citation

Received: 25 August 2020
Accepted: 21 December 2020
Published: 30 June 2021
Issue Date: June 2021
DOI: https://doi.org/10.1631/jzus.A2000384

Key words

关键词

CLC number

TP18

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Optimizing the neural network hyperparameters utilizing genetic algorithm

Abstract

Abstract

目的

创新点

方法

结论

Access this article

Similar content being viewed by others

Scientific Machine Learning Through Physics–Informed Neural Networks: Where we are and What’s Next

A physics-informed neural network technique based on a modified loss function for computational 2D and 3D solid mechanics

A Review of Physics Informed Neural Networks for Multiscale Analysis and Inverse Problems

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Contributors

Conflict of interest

Rights and permissions

About this article

Cite this article

Key words

关键词

CLC number

Navigation

Optimizing the neural network hyperparameters utilizing genetic algorithm

Abstract

Abstract

目的

创新点

方法

结论

Access this article

Similar content being viewed by others

Scientific Machine Learning Through Physics–Informed Neural Networks: Where we are and What’s Next

A physics-informed neural network technique based on a modified loss function for computational 2D and 3D solid mechanics

A Review of Physics Informed Neural Networks for Multiscale Analysis and Inverse Problems

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Contributors

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Key words

关键词

CLC number

Search

Navigation