Short-and-Long-Term Impact of Initialization Functions in NeuroEvolution

Evangelista, Lucas Gabriel Coimbra; Giusti, Rafael

doi:10.1007/978-3-031-21686-2_21

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13653))

Included in the following conference series:

Brazilian Conference on Intelligent Systems

668 Accesses

Abstract

Neural evolutionary computation has risen as a promising approach to propose neural network architectures without human interference. However, the often high computational cost of these approaches is a serious challenge for their application and research. In this work, we empirically analyse standard practices with Coevolution of Deep NeuroEvolution of Augmenting Topologies (CoDeepNEAT) and the effect that different initialization functions have when experiments are tuned for quick evolving networks on a small number of generations and small populations. We compare networks initialized with the He, Glorot, and Random initializations on different settings of population size, number of generations, training epochs, etc. Our results suggest that properly setting hyperparameters for short training sessions in each generation may be sufficient to produce competitive neural networks. We also observed that the He initialization, when associated with neural evolution, has a tendency to create architectures with multiple residual connections, while the Glorot initializer has the opposite effect.

We thank Coordination for the Improvement of Higher Education Personnel - CAPES/PROAP and Amazonas State Research Support Foundation - FAPEAM/POSGRAD 2021. This research was partially supported by CAPES via student support grant #88887.498437/2020-00.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ma, Y., Xie, Y.: Evolutionary neural networks for deep learning: a review. Int. J. Mach. Learn. Cybern. (2022). https://doi.org/10.1007/s13042-022-01578-8
Kumar, S. K.: On weight initialization in deep neural networks. In: arXiv preprint arXiv:1704.08863 (2017)
Initializing neural networks. https://www.deeplearning.ai/ai-notes/initialization/. Accessed 12 June 2022
Goodfellow, I.J., Bengio, Y., Courville, A.: Deep Learning, 1st edn. Cambridge (2016)
Google Scholar
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 249–256. JMLR Workshop and Conference Proceedings, Sardinia (2010)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1026–1034. IEEE, Santiago (2010)
Google Scholar
Koutnik, J., Gomez, F., Schmidhuber, J.: Evolving neural networks in compressed weight space. In: Proceedings of the 12th Annual Conference on Genetic and Evolutionary Computation, pp. 619–626. ACM, Portland (2010)
Google Scholar
Togelius, J., Gomez, F., Schmidhuber, J.: Learning what to ignore: memetic climbing in topology and weight space. In: Proceedings of the 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence), pp. 3274–3281. IEEE, Hong Kong (2008)
Google Scholar
Okada, H., Wada, T., Yamashita, A., Matsue, T.: Interval-valued evolution strategy for evolving neural networks with interval weights and biases. In: Proceedings of the International Conference on Soft Computing and Intelligent Systems, and the 13th International Symposium on Advanced Intelligence Systems, pp. 2056–2060. IEEE, Kobe (2012)
Google Scholar
Desell, T.: Accelerating the evolution of convolutional neural networks with node-level mutations and epigenetic weight initialization. In: Proceedings of the Genetic and Evolutionary Computation Conference Companion, pp. 157–158. IEEE, Kyoto (2018)
Google Scholar
Lyu, Z., ElSaid, A., Karns, J., Mkaouer, M., Desell, T.: An experimental study of weight initialization and weight inheritance effects on neuroevolution. In: Proceedings of Applications of Evolutionary Computation: 24th International Conference. ACM, Seville (2021)
Google Scholar
Miikkulainen, R., Liang, J., Meyerson, E., Rawal, A.: Evolving deep neural networks. In: Artificial Intelligence in the Age of Neural Networks and Brain Computing, pp. 293–312 (2019)
Google Scholar
Papavasileiou, E., Cornelis, J., Jansen, B.: A systematic literature review of the successors of ‘NeuroEvolution of augmenting topologies’. Evol. Comput. 29, 1–73 (2020)
Article Google Scholar
Bohrer, J.S., Grisci, B.I., Dorn, M.: Neuroevolution of neural network architectures using CoDeepNEAT and Keras. In: arXiv preprint arXiv:2002.04634 (2020)
Zhou, X., Li, X., Hu, K., Zhang, Y., Chen, Z., Gao, X.: ERV-Net: an efficient 3D residual neural network for brain tumor segmentation. Expert Syst. Appl. 170, 114566 (2021)
Google Scholar
Dogan, S, et al.: Automated accurate fire detection system using ensemble pretrained residual network. Expert Syst. Appl. 203, 117407 (2022)
Google Scholar
Hoorali, F., Khosravi, H., Moradi, B.: IRUNet for medical image segmentation. Expert Syst. Appl. 191, 116399 (2022)
Google Scholar
Li, H., Xu, Z., Tyalor, G., Studer, C., Goldstein, T.: Visualizing the loss landscape of neural nets. In: Advances in Neural Information Processing Systems (2018)
Google Scholar
Intuitive Explanation of Skip Connections in Deep Learning. https://theaisummer.com/skip-connections/. Accessed 12 June 2022
Keras Documentation - Glorot Uniform. https://www.tensorflow.org/api_docs/python/tf/keras/initializers/GlorotUniform. Accessed 10 July 2022
Keras Documentation - Glorot Normal. https://www.tensorflow.org/api_docs/python/tf/keras/initializers/GlorotNormal. Accessed 10 July 2022
Keras Documentation - He Uniform. https://www.tensorflow.org/api_docs/python/tf/keras/initializers/HeUniform. Accessed 10 July 2022
Keras Documentation - He Normal. https://www.tensorflow.org/api_docs/python/tf/keras/initializers/HeNormal. Accessed 10 July 2022
Searching for activation functions. https://arxiv.org/abs/1710.05941. Accessed 12 June 2022
Mish: A self regularized non-monotonic neural activation function. https://arxiv.org/abs/1908.08681. Accessed 12 June 2022

Download references

Author information

Authors and Affiliations

Institute of Computing, Federal University of Amazonas, Manaus, Brazil
Lucas Gabriel Coimbra Evangelista & Rafael Giusti

Authors

Lucas Gabriel Coimbra Evangelista
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Giusti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rafael Giusti .

Editor information

Editors and Affiliations

Federal University of Rio Grande do Norte, Natal, Brazil
João Carlos Xavier-Junior
Federal University of Bahia, Salvador, Brazil
Ricardo Araújo Rios

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Evangelista, L.G.C., Giusti, R. (2022). Short-and-Long-Term Impact of Initialization Functions in NeuroEvolution. In: Xavier-Junior, J.C., Rios, R.A. (eds) Intelligent Systems. BRACIS 2022. Lecture Notes in Computer Science(), vol 13653. Springer, Cham. https://doi.org/10.1007/978-3-031-21686-2_21

Download citation

DOI: https://doi.org/10.1007/978-3-031-21686-2_21
Published: 19 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21685-5
Online ISBN: 978-3-031-21686-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Short-and-Long-Term Impact of Initialization Functions in NeuroEvolution