Improving Convolutional Neural Network Design via Variable Neighborhood Search

Araújo, Teresa; Aresta, Guilherme; Almada-Lobo, Bernardo; Mendonça, Ana Maria; Campilho, Aurélio

doi:10.1007/978-3-319-59876-5_41

Improving Convolutional Neural Network Design via Variable Neighborhood Search

Teresa Araújo^16,17,
Guilherme Aresta^16,17,
Bernardo Almada-Lobo^16,17,
Ana Maria Mendonça^16,17 &
…
Aurélio Campilho^16,17

Conference paper
First Online: 02 June 2017

2693 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10317))

Abstract

An unsupervised method for convolutional neural network (CNN) architecture design is proposed. The method relies on a variable neighborhood search-based approach for finding CNN architectures and hyperparameter values that improve classification performance. For this purpose, t-Distributed Stochastic Neighbor Embedding (t-SNE) is applied to effectively represent the solution space in 2D. Then, k-Means clustering divides this representation space having in account the relative distance between neighbors. The algorithm is tested in the CIFAR-10 image dataset. The obtained solution improves the CNN validation loss by over \(15\%\) and the respective accuracy by \(5\%\). Moreover, the network shows higher predictive power and robustness, validating our method for the optimization of CNN design.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Cireşan, D.C., Giusti, A., Gambardella, L.M., Schmidhuber, J.: Mitosis detection in breast cancer histology images with deep neural networks. In: Mori, K., Sakuma, I., Sato, Y., Barillot, C., Navab, N. (eds.) MICCAI 2013. LNCS, vol. 8150, pp. 411–418. Springer, Heidelberg (2013). doi:10.1007/978-3-642-40763-5_51
Chapter Google Scholar
Domhan, T., Springenberg, J.T., Hutter, F.: Speeding up automatic hyperparameter optimization of deep neural networks by extrapolation of learning curves. In: IJCAI International Joint Conference on Artificial Intelligence 2015, pp. 3460–3468, January 2015
Google Scholar
Github: Cifar 10 CNN. https://github.com/fchollet/keras/blob/master/examples/cifar10_cnn.py
Hansen, P., Mladenovi, N.: Variable neighborhood search: principles and applications. Eur. J. Oper. Res. 130, 449–467 (2001)
Article MathSciNet MATH Google Scholar
Jin, J., Yan, Z., Fu, K., Jiang, N., Zhang, C.: Neural Network Architecture Optimization through Submodularity and Supermodularity, pp. 1–10 (2016)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 11061114 (2012)
Google Scholar
Krizhevsky, A.: Learning multiple layers of features from tiny images. Master’s thesis, Department of Computer Science, University of Toronto (2009)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Maaten, L., Hinton, G.E.: Visualizing high-dimensional data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008)
MATH Google Scholar
Macqueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, no. 233, pp. 281–297 (1967)
Google Scholar
Snoek, J., Larochelle, H., Adams, R.: Practical Bayesian optimization of machine learning algorithms. In: Advances in Neural Information Processing Systems, pp. 1–9 (2012)
Google Scholar
Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.: Striving for simplicity: the all convolutional net. In: ICLR 2015, pp. 1–14 (2015)
Google Scholar

Download references

Acknowledgements

Teresa Araújo and Guilherme Aresta equally contributed to this work. Project “NanoSTIMA: Macro-to-Nano Human Sensing: Towards Integrated Multimodal Health Monitoring and Analytics/NORTE-01-0145-FEDER-000016” is financed by the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, and through the European Regional Development Fund (ERDF). Teresa Araújo is funded by the FCT grant contract SFRH/BD/122365/2016. Guilherme Aresta is funded by the FCT grant contract SFRH/BD/120435/2016.

Author information

Authors and Affiliations

INESC TEC - Institute for Systems and Computer Engineering, Technology and Science, Porto, Portugal
Teresa Araújo, Guilherme Aresta, Bernardo Almada-Lobo, Ana Maria Mendonça & Aurélio Campilho
Faculdade de Engenharia da Universidade do Porto, Porto, Portugal
Teresa Araújo, Guilherme Aresta, Bernardo Almada-Lobo, Ana Maria Mendonça & Aurélio Campilho

Authors

Teresa Araújo
View author publications
You can also search for this author in PubMed Google Scholar
Guilherme Aresta
View author publications
You can also search for this author in PubMed Google Scholar
Bernardo Almada-Lobo
View author publications
You can also search for this author in PubMed Google Scholar
Ana Maria Mendonça
View author publications
You can also search for this author in PubMed Google Scholar
Aurélio Campilho
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Teresa Araújo .

Editor information

Editors and Affiliations

University of Waterloo, Waterloo, Ontario, Canada
Fakhri Karray
University of Porto, Porto, Portugal
Aurélio Campilho
Politechnique Montreal, Montreal, Québec, Canada
Farida Cheriet

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Araújo, T., Aresta, G., Almada-Lobo, B., Mendonça, A.M., Campilho, A. (2017). Improving Convolutional Neural Network Design via Variable Neighborhood Search. In: Karray, F., Campilho, A., Cheriet, F. (eds) Image Analysis and Recognition. ICIAR 2017. Lecture Notes in Computer Science(), vol 10317. Springer, Cham. https://doi.org/10.1007/978-3-319-59876-5_41

Download citation

DOI: https://doi.org/10.1007/978-3-319-59876-5_41
Published: 02 June 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59875-8
Online ISBN: 978-3-319-59876-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics