Local Search is a Remarkably Strong Baseline for Neural Architecture Search

Den Ottelander, Tom; Dushatskiy, Arkadiy; Virgolin, Marco; Bosman, Peter A. N.

doi:10.1007/978-3-030-72062-9_37

Local Search is a Remarkably Strong Baseline for Neural Architecture Search

Tom Den Ottelander¹⁵,
Arkadiy Dushatskiy¹⁵,
Marco Virgolin¹⁵ &
…
Peter A. N. Bosman^15,16

Conference paper
First Online: 24 March 2021

1891 Accesses
14 Citations
5 Altmetric

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12654))

Abstract

Neural Architecture Search (NAS), i.e., the automation of neural network design, has gained much popularity in recent years with increasingly complex search algorithms being proposed. Yet, solid comparisons with simple baselines are often missing. At the same time, recent retrospective studies have found many new algorithms to be no better than random search (RS). In this work we consider the use of a simple Local Search (LS) algorithm for NAS. We particularly consider a multi-objective NAS formulation, with network accuracy and network complexity as two objectives, as understanding the trade-off between these two objectives is arguably among the most interesting aspects of NAS. The proposed LS algorithm is compared with RS and two evolutionary algorithms (EAs), as these are often heralded as being ideal for multi-objective optimization. To promote reproducibility, we create and release two benchmark datasets, named MacroNAS-C10 and -C100, containing 200K saved network evaluations for two established image classification tasks, CIFAR-10 and CIFAR-100. Our benchmarks are designed to be complementary to existing benchmarks, especially in that they are better suited for multi-objective search. We additionally consider a version of the problem with a much larger architecture space. While we find and show that the considered algorithms explore the search space in fundamentally different ways, we also find that LS substantially outperforms RS and even performs nearly as good as state-of-the-art EAs. We believe that this provides strong evidence that LS is truly a competitive baseline for NAS against which new NAS algorithms should be benchmarked.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
https://www.automl.org/automl/literature-on-neural-architecture-search/.
2.
Benchmark datasets: https://github.com/ArkadiyD/MacroNASBenchmark. Source code for reproducibility: https://github.com/tdenottelander/MacroNAS.
3.
Recall that this net is pre-inserted in the archive because uninteresting.

References

Bishop, C.M.: Pattern Recognition and Machine Learning. In: Information Science and Statistics. Springer, New York (2006). https://www.springer.com/gp/book/9780387310732, ISBN: 978-0-387-31073-2
Bosman, P.A.N.: The anticipated mean shift and cluster registration in mixture-based EDAs for multi-objective optimization. In: Proceedings of the Genetic and Evolutionary Computation Conference, GECCO 2010, pp. 351–358. Association for Computing Machinery, New York (2010). https://doi.org/10.1145/1830483.1830549
Chu, X., Zhang, B., Xu, R., Ma, H.: Multi-objective reinforced evolution in mobile neural architecture search. arXiv preprint arXiv:1901.01074 (2019)
Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6(2), 182–197 (2002)
Article Google Scholar
Dong, X., Yang, Y.: NAS-Bench-201: extending the scope of reproducible neural architecture search. In: International Conference on Learning Representations (2020)
Google Scholar
Elsken, T., Metzen, J.H., Hutter, F.: Efficient multi-objective neural architecture search via Lamarckian evolution. In: International Conference on Learning Representations (2019)
Google Scholar
Elsken, T., Metzen, J.H., Hutter, F.: Neural architecture search: a survey. J. Mach. Learn. Res. 20(55), 1–21 (2019)
MathSciNet MATH Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Jouppi, N.P., et al.: In-datacenter performance analysis of a tensor processing unit. In: Proceedings of the 44th Annual International Symposium on Computer Architecture, pp. 1–12 (2017)
Google Scholar
Kim, Y.H., Reddy, B., Yun, S., Seo, C.: NEMO: neuro-evolution with multiobjective optimization of deep neural network for speed and accuracy. In: ICML 2017 AutoML Workshop (2017)
Google Scholar
Kitchin, R.: The Data Revolution: Big Data, Open Data, Data Infrastructures and Their Consequences. Sage, Thousand Oaks (2014)
Google Scholar
Lu, Z., et al.: NSGA-Net: neural architecture search using multi-objective genetic algorithm. In: Proceedings of the Genetic and Evolutionary Computation Conference, GECCO 2019, pp. 419–427. Association for Computing Machinery, New York (2019). https://doi.org/10.1145/3321707.3321729
Luong, N.H., Poutré, H.L., Bosman, P.A.N.: Multi-objective Gene-pool optimal mixing evolutionary algorithm with the interleaved multi-start scheme. Swarm Evol. Comput. 40, 238–254 (2018)
Article Google Scholar
Miikkulainen, R., et al.: Evolving deep neural networks. In: Artificial Intelligence in the Age of Neural Networks and Brain Computing, pp. 293–312. Elsevier (2019)
Google Scholar
Ottelander, T.D., Dushatskiy, A., Virgolin, M., Bosman, P.A.N.: Local search is a remarkably strong baseline for neural architecture search. arXiv preprint arXiv:2004.08996 (2020)
Pelikan, M., Sastry, K., Cantú-Paz, E.: Scalable Optimization via Probabilistic Modeling: From Algorithms to Applications (Studies in Computational Intelligence). Springer, Heidelberg (2006). https://doi.org/10.1007/978-3-540-34954-9
Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I.: Language models are unsupervised multitask learners. OpenAI Blog (2019)
Google Scholar
Real, E., et al.: Large-scale evolution of image classifiers. In: Precup, D., Teh, Y.W. (eds.) Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 2902–2911. PMLR, International Convention Centre, Sydney, Australia (2017)
Google Scholar
Russakovsky, O., et al.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015). https://doi.org/10.1007/s11263-015-0816-y
Article MathSciNet Google Scholar
Senior, A.W., et al.: Improved protein structure prediction using potentials from deep learning. Nature 577, 706–710 (2020)
Article Google Scholar
Silver, D., et al.: Mastering the game of Go without human knowledge. Nature 550(7676), 354–359 (2017)
Article Google Scholar
Sze, V., Chen, Y.H., Yang, T.J., Emer, J.S.: Efficient processing of deep neural networks: a tutorial and survey. Proc. IEEE 105(12), 2295–2329 (2017)
Article Google Scholar
Thierens, D., Bosman, P.A.N.: Optimal mixing evolutionary algorithms. In: Proceedings of the 13th Annual Conference on Genetic and Evolutionary Computation, GECCO 2011, pp. 617–624. Association for Computing Machinery, New York (2011). https://doi.org/10.1145/2001576.2001661
Vinyals, O., et al.: AlphaStar: Mastering the Real-Time Strategy Game StarCraft II (2019). https://deepmind.com/blog/alphastar- mastering-real-time-strategy-game-starcraft-ii/
White, C., Nolen, S., Savani, Y.: Local search is state of the art for NAS benchmarks. arXiv preprint arXiv:2005.02960 (2020)
Wistuba, M., Rawat, A., Pedapati, T.: A survey on neural architecture search. arXiv preprint arXiv:1905.01392 (2019)
Wong, C., Houlsby, N., Lu, Y., Gesmundo, A.: Transfer learning with neural AutoML. In: Bengio, S., Wallach, H., Larochelle, H., Grauman, K., Cesa-Bianchi, N., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 31, pp. 8356–8365. Curran Associates, Inc. (2018)
Google Scholar
Yang, A., Esperança, P.M., Carlucci, F.M.: NAS evaluation is frustratingly hard. In: International Conference on Learning Representations (2020)
Google Scholar
Ying, C., Klein, A., Christiansen, E., Real, E., Murphy, K., Hutter, F.: NAS-Bench-101: towards reproducible neural architecture search. In: Chaudhuri, K., Salakhutdinov, R. (eds.) Proceedings of the 36th International Conference on Machine Learning, vol. 97, pp. 7105–7114. PMLR, Long Beach (2019)
Google Scholar
Yu, K., Sciuto, C., Jaggi, M., Musat, C., Salzmann, M.: Evaluating the search phase of neural architecture search. In: International Conference on Learning Representations (2020)
Google Scholar
Zoph, B., Vasudevan, V., Shlens, J., Le, Q.V.: Learning transferable architectures for scalable image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8697–8710 (2018)
Google Scholar

Download references

Acknowledgements

This work is part of the research programme Commit2Data with (project #628.011.012), which is financed by the Dutch Research Council (NWO), and part of a project (#15198) that is included in the research program Technology for Oncology, which is financed by the Netherlands Organization for Scientific Research (NWO), the Dutch Cancer Society (KWF), the TKI Life Sciences & Health, Asolutions, Brocacef, and Cancer Health Coach.

Author information

Authors and Affiliations

Centrum Wiskunde and Informatica, Amsterdam, The Netherlands
Tom Den Ottelander, Arkadiy Dushatskiy, Marco Virgolin & Peter A. N. Bosman
Delft University of Technology, Delft, The Netherlands
Peter A. N. Bosman

Authors

Tom Den Ottelander
View author publications
You can also search for this author in PubMed Google Scholar
Arkadiy Dushatskiy
View author publications
You can also search for this author in PubMed Google Scholar
Marco Virgolin
View author publications
You can also search for this author in PubMed Google Scholar
Peter A. N. Bosman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tom Den Ottelander .

Editor information

Editors and Affiliations

Southern University of Science and Technology, Shenzhen, China
Hisao Ishibuchi
City University of Hong Kong, Kowloon Tong, China
Qingfu Zhang
Southern University of Science and Technology, Shenzhen, China
Ran Cheng
University of Exeter, Exeter, UK
Ke Li
Xi'an Jiaotong University, Xi'an, China
Hui Li
Xidian University, Xi'an, China
Handing Wang
East China Normal University, Shanghai, China
Aimin Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Den Ottelander, T., Dushatskiy, A., Virgolin, M., Bosman, P.A.N. (2021). Local Search is a Remarkably Strong Baseline for Neural Architecture Search. In: Ishibuchi, H., et al. Evolutionary Multi-Criterion Optimization. EMO 2021. Lecture Notes in Computer Science(), vol 12654. Springer, Cham. https://doi.org/10.1007/978-3-030-72062-9_37

Download citation

DOI: https://doi.org/10.1007/978-3-030-72062-9_37
Published: 24 March 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-72061-2
Online ISBN: 978-3-030-72062-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics