Pruning Neural Nets by Optimal Neuron Merging

Goldberg, Felix; Lubarsky, Yackov; Gaissinski, Alexei; Botchan, Dan; Kisilev, Pavel

doi:10.1007/978-3-031-09037-0_56

Felix Goldberg ORCID: orcid.org/0000-0002-6963-6010¹²,
Yackov Lubarsky¹²,
Alexei Gaissinski¹²,
Dan Botchan¹² &
…
Pavel Kisilev¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13363))

Included in the following conference series:

International Conference on Pattern Recognition and Artificial Intelligence

1725 Accesses

Abstract

We present a new method for structured pruning of neural networks, based on the recently proposed neuron merging trick in which following a pruning operation, the weights of the next layer are suitably modified. By a rigorous mathematical analysis of the neuron merging technique we prove an upper bound on the reconstruction error. This bound defines a new objective function for pruning-and-merging. Our new optimal algorithm provably achieves the lowest objective cost among all possible prune-and-merge strategies. We also show empirically that nuclear norm regularization can be used to obtain even better pruning-and-merging accuracy; this finding is supported by our theoretical analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions

DyPrune: Dynamic Pruning Rates for Neural Networks

EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning

Notes

1.
The bias also needs to be taken into account - see [9, Section 6.1] for details how to do that.

References

Church, R.L.: Beamr: an exact and approximate model for the p-median problem. Comput. Operat. Res. 35(2), 417–426 (2008)
Article MATH Google Scholar
Enderich, L., Timm, F., Burgard, W.: Holistic filter pruning for efficient deep neural networks. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 2596–2605 (2021)
Google Scholar
Hakimi, S.L.: Optimum distribution of switching centers in a communication network and some related graph theoretic problems. Oper. Res. 13(3), 462–475 (1965)
Article MATH Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
He, Y., Kang, G., Dong, X., Fu, Y., Yang, Y.: Soft filter pruning for accelerating deep convolutional neural networks. arXiv preprint arXiv:1808.06866 (2018)
He, Y., Liu, P., Wang, Z., Hu, Z., Yang, Y.: Filter pruning via geometric median for deep convolutional neural networks acceleration. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4340–4349 (2019)
Google Scholar
He, Y., Zhang, X., Sun, J.: Channel pruning for accelerating very deep neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1389–1397 (2017)
Google Scholar
Hoefler, T., Alistarh, D., Ben-Nun, T., Dryden, N., Peste, A.: Sparsity in deep learning: pruning and growth for efficient inference and training in neural networks. arXiv preprint arXiv:2102.00554 (2021)
Kim, W., Kim, S., Park, M., Jeon, G.: Neuron merging: compensating for pruned neurons. In: Advances in Neural Information Processing Systems, vol. 33 (2020)
Google Scholar
Krizhevsky, A., Hinton, G., et al.: Learning multiple layers of features from tiny images (2009)
Google Scholar
Le, D.H., Hua, B.S.: Network pruning that matters: a case study on retraining variants. In: International Conference on Learning Representations (2021). https://openreview.net/forum?id=Cb54AMqHQFP
Li, T., Li, J., Liu, Z., Zhang, C.: Few sample knowledge distillation for efficient network compression. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14639–14647 (2020)
Google Scholar
Liu, Z., Li, J., Shen, Z., Huang, G., Yan, S., Zhang, C.: Learning efficient convolutional networks through network slimming. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2736–2744 (2017)
Google Scholar
Luo, J.H., Wu, J., Lin, W.: Thinet: a filter level pruning method for deep neural network compression. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5058–5066 (2017)
Google Scholar
Mao, H., et al.: Exploring the granularity of sparsity in convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 13–20 (2017)
Google Scholar
Nie, F., Huang, H., Ding, C.: Low-rank matrix recovery via efficient Schatten p-norm minimization. In: Twenty-sixth AAAI Conference on Artificial Intelligence (2012)
Google Scholar
Recht, B., Xu, W., Hassibi, B.: Necessary and sufficient conditions for success of the nuclear norm heuristic for rank minimization. In: 2008 47th IEEE Conference on Decision and Control, pp. 3065–3070. IEEE (2008)
Google Scholar
Reese, J.: Solution methods for the p-median problem: an annotated bibliography. Netw. Int. J. 48(3), 125–142 (2006)
Google Scholar
Renda, A., Frankle, J., Carbin, M.: Comparing rewinding and fine-tuning in neural network pruning. In: International Conference on Learning Representations (2020). https://openreview.net/forum?id=S1gSj0NKvB
Russakovsky, O., et al.: Imagenet large scale visual recognition challenge. Int. J. Comput. Vis. 115(3), 211–252 (2015)
Google Scholar
Vadera, S., Ameen, S.: Methods for pruning deep neural networks. arXiv preprint arXiv:2011.00241 (2020)
You, Z., Yan, K., Ye, J., Ma, M., Wang, P.: Gate decorator: Global filter pruning method for accelerating deep convolutional neural networks. arXiv preprint arXiv:1909.08174 (2019)
Zagoruyko, S., Komodakis, N.: Wide residual networks. arXiv preprint arXiv:1605.07146 (2016)
Zhou, D., et al.: Go wide, then narrow: efficient training of deep thin networks. In: International Conference on Machine Learning, pp. 11546–11555. PMLR (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Toga Networks Ltd., HaHarash Street 4, Hod Hasharon, Israel
Felix Goldberg, Yackov Lubarsky, Alexei Gaissinski, Dan Botchan & Pavel Kisilev

Authors

Felix Goldberg
View author publications
You can also search for this author in PubMed Google Scholar
Yackov Lubarsky
View author publications
You can also search for this author in PubMed Google Scholar
Alexei Gaissinski
View author publications
You can also search for this author in PubMed Google Scholar
Dan Botchan
View author publications
You can also search for this author in PubMed Google Scholar
Pavel Kisilev
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Felix Goldberg .

Editor information

Editors and Affiliations

Télécom SudParis, Palaiseau, France
Mounîm El Yacoubi
École de Technologie Supérieure, Montreal, QC, Canada
Eric Granger
Hong Kong Baptist University, Kowloon, Kowloon, Hong Kong
Pong Chi Yuen
Indian Statistical Institute, Kolkata, India
Umapada Pal
Université Paris Cité, Paris, France
Nicole Vincent

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Goldberg, F., Lubarsky, Y., Gaissinski, A., Botchan, D., Kisilev, P. (2022). Pruning Neural Nets by Optimal Neuron Merging. In: El Yacoubi, M., Granger, E., Yuen, P.C., Pal, U., Vincent, N. (eds) Pattern Recognition and Artificial Intelligence. ICPRAI 2022. Lecture Notes in Computer Science, vol 13363. Springer, Cham. https://doi.org/10.1007/978-3-031-09037-0_56

Download citation

DOI: https://doi.org/10.1007/978-3-031-09037-0_56
Published: 02 June 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-09036-3
Online ISBN: 978-3-031-09037-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Pruning Neural Nets by Optimal Neuron Merging

Abstract

Access this chapter

Similar content being viewed by others

Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions

DyPrune: Dynamic Pruning Rates for Neural Networks

EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Pruning Neural Nets by Optimal Neuron Merging

Abstract

Access this chapter

Similar content being viewed by others

Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions

DyPrune: Dynamic Pruning Rates for Neural Networks

EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation