Compressing Deep Neural Network

He, Shiming; Li, Zhuozhou; Wang, Jin; Xie, Kun; Zhang, Dafang

doi:10.1007/978-981-13-9341-9_107

Shiming He³⁸,
Zhuozhou Li³⁸,
Jin Wang³⁸,
Kun Xie³⁹ &
…
Dafang Zhang³⁹

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 536))

Included in the following conference series:

840 Accesses

Abstract

Deep learning is the most useful tool for may applications, such as image recognize, nature language processing. But huge computation power and millions of parameters are needed in large models which may can’t be supported and stored. For this problem, some works tried to compress the dense weight matrices with sparse representations technologies, such as matrix decomposition and tensor decomposition. But it is still unknown which is the largest compress ratio. Therefore, in this paper, we analyse the relationship between the shape of tensor and the number of parameters, formulate the problem of minimizing the number of parameters, and solve it to find the best compress ratio. We compare the compressed ration on three data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Convolutional Neural Network Compression via Tensor-Train Decomposition on Permuted Weight Tensor with Automatic Rank Determination

Stable Low-Rank Tensor Decomposition for Compression of Convolutional Neural Network

Compressed neural architecture utilizing dimensionality reduction and quantization

Article 27 April 2022

References

Denil, B., Shakibi, L., Dinh, N., de Freitas et al., Predicting parameters in deep learning. In: Advances in Neural Information Processing Systems, pp. 2148–2156. IEEE Press, New York (2013)
Google Scholar
Chien, J.T., Bao, Y.T.: Tensor-factorized neural networks. IEEE Trans. Neural Netw. Learn. Syst. 29(5), 1998–2011 (2018)
Article MathSciNet Google Scholar
Tjandra, A., Sakti, S., Nakamura, S., Compressing recurrent neural network with tensor train. In: 2017 International Joint Conference on in Neural Networks (IJCNN), pp. 4451–4458. IEEE Press, New York (2017)
Google Scholar
Lathauwer, L.D., Moor, B.D., Vandewalle, J.: A multilinear singular value decomposition. SIAM J. Matrix Anal. Appl 21(4), 1253–1278 (2000)
Article MathSciNet Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Netzer, Y., Wang, T., Coates, A., Bissacco, A., Wu, B., Ng, A.Y.: Reading digits in natural images with unsupervised feature learning. In: Proceedings of NIPS Workshop Deep Learning and Unsupervised Feature Learning, p. 5. IEEE Press. New York (2011)
Google Scholar
Liao, C.P., Chien J.T.: Graphical modeling of conditional random fields for human motion recognition. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1969–1972. IEEE Press, New York (2008)
Google Scholar

Download references

Acknowledgments

This work was supported by National Natural Science Foundation of China (Nos. 61802030, 61572184, 61502054), the Science and Technology Projects of Hunan Province (No. 2016JC2075), the Research Foundation of Education Bureau of Hunan Province, China (Nos. 16C0047, 16B085).

Author information

Authors and Affiliations

School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha, 410114, China
Shiming He, Zhuozhou Li & Jin Wang
College of Computer Science and Electronics Engineering, Hunan University, Changsha, 410082, China
Kun Xie & Dafang Zhang

Authors

Shiming He
View author publications
You can also search for this author in PubMed Google Scholar
Zhuozhou Li
View author publications
You can also search for this author in PubMed Google Scholar
Jin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Kun Xie
View author publications
You can also search for this author in PubMed Google Scholar
Dafang Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shiming He .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Korea (Republic of)
James J. Park
Department of Computer Software Engineering, Soon Chun Hyang University, Asan, Korea (Republic of)
Doo-Soon Park
Department of Multimedia Engineering, Dongguk University, Seoul, Korea (Republic of)
Young-Sik Jeong
Department of Computer Science, Georgia State University, Atlanta, GA, USA
Yi Pan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, S., Li, Z., Wang, J., Xie, K., Zhang, D. (2020). Compressing Deep Neural Network. In: Park, J., Park, DS., Jeong, YS., Pan, Y. (eds) Advances in Computer Science and Ubiquitous Computing. CUTE CSA 2018 2018. Lecture Notes in Electrical Engineering, vol 536. Springer, Singapore. https://doi.org/10.1007/978-981-13-9341-9_107

Download citation

DOI: https://doi.org/10.1007/978-981-13-9341-9_107
Published: 04 December 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9340-2
Online ISBN: 978-981-13-9341-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Compressing Deep Neural Network

Abstract

Access this chapter

Similar content being viewed by others

Convolutional Neural Network Compression via Tensor-Train Decomposition on Permuted Weight Tensor with Automatic Rank Determination

Stable Low-Rank Tensor Decomposition for Compression of Convolutional Neural Network

Compressed neural architecture utilizing dimensionality reduction and quantization

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Compressing Deep Neural Network

Abstract

Access this chapter

Similar content being viewed by others

Convolutional Neural Network Compression via Tensor-Train Decomposition on Permuted Weight Tensor with Automatic Rank Determination

Stable Low-Rank Tensor Decomposition for Compression of Convolutional Neural Network

Compressed neural architecture utilizing dimensionality reduction and quantization

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation