Multipath Convolutional-Recursive Neural Networks for Object Recognition

Li, Xiangyang; Jiang, Shuqiang; Song, Xinhang; Herranz, Luis; Shi, Zhiping

doi:10.1007/978-3-662-44980-6_30

Xiangyang Li^5,6,
Shuqiang Jiang⁶,
Xinhang Song⁶,
Luis Herranz⁶ &
…
Zhiping Shi⁵

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 432))

Included in the following conference series:

International Conference on Intelligent Information Processing

Abstract

Extracting good representations from images is essential for many computer vision tasks. While progress in deep learning shows the importance of learning hierarchical features, it is also important to learn features through multiple paths. This paper presents Multipath Convolutional-Recursive Neural Networks(M-CRNNs), a novel scheme which aims to learn image features from multiple paths using models based on combination of convolutional and recursive neural networks (CNNs and RNNs). CNNs learn low-level features, and RNNs, whose inputs are the outputs of the CNNs, learn the efficient high-level features. The final features of an image are the combination of the features from all the paths. The result shows that the features learned from M-CRNNs are a highly discriminative image representation that increases the precision in object recognition.

Download to read the full chapter text

Chapter PDF

Learning Recursive Filters for Low-Level Vision via a Hybrid Neural Network

Image Classification Using Convolutional Neural Networks

A survey of the recent architectures of deep convolutional neural networks

Article 21 April 2020

Keywords

References

Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, San Diego (2005)
Google Scholar
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: CVPR, San Francisco (2010)
Google Scholar
Bo, L., Ren, X., Fox, D.: Kernel descriptors for visual recognition. In: NIPS, Vancouver (2010)
Google Scholar
Lobel, H., Vidal, R., Soto, A.: Hierarchical joint Max-Margin learning of mid and top level representations for visual recognition. In: ICCV, Sydney (2013)
Google Scholar
Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Transaction on Pattern Analysis and Machine Intelligence 35(8), 1798–1828 (2013)
Article Google Scholar
Yu, K., Lin, Y., Lafferty, J.: Learning image representations from the pixel level via hierarchical sparse coding. In: CVPR, Colorado Springs (2011)
Google Scholar
Le, Q., Ranzato, M., Monga, R., Devin, M., Chan, K., Gorrado, G., Dean, J., Ng, A.: Building high-level features using large scale unsupervised learning. In: ICML, Scotland (2012)
Google Scholar
Lee, H., Grosse, R., Ranganath, R., Ng, A.: Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: ICML, Montreal (2009)
Google Scholar
Lawrence, S., Giles, C., Tsoi, A., Back, D.: Face recognition: a convolutional neural-network approach. IEEE Transaction on Neural Networks 8(1), 98–113 (1997)
Article Google Scholar
Socher, R., Huval, B., Bhat, B., Manning, D., Ng, A.: Convolutional-recursive deep learning for 3D object classification. In: NIPS, Nevada (2012)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. In: NIPS, Nevada (2012)
Google Scholar
Bo, L., Ren, X., Fox, D.: Multipath sparse coding using hierarchical matching pursuit. In: CVPR, Portland (2013)
Google Scholar
Jarrett, K., Kavukcuoglu, K., Ranzato, M., LeCun, Y.: What is the best multi-stage architecture for object recognition. In: ICCV, Xi’an (2009)
Google Scholar
Serre, T., Wolf, L., Poggio, T.: Object recognition with features Inspired by visual cortex. In: CVPR, San Diego (2005)
Google Scholar
Kavukcuoglu, K., Ranzato, M., LeCun, Y.: Fast inference in sparse coding algorithm with applications to object recognition. Technical report, Computational and Biological Learning Lab, Courant Institute, NYU (2008)
Google Scholar
Saxe, A., Koh, P., Chen, Z., Bhand, M., Suresh, B., Ng, A.: On random weights and unsupervised feature learning. In: ICML, Washington (2011)
Google Scholar
Socher, R., Maning, C., Ng, A.: Learning continuous phrase representation and syntactic parsing with recursive neural networks. In: NIPS, Vancouver (2010)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR, New York (2006)
Google Scholar
Pinto, N., Cox, D., DiCarlo, J.: Why is real-world visual object recognition hard. PLOS Computational Biology 4(1) (2008)
Google Scholar
Coates, A., Ng, A.: The importance of encoding versus training with sparse coding and vector quantization. In: ICML, Washington (2011)
Google Scholar
Zhang, H., Berg, A., MaireM.,Malik, J.: SVM-KNN: discriminative nearest classification for visual category recognition. In: CVPR, New York (2006)
Google Scholar
Kavukcuoglu, K., Ranzato, M., Fergus, R., LeCun, Y.: Learning invariant features through topographic filter maps. In: CVPR, Florida (2009)
Google Scholar
Zeiler, M., Krishnan, D., Taylor, G., Fergus, R.: Deconvolutional networks. In: CVPR, San Francisco (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Information Engineering, Capital Normal University, Beijing, China
Xiangyang Li & Zhiping Shi
Key Lab of Intelligent Information Processing, Institute of Computing Tech., Beijing, China
Xiangyang Li, Shuqiang Jiang, Xinhang Song & Luis Herranz

Authors

Xiangyang Li
View author publications
You can also search for this author in PubMed Google Scholar
Shuqiang Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Xinhang Song
View author publications
You can also search for this author in PubMed Google Scholar
Luis Herranz
View author publications
You can also search for this author in PubMed Google Scholar
Zhiping Shi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computing Technology, Chinese Academy of Sciences, 100190, Beijing, China
Zhongzhi Shi
Department of Computer Science, Zhejiang University, 310027, Hangzhou, China
Zhaohui Wu
Computer Science Department, Indiana University, 47405, Bloomington, IN, USA
David Leake
School of Computer Science, University of Manchester, M13 9PL, Manchester, UK
Uli Sattler

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, X., Jiang, S., Song, X., Herranz, L., Shi, Z. (2014). Multipath Convolutional-Recursive Neural Networks for Object Recognition. In: Shi, Z., Wu, Z., Leake, D., Sattler, U. (eds) Intelligent Information Processing VII. IIP 2014. IFIP Advances in Information and Communication Technology, vol 432. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44980-6_30

Download citation

DOI: https://doi.org/10.1007/978-3-662-44980-6_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-44979-0
Online ISBN: 978-3-662-44980-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Multipath Convolutional-Recursive Neural Networks for Object Recognition

Abstract

Chapter PDF

Similar content being viewed by others

Learning Recursive Filters for Low-Level Vision via a Hybrid Neural Network

Image Classification Using Convolutional Neural Networks

A survey of the recent architectures of deep convolutional neural networks

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Multipath Convolutional-Recursive Neural Networks for Object Recognition

Abstract

Chapter PDF

Similar content being viewed by others

Learning Recursive Filters for Low-Level Vision via a Hybrid Neural Network

Image Classification Using Convolutional Neural Networks

A survey of the recent architectures of deep convolutional neural networks

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation