Hierarchical Learning for Large-Scale Image Classification via CNN and Maximum Confidence Path

Lu, Chang; Qu, Yanyun; Shi, Cuiting; Fan, Jianping; Wu, Yang; Wang, Hanzi

doi:10.1007/978-3-319-24078-7_23

Chang Lu¹⁸,
Yanyun Qu¹⁸,
Cuiting Shi¹⁸,
Jianping Fan¹⁹,
Yang Wu²⁰ &
…
Hanzi Wang¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9315))

Included in the following conference series:

Pacific Rim Conference on Multimedia

1977 Accesses
1 Citations

Abstract

We propose a framework to integrate the large scale image data visualization with image classification. The Convolution Neural Network is used to learn the feature vector for an image. A fast algorithm is developed for inter-class similarity measurement. The spectral clustering is implemented to construct a hierarchical visual tree. Instead of the flat classification way, a hierarchical classification is designed according to the visual tree, which is transformed to a path search problem. The path with the maximum joint probability is the final solution. Experimental results on the ILSVRC2010 dataset demonstrate that our method achieves the highest top-1 and top-5 classification accuracy in comparison with 6 state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

http://www.image-net.org/
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, pp. 1–2, Prague (2004)
Google Scholar
Winn, J., Criminisi, A., Minka, T.: Object categorization by learned universal visual dictionary. In: Tenth IEEE International Conference on Computer Vision, ICCV 2005, pp. 1800–1807. IEEE (2005)
Google Scholar
Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 490–503. Springer, Heidelberg (2006)
Chapter Google Scholar
Fan, J., Gao, Y., Luo, H., Jain, R.: Mining multilevel image semantics via hierarchical classification. IEEE Trans. Multimed. 10, 167–187 (2008)
Article Google Scholar
Miller, G., Fellbaum, C.: Wordnet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
MATH Google Scholar
Fan, J., Shen, Y., Yang, C., Zhou, N.: Structured max-margin learning for inter-related classifier training and multilabel image annotation. IEEE Trans. Image Process. 20, 837–854 (2011)
Article MathSciNet Google Scholar
Fan, J., He, X., Zhou, N., Peng, J., Jain, R.: Quantitative characterization of semantic gaps for learning complexity estimation and inference model selection. IEEE Trans. Multimed. 14, 1414–1428 (2012)
Article Google Scholar
Sivic, J., Russell, B.C., Zisserman, A., Freeman, W.T., Efros, A.A.: Unsupervised discovery of visual object class hierarchies. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008)
Google Scholar
Bart, E., Porteous, I., Perona, P., Welling, M.: Unsupervised learning of visual taxonomies. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008)
Google Scholar
Griffin, G., Perona, P.: Learning and using taxonomies for fast visual categorization. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008)
Google Scholar
Marszałek, M., Schmid, C.: Constructing category hierarchies for visual recognition. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part IV. LNCS, vol. 5305, pp. 479–491. Springer, Heidelberg (2008)
Chapter Google Scholar
Bengio, S., Weston, J., Grangier, D.: Label embedding trees for large multi-class tasks. In: Advances in Neural Information Processing Systems, pp. 163–171 (2010)
Google Scholar
Deng, J., Satheesh, S., Berg, A.C., Li, F.: Fast and balanced: efficient label tree learning for large scale object recognition. In: Advances in Neural Information Processing Systems, pp. 567–575 (2011)
Google Scholar
Moosmann, F., Triggs, B., Jurie, F.: Fast discriminative visual codebooks using randomized clustering forests. In: Twentieth Annual Conference on Neural Information Processing Systems (NIPS 2006), pp. 985–992. MIT Press (2007)
Google Scholar
http://caffe.berkeleyvision.org/
Frey, B.J., Dueck, D.: Clustering by passing messages between data points. Sci. 315(5814), 972–976 (2007)
Article MathSciNet MATH Google Scholar
http://www.csie.ntu.edu.tw/~cjlin/liblinear/
Berg, A., Deng, J., Fei-Fei, L.: Large scale visual recognition challenge 2010 (2010). www.image-net.org
Sánchez, J., Perronnin, F.: High-dimensional signature compression for large-scale image classification. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1665–1672. IEEE (2011)
Google Scholar
Zhou, N., Fan, J.: Jointly learning visually correlated dictionaries for large-scale visual recognition applications. IEEE Trans. Pattern Anal. Mach. Intell. 36, 715–730 (2014)
Article Google Scholar
Perronnin, F., Akata, Z., Harchaoui, Z., Schmid, C.: Towards good practice in large-scale learning for image classification. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3482–3489. IEEE (2012)
Google Scholar
Lin, Y., Lv, F., Zhu, S., Yang, M., Cour, T., Yu, K., Cao, L., Huang, T.: Large-scale image classification: fast feature extraction and SVM training. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1689–1696. IEEE (2011)
Google Scholar
Fan, J., Zhang, J., Mei, K., Peng, J., Gao, L.: Cost-sensitive learning of hierarchical tree classifiers for large-scale image classification and novel category detection. Pattern Recognit. 48, 1673–1687 (2014)
Article Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundations of China under Grants 61373077, 61472334 and 61170179,the Natural Science Foundation of Fujian Province of China Under Grant 2013J01257,the Fundamental Research Funds for the Central Universities under Grant 20720130720,the 2014 national college students’ innovative and entrepreneurial training project, and the Scientific Research Foundation for the Introduction of Talent at Xiamen University of Technology YKJ12023R.

Author information

Authors and Affiliations

Computer Science Department, Xiamen University, Xiamen, China
Chang Lu, Yanyun Qu, Cuiting Shi & Hanzi Wang
Computer Science Department, University of North Carolina at Charlotte, Charlotte, NC, USA
Jianping Fan
Center for Frontier Science and Technology, Nara Institute of Science and Technology, Ikoma, Japan
Yang Wu

Authors

Chang Lu
View author publications
You can also search for this author in PubMed Google Scholar
Yanyun Qu
View author publications
You can also search for this author in PubMed Google Scholar
Cuiting Shi
View author publications
You can also search for this author in PubMed Google Scholar
Jianping Fan
View author publications
You can also search for this author in PubMed Google Scholar
Yang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Hanzi Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yanyun Qu .

Editor information

Editors and Affiliations

Gwangju Institute of Science and Technology, Gwangju, Korea (Republic of)
Yo-Sung Ho
Chinese Academy of Sciences, Institute of Automation, Beijing, China
Jitao Sang
KAIST, Daejeon, Korea (Republic of)
Yong Man Ro
KAIST, Daejeon, Korea (Republic of)
Junmo Kim
College of Computer Science, Zhejiang University, Hangzhou, China
Fei Wu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lu, C., Qu, Y., Shi, C., Fan, J., Wu, Y., Wang, H. (2015). Hierarchical Learning for Large-Scale Image Classification via CNN and Maximum Confidence Path. In: Ho, YS., Sang, J., Ro, Y., Kim, J., Wu, F. (eds) Advances in Multimedia Information Processing -- PCM 2015. PCM 2015. Lecture Notes in Computer Science(), vol 9315. Springer, Cham. https://doi.org/10.1007/978-3-319-24078-7_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-24078-7_23
Published: 15 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24077-0
Online ISBN: 978-3-319-24078-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics