Abstract
In this paper, we propose a Coupled Discriminative Dictionary Learning framework to tackle the zero-shot image classification problem. Instead of the original attribute vectors and sample feature vectors, we use their corresponding sparse coefficients attained from sparse coding to do the classification. The purpose of our framework is that, when an unseen-class sample shows during test time, we first attain its corresponding sparse coefficient through learned feature dictionary. Then we use a mapping method to map it to the attribute sparse coefficients category histogram domain where we can accomplish the classification. We evaluate our method performance on two benchmark datasets for zero-shot image classification. The results are compelling to other state-of-the-art, especially on fine-grained dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bishop, C., Bishop, C.M.: Pattern recognition and machine learning. J. Electron. Imaging 16(4), 140–155 (2006). Springer
Palatucci, M., Pomerleau, D., Hinton, G., et al.: Zero-shot learning with semantic output codes. In: International Conference on Neural Information Processing Systems, pp. 1410–1418. Curran Associates Inc. (2009)
Lampert, C.H., Nickisch, H., Harmeling, S.: Learning to detect unseen object classes by between-class attribute transfer. IEEE (2009)
Akata, Z., Perronnin, F., Harchaoui, Z., et al.: Label-embedding for attribute-based classification. In: Computer Vision and Pattern Recognition, pp. 819–826. IEEE (2013)
Yu, F.X., Cao, L., Feris, R.S., et al.: Designing category-level attributes for discriminative visual recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 771–778. IEEE Computer Society (2013)
Fu, Y., Hospedales, T.M., Xiang, T., Fu, Z., Gong, S.: Transductive multi-view embedding for zero-shot recognition and annotation. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8690, pp. 584–599. Springer, Cham (2014). doi:10.1007/978-3-319-10605-2_38
Farhadi, A., Endres, I., Hoiem, D., et al.: Describing objects by their attributes. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 1778–1785. IEEE (2009)
Lampert, C.H., Nickisch, H., Harmeling, S.: Attribute-based classification for zero-shot visual object categorization. IEEE Trans. Pattern Anal. Mach. Intell. 36(3), 453–465 (2014)
Mensink, T., Verbeek, J., Perronnin, F., Csurka, G.: Metric learning for large scale image classification: generalizing to new classes at near-zero cost. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, pp. 488–501. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33709-3_35
Parikh, D., Grauman, K.: Interactively building a discriminative vocabulary of nameable attributes. In: The IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011, DBLP, 20–25 June, Colorado Springs, Co, USA, pp. 1681–1688 (2011)
Duan, K., Parikh, D., Crandall, D., et al.: Discovering localized attributes for fine-grained recognition. In: Computer Vision and Pattern Recognition, pp. 3474–3481. IEEE (2012)
Rohrbach, M., Stark, M., Schiele, B.: Evaluating knowledge transfer and zero-shot learning in a large-scale setting. In: Computer Vision and Pattern Recognition, pp. 1641–1648. IEEE (2011)
Berg, T.L., Berg, A.C., Shih, J.: Automatic attribute discovery and characterization from noisy web data. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6311, pp. 663–676. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15549-9_48
Frome, A., Corrado, G.S., Shlens, J., et al.: DeViSE: a deep visual-semantic embedding model. In: International Conference on Neural Information Processing Systems, pp. 2121–2129. Curran Associates Inc. (2013)
Socher, R., Ganjoo, M., Sridhar, H., et al.: Zero-shot learning through cross-modal transfer. In: Advances in Neural Information Processing Systems, pp. 935–943 (2013)
Mensink, T., Gavves, E., Snoek, C.G.M.: COSTA: co-occurrence statistics for zero-shot classification. In: Computer Vision and Pattern Recognition, pp. 2441–2448. IEEE (2014)
Akata, Z., Reed, S., Walter, D., et al.: Evaluation of output embeddings for fine-grained image classification. 2927–2936 (2015)
Norouzi, M., et al.: Zero-shot learning by convex combination of semantic embeddings. arXiv preprint arXiv:1312.5650 (2013)
Akata, Z., Perronnin, F., Harchaoui, Z., et al.: Label-embedding for image classification. IEEE Trans. Pattern Anal. Mach. Intell. 38(7), 1425–1438 (2015)
Zhang, Z., Saligrama, V.: Zero-shot learning via semantic similarity embedding. In: IEEE International Conference on Computer Vision, pp. 4166–4174. IEEE (2015)
Liang, Y.: Semi-coupled dictionary learning with applications to image super-resolution and photo-sketch synthesis. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2216–2223. IEEE Computer Society (2012)
Li, F., Zhang, S.: Double sparse dictionary learning for image super resolution. In: Chinese Control and Decision Conference, pp. 4344–4348. IEEE (2016)
Skau, E., Wohlberg, B., Krim, H., et al.: Pansharpening via coupled triple factorization dictionary learning. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1234–1237. IEEE (2016)
Huang, D.A., Wang, Y.C.F.: Coupled dictionary and feature space learning with applications to cross-domain image synthesis and recognition. In: IEEE International Conference on Computer Vision, pp. 2496–2503. IEEE (2013)
Duda, R., Hart, P., Stork, D.: Pattern Classification, 2nd edn. En Broeck the Statistical Mechanics of Learning (2001)
Yang, M., Zhang, L., Yang, J., et al.: Metaface learning for sparse representation based face recognition. In: IEEE International Conference on Image Processing, 1601–1604. IEEE (2010)
Rosasco, L., Verri, A., Santoro, M., et al.: Iterative projection methods for structured sparsity regularization. Computation (2009)
Yang, M., Zhang, L., Feng, X., et al.: Fisher discrimination dictionary learning for sparse representation. In: International Conference on Computer Vision, pp. 543–550. IEEE (2011)
Jayaraman, D., Grauman, K.: Zero-shot recognition with unreliable attributes. In: NIPS, pp. 3464–3472 (2014)
Romera-Paredes, B., Torr, P.H.S.: An embarrassingly simple approach to zero-shot learning. In: ICML (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Liu, L., Wu, S., Chen, R., Zhou, M. (2017). Zero-Shot Image Classification via Coupled Discriminative Dictionary Learning. In: Yue, D., Peng, C., Du, D., Zhang, T., Zheng, M., Han, Q. (eds) Intelligent Computing, Networked Control, and Their Engineering Applications. ICSEE LSMS 2017 2017. Communications in Computer and Information Science, vol 762. Springer, Singapore. https://doi.org/10.1007/978-981-10-6373-2_37
Download citation
DOI: https://doi.org/10.1007/978-981-10-6373-2_37
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6372-5
Online ISBN: 978-981-10-6373-2
eBook Packages: Computer ScienceComputer Science (R0)