Novel Sparse Kernel Manifold Learner for Image Classification Applications

Puthenputhussery, Ajit; Liu, Chengjun

doi:10.1007/978-3-319-52081-0_5

Ajit Puthenputhussery⁴ &
Chengjun Liu⁴

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 121 ))

636 Accesses
1 Citations

Abstract

This chapter presents a sparse kernel manifold learner framework for different image classification applications. First, a new DAISY Fisher vector (D-FV) feature is created by computing Fisher vectors on densely sampled DAISY features. Second, a WLD-SIFT Fisher vector (WS-FV) feature is developed by fusing the Weber local descriptors (WLD) with the SIFT descriptors, and the Fisher vectors are computed on the fused WLD-SIFT features. Third, an innovative fused Fisher vector (FFV) feature is developed by integrating the most expressive features of the D-FV, the WS-FV and the SIFT-FV features. The FFV feature is then further assessed in eight different color spaces and a novel fused color Fisher vector (FCFV) feature is computed by integrating the PCA features of the eight color FFV descriptors. Finally, we propose a sparse kernel manifold learner (SKML) method for learning a discriminative sparse representation by considering the local manifold structure and the label information based on the marginal Fisher criterion. The objective of the SKML method is to minimize the intraclass scatter and maximize the interclass separability which are defined based on the sparse criterion. The effectiveness of the proposed SKML method is assessed on different image classification datasets such as the Painting-91 dataset for computational art painting classification, the CalTech-101 dataset for object categorization, and the 15 Scenes dataset for scene classification . Experimental results show that our proposed method is able to achieve better performance than other popular image descriptors and learning methods for different visual recognition applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Amiri, S.M., Nasiopoulos, P., Leung, V.C.M.: Non-negative sparse coding for human action recognition. In: 2012 19th IEEE International Conference on Image Processing, pp. 1421–1424 (2012)
Google Scholar
Bosch, A., Zisserman, A., Munoz, X.: Representing shape with a spatial pyramid kernel. In: Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR ’07, pp. 401–408 (2007)
Google Scholar
Cai, D., He, X., Zhou, K., Han, J., Bao, H.: Locality sensitive discriminant analysis. In: Proceedings of the 20th International Joint Conference on Artifical Intelligence, IJCAI’07, pp. 708–713 (2007)
Google Scholar
Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets. In: BMVC (2014)
Google Scholar
Chen, J., Shan, S., He, C., Zhao, G., Pietikainen, M., Chen, X., Gao, W.: Wld: a robust local image descriptor. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1705–1720 (2010)
Article Google Scholar
Chen, S., Liu, C.: Clustering-based discriminant analysis for eye detection. IEEE Trans. Image Processing 23(4), 1629–1638 (2014)
Article MathSciNet Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: CVPRW, pp. 178–178 (2004)
Google Scholar
Fukunaga, K.: Introduction to Statistical Pattern Recognition. Academic Press Professional Inc, San Diego, CA, USA (1990)
MATH Google Scholar
Gao, S., Tsang, I.W.H., Chia, L.T.: Laplacian sparse coding, hypergraph laplacian sparse coding, and applications. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 92–104 (2013)
Article Google Scholar
Gao, Z., Liu, A., Zhang, H., Xu, G., Xue, Y.: Human action recognition based on sparse representation induced by l1/l2 regulations. In: 21st International Conference on Pattern Recognition (ICPR), pp. 1868–1871 (2012)
Google Scholar
Goh, H., Thome, N., Cord, M., Lim, J.H.: Learning deep hierarchical visual feature coding. IEEE Trans. Neural Netw. Learn. Syst. 25(12), 2212–2225 (2014)
Article Google Scholar
Guo, Z., Zhang, D., Zhang, D.: A completed modeling of local binary pattern operator for texture classification. IEEE Trans. Image Process. 19(6), 1657–1663 (2010)
Article MathSciNet Google Scholar
He, X., Yan, S., Hu, Y., Niyogi, P., Zhang, H.J.: Face recognition using laplacianfaces. IEEE Trans. Pattern Anal. Mach. Intell. 27(3), 328–340 (2005)
Article Google Scholar
Jegou, H., Perronnin, F., Douze, M., Sanchez, J., Perez, P., Schmid, C.: Aggregating local image descriptors into compact codes. IEEE Trans. Pattern Anal. Mach. Intell. 34(9), 1704–1716 (2012)
Article Google Scholar
Jiang, Z., Lin, Z., Davis, L.S.: Label consistent k-svd: learning a discriminative dictionary for recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(11), 2651–2664 (2013)
Article Google Scholar
Khan, F., Beigpour, S., van de Weijer, J., Felsberg, M.: Painting-91: a large scale database for computational painting categorization. Mach. Vision Appl. 25(6), 1385–1397 (2014)
Article Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the IEEE Conference on CVPR, vol. 2, pp. 2169–2178 (2006)
Google Scholar
Li, Q., Schonfeld, D.: Multilinear discriminant analysis for higher-order tensor data classification. IEEE Trans. Pattern Anal. Mach. Intell. 36(12), 2524–2537 (2014)
Article Google Scholar
Liu, C.: Enhanced independent component analysis and its application to content based face image retrieval. Trans. Sys. Man Cyber. Part B 34(2), 1117–1127 (2004). doi:10.1109/TSMCB.2003.821449
Liu, C.: Gabor-based kernel pca with fractional power polynomial models for face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 26(5), 572–581 (2004)
Article Google Scholar
Liu, C.: Extracting discriminative color features for face recognition. Pattern Recogn. Lett. 32(14), 1796–1804 (2011)
Article Google Scholar
Liu, C.: Discriminant analysis and similarity measure. Pattern Recogn. 47(1), 359–367 (2014)
Article Google Scholar
Liu, C., Wechsler, H.: Gabor feature based classification using the enhanced fisher linear discriminant model for face recognition. Trans. Img. Proc. 11(4), 467–476 (2002). doi:10.1109/TIP.2002.999679
Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)
Article Google Scholar
Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)
Article MATH Google Scholar
Olshausen, B., Field, D.: Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381(6583), 607–609 (1996)
Article Google Scholar
Olshausen, B., Field, D.: Sparse coding with an overcomplete basis set: a strategy employed by v1? Vision Res. 37(23), 3311–3325 (1997)
Article Google Scholar
Peng, K.C., Chen, T.: Cross-layer features in convolutional neural networks for generic classification tasks. In: 2015 IEEE International Conference on Image Processing (ICIP), pp. 3057–3061 (2015)
Google Scholar
Peng, K.C., Chen, T.: A framework of extracting multi-scale features using multiple convolutional neural networks. In: 2015 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6 (2015)
Google Scholar
Rathus, L.: Foundations of Art and Design. Wadsworth Cengage Learning, Boston, MA (2008)
Google Scholar
van de Sande, K., Gevers, T., Snoek, C.: Evaluating color descriptors for object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1582–1596 (2010)
Article Google Scholar
Shechtman, E., Irani, M.: Matching local self-similarities across images and videos. In: CVPR, pp. 1–8 (2007)
Google Scholar
Simonyan, K., Parkhi, O.M., Vedaldi, A., Zisserman, A.: Fisher Vector Faces in the Wild. In: BMVC (2013)
Google Scholar
Sinha, A., Banerji, S., Liu, C.: New color gphog descriptors for object and scene image classification. Mach. Vis. Appl. 25(2), 361–375 (2014)
Article Google Scholar
Tola, E., Lepetit, V., Fua, P.: Daisy: an efficient dense descriptor applied to wide-baseline stereo. IEEE Trans. Pattern Anal. Mach. Intell. 32(5), 815–830 (2010)
Article Google Scholar
Wang, H., Yuan, C., Hu, W., Ling, H., Yang, W., Sun, C.: Action recognition using nonnegative action component representation and sparse basis selection. IEEE Trans. Image Process. 23(2), 570–581 (2014)
Article MathSciNet Google Scholar
Wang, J., Wonka, P., Ye, J.: Lasso screening rules via dual polytope projection. J. Mach. Learn. Res. 16, 1063–1101 (2015)
MathSciNet MATH Google Scholar
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: Proceedings of the IEEE Conference on CVPR, pp. 3360–3367 (2010)
Google Scholar
Wang, J., Zhou, J., Liu, J., Wonka, P., Ye, J.: A safe screening rule for sparse logistic regression. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems, vol. 27, pp. 1053–1061 (2014)
Google Scholar
van de Weijer, J., Schmid, C., Verbeek, J., Larlus, D.: Learning color names for real-world applications. IEEE Trans. Image Process. 18(7), 1512–1523 (2009)
Article MathSciNet Google Scholar
Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Transa. Pattern Anal. Mach. Intell. 31(2), 210–227 (2009)
Article Google Scholar
Xiang, Z.J., Ramadge, P.J.: Fast lasso screening tests based on correlations. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2137–2140 (2012)
Google Scholar
Xiang, Z.J., Xu, H., Ramadge, P.J.: Learning sparse representations of high dimensional data on large scale dictionaries. In: Shawe-Taylor, J., Zemel, R.S., Bartlett, P.L., Pereira, F., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems, pp. 900–908 (2011)
Google Scholar
Xin, M., Zhang, H., Sun, M., Yuan, D.: Recurrent temporal sparse autoencoder for attention-based action recognition. In: 2016 International Joint Conference on Neural Networks (IJCNN), pp. 456–463 (2016)
Google Scholar
Yan, S., Xu, D., Zhang, B., Zhang, H., Yang, Q., Lin, S.: Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Trans. Pattern Anal. Mach. Intell. 29(1), 40–51 (2007)
Google Scholar
Yan, Y., Ricci, E., Subramanian, R., Liu, G., Sebe, N.: Multitask linear discriminant analysis for view invariant action recognition. IEEE Trans. Image Process. 23(12), 5599–5611 (2014)
Article MathSciNet Google Scholar
Yang, M., Zhang, L., Feng, X., Zhang, D.: Fisher discrimination dictionary learning for sparse representation. In: 2011 International Conference on Computer Vision, pp. 543–550 (2011)
Google Scholar
Yang, M., Zhang, L., Feng, X., Zhang, D.: Sparse representation based fisher discrimination dictionary learning for image classification. Int. J. Comput. Vision 109(3), 209–232 (2014)
Article MathSciNet MATH Google Scholar
Yuan, C., Hu, W., Tian, G., Yang, S., Wang, H.: Multi-task sparse learning with beta process prior for action recognition. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 423–429 (2013)
Google Scholar
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: European Conference on Computer Vision, pp. 818–833. Springer (2014)
Google Scholar
Zhang, H., Berg, A.C., Maire, M., Malik, J.: Svm-knn: discriminative nearest neighbor classification for visual category recognition. In: Proceedings of the IEEE Conference on CVPR, vol. 2, pp. 2126–2136 (2006)
Google Scholar
Zhang, Q., Li, B.: Discriminative k-svd for dictionary learning in face recognition. In: Proceedings of the IEEE Conference on CVPR, pp. 2691–2698 (2010)
Google Scholar
Zhang, X., Chu, D., Tan, R.C.E.: Sparse uncorrelated linear discriminant analysis for undersampled problems. IEEE Trans. Neural Netw. Learn. Syst. 27(7), 1469–1485 (2016)
Article MathSciNet Google Scholar
Zheng, J., Jiang, Z.: Learning view-invariant sparse representations for cross-view action recognition. In: 2013 IEEE International Conference on Computer Vision, pp. 3176–3183 (2013)
Google Scholar
Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., Oliva, A.: Learning deep features for scene recognition using places database. In: Proceedings of the NIPS, pp. 487–495 (2014)
Google Scholar
Zhou, N., Shen, Y., Peng, J., Fan, J.: Learning inter-related visual dictionary for object recognition. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3490–3497 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

New Jersey Institute of Technology, Newark, NJ, 07102, USA
Ajit Puthenputhussery & Chengjun Liu

Authors

Ajit Puthenputhussery
View author publications
You can also search for this author in PubMed Google Scholar
Chengjun Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Ajit Puthenputhussery or Chengjun Liu .

Editor information

Editors and Affiliations

Department of Computer Science, New Jersey Institute of Technology, University Heights, Newark, New Jersey, USA
Chengjun Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Puthenputhussery, A., Liu, C. (2017). Novel Sparse Kernel Manifold Learner for Image Classification Applications. In: Liu, C. (eds) Recent Advances in Intelligent Image Search and Video Retrieval. Intelligent Systems Reference Library, vol 121 . Springer, Cham. https://doi.org/10.1007/978-3-319-52081-0_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-52081-0_5
Published: 19 April 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-52080-3
Online ISBN: 978-3-319-52081-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics