Advertisement

Human Gesture Recognition in Still Images Using GMM Approach

  • Soumya Ranjan Mishra
  • Tusar Kanti Mishra
  • Goutam Sanyal
  • Anirban Sarkar
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 695)

Abstract

Human gesture and activity recognition is an important topic, and it gains popularity in the field research in several sectors associated with computer vision. The requirements are still challenging, and researchers are proposing handful of methods to come up with those requirements. In this work, the objective is to compute and analyze native space-time features in a general experimentation for recognition of several human gestures. Particularly, we have considered four distinct feature extraction methods and six native feature representation methods. Thus, we have used a bag-of-features. As a classifier, the support vector machine (SVM) is used for classification purpose. The performance of the scheme has been analyzed using ten distinct gesture images that have been derived from the Willow 7-action dataset (Delaitre et al, Proceedings British Machine Vision Conference, 2010). Interesting experimental results are obtained that validates the efficiency of the proposed technique.

References

  1. 1.
    Delaitre, V., Laptev, I., Sivic, J.: Recognizing human actions in stilll images: a study of bag-of-features and part-based representations. In: Proceedings British Machine Vision Conference (2010)Google Scholar
  2. 2.
    Poppe, R.: A survey on vision-based human action recognition. Image Vis. Comput. 28(6), 97690 (2010)CrossRefGoogle Scholar
  3. 3.
    Cheng, G., Wan, Y., Saudagar, A., Namuduri, K., Buckles, B.: Advances in human action recognition: a survey, 130 (2015)Google Scholar
  4. 4.
    Everingham, M., Gool, L.V., Williams, C., Winn, J., Zisserman, A.: The PASCAL visual object classes challenge 2012 (VOC2012) results. http://www.pascalnetwork.org/challenges/VOC/voc2012/workshop/index.html
  5. 5.
    Wu, J., Zhang, Y., Lin, W.: Towards good practices for action video encoding. In: Proceedings IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2577–2584 (2014)Google Scholar
  6. 6.
    Zhang, T., Zhang, Y., Cai, J., Kot, A.: Efficient object feature selection for action recognition. In: International Conference on Acoustics, Speech and Signal Processing (2016)Google Scholar
  7. 7.
    Guo, G.-D., Lai, A.: A survey on stilll image based human action recognition. Pattern Recogn. 47(10), 334–361 (2014)CrossRefGoogle Scholar
  8. 8.
    Maji, S., Bourdev, L., Malik, J.: Action recognition from a distributed representation of pose and appearance. In: Proceedings IEEE International Conference on Computer Vision and Pattern Recognition, pp. 3177–3184 (2011)Google Scholar
  9. 9.
    Hoai, M.: Regularized max pooling for image categorization. In: Proceedings British Machine Vision Conference (2014)Google Scholar
  10. 10.
    Oquab, M., Bottou, L., Laptev, I., Sivic, J.: Learning and transferring mid-level image representations using convolutional neural networks. In: Proceedings IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1717–1724 (2014)Google Scholar
  11. 11.
    Gupta, S., Malik, J.: Visual semantic role labeling (2015). arXiv:1505.0447
  12. 12.
    Gkioxari, G., Girshick, R., Malik, J.: Contextual action recognition with R*CNN. In: Proceedings IEEE International Conference on Computer Vision, pp. 1080–1088 (2015)Google Scholar
  13. 13.
    Sharma, G., Jurie, F., Schmid, C.: Expanded parts model for semantic description of humans in stilll images (2015). arXiv:1509.04186
  14. 14.
    Yang, H., Zhou, J.T., Zhang, Y., Gao, B.-B., Wu, J., Cai, J.: Exploit bounding box annotations for multi-label object recognition. In: Proceedings IEEE International Conference on Computer Vision and Pattern Recognition, pp. 280–288 (2016)Google Scholar
  15. 15.
    Gkioxari, G., Girshick, R., Malik, J.: Actions and attributes from wholes and parts. In: Proceedings IEEE International Conference on Computer Vision, pp. 2470–2478 (2015)Google Scholar
  16. 16.
    Prest, A., Schmid, C., Ferrari, V.: Weakly supervised learning of interactions between humans and objects. IEEE Trans. Pattern Anal. Mach. Intell. 34(3), 601–614 (2012)CrossRefGoogle Scholar
  17. 17.
    Mahapatra, A., Mishra, T.K., Sa, P.K., Majhi, B.: Human recognition system for outdoor videos using hidden markov model. AEU-Int. J. Electron. Commun. 68(3), 227–236 (2014)CrossRefGoogle Scholar
  18. 18.
    Rother, C., Kolmogorov, V., Blake, A.: GrabCutinteractive foreground extraction using iterated graph cuts. In: SIGGRAPH, pp. 309–314 (2004)Google Scholar
  19. 19.
    Sanchez, J., Perronnin, F., Mensink, T., Verbeek, J.: Image classifica- tion with the Fisher vector: theory and practice. Int. J. Comput. Vis. 105(3), 222–245 (2013)MathSciNetCrossRefGoogle Scholar
  20. 20.
    Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Localityconstrained linear coding for image classification. In: Proceedings IEEE International Conference on Computer Vision and Pattern Recognition, pp. 3360–3367 (2010)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  • Soumya Ranjan Mishra
    • 1
    • 2
  • Tusar Kanti Mishra
    • 1
    • 2
  • Goutam Sanyal
    • 1
    • 2
  • Anirban Sarkar
    • 1
    • 2
  1. 1.Department of Computer Science and EngineeringNIT DurgapurDurgapurIndia
  2. 2.Department of Computer Science and EngineeringANITSVisakhapatnamIndia

Personalised recommendations