Research on HVS-Inspired, Parallel, and Hierarchical Scene Classification Framework

Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 215)

Abstract

A novel bionic, parallel, and hierarchical scene classification framework is presented in this paper. Moreover, we build the model based on the perception as defined by the human visual system. At first, we use an image pyramid to present both the global scene and local patches containing specific objects. Second, we build our own codebooks, which satisfy both long stare and short saccade similar to humans. Next, we train the visual words by generative and discriminative methods, respectively, which could obtain the initial scene categories based on the potential semantics using the bag-of-words model. Then, we use a neural network to simulate a human decision process. This leads to the final scene category. Experiments show that the parallel, hierarchical image representation, and classification model obtain superior results with respect to accuracy.

Keywords

Image pyramid Visual codebook Generative method Discriminative method Neural network 

Notes

Acknowledgments

This work was financially supported by the Chinese People’s Public Security University Natural Science Foundation (2011LG08).

References

  1. 1.
    Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: IEEE conference on computer vision and pattern recognition, pp 1–8Google Scholar
  2. 2.
    Malisiewicz T, Efros AA (2008) Recognition by association via learning per-exemplar distances. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8Google Scholar
  3. 3.
    Fei–Fei L, Perona P (2005) A Bayesian hierarchical model for learning natural scene categories, In: IEEE conference on computer vision and pattern recognition, pp 524–531Google Scholar
  4. 4.
    Talter BW (2009) Current understanding of eye guidance. Vis Cognition 17:777–789CrossRefGoogle Scholar
  5. 5.
    Talter BW, Vincent BT (2008) Systematic tendencies in scene viewing. J Eye Mov Res 2:1–18Google Scholar
  6. 6.
    Talter BW, Vincent BT (2009) The prominence of behavioural biases in eye guidance. Vis Cognition 17:1029–1054Google Scholar
  7. 7.
    Grauman K, Darrell T (2005) Pyramid match kernels: discriminative classification with sets of image features. In: Proceedings of the international conference on computer vision, pp 1–8Google Scholar
  8. 8.
    Lowe DG (1999) Object recognition from local scale-invariant features. In: International conference on computer vision, Corfu, Greece, pp 1150–1157Google Scholar
  9. 9.
    Hofmann T (1998) Probabilistic latent semantic indexing. In: Proceedings of the SIGIR conference research and development in information retrieval, pp 1–13Google Scholar
  10. 10.
    Torralba A (2003) Contextual priming for object detection. Int J Comput Vision 53(2):169–191CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  1. 1.Department of Policing IntelligenceChinese People’s Public Security UniversityBeijingChina
  2. 2.Public Security Intelligence Research CenterChinese People’s Public Security UniversityBeijingChina

Personalised recommendations