Introducing a Inter-frame Relational Feature Model for Pedestrian Detection

  • Andreas Zweng
  • Martin Kampel
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7944)


Pedestrian detection has been used with the help of various local features in still images such as histograms of oriented gradients (HOG), local binary patterns (LBP) and more recently, the histograms of optical flow (HOF). In order to improve the robustness of pedestrian detection, movement of people can be taken into the training process which has been done in the HOF descriptor. Optical flow is used to model the movement of a person and to detect actions in image sequences. For action recognition it is necessary to incorporate movement into models when using feature descriptors such as the HOF descriptor. In this paper we introduce a novel method to train and to detect human movement for pedestrian detection using relational gradient features within multiple consecutive frames. The goal of this descriptor is to detect pedestrians using multiple frames for moving cameras instead of static cameras. The relational features between consecutive frames help to robustly find pedestrians in image sequences due to a flexible detection algorithm. We demonstrate the robustness of the resulting feature model computed for a temporal time window of three frames. In our experiments we show the improvement regarding true positives as well as false positives using our inter-frame HOG (ifHOG) model compared to other feature descriptors.


pedestrian detection local features relational features machine learning histograms of oriented gradients 


  1. 1.
    Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 1, pp. 886–893 (June 2005)Google Scholar
  2. 2.
    Wang, X., Han, T.X., Yan, S.: An hog-lbp human detector with partial occlusion handling. In: IEEE 12th International Conference on Computer Vision (ICCV 2009), pp. 32–39 (October 2009)Google Scholar
  3. 3.
    Liao, W.H.: Region description using extended local ternary patterns. In: 20th International Conference on Pattern Recognition (ICPR 2010), pp. 1003–1006 (August 2010)Google Scholar
  4. 4.
    Felzenszwalb, P.F., Huttenlocher, D.P.: Pictorial structures for object recognition. International Journal of Computer Vision (IJCV 2005) 61(1), 55–79 (2005)CrossRefGoogle Scholar
  5. 5.
    Wu, B., Nevatia, R.: Detection of multiple, partially occluded humans in a single image by bayesian combination of edgelet part detectors. In: 10th IEEE International Conference on Computer Vision (ICCV 2005), vol. 1, pp. 90–97 (October 2005)Google Scholar
  6. 6.
    Ronfard, R., Schmid, C., Triggs, B.: Learning to parse pictures of people. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002, Part IV. LNCS, vol. 2353, pp. 700–714. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  7. 7.
    Watanabe, T., Ito, S., Yokoi, K.: Co-occurrence histograms of oriented gradients for pedestrian detection. In: Wada, T., Huang, F., Lin, S. (eds.) PSIVT 2009. LNCS, vol. 5414, pp. 37–47. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  8. 8.
    Tuzel, O., Porikli, F., Meer, P.: Human detection via classification on riemannian manifolds. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2007), pp. 1–8 (June 2007)Google Scholar
  9. 9.
    Ren, H., Heng, C.K., Zheng, W., Liang, L., Chen, X.: Fast object detection using boosted co-occurrence histograms of oriented gradients. In: 17th IEEE International Conference on Image Processing (ICIP 2010), pp. 2705–2708 (September 2010)Google Scholar
  10. 10.
    Yamauchi, Y., Matsushima, C., Yamashita, T., Fujiyoshi, H.: Relational hog feature with wild-card for object detection. In: IEEE International Conference on Computer Vision Workshops (ICCV 2011 Workshops), pp. 1785–1792 (November 2011)Google Scholar
  11. 11.
    Zweng, A., Kampel, M.: Improved relational feature model for people detection using histogram similarity functions. In: IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance (AVSS 2012), pp. 422–427 (September 2012)Google Scholar
  12. 12.
    Viola, P., Jones, M.J., Snow, D.: Detecting pedestrians using patterns of motion and appearance. International Journal of Computer Vision (IJCV 2005) 63(2), 153–161 (2005)CrossRefGoogle Scholar
  13. 13.
    Dalal, N., Triggs, B., Schmid, C.: Human detection using oriented histograms of flow and appearance. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 428–441. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  14. 14.
    Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2008), pp. 1–8 (June 2008)Google Scholar
  15. 15.
    Kläser, A., Marszałek, M., Schmid, C.: A spatio-temporal descriptor based on 3d-gradients. In: British Machine Vision Conference, pp. 995–1004 (September 2008)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Andreas Zweng
    • 1
  • Martin Kampel
    • 1
  1. 1.Computer Vision LabVienna University of TechnologyViennaAustria

Personalised recommendations