Advertisement

Pedestrian Detection via Structure-Sensitive Deep Representation Learning

  • Deliang Huang
  • Shijia Huang
  • Hefeng WuEmail author
  • Ning Liu
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10666)

Abstract

Pedestrian detection is a fundamental task in a wide range of computer vision applications. Detecting the head-shoulder appearance is an attractive way for pedestrian detection, especially in scenes with crowd, heavy occlusion or large camera tilt angles. However, the head-shoulder part contains less information than the full human body, which requires better feature extraction to ensure the effectiveness of the detection. This paper proposes a head-shoulder detection method based on the convolutional neural network (CNN). According to the characteristics of the head and shoulders, our method integrates a structure-sensitive ROI pooling layer into the CNN architecture. The proposed CNN is trained in a multi-task scheme with classification and localization outputs. Furthermore, the convolutional layers of the network are pre-trained using a triplet loss to capture better features of the head-shoulder appearance. Extensive experimental results demonstrate that the average accuracy of the proposed method is 89.6% when the IoU threshold is 0.5. Our method obtains close results to the state-of-the-art method Faster R-CNN while outperforming it in speed. Even when the number of extracted candidate regions increases, the increased detection time is negligible. In addition, when the IoU threshold is greater than 0.6, the average accuracy of our method is higher than that of Faster R-CNN, which indicates that our results have higher IoU with ground truth.

Notes

Acknowledgement

This research is supported by Natural Science Foundation of Guangdong Province (2014A030310348, 2014A030313154), National Natural Science Foundation of China (61472455, 61402120), Guangdong Provincial Department of Science and Technology (GDST16EG04) 2016A050503024, and the Startup Program in Guangdong University of Foreign Studies (299-X5122029).

References

  1. 1.
    Liu, N., Wu, H., Lin, L.: Hierarchical ensemble of background models for PTZ-based video surveillance. IEEE Trans. Cybern. 45, 89–102 (2015)CrossRefGoogle Scholar
  2. 2.
    Dollár, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 34, 743–761 (2012)CrossRefGoogle Scholar
  3. 3.
    Wu, H., Liu, N., Luo, X., Su, J., Chen, L.: Real-time background subtraction-based video surveillance of people by integrating local texture patterns. Sig. Image Video Process. 8, 665–676 (2014)CrossRefGoogle Scholar
  4. 4.
    Teichman, A., Thrun, S.: Practical object recognition in autonomous driving and beyond. In: Advanced Robotics and its Social Impacts (ARSO), pp. 35–38 (2011)Google Scholar
  5. 5.
    Li, M., Zhang, Z., Huang, K., Tan, T.: Rapid and robust human detection and tracking based on omega-shape features. In: IEEE International Conference on Image Processing, pp. 2545–2548 (2010)Google Scholar
  6. 6.
    Li, M., Zhang, Z., Huang, K., Tan, T.: Estimating the number of people in crowded scenes by MID based foreground segmentation and head-shoulder detection. In: International Conference on Pattern Recognition, pp. 1–4 (2008)Google Scholar
  7. 7.
    Zeng, C., Ma, H.: Robust head-shoulder detection by PCA-based multilevel HOG-LBP detector for people counting. In: International Conference on Pattern Recognition, pp. 2069–2072 (2010)Google Scholar
  8. 8.
    Wu, B., Nevatia, R.: Tracking of multiple humans in meetings. In: IEEE Conference on Computer Vision and Pattern Recognition Workshop, p. 143 (2006)Google Scholar
  9. 9.
    Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 886–893 (2005)Google Scholar
  10. 10.
    Wang, X., Han, T.X., Yan, S.: An HOG-LBP human detector with partial occlusion handling. In: IEEE International Conference on Computer Vision (ICCV), pp. 32–39 (2010)Google Scholar
  11. 11.
    Zhu, Q., Yeh, M.C., Cheng, K.T., Avidan, S.: Fast human detection using a cascade of histograms of oriented gradients. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1491–1498 (2006)Google Scholar
  12. 12.
    Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580–587 (2013)Google Scholar
  13. 13.
    He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1904–1916 (2015)CrossRefGoogle Scholar
  14. 14.
    Girshick, R.: Fast R-CNN. In: IEEE International Conference on Computer Vision (ICCV), pp. 1440–1448 (2015)Google Scholar
  15. 15.
    Ren, S., He, K., Girshick, R.B., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39, 1137–1149 (2017)CrossRefGoogle Scholar
  16. 16.
    Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., Lecun, Y.: Overfeat: integrated recognition, localization and detection using convolutional networks. arXiv:1312.6229 (2015)
  17. 17.
    Schroff, F., Kalenichenko, D., Philbin, J.: Facenet: a unified embedding for face recognition and clustering. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 815–823 (2015)Google Scholar
  18. 18.
    Felzenszwalb, P.F., Girshick, R.B., McAllester, D.A., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1627–1645 (2010)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Deliang Huang
    • 1
  • Shijia Huang
    • 2
  • Hefeng Wu
    • 2
    Email author
  • Ning Liu
    • 1
  1. 1.School of Data and Computer ScienceSun Yat-sen UniversityGuangzhouChina
  2. 2.School of Information Science and TechnologyGuangdong University of Foreign StudiesGuangzhouChina

Personalised recommendations