A Channel-Cascading Pedestrian Detection Network for Small-Size Pedestrians

He, Jiaojiao; Liu, Ken; Zhang, Yongping; Yao, Tuozhong; Zhao, Zhongjie; Xiao, Jiangjian; Peng, Chengbin

doi:10.1007/978-3-030-02698-1_28

Jiaojiao He^17,18,
Ken Liu^18,19,
Yongping Zhang¹⁸,
Tuozhong Yao¹⁸,
Zhongjie Zhao¹⁷,
Jiangjian Xiao²⁰ &
…
Chengbin Peng²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11266))

Included in the following conference series:

International Conference on Intelligent Science and Big Data Engineering

1773 Accesses
1 Citations

Abstract

At present, there are several new challenges for multi-scale pedestrian detection in wide-angle field of view, especially small-size pedestrians. So the problem is how we can detect pedestrians efficiently and accurately with limited resources in wide-angle field of vision. In this work, we propose a Channel-Cascading pedestrian detection network for small-size pedestrians. In combination with the two-stage idea of Faster-RCNN in our detector, the optimized network was applied and the regional proposal network was improved. We propose a novel feature extraction network as optimized network, which we call the “Channel-Cascading Network” (CCN), that fuses information between channels by progressive cascading strategy and adapts our idea to other network designs. The experimental results show that our detector performs better for small-size pedestrians, it not only the precision of pedestrian detection in wide field of view is greatly improved especially small-size pedestrian, but also the speed is accelerated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zhang, Q.: Research on pedestrian detection methods on still images. University of Science and Technology of China (2015). (In Chinese)
Google Scholar
Lin, T.Y., et al.: Feature pyramid networks for object detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 936–944 (2016)
Google Scholar
Wang, B.: Pedestrian Detection Based on Deep Learning. Beijing Jiaotong University (2015). (In Chinese)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: International Conference on Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 91–110 (2004)
Article MathSciNet Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 886–893 (2005)
Google Scholar
Zhu, Q., et al.: Fast human detection using a cascade of histograms of oriented gradients. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 1491–1498 (2006)
Google Scholar
Chen, P.H., Lin, C.J.: A Tutorial on -support vector machines. In: Applied Stochastic Models in Business & Industry, vol. 21, No. 2, pp. 111–136 (2005)
Article MathSciNet Google Scholar
Felzenszwalb, P.F., et al.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Article Google Scholar
Wang, X.: An HOG-LBP human detector with partial occlusion handling. In: Proceedings of IEEE International Conference on Computer Vision, September, Kyoto, Japan, pp. 32–39 (2009)
Google Scholar
Kuo, W., Hariharan, B., Malik, J.: DeepBox: learning objectness with convolutional networks. In: IEEE International Conference on Computer Vision, pp. 2479–2487 (2015)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
He, K., et al.: Spatial pyramid pooling in deep convolutional networks for visual recognition. In: IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)
Article Google Scholar
Girshick, R.: Fast R-CNN. Computer Science, pp. 1440–1448 (2015)
Google Scholar
Ren, S., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: International Conference on Neural Information Processing Systems, pp. 91–9 (2015)
Google Scholar
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 6517–6525 (2016)
Google Scholar
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2018)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 21–37 (2016)
Chapter Google Scholar
He, K., et al.: Deep residual learning for image recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2015)
Google Scholar
Kong, T., et al.: HyperNet: towards accurate region proposal generation and joint object detection. In: Computer Vision and Pattern Recognition (2016)
Google Scholar
Cai, Z., Fan, Q., Feris, Rogerio, S., Vasconcelos, N.: A unified multi-scale deep convolutional neural network for fast object detection. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 354–370. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_22
Chapter Google Scholar
Hariharan, B., et al.: Hypercolumns for object segmentation and fine-grained localization. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 447–456 (2014)
Google Scholar
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: IEEE on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Google Scholar
Sermanet, P., Kavukcuoglu, K., Chintala, S., et al.: Pedestrian detection with unsupervised multi-stage feature learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3626–3633 (2013)
Google Scholar
Simonyan, K., Zisserman, A.: Very Deep Convolutional Networks for Computer Science. pp. 730–734 (2014)
Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (N0. 61771270), the Natural Science Foundation of Zhejiang Province (No. 2017A610109) and (LQ15F020004), Key research and development plan of Zhejiang province (2018C01086).

Author information

Authors and Affiliations

Institute of Electronic Information and Control Engineering, Chang’an University, Xi’an, 713100, Shaanxi, China
Jiaojiao He & Zhongjie Zhao
School of Electronic and Information Engineering, Ningbo University of Technology, Ningbo, 315211, Zhejiang, China
Jiaojiao He, Ken Liu, Yongping Zhang & Tuozhong Yao
Institute of Information, Chang’an University, Xi’an, 713100, Shaanxi, China
Ken Liu
Ningbo Institution of Industrial Technology, Chinese Academy of Sciences, Ningbo, 315201, Zhejiang, China
Jiangjian Xiao & Chengbin Peng

Authors

Jiaojiao He
View author publications
You can also search for this author in PubMed Google Scholar
Ken Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yongping Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Tuozhong Yao
View author publications
You can also search for this author in PubMed Google Scholar
Zhongjie Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jiangjian Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Chengbin Peng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yongping Zhang .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Yuxin Peng
Shanghai Jiao Tong University, Shanghai, China
Kai Yu
Tsinghua University, Beijing, China
Jiwen Lu
Central China Normal University, Wuhan, China
Xingpeng Jiang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, J. et al. (2018). A Channel-Cascading Pedestrian Detection Network for Small-Size Pedestrians. In: Peng, Y., Yu, K., Lu, J., Jiang, X. (eds) Intelligence Science and Big Data Engineering. IScIDE 2018. Lecture Notes in Computer Science(), vol 11266. Springer, Cham. https://doi.org/10.1007/978-3-030-02698-1_28

Download citation

DOI: https://doi.org/10.1007/978-3-030-02698-1_28
Published: 09 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02697-4
Online ISBN: 978-3-030-02698-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics