Learning rich features from objectness estimation for human lying-pose detection

Xia, Dao-Xun; Su, Song-Zhi; Geng, Li-Chuan; Wu, Guo-Xi; Li, Shao-Zi

doi:10.1007/s00530-016-0518-5

Learning rich features from objectness estimation for human lying-pose detection

Regular Paper
Published: 17 May 2016

Volume 23, pages 515–526, (2017)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Dao-Xun Xia^1,2,
Song-Zhi Su¹,
Li-Chuan Geng³,
Guo-Xi Wu⁴ &
…
Shao-Zi Li¹

629 Accesses
10 Citations
Explore all metrics

Abstract

Lying-pose human detection is an active research field of computer vision in recent years. It has a good theoretical significance and furthermore many applications, such as victim detection or home service robot. But the study on lying-pose human detection in low-altitude overlooking images have many unsolved problems owing to multiple poses, arbitrary orientation, in-plane rotation, perspective distortion, and time-consuming. In this paper, the proposed framework of human lying-pose detection is optimization and machine learning algorithms inspired by processes of neurobiology suggest and human vision system to select possible object locations. First, the proposed model effectively utilizes binarized normed gradient features to obtain the objectness rapidly based on the vision saliency. Further, deep-learning techniques based on the convolution neural network are trained for learning rich feature hierarchies, in order to obtain the object of lying-pose human from objectness estimation, unlike the classical sliding-window algorithm. Eventually, employed pyramid mean-shift algorithm and rotation-angle recovery method to find position and direction of human lying-pose. The experimental results show that our method is rapid and efficient, and that it achieves state-of-the-art results with our XMULP dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Human Pose Estimation via Deep Part Detection

Multi-Human Pose Estimation by Deep Learning-Based Sequential Approach for Human Keypoint Position and Human Body Detection

Article 28 October 2023

Lightweight human pose estimation: CVC-net

Article 07 March 2022

Notes

References

Andreopoulos, A., John, K.T.: 50 years of object recognition: directions forward. Comput. Vis. Image Underst. 8(117), 827–891 (2013)
Article Google Scholar
Benenson, R., Mathias, M., Timofte, R., Gool, L.V.: Pedestrian detection at 100 frames per second. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2903–2910 (2012)
Dollar, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 743–761 (2012)
Article Google Scholar
Song-zhi, S., Shao-zi, L., Shu-yuan, C., Guo-rong, C., Yundong, W.: A survey on pedestrian detection. Dianzi Xuebao (Acta Electronica Sinica) 40(4), 814–820 (2012)
Google Scholar
Geronimo, D., Lopez, A.M.: Vision-Based Pedestrian Protection Systems for Intelligent Vehicles, Springer Briefs in Computer Science. Springer, Berlin (2014)
Book Google Scholar
Wang, X.G., Wang, M., Li, W.: Scene-specific pedestrian detection for static video surveillance. IEEE Trans. Pattern Anal. Mach. Intell. 36(2), 361–374 (2014)
Article Google Scholar
Wang, S.M.: Lying pose recognition for elderly fall detection. Robot. Sci. Syst. VII 1, 345–353 (2012)
Google Scholar
Muhammad, M., Ling, S., Luke, S.: A survey on fall detection: principles and approaches. Neurocomputing 100, 144–152 (2013)
Article Google Scholar
Xia, D.X., Su, S.Z., Li, S.Z., Pierre-Marc, J.: Pose-specific lying human detection with samples expanding. In: IEEE International Conference on Image Processing, Oct. 2014
Zhang, Z., Conly, C., Athitsos, V.: A survey on vision-based fall detection. In: 8th ACM Int. Conf. on PErvasive Technologies Related to Assistive Environments, pp. 1–7, July 2015
Yang, D., Kriegman, M.H., Ahuja, N.: Detecting faces in images: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 24(1), 34–58 (2002)
Article Google Scholar
Fan, H. et al.: Learning deep face representation. In: CoRR (2014). arXiv:abs/1403.2802
Mirmahboub, B., Samavi, S., Karimi, N., Shirani, S.: Automatic monocular system for human fall detection based on variations in silhouette area. IEEE Trans. Biomed. Eng. 60(2), 427–436 (2013)
Article Google Scholar
Rudol, P., Doherty, P.: Human body detection and geolocalization for UAV search and rescue missions using color and thermal imagery. In: IEEE Aerospace Conference, pp. 1–8 (2008)
Diraco, G., Leone, A., Siciliano, P.: An active vision system for fall detection and posture recognition in elderly healthcare. In: Europe Conference and Exhibition on Design, Automation and Test, pp. 1536–1541, Mar. 2010
Yu, X.G.: Approaches and principles of fall detection for elderly and patient. In: 10th Int. Conf. on e-health Networking, Applications and Services, pp. 42–47, July 2008
Doherty, P., Rudol, P.: A UAV search and rescue scenario with human body detection and geolocalization. In: Advances in Artificial Intelligence, pp. 1–13. Springer, Berlin (2007)
Papageorgiou, C., Poggio, T.: A trainable system for object detection. Int. J. Comput. Vis. 38(1), 15–33 (2000)
Article MATH Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587, June 2014
Jia, Y.Q., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the ACM International Conference on Multimedia. ACM, pp. 675–678 (2014)
Heitz, G., Koller, D.: Learning spatial context: Using stuff to find things. In: European Conference on Computer Vision, of Lecture Notes in Computer Science. Springer, Berlin, pp. 30–43 (2008)
Bogdan, A., Thomas, D., Vittorio, F.: Measuring the objectness of image windows. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2189–2202 (2012)
Article Google Scholar
Cheng, M.M., Ziming, Z., Wen-Yan, L., Philip, T.: BING: binarized normed gradients for objectness estimation at 300fps. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3286–3293 (2014)
Xia D.X., Li, S.Z.: Rotation angle recovery for rotation invariant detector in lying pose human body detection. J. Eng. (2015). doi:10.1049/joe.2015.0032
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. Int. Conf. Comput. Vis. Pattern Recognit. 2, 886–893 (2005)
Google Scholar
Wang, X.Y., Han, T.X., Yan, S.C.: An hog-lbp human detector with partial occlusion handling. In: IEEE 12th Int. Conf. on, Computer Vision, pp. 32–39, Sept. 2009
Dollar, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: Proc. British Machine Vision Conference, pp. 1–11, Sept. 2009
Ren, X.F., Ramanan, D.: Histograms of sparse codes for object detection. In: IEEE Conf. Computer Vision and Pattern Recognition, pp. 3246–3253, June 2013
Dollar, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014)
Article Google Scholar
Costea, A.D., Nedevschi, S.: Word channel based multiscale pedestrian detection without image resizing and using only one classifier. In: IEEE Conf. Computer Vision and Pattern Recognition, pp.2393–2400, June 2014
Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. IEEE Trans. Pattern Anal. Mach. Intell. 23(4), 349–361 (2001)
Article Google Scholar
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Cascade object detection with deformable part models. In: IEEE Conf. Computer Vision and Pattern Recognition, pp. 2241–2248, June 2010
Yan, J.J., Lei, Z., Wen, L.Y., Li, S.Z.: The fastest deformable part model for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2497–2504, June 2014
Zhang, S., Bauckhage, C., Cremers, A.B.: Informed Haar-Like features improve pedestrian detection. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 947–954, June 2014
Leibe, B., Leonardis, A., Schiele, B.: Robust object detection with interleaved categorization and segmentation. Int. J. Comput. Vis. 77(1–3), 259–289 (2008)
Article Google Scholar
Gall, J., Lempitsky, V.: Class-specific hough forests for object detection. In: IEEE Conf. Computer Vision and Pattern Recognition, pp. 1022–1029, June 2009
Ouyang, W.L., Wang, X.G.: A discriminative deep model for pedestrian detection with occlusion handling. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3258–3265, June 2012
Ouyang, W.L., Wang, X.G.: Joint deep learning for pedestrian detection. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 2056–2063, Dec. 2013
Luo, P., Tian, Y.L., Wang, X.G., Tang, X.O.: Switchable deep network for pedestrian detection. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 899–906, June 2014
Amin, M.G., Zhang, Y.D., Ahmad, F., Ho, K.C.D.: Radar signal processing for elderly fall detection: the future for in-home monitoring. IEEE Signal Process. Mag. 33(2), 71–80 (2016)
Article Google Scholar
Zhang, C.Y., Tian, Y.L., Capezuti, E.: Privacy preserving automatic fall detection for elderly using RGBD cameras. In: 13th International Conference on Computers Helping People with Special Needs, no. 9, pp. 625–633, July 2012
Wand, R.D., Zhang, Y.L., Dong, L.P., Lu, J.W,. Zhang, Z.Q., He, X.: Fall detection algorithm for the elderly based on human characteristic matrix and SVM. In: 15th Int. Conf. on Control, Automation and Systems, pp. 1190–1195, Oct. 2015
Wang, S., Xu, Z.W., Yang, Y., Li, X., Pang, C.Y., Alexander, G.: Fall detection in multi-camera surveillance videos: experimentations and observations. In: 1st ACM Int. Workshop on Multimedia Indexing and Information Retrieval for Healthcare, pp. 33–38, Oct. 2013
Feng, W.G., Liu, R., Zhu, M.: Fall detection for elderly person care in a vision-based home surveillance environment using a monocular camera, on Signal. Image Video Process. 8, 1129–1138 (2014)
Article Google Scholar
Liu, C.L., Lee, C.H., Lin, P.M.: A fall detection system using k-nearest neighbor classifier. Expert Syst. Appl. 37(10), 7174–7181 (2010)
Article Google Scholar
Tasoulis, S.K., Doukas, C.N., Plagianakos, V.P., Maglogiannis, I.: Statistical data mining of streaming motion data for activity and fall recognition in assistive environments. Neurocomputing 107, 87–96 (2013)
Article Google Scholar
Htike, Z.Z., Egerton, S., Chow, K.Y.: A monocular view-invariant fall detection system for the elderly in assisted home environments. In: 7th Int. Conf. on Intelligent Environments, pp. 40–46, July 2011
Yu, M., Yu, Y., Rhuma, A., Naqvi, S.M.R., Wang, L., Chambers, J.A.: An online one class support vector machine-based person-specific fall detection system for monitoring an elderly individual in a room environment. IEEE J. Biomed. Health Inform. 17(6), 1002–1014 (2013)
Article Google Scholar
Alazrai, R., Zmily, A., Mowafi, Y.: Fall detection for elderly using anatomical-plane-based representation. In: 36th Annual Int. Conf. of the IEEE on Engineering in Medicine and Biology Society, pp. 5916–5919, Aug. 2014
Andriluka, M., Schnitzspan, P., Meyer, J., Kohlbrecher, S., Petersen, K., Stryk, O.V., Roth, S., Schiele, B.: Vision based victim detection from unmanned aerial vehicles. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1740–1747 (2010)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Article Google Scholar
Felzenszwalb, P., Huttenlocher, D.: Pictorial structures for object recognition. Int. J. Comput. Vis. 61(1), 55–79 (2005)
Article Google Scholar
Bourdev, L., Malik, J.: Poselets: body part detectors trained using 3D human pose annotations. In: IEEE 12th Int. Conf. on Computer Vision, pp. 1365–1372, Sept. 2009
Zhang, Z.M., Warrell, J., Torr, P.H.S.: Proposal generation for object detection using cascaded ranking SVMs. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1497–1504, June 2011

Download references

Acknowledgments

This work was supported by the Nature Science Foundation of China (No. 61202143), the Collaborative Innovation Special Foundation of Xuchang University (No. XCUXT2014-08), and the Natural Science Foundation of Guizhou Province (No. QKHLHZi [2015] 7784).

Author information

Authors and Affiliations

School of Information Science and Technology, Xiamen University, Xiamen, 361005, Fujian, China
Dao-Xun Xia, Song-Zhi Su & Shao-Zi Li
School of Big Data and Computer Science, Guizhou Normal University, Guiyang, 550001, Guizhou, China
Dao-Xun Xia
School of Urban Planning and Landscaping, Xuchang University, Xuchang, 461000, Hebei, China
Li-Chuan Geng
The UAVLARS Collaborative Innovation Center, Xuchang University, Xuchang, 461000, Hebei, China
Guo-Xi Wu

Authors

Dao-Xun Xia
View author publications
You can also search for this author in PubMed Google Scholar
Song-Zhi Su
View author publications
You can also search for this author in PubMed Google Scholar
Li-Chuan Geng
View author publications
You can also search for this author in PubMed Google Scholar
Guo-Xi Wu
View author publications
You can also search for this author in PubMed Google Scholar
Shao-Zi Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shao-Zi Li.

Additional information

Communicated by S. Kopf.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xia, DX., Su, SZ., Geng, LC. et al. Learning rich features from objectness estimation for human lying-pose detection. Multimedia Systems 23, 515–526 (2017). https://doi.org/10.1007/s00530-016-0518-5

Download citation

Received: 30 July 2015
Accepted: 28 April 2016
Published: 17 May 2016
Issue Date: July 2017
DOI: https://doi.org/10.1007/s00530-016-0518-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning rich features from objectness estimation for human lying-pose detection

Abstract

Access this article

Similar content being viewed by others

Human Pose Estimation via Deep Part Detection

Multi-Human Pose Estimation by Deep Learning-Based Sequential Approach for Human Keypoint Position and Human Body Detection

Lightweight human pose estimation: CVC-net

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Learning rich features from objectness estimation for human lying-pose detection

Abstract

Access this article

Similar content being viewed by others

Human Pose Estimation via Deep Part Detection

Multi-Human Pose Estimation by Deep Learning-Based Sequential Approach for Human Keypoint Position and Human Body Detection

Lightweight human pose estimation: CVC-net

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation