Abstract
Lying-pose human detection is an active research field of computer vision in recent years. It has a good theoretical significance and furthermore many applications, such as victim detection or home service robot. But the study on lying-pose human detection in low-altitude overlooking images have many unsolved problems owing to multiple poses, arbitrary orientation, in-plane rotation, perspective distortion, and time-consuming. In this paper, the proposed framework of human lying-pose detection is optimization and machine learning algorithms inspired by processes of neurobiology suggest and human vision system to select possible object locations. First, the proposed model effectively utilizes binarized normed gradient features to obtain the objectness rapidly based on the vision saliency. Further, deep-learning techniques based on the convolution neural network are trained for learning rich feature hierarchies, in order to obtain the object of lying-pose human from objectness estimation, unlike the classical sliding-window algorithm. Eventually, employed pyramid mean-shift algorithm and rotation-angle recovery method to find position and direction of human lying-pose. The experimental results show that our method is rapid and efficient, and that it achieves state-of-the-art results with our XMULP dataset.
Similar content being viewed by others
References
Andreopoulos, A., John, K.T.: 50 years of object recognition: directions forward. Comput. Vis. Image Underst. 8(117), 827–891 (2013)
Benenson, R., Mathias, M., Timofte, R., Gool, L.V.: Pedestrian detection at 100 frames per second. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2903–2910 (2012)
Dollar, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 743–761 (2012)
Song-zhi, S., Shao-zi, L., Shu-yuan, C., Guo-rong, C., Yundong, W.: A survey on pedestrian detection. Dianzi Xuebao (Acta Electronica Sinica) 40(4), 814–820 (2012)
Geronimo, D., Lopez, A.M.: Vision-Based Pedestrian Protection Systems for Intelligent Vehicles, Springer Briefs in Computer Science. Springer, Berlin (2014)
Wang, X.G., Wang, M., Li, W.: Scene-specific pedestrian detection for static video surveillance. IEEE Trans. Pattern Anal. Mach. Intell. 36(2), 361–374 (2014)
Wang, S.M.: Lying pose recognition for elderly fall detection. Robot. Sci. Syst. VII 1, 345–353 (2012)
Muhammad, M., Ling, S., Luke, S.: A survey on fall detection: principles and approaches. Neurocomputing 100, 144–152 (2013)
Xia, D.X., Su, S.Z., Li, S.Z., Pierre-Marc, J.: Pose-specific lying human detection with samples expanding. In: IEEE International Conference on Image Processing, Oct. 2014
Zhang, Z., Conly, C., Athitsos, V.: A survey on vision-based fall detection. In: 8th ACM Int. Conf. on PErvasive Technologies Related to Assistive Environments, pp. 1–7, July 2015
Yang, D., Kriegman, M.H., Ahuja, N.: Detecting faces in images: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 24(1), 34–58 (2002)
Fan, H. et al.: Learning deep face representation. In: CoRR (2014). arXiv:abs/1403.2802
Mirmahboub, B., Samavi, S., Karimi, N., Shirani, S.: Automatic monocular system for human fall detection based on variations in silhouette area. IEEE Trans. Biomed. Eng. 60(2), 427–436 (2013)
Rudol, P., Doherty, P.: Human body detection and geolocalization for UAV search and rescue missions using color and thermal imagery. In: IEEE Aerospace Conference, pp. 1–8 (2008)
Diraco, G., Leone, A., Siciliano, P.: An active vision system for fall detection and posture recognition in elderly healthcare. In: Europe Conference and Exhibition on Design, Automation and Test, pp. 1536–1541, Mar. 2010
Yu, X.G.: Approaches and principles of fall detection for elderly and patient. In: 10th Int. Conf. on e-health Networking, Applications and Services, pp. 42–47, July 2008
Doherty, P., Rudol, P.: A UAV search and rescue scenario with human body detection and geolocalization. In: Advances in Artificial Intelligence, pp. 1–13. Springer, Berlin (2007)
Papageorgiou, C., Poggio, T.: A trainable system for object detection. Int. J. Comput. Vis. 38(1), 15–33 (2000)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587, June 2014
Jia, Y.Q., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: Proceedings of the ACM International Conference on Multimedia. ACM, pp. 675–678 (2014)
Heitz, G., Koller, D.: Learning spatial context: Using stuff to find things. In: European Conference on Computer Vision, of Lecture Notes in Computer Science. Springer, Berlin, pp. 30–43 (2008)
Bogdan, A., Thomas, D., Vittorio, F.: Measuring the objectness of image windows. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2189–2202 (2012)
Cheng, M.M., Ziming, Z., Wen-Yan, L., Philip, T.: BING: binarized normed gradients for objectness estimation at 300fps. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3286–3293 (2014)
Xia D.X., Li, S.Z.: Rotation angle recovery for rotation invariant detector in lying pose human body detection. J. Eng. (2015). doi:10.1049/joe.2015.0032
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. Int. Conf. Comput. Vis. Pattern Recognit. 2, 886–893 (2005)
Wang, X.Y., Han, T.X., Yan, S.C.: An hog-lbp human detector with partial occlusion handling. In: IEEE 12th Int. Conf. on, Computer Vision, pp. 32–39, Sept. 2009
Dollar, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: Proc. British Machine Vision Conference, pp. 1–11, Sept. 2009
Ren, X.F., Ramanan, D.: Histograms of sparse codes for object detection. In: IEEE Conf. Computer Vision and Pattern Recognition, pp. 3246–3253, June 2013
Dollar, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014)
Costea, A.D., Nedevschi, S.: Word channel based multiscale pedestrian detection without image resizing and using only one classifier. In: IEEE Conf. Computer Vision and Pattern Recognition, pp.2393–2400, June 2014
Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. IEEE Trans. Pattern Anal. Mach. Intell. 23(4), 349–361 (2001)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Cascade object detection with deformable part models. In: IEEE Conf. Computer Vision and Pattern Recognition, pp. 2241–2248, June 2010
Yan, J.J., Lei, Z., Wen, L.Y., Li, S.Z.: The fastest deformable part model for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2497–2504, June 2014
Zhang, S., Bauckhage, C., Cremers, A.B.: Informed Haar-Like features improve pedestrian detection. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 947–954, June 2014
Leibe, B., Leonardis, A., Schiele, B.: Robust object detection with interleaved categorization and segmentation. Int. J. Comput. Vis. 77(1–3), 259–289 (2008)
Gall, J., Lempitsky, V.: Class-specific hough forests for object detection. In: IEEE Conf. Computer Vision and Pattern Recognition, pp. 1022–1029, June 2009
Ouyang, W.L., Wang, X.G.: A discriminative deep model for pedestrian detection with occlusion handling. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3258–3265, June 2012
Ouyang, W.L., Wang, X.G.: Joint deep learning for pedestrian detection. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 2056–2063, Dec. 2013
Luo, P., Tian, Y.L., Wang, X.G., Tang, X.O.: Switchable deep network for pedestrian detection. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 899–906, June 2014
Amin, M.G., Zhang, Y.D., Ahmad, F., Ho, K.C.D.: Radar signal processing for elderly fall detection: the future for in-home monitoring. IEEE Signal Process. Mag. 33(2), 71–80 (2016)
Zhang, C.Y., Tian, Y.L., Capezuti, E.: Privacy preserving automatic fall detection for elderly using RGBD cameras. In: 13th International Conference on Computers Helping People with Special Needs, no. 9, pp. 625–633, July 2012
Wand, R.D., Zhang, Y.L., Dong, L.P., Lu, J.W,. Zhang, Z.Q., He, X.: Fall detection algorithm for the elderly based on human characteristic matrix and SVM. In: 15th Int. Conf. on Control, Automation and Systems, pp. 1190–1195, Oct. 2015
Wang, S., Xu, Z.W., Yang, Y., Li, X., Pang, C.Y., Alexander, G.: Fall detection in multi-camera surveillance videos: experimentations and observations. In: 1st ACM Int. Workshop on Multimedia Indexing and Information Retrieval for Healthcare, pp. 33–38, Oct. 2013
Feng, W.G., Liu, R., Zhu, M.: Fall detection for elderly person care in a vision-based home surveillance environment using a monocular camera, on Signal. Image Video Process. 8, 1129–1138 (2014)
Liu, C.L., Lee, C.H., Lin, P.M.: A fall detection system using k-nearest neighbor classifier. Expert Syst. Appl. 37(10), 7174–7181 (2010)
Tasoulis, S.K., Doukas, C.N., Plagianakos, V.P., Maglogiannis, I.: Statistical data mining of streaming motion data for activity and fall recognition in assistive environments. Neurocomputing 107, 87–96 (2013)
Htike, Z.Z., Egerton, S., Chow, K.Y.: A monocular view-invariant fall detection system for the elderly in assisted home environments. In: 7th Int. Conf. on Intelligent Environments, pp. 40–46, July 2011
Yu, M., Yu, Y., Rhuma, A., Naqvi, S.M.R., Wang, L., Chambers, J.A.: An online one class support vector machine-based person-specific fall detection system for monitoring an elderly individual in a room environment. IEEE J. Biomed. Health Inform. 17(6), 1002–1014 (2013)
Alazrai, R., Zmily, A., Mowafi, Y.: Fall detection for elderly using anatomical-plane-based representation. In: 36th Annual Int. Conf. of the IEEE on Engineering in Medicine and Biology Society, pp. 5916–5919, Aug. 2014
Andriluka, M., Schnitzspan, P., Meyer, J., Kohlbrecher, S., Petersen, K., Stryk, O.V., Roth, S., Schiele, B.: Vision based victim detection from unmanned aerial vehicles. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1740–1747 (2010)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Felzenszwalb, P., Huttenlocher, D.: Pictorial structures for object recognition. Int. J. Comput. Vis. 61(1), 55–79 (2005)
Bourdev, L., Malik, J.: Poselets: body part detectors trained using 3D human pose annotations. In: IEEE 12th Int. Conf. on Computer Vision, pp. 1365–1372, Sept. 2009
Zhang, Z.M., Warrell, J., Torr, P.H.S.: Proposal generation for object detection using cascaded ranking SVMs. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1497–1504, June 2011
Acknowledgments
This work was supported by the Nature Science Foundation of China (No. 61202143), the Collaborative Innovation Special Foundation of Xuchang University (No. XCUXT2014-08), and the Natural Science Foundation of Guizhou Province (No. QKHLHZi [2015] 7784).
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by S. Kopf.
Rights and permissions
About this article
Cite this article
Xia, DX., Su, SZ., Geng, LC. et al. Learning rich features from objectness estimation for human lying-pose detection. Multimedia Systems 23, 515–526 (2017). https://doi.org/10.1007/s00530-016-0518-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-016-0518-5