HCI 2014: Human-Computer Interaction. Advanced Interaction Modalities and Techniques pp 325-336 | Cite as
View-Invariant Human Detection from RGB-D Data of Kinect Using Continuous Hidden Markov Model
Abstract
In this paper authors have presented a method to detect human from a Kinect captured Gray-Depth (G-D) using Continuous Hidden Markov models (C-HMMs). In our proposed approach, we initially generate multiple gray scale images from a single gray scale image/ video frame based on their depth connectivity. Thus, we initially segment the G image using depth information and then relevant components were extracted. These components were further filtered out and features were extracted from the candidate components only. Here a robust feature named Local gradients histogram(LGH) is used to detect human from G-D video. We have evaluated our system against the data set published by LIRIS in ICPR 2012 and on our own data set captured in our lab. We have observed that our proposed method can detect human from this data-set with a 94.25% accuracy.
Keywords
Gaussian Mixture Model Video Frame Recognition Accuracy Depth Information Human DetectionPreview
Unable to display preview. Download preview PDF.
References
- 1.Henry, P., Krainin, M., Herbst, E., Ren, X., Fox, D.: RGBD Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments. In: Proc. ISER 2010 (2010)Google Scholar
- 2.Trinh, H., Fan, Q., Pankanti, S., et al.: Detecting Human Activities in Retail Surveillance Using Hierarchical Finite State Machine. In: Proc. ICASSP 2011, pp. 1337–1340 (2011)Google Scholar
- 3.Trinh, H., Fan, Q., Gabbur, P., Pankanti, S.: Hand tracking by binary quadratic programming and its application to retail activity recognition. In: Proc. CVPR 2012, pp. 1902–1909 (2012)Google Scholar
- 4.Gabbur, P., Pankanti, S., Fan, Q., Trinh, H.: A pattern discovery approach to retail fraud detection. In: Proc. KDD 2011, pp. 307–315 (2011)Google Scholar
- 5.Sinha, A., Chattopadhyay, T., Mallik, A.: Segmentation of Kinect Captured Images using Grid Based 3D Connected Component Labeling. In: Proc. VISAPP 2013, pp. 327–332 (2013)Google Scholar
- 6.Wolf, C., Mille, J., Lombardi, L.E., Celiktutan, O., Jiu, M., Baccouche, M., Dellandrea, E., Bichot, C.-E., Garcia, C., Sankur, B.: The LIRIS Human activities dataset and the ICPR 2012 human activities recognition and localization competition, Technical Report RR-LIRIS-2012-004. In: Proc. ICPR 2012 (2012)Google Scholar
- 7.Salembier, P., Serra, J.: Flat Zone Filtering, Connected Operator, and Filters by Reconstruction. Proc. IEEE Transactions on Image Processing 1995, 1153–1160 (1995)Google Scholar
- 8.Jin, R., Hauptmann, A.G.: Learning to Identify Video Shots With People Based on Face Detection. In: Proc. ICME 2003, pp. 6–9 (2003)Google Scholar
- 9.Low&, B.K., Hjelmas, E.: Face Detection: A Survey, Computer Vision and Image Understanding 2001 (2001)Google Scholar
- 10.Khan, M.U.G., Saeed, A.: Human Detecion in Videos. Journal of Theoretical and Applied Information Technology (2009)Google Scholar
- 11.Zhu, Q., Yeh, M., Cheng, K., Avidan, S.: Fast Human Detection Using a Cascade of Histograms of Oriented Gradients. In: Proc. CVPR 2006, pp. 1491–1498 (2006)Google Scholar
- 12.Xu, R., Zhang, B., Ye, Q., Jiao, J.: Cascaded L1-norm Minimization Learning (CLML) classifier for human detection. In: Proc. CVPR 2010, pp. 89–96 (2010)Google Scholar
- 13.Schwartz, W.R., Kembhavi, A., Harwood, D., Davis, L.S.: Human detection using partial least squares analysis. In: Proc. ICCV 2009, pp. 24–31 (2009)Google Scholar
- 14.Jain, H.P., Subramanian, A., Das, S., Mittal, A.: Real-time upper-body human pose estimation using a depth camera. In: Gagalowicz, A., Philips, W. (eds.) MIRAGE 2011. LNCS, vol. 6930, pp. 227–238. Springer, Heidelberg (2011)CrossRefGoogle Scholar
- 15.Xia, L., Chen, C., Aggarwal, J.K.: Human Detection Using Depth Information by Kinect. In: Proc. CVPRW 2011, pp. 15–22 (2011)Google Scholar
- 16.Micilotta, A., Ong, E., Bowden, R.: Detection and tracking of humans by probabilistic body part assembly. In: Proc. British Machine Vision Conference 2005, pp. 429–438 (2005)Google Scholar
- 17.Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. Proc. IEEE Transaction on Pattern Analysis and Machine Intelligence 2001, 349–361 (2001)Google Scholar
- 18.Alon, J., Athitsos, V., Yuan, Q., Sclaroff, S.: A Unified Framework for Gesture Recognition and Spatiotemporal Gesture Segmentation. Proc. IEEE Transaction on Pattern Analysis and Machine Intelligence 2009, 1685–1699 (2009)Google Scholar
- 19.Dalal, N., Triggs, B., Schmid, C.: Human Detection Using Oriented Histograms of Flow and Appearance. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 428–441. Springer, Heidelberg (2006)CrossRefGoogle Scholar
- 20.Young, S.J., Jansen, J., Odell, J.J., Ollason, D., Woodland, P.C.: The HTK Hidden Markov Model Toolkit Book. Entropic Cambridge Research Laboratory (1995)Google Scholar
- 21.Chakraborty, B., Rudovic, O.N., Gonzlez, J.: View-invariant human-body detection with extension to human action recognition using component-wise HMM of body parts. In: Proc. FG 2008, pp. 1–6 (2008)Google Scholar
- 22.Aggarwal, J.K., Ryoo, M.S.: Human activity analysis: A review. ACM Computing Surveys (CSUR) 2011 43(3) (2011)Google Scholar
- 23.Chattopadhyay, T., Roy, S.: Human Localization at Home Using Kinect. In: Proc. HomeSys, UbiComp (Adjunct Publication) 2013, pp. 821–828 (2013)Google Scholar
- 24.Jos, A., Serrano, R., Perronnin, F.: Handwritten word-spotting using hidden Markov models and universal vocabularies. In: Proc. Pattern Recognition 2009, pp. 2106–2116 (2009)Google Scholar