Abstract
The traditional human tracking systems are often prone to problems which can be deduced either from the subject, camera or background movements which includes major changes in posture, appearance, clothing and lighting of the background. The work introduced here proposes a system for extracting and tracking objects from a video sequence by initializing the process of feature extraction of the selected area of the human. The need for cascade extraction by using Haar-like features, is to basically decrease the use of crude or raw pixel values and then make classification easier. The major issue here is the problem in extracting the required finite set of input for identifying the characteristics necessary for domain encoding rule In the proposed work the major aim is to address the tracking procedure by facilitating the exclusive histogram of oriented gradient (HOG) methodology, using which only the human’s unit element is embraced in the usual tracking system, making it problematic for robust monitoring. The features of Haar-like and HOG methods are combined to propose a tracking system which uses the Haar characteristics for the object’s structure and the HOG features for the edge. A set of mixed features is developed with these two features. Boosting Online’s selection feature is used to select important features and update these online features to understand the optimal choice using cascading SVM classifier.
Similar content being viewed by others
Change history
05 December 2022
This article has been retracted. Please see the Retraction Notice for more detail: https://doi.org/10.1007/s10586-022-03897-5
References
Nagabhushana, S.: Introduction in Computer Vision and Image Processing, p. 3. New Age International (P) Ltd. Publishers, New Delhi (2005)
Lovell, N., Estivill-Castro, V.: Color Classification and Object Recognition for robotic Soccer Under Variable Illumination. Griffith University, Queensland
Jones, V.: Rapid object detection using a boosted cascade of simple features. In: Computer Vision and Pattern Recognition (CVPR) (2001)
T. M. Inc.: Train a cascade object detector. http://www.mathworks.se/help/vision/ug/train-a-cascadeobject-detector.html#btugex8. Accessed July 2017
Zhang, G., Liu, J., Li, H., Chen, Y.Q., Davis, L.S.: Joint human detection and head pose estimation via multi-stream networks for RGB-D’Videos. IEEE Signal Process. Lett. 24(11), 1666–1670 (2017)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. IEEE Computer Vision and Pattern Recognition (CVPR) 1, 886–893 (2005)
Benenson, R., Mathias, M., Timofte, R., Van Gool, L.: Pedestrian detection at 100 frames per second. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 2903–2910 (2012)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.: Cascade object detection with deformable part models. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 2241–2248 (2010)
Doll’ar, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. In: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 36, no. 8, pp. 1532–1545 (2014)
Xia, L., Chen, C.-C., Aggarwal, J.K.: Human detection using depth information by kinect. In: IEEE Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 15–22 (2011)
Spinello, L., Arras, K. O.: People detection in RGB-D data. In: IEEE Intelligent Robots and Systems (IROS), pp. 3838–3843 (2011)
Choi, W., Pantofaru, C., Savarese, S.: A general framework for tracking multiple people from a moving camera. In: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 35, no. 7, pp. 1577–1591 (2013)
Jafari, O.H., Mitzel, D., Leibe, B.: Real-time RGB-D based people detection and tracking for mobile robots and head-worn cameras. In: IEEE International Conference on Robotics and Automation (ICRA), pp. 5636–5643 (2014)
Wang, L., Qiao, Y., Tang, X.: Action recognition with trajectorypooled deep-convolutional descriptors. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 4305–4314 (2015)
Du, Y., Wang, W., Wang, L.: Hierarchical recurrent neural network for skeleton based action recognition. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 1110–1118 (2015)
Liu, J., Shahroudy, A., Xu, D., Wang, G.: Spatio-temporal lstm with trust gates for 3d human action recognition. In: European Conference on Computer Vision (ECCV), pp. 816–833. Springer, New York (2016)
Ke, Q., Bennamoun, M., An, S., Boussaid, F., Sohel, F.: Human interaction prediction using deep temporal features. In: European Conference on Computer Vision (ECCV), pp. 403–414. Springer, New York (2016)
Liu, J., Wang, G.: Global context-aware attention LSTM networks for 3d action recognition. In: IEEE Computer Vision and Pattern Recognition (CVPR) (2017)
Toshev, A., Szegedy, C.: Deeppose: human pose estimation via deep neural networks. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 1653–1660 (2014)
Pfister, T., Charles, J., Zisserman, A.: Flowing convnets for human pose estimation in videos. In: IEEE International Conference on Computer Vision (ICCV), pp. 1913–1921 (2015)
Ouyang, W., Wang, X.: Joint deep learning for pedestrian detection. In: IEEE International Conference on Computer Vision (ICCV), pp. 2056–2063 (2013)
Luo, P., Tian, Y., Wang, X., Tang, X.: Switchable deep network for pedestrian detection. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 899–906 (2014)
Angelova, A., Krizhevsky, A., Vanhoucke, V.: Pedestrian detection with a large-field-of-view deep network. In: IEEE International Conference on Robotics and Automation (ICRA), pp. 704–711 (2015)
Zhao, J., Zhang, G., Tian, L., Chen, Y. Q.: Real-time human detection with depth camera via a physical radius-depth detector and a cnn descriptor. In: IEEE International Conference on Multimedia and Expo (ICME) (2017)
Ng, J.Y.-H., Hausknecht, M., Vijayanarasimhan, S., Vinyals, O., Monga, R., Toderici, G.: Beyond short snippets: deep networks for video classification. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 4694–4702 (2015)
Wu, Z., Wang, X., Jiang, Y.-G., Ye, H., Xue, X.: Modeling spatialtemporal clues in a hybrid deep learning framework for video classification. In: International Conference on Multimedia, pp. 461–470. ACM (2015)
Wang, L., Xiong, Y., Wang, Z., Qiao, Y., Lin, D., Tang, X., Van Gool, L.: Temporal segment networks: towards good practices for deep action recognition. In: European Conference on Computer Vision (ECCV), pp. 20-36. Springer, New York (2016)
Ke, Q., An, S., Bennamoun, M., Sohel, F., Boussaid, F.: Skeletonnet: mining deep part features for 3-d action recognition. In: IEEE Signal Processing Letters, vol. 24, no. 6, pp. 731–735 (2017)
Tompson, J.J., Jain, A., LeCun, Y., Bregler, C.: Joint training of a convolutional network and a graphical model for human pose estimation. In: Advances in Neural Information Processing Systems (NIPS), pp. 1799–1807 (2014)
Jain, A., Tompson, J., LeCun, Y., Bregler, C.: Modeep: a deep learning framework using motion features for human pose estimation. In: Asian Conference on Computer Vision (ACCV), pp. 302–315. Springer, New York (2014)
Ch’eron, G., Laptev, I., Schmid, C.: P-cnn: Pose-based cnn features for action recognition. In: IEEE International Conference on Computer Vision (ICCV), pp. 3218–3226 (2015)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 580–587 (2014)
Soo, S.: Object Detection Using Haar-Cascade Classifier. Institute of Computer Science, University of Tartu, Estonia (2014)
Author information
Authors and Affiliations
Corresponding author
Additional information
This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s10586-022-03897-5
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Prasanna, D., Prabhakar, M. RETRACTED ARTICLE: An effiecient human tracking system using Haar-like and hog feature extraction. Cluster Comput 22 (Suppl 2), 2993–3000 (2019). https://doi.org/10.1007/s10586-018-1747-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-018-1747-5