Robust human tracking using harmonious polling tracker

Wagh, Kavita; Kanade, Sudhir S.

doi:10.1007/s42452-019-1219-4

Robust human tracking using harmonious polling tracker

Research Article
Published: 17 September 2019

Volume 1, article number 1227, (2019)
Cite this article

Download PDF

SN Applied Sciences Aims and scope Submit manuscript

Robust human tracking using harmonious polling tracker

Download PDF

Kavita Wagh¹ &
Sudhir S. Kanade²

722 Accesses
1 Citation
Explore all metrics

Abstract

Human tracking is one of the challenging and essential components of an intelligent surveillance system. Variety of correlation filter-based tracking algorithms has presented their brilliance in human tracking. However, few of them have demonstrated disappointments in the presence of occlusion, background clutters, illumination variation, scale variation, fast motion, in-plane rotation, and out of the plane rotation. The paper presents, an improved correlation filter-based tracking algorithm, harmonious polling of patched correlation technique for tracking a human in a video sequence thinking about all the challenging attributes. A human to be tracked is represented by using multiple image patches, as patch-based tracking framework resolves the issues based on occlusion and global scene changes to a great extent. An innovative methodology utilized in the proposed framework is, every individual patch is treated independently thought the process and applied to the polling mechanism, which improves the performance of the system to an extraordinary degree. Kernelized correlation filter is applied to each patch individually generating the correlation score. A polling mechanism is a novel technique used in the proposed framework, which generates the confidence map from the correlation score. The maximum score achieved from the confidence map gives an exact position of the target. The tracker is applied to the number of challenging sequences and contrasted with various outperforming algorithms. The precision and success rate of the proposed tracker is improved by 15% and 19%, respectively. From the qualitative and quantitative analysis, it can be demonstrated that the proposed algorithm beats the cutting-edge execution.

BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking

Article Open access 12 April 2024

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

Article Open access 08 October 2020

ByteTrack: Multi-object Tracking by Associating Every Detection Box

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Human tracking is one of the core problems in the field of computer vision The fundamental thought behind the tracking system is estimating the target position from the sequence of the frame where real-time depth cameras have simplified the tracking process to a great extent. Increasing the number of surveillance cameras and day by day development in it makes the tracking system extremely popular and more accessible nowadays. The number of differential methods [1,2,3] and procreative methods [4,5,6] tracked the appearance model of the target and used for estimating the position of the target in the next frame. There are different applications based on the human tracking framework, like human–computer interaction, security, telepresence, military, and health-care systems [7,8,9,10].

Various skin detection methods introduced the deep learning techniques used the AdaBoost algorithm, cascade classifier with AdaBoost algorithm. The techniques discriminate skin and non-skin pixels, making the skin detector robust and practicable [11, 12]. Ghaziasgar et al. [13] adopted the process of filling the skin holes. For the computer vision and the medical image analysis, Criminisi and Shotton [14] presented the unified and efficient model of decision forest used for scene recognition from photographs, object recognition in the images and automatic diagnosis from radiological scans with supervised or unsupervised machine learning techniques. Random forests are highly non-linear learners that are usually extremely fast during both learning and evaluation.

In this paper, the harmonious polling of patched correlation (HPPC) technique is presented, which is the improvement in the kernelized correlation filter-based tracking algorithm. There are four significant contributions presented in the proposed framework. The first contribution is an Improved Patch Based Tracking approach, which is the significant innovation used in the proposed framework. In this approach, 50 patches are extracted from the bounding box of every image. In the second contribution, each patch is processed by using the windowing mechanism and feature extraction with HOG and tracked using the kernelized correlation filter, which generates 50 correlation scores. The third contribution is a polling mechanism, which is the next novel concept used in the proposed framework. In the polling mechanism, 50 correlation scores from the KCF provided as an input, and the confidence map is drawn from all correlation score. The maximum score achieved from the confidence map is the target.

The correlation filter further trained by applying the same procedure for all the patches. The trained correlation filter is utilized continuously throughout the sequence for tracking the target. The fourth contribution is the proposed framework applied to the online object tracking benchmark OOTB-100 [15, 16] and National Laboratory of Pattern Recognition NLPR_MCT [17] dataset which are open source. Moreover, the system compared with the existing correlation filter-based tracking algorithms.

The rest of the paper is systematized as follows. Section 2 presents the previous work related to the proposed HPPC tracking framework. Section 3 includes harmonious polling of patch confidence tracker. Section 4 shows the qualitative and quantitative analysis of the HPPC tracker, which is compared with the existing correlation filter-based tracking algorithms and applied to [15,16,17].

2 Related work

Human tracking has long been a popular topic in Computer vision. A large number of trackers have been proposed and standardized the quantitative and qualitative evaluation metrics which is accelerating the pace of development in this field. Various approaches that form the basis of existing trackers can be used for tracking numerous unusual circumstances. Henriques et al. [1] introduced the improved MOSSE filter with circulant structure and the Kernel matrix, which proposed that the correlation filters can be effectively kernelized, improving the tracking performance. Bolme et al. [18] presented an adaptive training algorithm, minimum output sum of squared error filter (MOSSE) which is the vigorous and successful method utilized for tracking. After the MOSSE filters various advancements made for improving the performance of the tracking system. Danelljan et al. [16] applied color-attributes for proper representation of the input data and improved the baseline intensity-based tracker by 24% in median distance precision. The correlation filter-based tracking algorithms (CFT) [19] geared up the concept of the correlation filters and presented various algorithms for tracking the object and pedestrian [1, 20,21,22,23].

Further, the advanced CFTs increased the efficiency and robustness in the correlation filter-based tracking frameworks improving it to the next level of progression [16, 24,25,26]. The context-aware correlation filter-based tracking system [27] has built up by incorporating the CFT with the global context. Learned specific convolutional neural network (CNN) presented the trackers without pre-training, which averts the issues brought about by the offline training [28] where CNN treated as a black box. A perceptual hash (pHash) algorithm [29] is a straightforward and fast technique to update the observation model dynamically with image similarity. Chen et al. [30] worked on face tracking algorithm, which is an online feature selection mechanism. The algorithm chooses the most discriminative feature during the tracking process with the unconstrained correlation filters. A while later, Yang et al. [31] introduced an on-line feature selection mechanism to choose the most discriminative feature during the tracking to make the tracker more accurate. A boolean map representations method for visual tracking is a simple and effective Boolean map-based way of representation that exploits connectivity cues for visual tracking [32]. As of late, in 2018, Yang et al. [33, 34] presented the spatiotemporal nonlocally regularized correlation filter and parallel attentive correlation filter utilized for tracking.

Viola and Jones [35] described a machine learning approach for visual object detection for processing images rapidly and achieving high detection rates. The work distinguished the Integral Images, AdaBoost learning algorithm, and cascade classifiers method. Further, Pooya and Yazdi used a train set selection method, based on histograms generated from AdaBoost for selecting the features [36]. Moreover, Viola and Jones face detection method used a simple method to select few features in beginning cascades are proposed in [37]. Moreover, a cascaded classifier using the AdaBoost algorithm is trained in [38] with two edge detectors.

Several classification and regression methods are there, utilized for analyzing different type of data [39, 40]. The methods classify subjects: the technique of “classification trees” or “recursive partitioning” as defined by Breiman et al. [39].

The feature descriptor algorithms like SIFT, HAAR, HOG takes an image and outputs the feature which encodes the information into a series of numbers and differentiates one feature from another [41, 42]. In hatred of the vital advancement in this area, the tracking system has experienced many challenging situations like occlusion, complex motions, fast motion, illumination variation, deformation, image blur, background clutter, scale variation, rotation which debases the general execution of the framework [43, 44].

3 Harmonious polling of patch confidence tracker

In the proposed framework, shown in Fig. 1, the main idea is to process each patch from the bounding box, separately. In the processing stage, the patch boundaries extracted from the bounding box smoothed by passing it through a cosine window. HOG feature description algorithm simplifies the image by extracting the useful information about the patches. It works on the gradient and orientation information. Patches are tracked using KCF, which provides a correlation score. HOG extracts the positive and the negative training samples, which are further applied to train the correlation filters for the next frames. The process is repeated for 50 times, and 50 correlation scores are there at the output of the correlation filters, which are further applied for the polling mechanism. The polling is the next innovative mechanism used in the proposed system, where the correlation score is used to draw the confidence map. The highest point of matching in the confidence map is the exact position of the target. The patch tracking using a kernelized correlation filter followed with a polling mechanism, effectively improve the accuracy and the overall performance of the system.

In this paper, 50 patches are extracted from the bounding box, as shown in Fig. 2. Information about the target position, patch position, target size for all the patches is stored in the Context Field.

3.1 Improved patch based tracking

The improved patch-based tracking algorithm used in the proposed framework is an innovative idea which is treated separately throughout the process. If an entire bounding box is considered at once for tracking, the effect of the occlusion is for the whole bounding box, which degrades the correlation score. In case of patch-based tracking approach, when occlusion is detected on a patch, it affects only on that patch, not on the complete image. So, the patch-based tracking approach significantly reduce the effect of occlusion. In the proposed framework, patches are extracted from the bounding box, and each patch is tracked separately. Information of each patch fitted in the context field. It means, 50 patches are sampled from one weak patch, as shown in Fig. 2. Each small patch is treated separately for processing. There is an abrupt change in the patch boundaries when we crop the patch from an image. It is essential to nullify the effect of these abrupt changes to get smooth patch boundaries. In the HPPC tracker, cosine window applied to these patches, nullify the effect of these abrupt changes [19]. A feature description algorithm, histogram of oriented gradient (HOG) applied to the patches for extracting the features, provides gradient and orientation information [41].

3.1.1 Kernelized correlation filter

The patches cropped from an image further produces peaks for the target using kernelized correlation filters (KCF). The KCF used in the proposed system for tracking the human provides a correlation score for the polling mechanism. A preparatory version of this work was presented earlier [3]. The connection between ridge regression with cyclically shifted samples and classical correlation filters is well explained in [1].

Ridge regression This method uses a simple solution which is closely related to the support vector machine. The aim is to find out a function, f (z) = w^tz to minimize the squared error over the input training samples x_i and regression target y_i where λ is regularization parameter.

$$\mathop {\hbox{min} }\limits_{w} \sum\limits_{i} {\mathop {\left( {f\left( {\mathop x\nolimits_{i} } \right) - \left( {\mathop y\nolimits_{i} } \right)} \right)}\nolimits^{2} } + \mathop {\lambda \parallel w\parallel }\nolimits^{2}$$

(1)

Equation (1) is the error between the output of the training sample and given input. The difference should be as low as possible, which is the minimum output sum of squared error filter (MOSSE) [1]. The objective is to minimize the error in the regression equation, which is an objective equation.

Circulant matrix For computing a regression with the shifted sample, consider $n*1$ vector representing a patch with the target where x referred to as the base sample. The goal is to train a classifier with positive, negative, and the base samples. The cyclic property [1] indicates that the shifted signal is obtained $\left\{ {\mathop P\nolimits^{u} x} \right\}$

$$\left\{ {P^{u} x|u = 0, \ldots ,n - 1} \right\}$$

(2)

Due to the cyclic property, the first half of the overall set is shifted in the positive direction and the second half in a negative direction. A full kernel correlation function can be given by the following equation which is well explained in Henriques et al. [1],

$$\mathop k\nolimits^{{xx^{\prime } }} = \exp \left( { - \frac{1}{{\mathop \sigma \nolimits^{2} }}\left( {\mathop {\parallel X\parallel }\nolimits^{2} + \mathop {\parallel X^{\prime } \parallel }\nolimits^{2} - 2\mathop F\nolimits^{ - 1} \mathop {\hat{X}}\nolimits^{*} \odot\,\hat{X}^{\prime } } \right)} \right)$$

(3)

Now, for the next frame, the target is detected by the trained parameter and maintain the training sample. For a new sample, confidence map is,

$$y = C\left( {\mathop k\nolimits^{xz} } \right)a = \mathop F\nolimits^{ - 1} \left( {\mathop {\hat{X}}\nolimits^{xz} \odot\,\hat{a}} \right)$$

(4)

So, the position of the maximum value in y is predicted as a new targeted position. All the equations starting from (1) to (4) are from [1, 8].

3.2 Polling mechanism

The most appealing approach in the proposed system is a polling mechanism. It describes the possible position of each patch in every frame, which is said to be a confidence score. A confidence score is achieved from the KCF equation. Combining all the confidence scores, robustly gives the maximum point of matching, which is the target positions. The polling mechanism is an innovative idea, which gives an exact position of the target, improving accuracy. The polling for tracking the target is based on spatial and temporal evaluation of patches.

(1)
KCF provides 50 values of correlation scores from 50 patches.
(2)
Length of trajectory is found out according to the patch location matched.
(3)
From the length of trajectory and correlation score, the poll (weight) of each patch w is found out and calculated using Eq. (5).
(4)
The final confidence map is achieved ψ_t, by normalizing the polls.
(5)
The maximum position in the confidence map is the exact position of the target.

The polling score increases in correspondence to the existence of patches in successive frames. Contexts are provided as an input for the polling. In the process of polling, the length of trajectory is calculated from the count of patch location match. For example, in the patch location matching process, if the first four patch locations are matched, and the fifth patch location does not match then the length of trajectory is 4. This way length of trajectory is found out.

Now to calculate the weight of the patch correlation score is divided by the lobe width. Let yⁱ be the confidence map or the response map of ith patch, and l be the side lobe of the corresponding confidence map.

$$w_{t}^{i} = n_{t}^{i} \frac{{y_{t}^{i} }}{{l_{t}^{i} }}$$

(5)

The ith patch appears in consecutive n frames and nⁱ which is defined as the trajectory of an ith patch. The polling score or weight of the patch is expressed as (5). The final confidence map to represent the target using a set of N patches is given by,

$$\psi_{t} = \sum\limits_{k = 1}^{N} {w_{t}^{i} }$$

(6)

The process is repeated for each patch. For all the patches the polling score is combined and normalized. Finally, a graph is plotted to find out a maximum value which can be recognized as the final detected target for a frame. The process is applied for every patch of the frame. From the performance of the system it is observed that the polling mechanism has profoundly improved the accuracy of the tracking system.

4 Quantitative and qualitative analysis

The extensive quantitative and qualitative evaluations are presented in the framework, shows the precision and success rate of the proposed system over the CFT’s few currently available source codes KCF [1], STC [23, 25, 45], CN [16], MUSTer [24]. The proposed framework is contrasted with the few state-of-the-art human tracking algorithms like Particle Filter (PF), Kalman filter (KF), Camshift (CS) algorithm, Mean shift (MS) algorithm [46, 47]. The system is applied over more than 100 test sequences from the online object tracking benchmark OOTB [15, 16] and National Laboratory of Pattern Recognition NLPR_MCT dataset [17]. The protocols used for evaluation in the proposed system are the area under the precision curve (APC), the area under the success curve (ASC).

4.1 Quantitative evaluation

The plots in Figs. 3, 4 and 5 shows the APC and ASC of the few sequences from [15,16,17]. Figure 3 shows the Girl sequence where the HPPC tracker successfully tracks the target even in case of scale variation, occlusion, in-plane rotation, out-of-plane rotation. Figure 4 shows the sequence Surfer which suffers from problems like scale variation, fast motion, in-plane rotation, out-plane rotation, low resolution. Figure 5 shows the APC and ASC plot of the NLPR_MCT dataset. The HPPC tracker successfully tracks in all these unfavorable conditions. According to the quantitative results presented in the APC and ASC, it is observed that the performance of the proposed system has been improved exceptionally.

Tables 1 and 2 shows quantitative evaluations of [15,16,17] by comparing HPPC tracker with the CFT’s few currently available source codes. Tables 3 and 4 shows quantitative evaluations of [15,16,17] by comparing HPPC tracker with state-of-the-art human tracking algorithms. Experimentally validating the results, it is being proved that the proposed technique outperforms the state-of-the-art performance.

Table 1 OOTB benchmark for the CFT’s few currently available source codes

Full size table

Table 2 NLPR benchmark for the CFT’s few currently available source codes

Full size table

Table 3 OOTB benchmark for state-of-the-art human tracking algorithms

Full size table

Table 4 NLPR benchmark for state-of-the-art human tracking algorithms

Full size table

4.2 Qualitative evaluation

The tracker is applied to the online object tracking benchmark OOTB-100 [15, 16], National Laboratory of Pattern Recognition NLPR_MCT dataset [17]. For the evaluation, all the challenging attributes have been selected like image blur, occlusion, change in illumination, in-plane rotation, out of plane rotation, deformation, which makes the database extremely challenging.

Despite of the critical scenario, the proposed framework works appropriately even in crowdy areas. In the sequences shown below few trackers which gives its best performance in almost every sequence are included in top-performing trackers, i.e., KCF, CFT, CN, MUSTer is compared with the proposed HPPC tracker and human tracking algorithms. From the evaluation in Fig. 6, it is observed that in the sequence of Jump, the tracker is leaving the tracking sequence in many frames as KCF has the limitation of a fixed window.

The same issue is observed in the sequences like Bolt-2, Walking, Basketball. The proposed system shows a much better result in such a case.

In case of partial occlusions like Girl, KCF is leaving the tracking sequence for some time, but the proposed algorithm still working better in such a scenario from starting to the last frame. It works excellent even the face is 360° rotating. However, in the case, when there is a partial occlusion, it is continuing the tracking properly.

From the sequences of the NLPR_MCT dataset HPPC tracker is giving an outstanding performance, as shown in Fig. 7. Figure 8 shows the comparison of the proposed system with correlation filters based algorithms and few state-of-the-art human tracking algorithms. So, it is being observed that HPPC tracker gives its best performance in almost every sequence.

5 Conclusion

The HPPC tracker includes an innovative technique where the bounding box is divided into 50 patches, and each patch is tracked separately using the kernelized correlation filter. A novel methodology polling, used in the proposed framework where, the maximum score achieved from the confidence map gives the exact position of the target. The results have been validated based on qualitative and quantitative evaluations. The tracker is applied to the online object tracking benchmark OOTB-100 [15, 16] tracking dataset and National Laboratory of Pattern Recognition NLPR benchmarks [17]. The algorithm is also compared with the correlation filter-based tracking algorithms MUSTer, KCF, STC, CN and human tracking algorithms Particle Filter, Kalman Filter, Camshift, Mean shift. From the experiment it has been proved that the HPPC tracker successfully track the human in all the challenging situations like occlusion, background clutters, illumination variation, scale variation, fast motion, in-plane rotation, out of the plane rotation. The precision value is improved by 15%, and the success rate is improved by 19% as compared to the existing techniques.

The limitation of the HPPC tracker is, the HPPC tracker is taking more run time as compared to the existing trackers since we are taking a higher number of patches to improve accuracy. However, it is justified by the high value of APC and ASC. One more limitation of the framework is, in the case of long-time occlusion, the performance of tracker degrades.

Henceforth, still human tracking is a challenging topic and there is a scope for further improvements.

References

Henriques JF, Caseiro R, Martins P, Batista J (2015) High-speed tracking with kernelized correlation filters. IEEE Trans Pattern Anal Mach Intell 37(3):583–596
Article Google Scholar
Hare S, Golodetz S, Saffari A, Vineet V, Cheng M-M, Hicks SL, Torr PHS (2011) Struck: structured output tracking with kernels. IEEE Trans Pattern Anal Mach Intell 38(10):2096–2109
Article Google Scholar
Henriques JF, Caseiro R, Martins P, Batista J (2012) Exploiting the circulant structure of tracking-by-detection with kernels. In: European conference on computer vision. Springer, Berlin, pp 702–715
Chapter Google Scholar
He S, Yang Q, Lau R, Wang J, Yang M-H (2013) Visual tracking via locality sensitive histograms. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 2427–2434
Sevilla-Lara L, Learned-Miller EG (2012) Distribution fields for tracking. In: Computer vision and pattern recognition IEEE conference on (CVPR), pp 19101917
Boiman O, Irani M (2007) Detecting irregularities in images and in the video. Int J Comput Vis 74(1):17–31
Article Google Scholar
Agarwal S, Awan A, Roth D (2004) Learning to detect objects in images via sparse, part-based representation. IEEE Trans Pattern Anal Mach Intell 20(11):14751490
Google Scholar
Wren CR, Azarbayejani A, Darrell T, Pentland P (1997) Real-time tracking of the human body. In: IEEE conference on in proceedings of IEEE transactions on pattern analysis and machine intelligence, vol 19, No. 7. pp 780–785
Article Google Scholar
Raudonis V, Simutis R, Narvydas G (2009) Discrete eye-tracking for medical applications. In: 2nd international symposium on applied sciences in biomedical and communication technologies, ISABEL
Yas QM et al (2018) A systematic review on smartphone skin cancer apps: coherent taxonomy, motivations, open challenges and recommendations, and new research direction. J Circuits Syst Comput 27(05):1830003
Article Google Scholar
Chyad MA, Alsattar HA, Zaidan BB, Zaidan AA, Al Shafeey GA (2019) A review of skin detector based deep learning techniques: coherent taxonomy, open challenges, motivations, recommendations and statistical analysis, future direction. IEEE Access, 106536–106575
Article Google Scholar
Danelljan M, Khan FS, Felsberg M, Weijer JVD (2014) Adaptive color attributes for real-time visual tracking. In: IEEE conference on computer vision and pattern recognition (CVPR), Columbus, Ohio, USA, pp 1090–1097
Ghaziasgar M, Connan J, Bagula AB (2016) Enhanced adaptive skin detection with contextual tracking feedback. In: 2016 pattern recognition association of South Africa and robotics and mechatronics international conference (PRASA-RobMech). IEEE
Criminisi A, Shotton J (2013) Decision forests for computer vision and medical image analysis. Springer, Heidelberg
Book Google Scholar
cvlab.hanyang.ac.kr/tracker benchmark/datasets.html
Wu Y, Lim J, Yang M-H, Torr PH (2013) Online object tracking: a benchmark. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 2411–2418
http://nlpr-mct.oss-us-west-1.aliyuncs.com/NLPR. MCT Dataset
Bolme DS, Beveridge JR, Draper BA, Lui YM (2010) Visual object tracking using adaptive correlation filters. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 2544–2550
Chen Z, Hong Z, Tao D (2018) An experimental survey on correlation filter-based tracking. Preprint arXiv:1509.05520
Li Y, Zhu J (2014) A scale adaptive kernel correlation filter tracker with feature integration. In: Computer vision-ECCV workshops. Springer, Berlin, pp 254265
Danelljan M, Hager G, Khan FS, Feldberg M (2014) Accurate scale estimation for robust visual tracking. In: British machine vision conference, Nottingham
Jeong KH, Pokharel PP, Xu J-W, Han S, Principe JC (2006) Kernel-based synthetic discriminant function for object recognition. In: IEEE international conference on acoustics, speech and signal processing, ICASSP, vol 5. pp 55
Wen L, Cai Z, Lei Z, Yi D, Li SZ, Yang M-H (2014) Robust online learned spatio-temporal context model for visual tracking. IEEE Trans Image Process 23(2):785–796
Article MathSciNet Google Scholar
Hong Z, Chen Z, Wang C, Mei X, Prokhorov D, Tao D (2015) Multistore tracker (muster): a cognitive psychology inspired approach to object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 749758
Jeon B, Landgrebe DA (1992) Classification with spatio-temporal interpixel class dependency contexts. In: IEEE transactions on geoscience and remote sensing, pp 663–672
Article Google Scholar
Valmadre J, Bertinetto L, Henriques JF, Vedaldi A, Torr PH (2017) End-to-end representation learning for correlation filter-based tracking. In: IEEE conference on in computer vision and pattern recognition (CVPR), pp 5000–5008
Mueller M, Smith N, Ghanem B (2017) Context-aware correlation filter tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1396–1404
Zhang K, Liu Q, Wu Y, Yang MH (2016) Robust visual tracking via convolutional networks without training. IEEE Trans Image Process 25:17791792
MathSciNet MATH Google Scholar
Song H, Zheng Y, Zhang K (2016) Robust visual tracking via self-similarity learning. Electron Lett 53:2022
Google Scholar
Chen W, Zhang K, Liu Q (2016) Robust visual tracking via patch-based kernel correlation filters with adaptive multiple feature ensemble. Neurocomputing 214:607617
Google Scholar
Yang J, Zhang K, Liu Q (2016) Robust object tracking by online fisher discrimination boosting feature selection. Comput Vis Image Underst 153:100108
Google Scholar
Yang J, Zhang K, Liu Q (2016) arXiv
Yang J, Zhang K, Liu Q (2018) Visual tracking using spatio-temporally nonlocally regularized correlation filter. Pattern Recogn 83:185–195
Article Google Scholar
Zhang K, Fan J, Liu Q, Yang J, Lian W (2019) Parallel attentive correlation tracking. IEEE Trans Image Process 28(1):479–491
Article MathSciNet Google Scholar
Viola PA, Jones MJ (2001) Rapid object detection using a boosted cascade of simple features. In: CVPR, No. 1, pp 511–518
Tavallali P, Yazdi M (2015) Robust skin detector based on AdaBoost and statistical luminance features. In: 2015 International congress on technology, communication, and knowledge (ICTCK). IEEE
Tavallali P, Mehran Y, Khosravi MR (2017) An efficient training procedure for Viola–Jones face detector. In: 2017 International conference on computational science and computational intelligence (CSCI). IEEE
Tavallali P, Yazdi M, Khosravi MR (2019) Robust cascaded skin detector based on AdaBoost. Multimed Tools Appl 78(2):2599–2620
Article Google Scholar
Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth, Belmont
MATH Google Scholar
Denison DG et al (2002) Bayesian methods for nonlinear classification and regression, vol 386. Wiley, New York
MATH Google Scholar
Dalal N, Triggs B, Schmid C (2006) Human detection using oriented histograms of flow and appearance. In: European conference on computer vision, pp 428–441
Chapter Google Scholar
Nair BM et al (2011) Multi-pose face recognition and tracking system. Procedia Comput Sci 6:381–386
Article Google Scholar
Hager GD, Dewan M, Stewart CV (2004) Multiple kernel tracking with SSD. In: IEEE computer society conference on computer vision and pattern recognition, vol 1
Zhang T, Xu C, Yang M-H (2017) Multi-task correlation particle filter for robust object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, vol 1, No. 2, p 3
Zhang K, Zhang L, Liu Q, Zhang D, Yang M-H (2014) Fast visual tracking via dense spatio-temporal context learning. In: European conference on computer vision. Springer, Cham
Chapter Google Scholar
Yussiff A-L, Yong S-P, Baharudin BB (2015) Human tracking in video surveillance using particle filter. In: International symposium on mathematical sciences and computing research (iSMSC). IEEE, pp 83–88
Hourali F, Sedaaghi M (2015) Robust and real-time face tracking using particle filter based on probabilistic face model. Int J Res Comput Appl Robot 3(2):71–78
Google Scholar

Download references

Author information

Authors and Affiliations

Babasaheb Ambedkar Marathwada University, Aurangabad, 431004, India
Kavita Wagh
Government Polytechnic, Ambad., Jalna, 431204, India
Sudhir S. Kanade

Authors

Kavita Wagh
View author publications
You can also search for this author in PubMed Google Scholar
Sudhir S. Kanade
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kavita Wagh.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wagh, K., Kanade, S.S. Robust human tracking using harmonious polling tracker. SN Appl. Sci. 1, 1227 (2019). https://doi.org/10.1007/s42452-019-1219-4

Download citation

Received: 02 May 2019
Accepted: 03 September 2019
Published: 17 September 2019
DOI: https://doi.org/10.1007/s42452-019-1219-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Robust human tracking using harmonious polling tracker

Abstract

Similar content being viewed by others

BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

ByteTrack: Multi-object Tracking by Associating Every Detection Box

1 Introduction

2 Related work