Localization of region of interest in surveillance scene

Ahmed, Sk. Arif; Dogra, Debi Prosad; Kar, Samarjit; Kim, Byung-Gyu; Hill, Paul; Bhaskar, Harish

doi:10.1007/s11042-016-3762-y

Localization of region of interest in surveillance scene

Published: 20 July 2016

Volume 76, pages 13651–13680, (2017)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Sk. Arif Ahmed¹,
Debi Prosad Dogra ORCID: orcid.org/0000-0002-3904-732X²,
Samarjit Kar³,
Byung-Gyu Kim⁴,
Paul Hill⁵ &
…
Harish Bhaskar⁶

404 Accesses
7 Citations
Explore all metrics

Abstract

In this paper, we present a method for autonomously detecting and extracting region(s)-of-interest (ROI) from surveillance videos using trajectory-based analysis. Our approach, localizes ROI in a stochastic manner using correlated probability density functions that model motion dynamics of multiple moving targets. The motion dynamics model is built by analyzing trajectories of multiple moving targets and associating importance to regions in the scene. The importance of each region is estimated as a function of the total time spent by multiple targets, their instantaneous velocity and direction of movement whilst passing through that region. We systematically validate our model and benchmark our technique against competing baselines through extensive experimentation using public datasets such as CAVIAR, ViSOR, and CUHK as well as a scenario-specific in-house surveillance dataset. Results obtained have demonstrated the superiority of the proposed technique against a few popular existing state-of-the-art techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Notes

References

Bharath R, Nicholas L, Cheng X (2013) Scalable scene understanding using saliency-guided object localization. In: 10th IEEE International Conference on Control and Automation, pp 1503–1508
Bao X, Javanbakhti S, Zinger S, Wijnhoven R, de With P (2014) Context-based object-of-interest detection for a generic traffic surveillance analysis system. In: IEEE International Conference on Advanced Video and Signal Based Surveillance, pp 1087–1090
Bharath R, Nicholas L, Cheng X (2013) Scalable scene understanding using saliency-guided object localization. In: Proceedings of the IEEE International Conference on Control and Automation, pp 1503–1508
Brun L, Saggese A, Vento M (2014) Dynamic scene understanding for behavior analysis based on string kernels. IEEE Trans Circuits Syst Video Technol 24 (10):1669–1681
Article Google Scholar
Colque R, Jnior C, Schwartz W (2015) Histograms of optical flow orientation and magnitude to detect anomalous events in videos. In: SIBGRAPI conference on Graphics, Patterns and Images, pp 126–133
Dinh T, Vo N, Medioni G (2011) Context tracker Exploring supporters and distracters in unconstrained environments. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 1177–1184
Dogra D, Ahmed A, Bhaskar H (2015) Interest area localization using trajectory analysis in surveillance scenes. In: Proceedings of the 10th International Conference on Computer Vision Theory and Applications, pp 478–485
Dogra D, Reddy R, Subramanyam K, Ahmed A, Bhaskar H (2015) Scene representation and anomalous activity detection using weighted region association graph. In: Proceedings of the 10th International Conference on Computer Vision Theory and Applications, pp 31–38
Dogra D, Ahmed A, Bhaskar H (2015) Smart video summarization using mealy machine-based trajectory modelling for surveillance applications. Multimedia Tools and Applications, 1–29. doi:10.1007/s11042-015-2576-7
Fisher R, Santos-Victor J, Crowley J (2001) Caviar: Context aware vision using image-based active recognition. http://homepages.inf.ed.ac.uk/rbf/CAVIAR/. Accessed: July 2014
Mathworks Inc (2014) Abandoned object detection http://www.mathworks.in/help/vision/examples/abandoned-object-detection.html. Accessed: July 2014
Jiang H, Wang J, Yuan Z, Wu Y, Zheng N, Li S (2013) Salient object detection: a discriminative regional feature integration approach. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 2083–2090
Javanbakhti S, Zinger S, de With P (2014) Context-based region labeling for event detection in surveillance video. In: International Conference on Information Science, Electronics and Electrical Engineering, vol 1, pp 94–98
Kapsalas P, Rapantzikos K, Sofou A, Avrithis Y (2008) Regions of interest for accurate object detection. In: Proceedings of the International Workshop on Content-Based Multimedia Indexing, pp 147–154
Keum J, Lee H, Hagiwara M (2012) Mean shift-based sift keypoint filtering for region-of-interest determination. In: Proceedings of the International Conference on Soft Computing and Intelligent Systems and International Symposium on Advanced Intelligent Systems, pp 266–271
Kim G, Torralba A (2009) Unsupervised detection of regions of interest using iterative link analysis, pp 961–969
Lai Y, Yang C (2015) Video object retrieval by trajectory and appearance. IEEE Trans Circuits Syst Video Technol 25(6):1026–1037
Article Google Scholar
Lee W, Huang T, Yeh S, Chen H (2011) Learning-based prediction of visual attention for video signals. IEEE Trans Image Process 20(11):3028–3038
Article MathSciNet Google Scholar
Li J, Tian Y, Huang T, Gao W (2010) Probabilistic multi-task learning for visual saliency estimation in video. Int J Comput Vis 90(2):150–165
Article Google Scholar
Lin W, Zhang Y, Lu J, Zhou B, Wang J, Zhou Y (2015) Summarizing surveillance videos with local-patch-learning-based abnorMality detection, blob sequence optimization, and type-based synopsis. Neurocomputing 155(0):84–98
Article Google Scholar
Liu T, Zheng N, Ding W, Yuan Z (2008) Video attention: Learning to detect a salient object sequence. In: 19th International Conference on Pattern Recognition, ICPR 2008, pp 1–4
Loy C, Xiang T, Gong S (2009) Multi-camera activity correlation analysis. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 1988–1995
Manikandan M, Soman K (2012) A novel method for detecting r-peaks in electrocardiogram (ecg) signal. Biomed Signal Process Control 7(2):118–128
Article Google Scholar
Margolin R, Tal A, Zelnik-Manor L (2013) What makes a patch distinct?. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 1139–1146
Mitri S, Frintrop S, Pervolz K, Surmann H, Nuchter A (2005) Robust object detection at regions of interest with an application in ball recognition. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp 125–130
Morris B, Trivedi M (2008) Learning and classification of trajectories in dynamic scenes: a general framework for live video analysis. In: Proceedings of the IEEE Fifth International Conference on Advanced Video and Signal Based Surveillance, pp 154–161
Osberger W, Rohaly A (2001) Automatic detection of regions of interest in complex video sequences. In: Proceedings of the Photonics West-Electronic Imaging, pp 361–372
Piciarelli C, Micheloni C, Foresti G (2008) Trajectory-based anomalous event detection. IEEE Trans Circuits Syst Video Technol 18(11):1544–1554
Article Google Scholar
Rahtu E, Kannala J, Salo M, Heikkilä J. (2010) Segmenting salient objects from images and videos. In: Proceedings of the European Conference on Computer Vision. Springer, pp 366–379
Rokunuzzaman M, Sekiyama K, Fukuda T (2010) Automatic roi detection and evaluation in video sequences based on human interest. J Rob Mechatronics 22 (1):65–75
Article Google Scholar
Saleemi I, Shafique K, Shah M (2009) Probabilistic modeling of scene dynamics for applications in visual surveillance. IEEE Trans Pattern Anal Mach Intell 31(8):1472–1485
Article Google Scholar
Shou N, Peng H, Wang H, Meng L, Du K (2012) An rois based pedestrian detection system for single images. In: Proceedings of the International Congress on Image and Signal Processing, pp 1205– 1208
Suzuki N, Hirasawa K, Tanaka K, Kobayashi Y, Sato Y, Fujino Y (2007) Learning motion patterns and anomaly detection by human trajectory analysis. In: Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, pp 498–503
Uddin M, Ravishankar C, Tsotras V (2011) Finding regions of interest from trajectory data. In: IEEE International Conference on Mobile Data Management, vol 1, pp 39–48
Vezzani R, Cucchiara R (2010) Video surveillance online repository (visor): an integrated framework. Multimedia Tools Appl 50(2):359–380
Article Google Scholar
Wang W, Lin W, Chen Y, Wu J, Wang J, Sheng B (2014) Finding coherent motions and semantic regions in crowd scenes: a diffusion and clustering approach. In: Proceedings of the European Conference on Computer Vision, volume 8689 of Lecture Notes in Computer Science, pp 756– 771
Wu T, Vu C, Cheng Q, Chandler D (2009) Region-of-importance detection based on fusion of audio and video. In: Forty-Third Asilomar Conference on Signals, Systems and Computers, pp 1673– 1677
Wang J, Wang Y, Zhang Z (2011) Interesting region detection in aerial video using Bayesian topic models. In: First Asian Conference on Pattern Recognition, pp 706–710
Wang X, Tieu K, Grimson E (2006) Learning semantic scene models by trajectory analysis. In: Proceedings of the European Conference on Computer Vision. Springer, pp 110–123
Xiang M, Bashir F, Khokhar A, Schonfeld D (2009) Event analysis based on multiple interactive motion trajectories. IEEE Trans Circuits Syst Video Technol 19 (3):397–406
Article Google Scholar
Xu D, Wu X, Song D, Li N, Chen Y (2013) Hierarchical activity discovery within spatio-temporal context for video anomaly detection. In: Proceedings of the IEEE International Conference on Image Processing, pp 3597–3601
Xuan M, Monga V, Bala R, Zhigang F (2014) Adaptive sparse representations for video anomaly detection. IEEE Trans Circuits Syst Video Technol 24(4):631–645
Article Google Scholar
Yan Y, Ricci E, Subramanian R, Liu G, Lanz O, Sebe N (2015) A Multi-task Learning Framework for Head Pose Estimation under Target Motion. IEEE Trans Pattern Anal Mach Intell PP(99):1– 1
Google Scholar
Yan Y, Ricci E, Liu G, Sebe N (2015) Egocentric daily activity recognition via multitask clustering. IEEE Trans Image Process 24(10):2984–2995
Article MathSciNet Google Scholar
Zhai Y, Shah M (2006) Visual attention detection in video sequences using spatiotemporal cues. In: Proceedings of the 14th annual ACM international conference on Multimedia, pp 815– 824
Zhou B, Wang X, Tang X (2012) Understanding collective crowd behaviors: Learning a mixture model of dynamic pedestrian-agents. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 2871–2878
Zhou Y, Yan S, Huang T (2007) Detecting anomaly in videos from trajectory similarity analysis. In: Proceedings of the IEEE International Conference on Multimedia and Expo, pp 1087– 1090

Download references

Author information

Authors and Affiliations

Haldia Institute of Technology, Haldia, India
Sk. Arif Ahmed
Indian Institute of Technology, Bhubaneswar, India
Debi Prosad Dogra
National Institute of Technology, Durgapur, India
Samarjit Kar
Sookmyung Women’s University, Seoul, Republic of Korea
Byung-Gyu Kim
University of Bristol, Bristol, UK
Paul Hill
Khalifa University, P.O.Box 127788, Abu Dhabi, United Arab Emirates
Harish Bhaskar

Authors

Sk. Arif Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Debi Prosad Dogra
View author publications
You can also search for this author in PubMed Google Scholar
Samarjit Kar
View author publications
You can also search for this author in PubMed Google Scholar
Byung-Gyu Kim
View author publications
You can also search for this author in PubMed Google Scholar
Paul Hill
View author publications
You can also search for this author in PubMed Google Scholar
Harish Bhaskar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Debi Prosad Dogra.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ahmed, S.A., Dogra, D.P., Kar, S. et al. Localization of region of interest in surveillance scene. Multimed Tools Appl 76, 13651–13680 (2017). https://doi.org/10.1007/s11042-016-3762-y

Download citation

Received: 04 November 2015
Revised: 17 May 2016
Accepted: 05 July 2016
Published: 20 July 2016
Issue Date: June 2017
DOI: https://doi.org/10.1007/s11042-016-3762-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Localization of region of interest in surveillance scene

Abstract

Access this article

Similar content being viewed by others

Smart video summarization using mealy machine-based trajectory modelling for surveillance applications

UMTSS: a unifocal motion tracking surveillance system for multi-object tracking in videos

Motion anomaly detection and trajectory analysis in visual surveillance

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Localization of region of interest in surveillance scene

Abstract

Access this article

Similar content being viewed by others

Smart video summarization using mealy machine-based trajectory modelling for surveillance applications

UMTSS: a unifocal motion tracking surveillance system for multi-object tracking in videos

Motion anomaly detection and trajectory analysis in visual surveillance

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation