Action boundaries detection in a video

Wehbe, Hassan; Haidar, Bassem; Joly, Philippe

doi:10.1007/s11042-015-2748-5

Action boundaries detection in a video

Published: 02 August 2015

Volume 75, pages 8239–8266, (2016)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Hassan Wehbe¹,
Bassem Haidar² &
Philippe Joly¹

193 Accesses
Explore all metrics

Abstract

In the video analysis domain, automatic detection of actions performed in a recorded video represents an important scientific and industrial challenge. This paper presents a new method to approximate the boundaries of actions performed by a person while interacting with his environment (such as moving objects). This method relies on a Codebook quantization method to analyze the rough evolution of each pixel and then decide whether this evolution corresponds to an action or not; this decision is taken by an automated system. Statistics are then produced - at the scale of the whole frame - to estimate the start and the end of an action. According to our proposed evaluation protocol, this method produces interesting results on both real and simulated videos. This statistic-based protocol is discussed at the end of this paper. The interpretation of this evaluation protocol nominates this method to be a solid base to localize the exact boundaries of actions or - in the framework of this research activity - to associate prescriptive text with a visual content.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Notes

A pixel’s RGB value (R, G, B) matches the codeword C if, and only if, the point (R, G, B) - in the RGB system - is located inside the cylinder corresponding to C.
Synchronization of a video with the text that describes its content

References

Ambata LU, Caluyo FS (2012) Background change detection using wavelet transform. TENCON 2012 I.E. Region 10 Conference, pp. 1–6. doi: 10.1109/tencon.2012.6412298.
Bouwmans T (2011) Recent Advanced Statistical Background Modeling for Foreground Detection - A Systematic Survey. Recent Patents Comput Sci 4:147–176
Google Scholar
Cucchiara R, Grana C, Piccardi M, Prati A (2003) Detecting moving objects, ghosts and shadows in video streams. IEEE Trans Pattern Anal Mach Intell 25:1337–1342. doi:10.1109/tpami.2003.1233909
Article Google Scholar
Elgammal A, Duraiswami R, Harwood D, Davis LS (2002) Background and foreground modeling using nonparametric kernel density estimation for visual surveillance. Proc IEEE 90:1151–1163. doi:10.1109/JPROC.2002.801448
Article Google Scholar
Fihl P, Corlin R, Park S, Moeslund TB, Trivedi MM (2006) Tracking of individuals in very long video sequences. Adv Vis Comput Lect Notes Comput Sci 4291:60–69. doi:10.1007/11919476_7
Article Google Scholar
Geng L, Xiao Z (2011) Real Time Foreground-Background Segmentation Using Two-Layer Codebook Model. International Conference on Control. Aut Syst Eng(CASE) 1:1–5. doi:10.1109/ICCASE.2011.5997546
Google Scholar
Gibbins D, Newsam GN, Brooks MJ (1996) Detecting suspicious background changes in video surveillance of busy scenes. Proceedings Third IEEE Workshop on Applications of Computer Vision, pp. 22–26. doi: 10.1109/acv.1996.571990
Gong Y, Sin LT, Chuan CH, Zhang H, Sakauchi M (1995) Automatic Parsing of TV Soccer Programs. International Conference on Multimedia Computing and Systems 1:167–174. doi:10.1109/MMCS.1995.484921
Article Google Scholar
Horprasert T, Harwood D, Davis LS (1999) A statistical approach for real-time robust background subtraction and shadow detection. IEEE Int Conf Comp Vis 99:1–19
Google Scholar
Kim K, Chalidabhongse TH, Harwood D, Davis L (2005) Real-time foreground - background segmentation using codebook model. Real-time Imaging 11:172–185. doi:10.1016/j.rti.2004.12.004
Article Google Scholar
Leykin A, Ran Y, Hammoud R (2007) Thermal-visible video fusion for moving target tracking and pedestrian classification. IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. doi:10.1109/CVPR.2007.383444
MathWorks laboratories (2013) Scene Change Detection System (vipscenechange). Computer Vision System Toolbox in MATLAB (R2013a). http://fr.mathworks.com/help/vision/examples/scene-change-detection.html, Accessed 1 December 2014.
Radke RJ, Andra S, Al-Kofahi O, Roysam B (2005) Image Change Detection Algorithms - A Systematic Survey. IEEE Trans Image Process 14:294–307. doi:10.1109/tip.2004.838698
Article MathSciNet Google Scholar
Rodriguez-Gomez R, Fernandez-Sanchez EJ, Diaz J, Ros E (2012) Codebook hardware implementation on FPGA for background subtraction. J Real-Time Image Proc 10:43–57. doi:10.1007/s11554-012-0249-6
Article Google Scholar
Rui Y, Gupta A, Acero A (2000) Automatically Extracting Highlights for TV Baseball Programs. Proceedings of the eighth ACM international conference on Multimedia, pp. 105–115. doi: 10.1145/354384.354443.
Sigari MH, Fathy M (2008) Real-time Background Modeling/Subtraction using Two-Layer Codebook Model. Proc Int MultiConference Eng Comp Scientists (IMECS) 1:717–720
Google Scholar
Stauffer C, Grimson WEL (1999) Adaptive background mixture models for real-time tracking. IEEE Comp Soc Conf Comp Vision PattRecog 2:252. doi:10.1109/CVPR.1999.784637
Google Scholar
Subudhi BN, Ghosh S, Ghosh A (2013) Change detection for moving object segmentation with robust background construction under Wronskian framework. Mach Vis Appl 24:795–809. doi:10.1007/s00138-012-0475-8
Article Google Scholar
Sudhir G, Lee JCM, Jain AK (1998) Automatic classification of tennis video for high-level content-based retrieval. Proceedings of IEEE International Workshop on Content-Based Access of Image and Video Database 1:81–90. doi:10.1109/caivd.1998.646036
Article Google Scholar
Szwoch G, Ellwart D, Czyżewski A (2012) Parallel implementation of background subtraction algorithms for real-time video processing on a supercomputer platform. J Real-Time Image Proc. doi:10.1007/s11554-012-0310-5
Google Scholar
Wren CR, Azarbayejani A, Darrell T, Pentland AP (1997) Pfinder: Real-time tracking of the human body. IEEE Trans Pattern Anal Mach Intell 19:780–785. doi:10.1109/34.598236
Article Google Scholar
Zhang D, Chang SF (2002) Event detection in baseball video using superimposed caption recognition. Proceedings of the tenth ACM international conference on Multimedia, pp. 315–318. doi: 10.1145/641007.641073.

Download references

Author information

Authors and Affiliations

IRIT – Toulouse University, 118 route de Narbonne, 31062, Toulouse Cedex 9, France
Hassan Wehbe & Philippe Joly
Lebanese University, Faculty of Sciences, Hadath, Beirut, Lebanon
Bassem Haidar

Authors

Hassan Wehbe
View author publications
You can also search for this author in PubMed Google Scholar
Bassem Haidar
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Joly
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hassan Wehbe.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wehbe, H., Haidar, B. & Joly, P. Action boundaries detection in a video. Multimed Tools Appl 75, 8239–8266 (2016). https://doi.org/10.1007/s11042-015-2748-5

Download citation

Received: 11 March 2014
Revised: 21 May 2015
Accepted: 15 June 2015
Published: 02 August 2015
Issue Date: July 2016
DOI: https://doi.org/10.1007/s11042-015-2748-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Action boundaries detection in a video

Abstract

Access this article

Similar content being viewed by others

A Motion-Driven Approach for Fine-Grained Temporal Segmentation of User-Generated Videos

Action identification using a descriptor with autonomous fragments in a multilevel prediction scheme

Diagnosing Error in Temporal Action Detectors

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Action boundaries detection in a video

Abstract

Access this article

Similar content being viewed by others

A Motion-Driven Approach for Fine-Grained Temporal Segmentation of User-Generated Videos

Action identification using a descriptor with autonomous fragments in a multilevel prediction scheme

Diagnosing Error in Temporal Action Detectors

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation