Semantic Content Analysis of Video: Issues and Trends

Garg, Aparna; Ramsay, Allan

doi:10.1007/978-3-642-19551-8_16

Aparna Garg⁷ &
Allan Ramsay⁷

Part of the book series: Studies in Computational Intelligence ((SCI,volume 346))

Abstract

Key issues in bridging the semantic gap for content analysis of video include flexibility required from the software, real time implementation and cost effectiveness. In recent years industry has begun to take a more realistic view of what to expect from video content analysis systems in the near future. This chapter presents the state-of–the-art trends in semantic video analysis in industry. The key challenges in bridging the semantic gap are discussed. It also presents the research trends in video analytics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Softcover Book: USD 379.99; Price excludes VAT (USA)

Hardcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Allen, J.F.: Maintaining knowledge about temporal interval. Communications of ACM 26(11), 832–843 (1983)
Article MATH Google Scholar
Andrade Ernesto, L., Scott, B., Fisher Robert, B.: Hidden Markov Models for Optical Flow Analysis in Crowds. In: 18th International Conference on Pattern Recognition (ICPR 2006), vol. 1, pp. 460–463 (2006)
Google Scholar
Scott, B., Ernesto, A., Robert, F.: Non Parametric Classification of Human Interaction. In: Martí, J., Benedí, J.M., Mendonça, A.M., Serrat, J. (eds.) IbPRIA 2007. LNCS, vol. 4478, pp. 347–354. Springer, Heidelberg (2007)
Chapter Google Scholar
Bramberger, M., Brunner, J., Rinner, B., Schwabach, H.: Real-time video analysis on an embedded smart camera for traffic surveillance. In: Proceedings of 10th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2004, pp. 174–181 (2004)
Google Scholar
Brand, M., Kettnaker, V.: Discovery and Segmentation of Activities in Video. IEEE Trans. Pattern Analysis and Machine Intelligence 22(8), 844–851 (2000)
Article Google Scholar
Dee Hannah, M., Velastin Sergio, A.: How close are we to solving the problem of automated visual surveillance? Machine Vision and Applications 19, 329–343 (2008)
Article Google Scholar
Desurmont, X., Wijnhoven, R.G.J.: Performance evaluation of real-time video content analysis systems in the CANDELA project. In: Proc. of the SPIE - Real-Time Imaging IX (2005)
Google Scholar
Duong, T.V., Bui, H.B., Phung, D.Q., Venkatesh, S.: Activity Recognition and Abnormality Detection with the Switching Hidden Semi-Markov Model. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. I, pp. 838–845 (2005)
Google Scholar
Ilker, E., Filiz, B., Subramanya, S.R.: A Framework for Trajectory Based Visual Event Retrieval. In: Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC 2004), vol. 2, pp. 23–27 (2004)
Google Scholar
Luca, F.G., Lucio, M., Regazzoni Carlo, S.: Automatic Detection and Indexing of Video-Event Shots for Surveillance Application. IEEE Transactions On Multimedia 4(4), 459–471 (2002)
Article Google Scholar
Xinbo, G., Yimin, Y., Dacheng, T., Xuelong, L.: Discriminative Optical Flow Tensor for Video Semantic Analysis. In: Computer Vision and Image Understanding, vol. 113, pp. 372–383. Elsevier, Amsterdam (2009)
Google Scholar
Jungong, H., Dirk, F., Peter, H.N., Weilun, L.: Real-Time Video Content Analysis Tool for Consumer Media Storage System. IEEE Transactions on Consumer Electronics 52(3), 870–878 (2006)
Article Google Scholar
Ismail, H., David, H., Larry, S.: W4: Real-Time Surveillance of People and Their Activities. IEEE Transactions On Pattern Analysis And Machine Intelligence 22(8), 809–830 (2000)
Article Google Scholar
Yongzhen, H., Kaiqi, H., Liangsheng, W., Dacheng, T., Tieniu, T., Xuelong, L.: Enhanced Biologically Inspired Model. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8 (2008)
Google Scholar
Hoogs, A., Bush, S., Brooksby, G., Perera, A.G.A., Dausch, M., Krahnstoever, N.: Detecting Semantic Group Activities Using Relational Clustering. In: IEEE Workshop on Motion and video Computing, WMVC 2008, January 8-9, pp. 1–8 (2008)
Google Scholar
Izo, T., Grimson, W.E.L.: Unsupervised Modeling of Object Tracks for Fast Anomaly Detection. In: IEEE International Conference on Image Processing, ICIP 2007, vol. 4, pp. 529–532 (2007)
Google Scholar
Jacobsen, C., Zscherpel, U., Perner, P.: A Comparison between Neural Networks and Decision Trees. In: Perner, P., Petrou, M. (eds.) MLDM 1999. LNCS (LNAI), vol. 1715, pp. 44–157. Springer, Heidelberg (1999)
Chapter Google Scholar
Jhuang, H., Serre, T., Wolf, L., Poggio, T.: A Biologically Inspired System for Action Recognition. In: IEEE 11th International Conference on Computer Vision, vol. 14(21), pp. 1–8 (2007)
Google Scholar
Yan, K., Rahul, S., Martial, H.: Event Detection in Crowded Videos. In: IEEE 11th International Conference on Computer Vision, ICCV 2007, October 14-21, pp. 1–8 (2007)
Google Scholar
Shehzad, K., Andrew, N.: Classifying spatiotemporal object trajec-tories using unsupervised learning of basis function coefficients. Multimedia Systems 12(3), 227–238 (2006)
Article Google Scholar
Richard, K., Anteneh, A., Ben, S., Alexander, D.: Camera mote with a high-performance parallel processor for real-time frame-based video processing. In: First ACM/IEEE International Conference on Distributed Smart Cameras, ICDSC 2007, pp. 109–116 (2007)
Google Scholar
Suha, K., Hee, H.J.: Hierarchical Event Representation and Rec-ognition Method for Scalable Video Event Analysis. In: Tenth IEEE International Symposium on Multimedia, pp. 586–591 (2008)
Google Scholar
Li, J.Z., Ozsu, M.T., Szafron, D.: Modeling of moving objects in a video database. In: Proc. 4th Int. Conf. On Multimedia and Computing System, pp. 336–343 (1997)
Google Scholar
Makris, D., Ellis, T.J.: Automatic Learning of an Activity-Based Semantic Scene Model. In: IEEE International Conference on Advanced Video and Signal Based Surveillance, Miami, FL, USA, pp. 183–188 (2003)
Google Scholar
Nascimento, J.C., Figueiredo, M.A.T., Marques, J.S.: Segmentation and Classification of Human Activities. In: HAREM 2005: International Workshop on Human Activity Recognition and Modelling (2005)
Google Scholar
Nevatia, R., Hobbs Jand Bolles, B.: An Ontology for Video Event Representation. In: Proceedings of the 2004 IEEE Computer Society Conference on Com-puter Vision and Patter Recognition Workshops(CVPRW 2004) (2004)
Google Scholar
Nguyen, N.T., Bui, H.H., Venkatesh, S., West, G.: Recognizing and Monitor-ing High-Level Behaviors in Complex Spatial Environments. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 620–625 (2003)
Google Scholar
Oliver, N.M., Rosario, B., Pentland, A.P.: A Bayesian Computer Vision System for Modeling Human Interactions. IEEE Trans. Pattern Analysis and Machine Intelligence 22(8), 831–843 (2000)
Article Google Scholar
Park, S., Aggarwal, J.K.: Semantic-Level Understanding of Human Actions and Interactions Using Event Hierarchy. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition Workshop, pp. 2–12 (2004)
Google Scholar
Neil, R., Ian, R.: Behaviour understanding in video: a combined method. In: Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV 2005), vol. 1, pp. 808–815 (2005)
Google Scholar
Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, pp. 994–1000 (2005)
Google Scholar
Shim, C.-B., Chang, J.-W., Kim, Y.-C.: Trajectory-Based Video Retrieval for Multimedia Information Systems. In: Yakhno, T. (ed.) ADVIS 2004. LNCS, vol. 3261, pp. 372–382. Springer, Heidelberg (2004)
Chapter Google Scholar
Siskind, J.: Grounding the Lexical Semantics of Verbs in Visual Perception using free Dynamics and Event logic. Artificial Intelligence Review 1(5), 31–90 (2001)
Google Scholar
Smeaton Alan, F.: Techniques used and open challenges to the analysis, indexing and retrieval of digital video. Information Systems 32, 545–559 (2007)
Article Google Scholar
Smith John, R.: VERL: An Ontology Framework for Representing and Annotating Video Events. IEEE MultiMedia, 76–86 (October-December 2005)
Google Scholar
Smith, J.R.: The Real Problem of Bridging the “Semantic Gap”. In: Sebe, N., Liu, Y., Zhuang, Y.-t., Huang, T.S. (eds.) MCAM 2007. LNCS, vol. 4577, pp. 16–17. Springer, Heidelberg (2007)
Chapter Google Scholar
Dongjin, S., Dacheng, T.: C1 Units for Scene Classification. In: 19th International Conference on Pattern Recognition, ICPR 2008, pp. 1–4 (2008)
Google Scholar
Tim, T., Wai, Y., Arbee, L., Chen, P.: Retrieving Video Data via Motion Tracks of Content Symbols. In: Proceedings of the Sixth International Conference on Information and Knowledge Management, pp. 105–112 (1997)
Google Scholar
Town, C.: Ontology-Driven Bayesian Networks for Dynamic Scene Understanding. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition Workshops (2004)
Google Scholar
Bogdan, V., Dimitrios, M., John-Paul, R.: A Framework for Ontology Enriched Semantic Annotation of CCTV Video. In: Eight International Workshop on Image Analysis for Multimedia Interactive Services, WIAMIS 2007 (2007)
Google Scholar
Xiaogang, W., Xiaoxu, M., Grimson, E.: Unsupervised Activity Per-ception by Hierarchical Bayesian Models. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007, pp. 1–8 (2007)
Google Scholar
Tao, X., Shaogang, G.: Activity based surveillance video content modeling. Pattern Recognition 41(7), 2309–2326 (2008)
MATH Google Scholar
Tao, X., Shaogang, G.: Video behaviour profiling and abnormality detection without manual labeling. In: Tenth IEEE International Conference on Computer Vision, ICCV 2005, vol. 2(17-21), pp. 1238–1245 (2005)
Google Scholar
http://groups.inf.ed.ac.uk/vision/CAVIAR/CAVIARDATA1
http://ngsim.fhwa.dot.gov/
http://www.multitel.be/cantata/

Download references

Author information

Authors and Affiliations

School of Computer Science, University of Manchester, U.K.
Aparna Garg & Allan Ramsay

Authors

Aparna Garg
View author publications
You can also search for this author in PubMed Google Scholar
Allan Ramsay
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Engineering , Nanyang Technological University, 639798, Singapore
Weisi Lin & Dacheng Tao &
Intelligent Systems Laboratory Systems Research Institute , Polish Academy of Sciences, Poland
Janusz Kacprzyk
Department of Computing , Hong Kong Polytechnic University, Hung Hom, Hong Kong
Zhu Li
School of Electronic Engineering and Computer Science, Queen Mary, University of London, London, U.K.
Ebroul Izquierdo
TCL-Thomson Electronics , Santa Clara, California
Haohong Wang

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Garg, A., Ramsay, A. (2011). Semantic Content Analysis of Video: Issues and Trends. In: Lin, W., Tao, D., Kacprzyk, J., Li, Z., Izquierdo, E., Wang, H. (eds) Multimedia Analysis, Processing and Communications. Studies in Computational Intelligence, vol 346. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19551-8_16

Download citation

DOI: https://doi.org/10.1007/978-3-642-19551-8_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19550-1
Online ISBN: 978-3-642-19551-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics