Skip to main content

Semantic Content Analysis of Video: Issues and Trends

  • Chapter
Multimedia Analysis, Processing and Communications

Part of the book series: Studies in Computational Intelligence ((SCI,volume 346))

Abstract

Key issues in bridging the semantic gap for content analysis of video include flexibility required from the software, real time implementation and cost effectiveness. In recent years industry has begun to take a more realistic view of what to expect from video content analysis systems in the near future. This chapter presents the state-of–the-art trends in semantic video analysis in industry. The key challenges in bridging the semantic gap are discussed. It also presents the research trends in video analytics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 379.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Allen, J.F.: Maintaining knowledge about temporal interval. Communications of ACM 26(11), 832–843 (1983)

    Article  MATH  Google Scholar 

  2. Andrade Ernesto, L., Scott, B., Fisher Robert, B.: Hidden Markov Models for Optical Flow Analysis in Crowds. In: 18th International Conference on Pattern Recognition (ICPR 2006), vol. 1, pp. 460–463 (2006)

    Google Scholar 

  3. Scott, B., Ernesto, A., Robert, F.: Non Parametric Classification of Human Interaction. In: Martí, J., Benedí, J.M., Mendonça, A.M., Serrat, J. (eds.) IbPRIA 2007. LNCS, vol. 4478, pp. 347–354. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  4. Bramberger, M., Brunner, J., Rinner, B., Schwabach, H.: Real-time video analysis on an embedded smart camera for traffic surveillance. In: Proceedings of 10th IEEE Real-Time and Embedded Technology and Applications Symposium, RTAS 2004, pp. 174–181 (2004)

    Google Scholar 

  5. Brand, M., Kettnaker, V.: Discovery and Segmentation of Activities in Video. IEEE Trans. Pattern Analysis and Machine Intelligence 22(8), 844–851 (2000)

    Article  Google Scholar 

  6. Dee Hannah, M., Velastin Sergio, A.: How close are we to solving the problem of automated visual surveillance? Machine Vision and Applications 19, 329–343 (2008)

    Article  Google Scholar 

  7. Desurmont, X., Wijnhoven, R.G.J.: Performance evaluation of real-time video content analysis systems in the CANDELA project. In: Proc. of the SPIE - Real-Time Imaging IX (2005)

    Google Scholar 

  8. Duong, T.V., Bui, H.B., Phung, D.Q., Venkatesh, S.: Activity Recognition and Abnormality Detection with the Switching Hidden Semi-Markov Model. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. I, pp. 838–845 (2005)

    Google Scholar 

  9. Ilker, E., Filiz, B., Subramanya, S.R.: A Framework for Trajectory Based Visual Event Retrieval. In: Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC 2004), vol. 2, pp. 23–27 (2004)

    Google Scholar 

  10. Luca, F.G., Lucio, M., Regazzoni Carlo, S.: Automatic Detection and Indexing of Video-Event Shots for Surveillance Application. IEEE Transactions On Multimedia 4(4), 459–471 (2002)

    Article  Google Scholar 

  11. Xinbo, G., Yimin, Y., Dacheng, T., Xuelong, L.: Discriminative Optical Flow Tensor for Video Semantic Analysis. In: Computer Vision and Image Understanding, vol. 113, pp. 372–383. Elsevier, Amsterdam (2009)

    Google Scholar 

  12. Jungong, H., Dirk, F., Peter, H.N., Weilun, L.: Real-Time Video Content Analysis Tool for Consumer Media Storage System. IEEE Transactions on Consumer Electronics 52(3), 870–878 (2006)

    Article  Google Scholar 

  13. Ismail, H., David, H., Larry, S.: W4: Real-Time Surveillance of People and Their Activities. IEEE Transactions On Pattern Analysis And Machine Intelligence 22(8), 809–830 (2000)

    Article  Google Scholar 

  14. Yongzhen, H., Kaiqi, H., Liangsheng, W., Dacheng, T., Tieniu, T., Xuelong, L.: Enhanced Biologically Inspired Model. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8 (2008)

    Google Scholar 

  15. Hoogs, A., Bush, S., Brooksby, G., Perera, A.G.A., Dausch, M., Krahnstoever, N.: Detecting Semantic Group Activities Using Relational Clustering. In: IEEE Workshop on Motion and video Computing, WMVC 2008, January 8-9, pp. 1–8 (2008)

    Google Scholar 

  16. Izo, T., Grimson, W.E.L.: Unsupervised Modeling of Object Tracks for Fast Anomaly Detection. In: IEEE International Conference on Image Processing, ICIP 2007, vol. 4, pp. 529–532 (2007)

    Google Scholar 

  17. Jacobsen, C., Zscherpel, U., Perner, P.: A Comparison between Neural Networks and Decision Trees. In: Perner, P., Petrou, M. (eds.) MLDM 1999. LNCS (LNAI), vol. 1715, pp. 44–157. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  18. Jhuang, H., Serre, T., Wolf, L., Poggio, T.: A Biologically Inspired System for Action Recognition. In: IEEE 11th International Conference on Computer Vision, vol. 14(21), pp. 1–8 (2007)

    Google Scholar 

  19. Yan, K., Rahul, S., Martial, H.: Event Detection in Crowded Videos. In: IEEE 11th International Conference on Computer Vision, ICCV 2007, October 14-21, pp. 1–8 (2007)

    Google Scholar 

  20. Shehzad, K., Andrew, N.: Classifying spatiotemporal object trajec-tories using unsupervised learning of basis function coefficients. Multimedia Systems 12(3), 227–238 (2006)

    Article  Google Scholar 

  21. Richard, K., Anteneh, A., Ben, S., Alexander, D.: Camera mote with a high-performance parallel processor for real-time frame-based video processing. In: First ACM/IEEE International Conference on Distributed Smart Cameras, ICDSC 2007, pp. 109–116 (2007)

    Google Scholar 

  22. Suha, K., Hee, H.J.: Hierarchical Event Representation and Rec-ognition Method for Scalable Video Event Analysis. In: Tenth IEEE International Symposium on Multimedia, pp. 586–591 (2008)

    Google Scholar 

  23. Li, J.Z., Ozsu, M.T., Szafron, D.: Modeling of moving objects in a video database. In: Proc. 4th Int. Conf. On Multimedia and Computing System, pp. 336–343 (1997)

    Google Scholar 

  24. Makris, D., Ellis, T.J.: Automatic Learning of an Activity-Based Semantic Scene Model. In: IEEE International Conference on Advanced Video and Signal Based Surveillance, Miami, FL, USA, pp. 183–188 (2003)

    Google Scholar 

  25. Nascimento, J.C., Figueiredo, M.A.T., Marques, J.S.: Segmentation and Classification of Human Activities. In: HAREM 2005: International Workshop on Human Activity Recognition and Modelling (2005)

    Google Scholar 

  26. Nevatia, R., Hobbs Jand Bolles, B.: An Ontology for Video Event Representation. In: Proceedings of the 2004 IEEE Computer Society Conference on Com-puter Vision and Patter Recognition Workshops(CVPRW 2004) (2004)

    Google Scholar 

  27. Nguyen, N.T., Bui, H.H., Venkatesh, S., West, G.: Recognizing and Monitor-ing High-Level Behaviors in Complex Spatial Environments. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 620–625 (2003)

    Google Scholar 

  28. Oliver, N.M., Rosario, B., Pentland, A.P.: A Bayesian Computer Vision System for Modeling Human Interactions. IEEE Trans. Pattern Analysis and Machine Intelligence 22(8), 831–843 (2000)

    Article  Google Scholar 

  29. Park, S., Aggarwal, J.K.: Semantic-Level Understanding of Human Actions and Interactions Using Event Hierarchy. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition Workshop, pp. 2–12 (2004)

    Google Scholar 

  30. Neil, R., Ian, R.: Behaviour understanding in video: a combined method. In: Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV 2005), vol. 1, pp. 808–815 (2005)

    Google Scholar 

  31. Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, pp. 994–1000 (2005)

    Google Scholar 

  32. Shim, C.-B., Chang, J.-W., Kim, Y.-C.: Trajectory-Based Video Retrieval for Multimedia Information Systems. In: Yakhno, T. (ed.) ADVIS 2004. LNCS, vol. 3261, pp. 372–382. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  33. Siskind, J.: Grounding the Lexical Semantics of Verbs in Visual Perception using free Dynamics and Event logic. Artificial Intelligence Review 1(5), 31–90 (2001)

    Google Scholar 

  34. Smeaton Alan, F.: Techniques used and open challenges to the analysis, indexing and retrieval of digital video. Information Systems 32, 545–559 (2007)

    Article  Google Scholar 

  35. Smith John, R.: VERL: An Ontology Framework for Representing and Annotating Video Events. IEEE MultiMedia, 76–86 (October-December 2005)

    Google Scholar 

  36. Smith, J.R.: The Real Problem of Bridging the “Semantic Gap”. In: Sebe, N., Liu, Y., Zhuang, Y.-t., Huang, T.S. (eds.) MCAM 2007. LNCS, vol. 4577, pp. 16–17. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  37. Dongjin, S., Dacheng, T.: C1 Units for Scene Classification. In: 19th International Conference on Pattern Recognition, ICPR 2008, pp. 1–4 (2008)

    Google Scholar 

  38. Tim, T., Wai, Y., Arbee, L., Chen, P.: Retrieving Video Data via Motion Tracks of Content Symbols. In: Proceedings of the Sixth International Conference on Information and Knowledge Management, pp. 105–112 (1997)

    Google Scholar 

  39. Town, C.: Ontology-Driven Bayesian Networks for Dynamic Scene Understanding. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition Workshops (2004)

    Google Scholar 

  40. Bogdan, V., Dimitrios, M., John-Paul, R.: A Framework for Ontology Enriched Semantic Annotation of CCTV Video. In: Eight International Workshop on Image Analysis for Multimedia Interactive Services, WIAMIS 2007 (2007)

    Google Scholar 

  41. Xiaogang, W., Xiaoxu, M., Grimson, E.: Unsupervised Activity Per-ception by Hierarchical Bayesian Models. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007, pp. 1–8 (2007)

    Google Scholar 

  42. Tao, X., Shaogang, G.: Activity based surveillance video content modeling. Pattern Recognition 41(7), 2309–2326 (2008)

    MATH  Google Scholar 

  43. Tao, X., Shaogang, G.: Video behaviour profiling and abnormality detection without manual labeling. In: Tenth IEEE International Conference on Computer Vision, ICCV 2005, vol. 2(17-21), pp. 1238–1245 (2005)

    Google Scholar 

  44. http://groups.inf.ed.ac.uk/vision/CAVIAR/CAVIARDATA1

  45. http://ngsim.fhwa.dot.gov/

  46. http://www.multitel.be/cantata/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Garg, A., Ramsay, A. (2011). Semantic Content Analysis of Video: Issues and Trends. In: Lin, W., Tao, D., Kacprzyk, J., Li, Z., Izquierdo, E., Wang, H. (eds) Multimedia Analysis, Processing and Communications. Studies in Computational Intelligence, vol 346. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19551-8_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-19551-8_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-19550-1

  • Online ISBN: 978-3-642-19551-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics