Skip to main content

Parsing a Video into Semantic Segments

  • Chapter
Content-Based Analysis of Digital Video
  • 204 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References and Further Reading

  1. Adams B.: Where does computational media aesthetics fit?, IEEE Multimedia, pp. 18–27, April-June 2003

    Google Scholar 

  2. Aigrain P., Joly P., Leplain P., Longueville V.: Medium knowledge-based macro-segmentation into sequences, Working notes of IJCAI Workshop on Intelligent Multimedia Information Retrieval, pp. 5–14, 1995

    Google Scholar 

  3. Allan J.: Topic detection and tracking: Event-based information organization, Kluwer Academic Publishers, February 2002

    Google Scholar 

  4. Aner A.: Video summaries and cross-referencing, Ph.D. thesis, Columbia University, New York, 2002

    Google Scholar 

  5. Arijon D.: Grammar of the film language, Silman-James Press, 1976

    Google Scholar 

  6. Ariki Y., Saito Y.: Extraction of TV news articles based on scene cut detection using DCT Clustering, ICIP’ 96, Vol. 3, pp. 847–850, Lausanne CH, 1996

    Google Scholar 

  7. Beaver F.: Dictionary of film terms, Twayne Publishing, New York, 1994

    Google Scholar 

  8. Beeferman D., Berger A., Lafferty J.: Statistical models for text segmentation, Machine Learning, 34/1, pp. 177–210, 1999

    Google Scholar 

  9. Del Bimbo A.: Visual information retrieval, Morgan Kaufmann Publishers, Inc., 1999

    Google Scholar 

  10. Boccignone G., De Santo M., Percannella G.: Joint audio-video processing of MPEG encoded sequences, proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS), 1999

    Google Scholar 

  11. Boccignone G., De Santo M., Percannella G.: A system for parsing MPEG videos, IS&T/SPIE Internet Imaging, 2000

    Google Scholar 

  12. Boggs J.M., Petrie D.W.: The art of watching films, 5th ed., Mountain View, CA: Mayfield 2000

    Google Scholar 

  13. Bordwell D., Thompson K.: Film Art: An Introduction, McGraw-Hill, New York, 1997

    Google Scholar 

  14. Chang S.-F., Smith J.R.: Extracting multidimensional signal features for content-based visual query, SPIE Symposium on Visual Communications and Signal Processing, pp. 995–1006, 1995

    Google Scholar 

  15. Chiu P., Girgensohn A., Polak W., Rieffel E., Wilcox L.: A genetic algorithm for video segmentation and summarization, Proceedings of IEEE International Conference on Multiemdia and EXPO (ICME), Vol. 3, pp. 1329–1332, 2000

    Google Scholar 

  16. Choi F.: Advances in domain independent linear text segmentation, NAACL’ 00, pp.26–33, 2000

    Google Scholar 

  17. Choi F., Wiemer-Hastings P., Moore J: Latent semantic analysis for text segmentation, proceedings of the NAACL’ 01, pp. 109–117, 2001

    Google Scholar 

  18. Corridoni J.M., Del Bimbo A.: Stuctured representation andautomatic indexing of movie information content, Pattern Recognition, 31(12), pp. 2027–2045, 1998

    Article  Google Scholar 

  19. Davies D.L., Bouldin D.W.: A cluster separation measure, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-1, pp. 224–227, April 1979

    Google Scholar 

  20. Dawson J.L.: Suffix removal and word connation, ALLC Bulletin, 2(3), pp. 33–46, 1974

    Google Scholar 

  21. Ferret O.: How to thematically segment texts by using lexical cohesion?, ACL-COLING’ 98, pp. 1481–1483,1998

    Google Scholar 

  22. Ferret O.: Using collocations for topic segmentation and link detection, COLING’ 02, 2002

    Google Scholar 

  23. Fiscus J., Doddington G., Garofolo J., Martin A.: MSTS’s 1998 topic detection and tracking evaluation, DARPA Broadcast News Workshop, 1999

    Google Scholar 

  24. Foltz P.W., Kintsch W., Landauer T.K.: The measurement of textual coherence with Latent Semantic Analysis, Discourse Processes, 25, 2&3, pp. 285–307, 1998

    Google Scholar 

  25. Furht B., Smoliar S.W., Zhang H.: Video and Image Processing in Multimedia Systems, Kluwer Academic Publishers, 1995

    Google Scholar 

  26. Girgensohn A., Boreczky J., Wilcox L.: Keyframe-based user interfaces for digital video, IEEE Computer, September 2001

    Google Scholar 

  27. Gonzales R.C., Woods R.E.: Digital image processing, Addison Wesley, 1993

    Google Scholar 

  28. Gunsel B., Fu Y., Tekalp A.M.: Hierarchical temporal video segmentation and content characterization, in Multimedia Storage and Archiving Systems II, proceedings of SPIE, Vol. 3229, pp. 46–56, 1997

    Google Scholar 

  29. Halliday M.A.K., Hasan R.: Cohesion in English, Lohman, London 1976

    Google Scholar 

  30. Hanjalic A., Lagendijk R.L., Biemond J.: Template-based detection of anchorperson shots in news programs, Proceedings of IEEE International Conference on Image Processing (ICIP), 1998

    Google Scholar 

  31. Hanjalic A., Lagendijk R.L., Biemond J.: Automated high-level movie segmentation for advanced video-retrieval systems, IEEE Transactions on Circuits and Systems for Video Technology, Vol.9, No.4, pp. 580–588, June 1999

    Article  Google Scholar 

  32. Hanjalic A., Zhang H.-J.: An integrated schemefor automated video abstraction based on unsupervised cluster-validity analysis, IEEE Transactions on Circuits and Systems for Video Technology, Vol.9, No.8, December 1999

    Google Scholar 

  33. Hanjalic A., Langelaar G.C., van Roosmalen P.M.B., Biemond J., Lagendijk R.L.: Image and video databases: restoration, watermarking and retrieval, Elsevier Science, Amsterdam 2000

    Google Scholar 

  34. Hanjalic A., Kakes G., Lagendijk R.L., Biemond J.: Indexing and retrieval of TV broadcast news using DANCERS, Journal of Electronic Imaging, 10(4), pp. 871–882, October 2001

    Article  Google Scholar 

  35. Hartley R., Zisserman A.: Multiple view geometry in computer vision, Cambridge University Press, 2001

    Google Scholar 

  36. Hearst M.: TextTiling: Segmenting text into multi-paragraph subtopic passages, Computational Linguistics, 23/1, pp. 33–64, 1997

    Google Scholar 

  37. Irani M., Anandan P., Bergenand J., Kumar R., Hsu S.: Efficient representation of video sequences and their applications, Signal Processing: Image Communication, Volume 8, 1996

    Google Scholar 

  38. Jain A.K., Dubes R.C.: Algorithms for clustering data, Engelwood Cliffs, NJ, Prentice Hall, 1988

    Google Scholar 

  39. Jiang H., Lin T., Zhang H.-J.: Video Segmentation with the assistance of audio content analysis, IEEE International Conference on Multimedia and Expo (ICME2000), 2000

    Google Scholar 

  40. Kaufmann S.: Cohesion and collocation: Using context vectors in text segmentation, ACL’ 99, pp. 591–595, 1999

    Google Scholar 

  41. Kender J.R., Yeo B.-L.: Video scene segmentation via continuous video coherence, Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 1998

    Google Scholar 

  42. Kozima H.: Text segmentation based on similarity between words, Proceedings of the 31st Annual Meeting of the Association for Compuational Linguistics, 1993

    Google Scholar 

  43. Krovetz R.: Viewing morphology as an inference process, Proceedings of the 16th ACM SIGIR Conference, pp. 191–202, 1993

    Google Scholar 

  44. Kwon Y.-M., Song C.-I, Kim I.-J.: A new approach for high level video structuring, Proceedings of the IEEE International Conference on Multimedia and EXPO (ICME), Vol. 2, pp/ 773–776, 2000

    Google Scholar 

  45. Landauer, T. K., Foltz, P. W., Laham, D.: Introduction to Latent Semantic Analysis. Discourse Processes, Vol. 25, pp. 259–284,1998

    Google Scholar 

  46. Lee S.-Y., Lee S.-T,, Chen D.-Y.: Automatic video summary and description, Lecture Notes in Computer Science, Vol. 1929, pp. 37–48, Springer Verlag, Berlin 2000

    Google Scholar 

  47. Li D., Dimitrova N., Li M., Sethi I.K.: Multimedia content processing through cross-modal association, Proceedings of ACM Multimedia’ 03, Berkeley 2003

    Google Scholar 

  48. Lienhart R., Pfeiffer S., Effelsberg W.: Scene determination based on video and audio features, Proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS), 1999

    Google Scholar 

  49. Lin T., Zhang H.J.: Automatic video scene extraction by shot grouping, Proceedings of International Conference on Pattern Recognition (ICPR), 2000

    Google Scholar 

  50. Lovins J.B.: Development of a stemming algorithm, Mechanical Translation and Computational Linguistics, 11, pp. 22–31, 1968

    Google Scholar 

  51. Lu X., Y.-F. Ma, H. Zhang, L. Wu, An integrated correlation measure for semantic video segmentation, Proceedings of IEEE International Conference on Multimedia and Expo, ICME, Lausanne, Switzerland, August, 2002.

    Google Scholar 

  52. Lu L., Zhang H.-J., Jiang H.: Content analysis for audio classification and segmentation, IEEE Transactions on Speech and Audio Processing, Vol. 10, No.7, October 2002

    Google Scholar 

  53. Mohan R.: Text-Based Search of TV News Stories, Proceedings of SPIE Conference on Multimedia Storage and Archiving Systems SPIE, Boston, MA, November 1996 pp. 2–13.

    Google Scholar 

  54. Morris J., Hirst G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text, Computational Linguistics, 17(1), pp. 21–48, 1991

    Google Scholar 

  55. O’ Connor N., Czirjek C. and al.: News Story Segmentation in The Físchlár Video Indexing System, Proceedings of the IEEE International Conference on Image Processing (ICIP), pp. 7–10, 2001

    Google Scholar 

  56. Paice C.P.: Another stemmer, Department of Computing, Lancaster University, UK, 1990

    Google Scholar 

  57. Passoneau R.J., Litman D.J.: Discourse segmentation by human and automated means, Computational Linguistics, 23(1): 103–139, 1997

    Google Scholar 

  58. Patel N.V., Sethi I.K.: Audio characterization for video indexing, IS&T/SPIE Electronic Imaging: Storage and Retrieval for Image and Video Databases IV, Vol. 2670, pp. 373–384, 1996

    Google Scholar 

  59. Patterson R.D., Robinson K., Holdsworth J., McKeown D., Zhang C, Allerhand M.H.: Complex sounds and auditory images, in Auditory Psychology and Perception, (Eds.) Y. Cazals, L. Demany, K. Horner, Pergamon, Oxford, 1992.

    Google Scholar 

  60. Pfeiffer S.: The importance of perceptive adaptation of sound features for audio content processing, IS&T/SPIE Electronic Imaging: Storage and Retrieval for Image and Video Databases VII, 1999

    Google Scholar 

  61. Ponte J., Croft W.B.: Text segmentation by topic, In proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries, pp. 120–129, 1997

    Google Scholar 

  62. Porter M.F.: An algorithm for suffix stripping, Program, 14, No.3, pp. 130–137, July 1980

    Google Scholar 

  63. Rabiner L.R., Huang B.H.: Fundamentals of speech recognition. Prentice Hall, 1993

    Google Scholar 

  64. Rabiner L.R., Shafer R.W.: Digital processing of speech signals, Prentice Hall, 1978

    Google Scholar 

  65. Raskin V., Weiser I.: Language and writing: applications of linguistics to thetoric and composition, ABLEX Publishing Corporation, Norwood, NJ, 1987

    Google Scholar 

  66. Reynar J.C.: An automatic method of finding topic boundaries, Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, 1994

    Google Scholar 

  67. Robertson S.E., Sparck Jones K.: Simple proven approaches to text retrieval, Technical Report TR356, Cambridge University, Computer laboratory, 1997

    Google Scholar 

  68. Rui Y., Huang T.S., Mehrotra S.: Exploring video structure beyond the shots, Proceedings of IEEE International Conference on Multimedia Computing and Systems (ICMCS), pp. 237–240, 1998

    Google Scholar 

  69. Rui Y., Huang T.S., Mehrotra S.: Constructing table-of-content for videos, Multimedia Systems, Special Section on Video Libraries, 7(5), pp. 359–368, 1999

    Google Scholar 

  70. Sahouria E., Zakhor A.: Content analysis of video using principal components, IEEE Transactions on Circuits and Systems for Video Technology, 9(8), pp. 1290–1298, 1999

    Article  Google Scholar 

  71. Saraceno C, Leonardi R.: Audio as a support to scene change detection and characterization of video sequences, proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1997

    Google Scholar 

  72. Scheirer E., Slaney M.: Construction and evaluation of a robust multifeature speech/music discriminator, Proceedings of (ICASSP), 1997

    Google Scholar 

  73. Shan M.-K., Lee S.-Y.: Content-based video retrieval based on similarity of frame sequence, Proceedings of the International Workshop on Multi-Media Database Management Systems, pp 90–97, August 1998

    Google Scholar 

  74. Skorochod’ko E.: Adaptive method of automatic abstracting and indexing, In C. Freiman (Eds.): Information Processing 71: Proceedings of the IFIP Congress 71, pp. 1179–1182, North-Holland Publishing Company, 1972

    Google Scholar 

  75. Smeulders A.W.M., Worring M., Santini S., Gupta A., Jain R., Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12): 1349–1380, 2000

    Article  Google Scholar 

  76. Srinivasan S., Petkovic D., Ponceleon D.: Towards robust features for classifying audio in the CueVideo system, Proceedings of the seventh ACM international conference on Multimedia, October 1999

    Google Scholar 

  77. Sundaram H., Chang S.-F.: Determining computable scenes in films and their structures using audio-visual memory models, Proceedings of the 8th ACM Multimedia Conference, 2000

    Google Scholar 

  78. Takao S., Ogata J., Ariki Y.: Topic segmentation of news speech using word similarity, Proceedings of the ACM Multimedia Conference, 2000

    Google Scholar 

  79. Tannen D.: Talking voices: repetition, dialogue and imagery in conversational discourse, Studies in International Sociolinguistics 6, Cambridge University Press, 1989

    Google Scholar 

  80. Truong B.T., Venkatesh S., Dorai C.: Neighborhood coherence and edge-based approach for scene extraction in films, proceedings of IEEE International Conference on Pattern Recognition (ICPR), 2002

    Google Scholar 

  81. Utiyama M., Isihara H.: A statistical model for domain-independent text segmentation, ACL’ 01, pp. 491498, 2001

    Google Scholar 

  82. Vendrig J., Worring M. Smeulders A.W.M.: Model based interactive story unit segmentation, IEEE International Conference on Multimedia and Expo (ICME), pages 1084–1087, August 22–25, 2001

    Google Scholar 

  83. Vendrig, J., Worring, M.: Systematic evaluation of logical story unit segmentation IEEE Transactions on Multimedia, Vol. 4, No. 4, pp. 492–499, Dec. 2002

    Article  Google Scholar 

  84. Vendrig, J.; Worring, M.: Interactive adaptive movie annotation, IEEE Multimedia, Vol. 10, No. 3, pp. 30–37, July–Sept. 2003

    Article  Google Scholar 

  85. Veneau E., Ronfard R., Bouthemy P.: From video shot clustering to sequence segmentation, Proceedings of International Conference on Pattern Recognition (ICPR), Vol. 4, pp. 254–257, 2000

    Google Scholar 

  86. Walker M.: Redundancy in collaborative dialogue, In J. Hirschberg, D. Litman, K. McCoy, C. Sidner (Eds.): AAAI Fall Symposium on Discourse Structure in Natural Language Understanding and Generation, Pacific Grove, CA, 1991

    Google Scholar 

  87. Wang J., Chua T.-S., Chen L.: Cinematic-based model for scene boundary detection. Proceedings of MMM’ 2001 (Multimedia Modeling Conference), Amsterdam, Netherlands, Nov 2001. pp 3–18

    Google Scholar 

  88. Yamron J., Carp I., Gillick L, Lowe S., van Mulbregt P.: A hidden Markov model approach to text segmentation and event tracking, Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1998

    Google Scholar 

  89. Yeung M., Liu B.: Efficient matching and clustering of video shots, Proceedings of the International Conference on Image Processing (ICIP), pp 23–26, 1995

    Google Scholar 

  90. Yeung M., Yeo B.-L., Wolf W., Liu B.: Video browsing using clustering and scene transitions on compressed sequences, Proceedings of Multimedia Computing and Networking 1995, Vo. SPIE 2417, pp. 399–413, 1995

    Google Scholar 

  91. Yeung M., Yeo B.-L.: Time-constrained clustering for segmentation of video into story units, Proceedings of the International Conference on Pattern Recognition (ICPR), pp. 375–380, 1996

    Google Scholar 

  92. Yeung M., Yeo B.-L., Liu B.: Extracting story units from long programs for video browsing and navigation, Proceedings of the IEEE international Conference on Multimedia Computing and Systems, pp. 296–305, 1996

    Google Scholar 

  93. Yeung M., Yeo B.-L.: Video visualization for compact presentation and fast browsing of pictorial content, IEEE Transactions on Circuits and Systems for Video Technology, Vol.7, No.5, pp. 771–785, 1997

    Article  Google Scholar 

  94. Yeung M., Yeo B.-L., Liu B.: Segmentation of video by clustering and graph analysis, Computer Vision and Image Understanding, 71(1), pp.94–109, 1998

    Article  Google Scholar 

  95. Zhang HJ., Tan S.Y., Smoliar S.W., Yihong G.: Automatic parsing and indexing of news video, Multimedia Systems, 2(6), pp. 256–266, 1995

    Article  Google Scholar 

Download references

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer Science + Business Media, Inc.

About this chapter

Cite this chapter

(2004). Parsing a Video into Semantic Segments. In: Content-Based Analysis of Digital Video. Springer, Boston, MA. https://doi.org/10.1007/1-4020-8115-4_3

Download citation

  • DOI: https://doi.org/10.1007/1-4020-8115-4_3

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4020-8114-9

  • Online ISBN: 978-1-4020-8115-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics