Parsing a Video into Semantic Segments

Keywords

Video Clip Time Stamp Latent Semantic Analysis Visual Content Shot Boundary 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References and Further Reading

  1. [Ada03]
    Adams B.: Where does computational media aesthetics fit?, IEEE Multimedia, pp. 18–27, April-June 2003Google Scholar
  2. [Aig95]
    Aigrain P., Joly P., Leplain P., Longueville V.: Medium knowledge-based macro-segmentation into sequences, Working notes of IJCAI Workshop on Intelligent Multimedia Information Retrieval, pp. 5–14, 1995Google Scholar
  3. [All02]
    Allan J.: Topic detection and tracking: Event-based information organization, Kluwer Academic Publishers, February 2002Google Scholar
  4. [Ane02]
    Aner A.: Video summaries and cross-referencing, Ph.D. thesis, Columbia University, New York, 2002Google Scholar
  5. [Ari76]
    Arijon D.: Grammar of the film language, Silman-James Press, 1976Google Scholar
  6. [Ari96]
    Ariki Y., Saito Y.: Extraction of TV news articles based on scene cut detection using DCT Clustering, ICIP’ 96, Vol. 3, pp. 847–850, Lausanne CH, 1996Google Scholar
  7. [Bea94]
    Beaver F.: Dictionary of film terms, Twayne Publishing, New York, 1994Google Scholar
  8. [Bee99]
    Beeferman D., Berger A., Lafferty J.: Statistical models for text segmentation, Machine Learning, 34/1, pp. 177–210, 1999Google Scholar
  9. [Bim99]
    Del Bimbo A.: Visual information retrieval, Morgan Kaufmann Publishers, Inc., 1999Google Scholar
  10. [Boc99]
    Boccignone G., De Santo M., Percannella G.: Joint audio-video processing of MPEG encoded sequences, proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS), 1999Google Scholar
  11. [Boc00]
    Boccignone G., De Santo M., Percannella G.: A system for parsing MPEG videos, IS&T/SPIE Internet Imaging, 2000Google Scholar
  12. [Bog00]
    Boggs J.M., Petrie D.W.: The art of watching films, 5th ed., Mountain View, CA: Mayfield 2000Google Scholar
  13. [Bor97]
    Bordwell D., Thompson K.: Film Art: An Introduction, McGraw-Hill, New York, 1997Google Scholar
  14. [Cha95]
    Chang S.-F., Smith J.R.: Extracting multidimensional signal features for content-based visual query, SPIE Symposium on Visual Communications and Signal Processing, pp. 995–1006, 1995Google Scholar
  15. [Chi00]
    Chiu P., Girgensohn A., Polak W., Rieffel E., Wilcox L.: A genetic algorithm for video segmentation and summarization, Proceedings of IEEE International Conference on Multiemdia and EXPO (ICME), Vol. 3, pp. 1329–1332, 2000Google Scholar
  16. [Cho00]
    Choi F.: Advances in domain independent linear text segmentation, NAACL’ 00, pp.26–33, 2000Google Scholar
  17. [Cho01]
    Choi F., Wiemer-Hastings P., Moore J: Latent semantic analysis for text segmentation, proceedings of the NAACL’ 01, pp. 109–117, 2001Google Scholar
  18. [Cor98]
    Corridoni J.M., Del Bimbo A.: Stuctured representation andautomatic indexing of movie information content, Pattern Recognition, 31(12), pp. 2027–2045, 1998CrossRefGoogle Scholar
  19. [Dav79]
    Davies D.L., Bouldin D.W.: A cluster separation measure, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-1, pp. 224–227, April 1979Google Scholar
  20. [Daw74]
    Dawson J.L.: Suffix removal and word connation, ALLC Bulletin, 2(3), pp. 33–46, 1974Google Scholar
  21. [Fer98]
    Ferret O.: How to thematically segment texts by using lexical cohesion?, ACL-COLING’ 98, pp. 1481–1483,1998Google Scholar
  22. [Fer02]
    Ferret O.: Using collocations for topic segmentation and link detection, COLING’ 02, 2002Google Scholar
  23. [Fis99]
    Fiscus J., Doddington G., Garofolo J., Martin A.: MSTS’s 1998 topic detection and tracking evaluation, DARPA Broadcast News Workshop, 1999Google Scholar
  24. [Fol98]
    Foltz P.W., Kintsch W., Landauer T.K.: The measurement of textual coherence with Latent Semantic Analysis, Discourse Processes, 25, 2&3, pp. 285–307, 1998Google Scholar
  25. [Fur95]
    Furht B., Smoliar S.W., Zhang H.: Video and Image Processing in Multimedia Systems, Kluwer Academic Publishers, 1995Google Scholar
  26. [Gir01]
    Girgensohn A., Boreczky J., Wilcox L.: Keyframe-based user interfaces for digital video, IEEE Computer, September 2001Google Scholar
  27. [Gon93]
    Gonzales R.C., Woods R.E.: Digital image processing, Addison Wesley, 1993Google Scholar
  28. [Gun97]
    Gunsel B., Fu Y., Tekalp A.M.: Hierarchical temporal video segmentation and content characterization, in Multimedia Storage and Archiving Systems II, proceedings of SPIE, Vol. 3229, pp. 46–56, 1997Google Scholar
  29. [Hal76]
    Halliday M.A.K., Hasan R.: Cohesion in English, Lohman, London 1976Google Scholar
  30. [Han98]
    Hanjalic A., Lagendijk R.L., Biemond J.: Template-based detection of anchorperson shots in news programs, Proceedings of IEEE International Conference on Image Processing (ICIP), 1998Google Scholar
  31. [Han99a]
    Hanjalic A., Lagendijk R.L., Biemond J.: Automated high-level movie segmentation for advanced video-retrieval systems, IEEE Transactions on Circuits and Systems for Video Technology, Vol.9, No.4, pp. 580–588, June 1999CrossRefGoogle Scholar
  32. [Han99b]
    Hanjalic A., Zhang H.-J.: An integrated schemefor automated video abstraction based on unsupervised cluster-validity analysis, IEEE Transactions on Circuits and Systems for Video Technology, Vol.9, No.8, December 1999Google Scholar
  33. [Han00]
    Hanjalic A., Langelaar G.C., van Roosmalen P.M.B., Biemond J., Lagendijk R.L.: Image and video databases: restoration, watermarking and retrieval, Elsevier Science, Amsterdam 2000Google Scholar
  34. [Han01]
    Hanjalic A., Kakes G., Lagendijk R.L., Biemond J.: Indexing and retrieval of TV broadcast news using DANCERS, Journal of Electronic Imaging, 10(4), pp. 871–882, October 2001CrossRefGoogle Scholar
  35. [Har01]
    Hartley R., Zisserman A.: Multiple view geometry in computer vision, Cambridge University Press, 2001Google Scholar
  36. [Hea97]
    Hearst M.: TextTiling: Segmenting text into multi-paragraph subtopic passages, Computational Linguistics, 23/1, pp. 33–64, 1997Google Scholar
  37. [Ira96]
    Irani M., Anandan P., Bergenand J., Kumar R., Hsu S.: Efficient representation of video sequences and their applications, Signal Processing: Image Communication, Volume 8, 1996Google Scholar
  38. [Jai88]
    Jain A.K., Dubes R.C.: Algorithms for clustering data, Engelwood Cliffs, NJ, Prentice Hall, 1988Google Scholar
  39. [Jia00]
    Jiang H., Lin T., Zhang H.-J.: Video Segmentation with the assistance of audio content analysis, IEEE International Conference on Multimedia and Expo (ICME2000), 2000Google Scholar
  40. [Kau99]
    Kaufmann S.: Cohesion and collocation: Using context vectors in text segmentation, ACL’ 99, pp. 591–595, 1999Google Scholar
  41. [Ken98]
    Kender J.R., Yeo B.-L.: Video scene segmentation via continuous video coherence, Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 1998Google Scholar
  42. [Koz93]
    Kozima H.: Text segmentation based on similarity between words, Proceedings of the 31st Annual Meeting of the Association for Compuational Linguistics, 1993Google Scholar
  43. [Kro93]
    Krovetz R.: Viewing morphology as an inference process, Proceedings of the 16th ACM SIGIR Conference, pp. 191–202, 1993Google Scholar
  44. [Kwo00]
    Kwon Y.-M., Song C.-I, Kim I.-J.: A new approach for high level video structuring, Proceedings of the IEEE International Conference on Multimedia and EXPO (ICME), Vol. 2, pp/ 773–776, 2000Google Scholar
  45. [Lan98]
    Landauer, T. K., Foltz, P. W., Laham, D.: Introduction to Latent Semantic Analysis. Discourse Processes, Vol. 25, pp. 259–284,1998Google Scholar
  46. [Lee00]
    Lee S.-Y., Lee S.-T,, Chen D.-Y.: Automatic video summary and description, Lecture Notes in Computer Science, Vol. 1929, pp. 37–48, Springer Verlag, Berlin 2000Google Scholar
  47. [Li03]
    Li D., Dimitrova N., Li M., Sethi I.K.: Multimedia content processing through cross-modal association, Proceedings of ACM Multimedia’ 03, Berkeley 2003Google Scholar
  48. [Lie99]
    Lienhart R., Pfeiffer S., Effelsberg W.: Scene determination based on video and audio features, Proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS), 1999Google Scholar
  49. [Lin00]
    Lin T., Zhang H.J.: Automatic video scene extraction by shot grouping, Proceedings of International Conference on Pattern Recognition (ICPR), 2000Google Scholar
  50. [Lov68]
    Lovins J.B.: Development of a stemming algorithm, Mechanical Translation and Computational Linguistics, 11, pp. 22–31, 1968Google Scholar
  51. [Lu02a]
    Lu X., Y.-F. Ma, H. Zhang, L. Wu, An integrated correlation measure for semantic video segmentation, Proceedings of IEEE International Conference on Multimedia and Expo, ICME, Lausanne, Switzerland, August, 2002.Google Scholar
  52. [Lu02b]
    Lu L., Zhang H.-J., Jiang H.: Content analysis for audio classification and segmentation, IEEE Transactions on Speech and Audio Processing, Vol. 10, No.7, October 2002Google Scholar
  53. [Moh96]
    Mohan R.: Text-Based Search of TV News Stories, Proceedings of SPIE Conference on Multimedia Storage and Archiving Systems SPIE, Boston, MA, November 1996 pp. 2–13.Google Scholar
  54. [Mor91]
    Morris J., Hirst G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text, Computational Linguistics, 17(1), pp. 21–48, 1991Google Scholar
  55. [OCo01]
    O’ Connor N., Czirjek C. and al.: News Story Segmentation in The Físchlár Video Indexing System, Proceedings of the IEEE International Conference on Image Processing (ICIP), pp. 7–10, 2001Google Scholar
  56. [Pai90]
    Paice C.P.: Another stemmer, Department of Computing, Lancaster University, UK, 1990Google Scholar
  57. [Pas97]
    Passoneau R.J., Litman D.J.: Discourse segmentation by human and automated means, Computational Linguistics, 23(1): 103–139, 1997Google Scholar
  58. [Pat96]
    Patel N.V., Sethi I.K.: Audio characterization for video indexing, IS&T/SPIE Electronic Imaging: Storage and Retrieval for Image and Video Databases IV, Vol. 2670, pp. 373–384, 1996Google Scholar
  59. [Pat92]
    Patterson R.D., Robinson K., Holdsworth J., McKeown D., Zhang C, Allerhand M.H.: Complex sounds and auditory images, in Auditory Psychology and Perception, (Eds.) Y. Cazals, L. Demany, K. Horner, Pergamon, Oxford, 1992.Google Scholar
  60. [Pfe99]
    Pfeiffer S.: The importance of perceptive adaptation of sound features for audio content processing, IS&T/SPIE Electronic Imaging: Storage and Retrieval for Image and Video Databases VII, 1999Google Scholar
  61. [Pon97]
    Ponte J., Croft W.B.: Text segmentation by topic, In proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries, pp. 120–129, 1997Google Scholar
  62. [Por80]
    Porter M.F.: An algorithm for suffix stripping, Program, 14, No.3, pp. 130–137, July 1980Google Scholar
  63. [Rab93]
    Rabiner L.R., Huang B.H.: Fundamentals of speech recognition. Prentice Hall, 1993Google Scholar
  64. [Rab78]
    Rabiner L.R., Shafer R.W.: Digital processing of speech signals, Prentice Hall, 1978Google Scholar
  65. [Ras87]
    Raskin V., Weiser I.: Language and writing: applications of linguistics to thetoric and composition, ABLEX Publishing Corporation, Norwood, NJ, 1987Google Scholar
  66. [Rey94]
    Reynar J.C.: An automatic method of finding topic boundaries, Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, 1994Google Scholar
  67. [Rob97]
    Robertson S.E., Sparck Jones K.: Simple proven approaches to text retrieval, Technical Report TR356, Cambridge University, Computer laboratory, 1997Google Scholar
  68. [Rui98]
    Rui Y., Huang T.S., Mehrotra S.: Exploring video structure beyond the shots, Proceedings of IEEE International Conference on Multimedia Computing and Systems (ICMCS), pp. 237–240, 1998Google Scholar
  69. [Rui99]
    Rui Y., Huang T.S., Mehrotra S.: Constructing table-of-content for videos, Multimedia Systems, Special Section on Video Libraries, 7(5), pp. 359–368, 1999Google Scholar
  70. [Sah99]
    Sahouria E., Zakhor A.: Content analysis of video using principal components, IEEE Transactions on Circuits and Systems for Video Technology, 9(8), pp. 1290–1298, 1999CrossRefGoogle Scholar
  71. [Sar97]
    Saraceno C, Leonardi R.: Audio as a support to scene change detection and characterization of video sequences, proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1997Google Scholar
  72. [Sch97]
    Scheirer E., Slaney M.: Construction and evaluation of a robust multifeature speech/music discriminator, Proceedings of (ICASSP), 1997Google Scholar
  73. [Sha98]
    Shan M.-K., Lee S.-Y.: Content-based video retrieval based on similarity of frame sequence, Proceedings of the International Workshop on Multi-Media Database Management Systems, pp 90–97, August 1998Google Scholar
  74. [Sko72]
    Skorochod’ko E.: Adaptive method of automatic abstracting and indexing, In C. Freiman (Eds.): Information Processing 71: Proceedings of the IFIP Congress 71, pp. 1179–1182, North-Holland Publishing Company, 1972Google Scholar
  75. [Sme00]
    Smeulders A.W.M., Worring M., Santini S., Gupta A., Jain R., Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12): 1349–1380, 2000CrossRefGoogle Scholar
  76. [Sri99]
    Srinivasan S., Petkovic D., Ponceleon D.: Towards robust features for classifying audio in the CueVideo system, Proceedings of the seventh ACM international conference on Multimedia, October 1999Google Scholar
  77. [Sun00]
    Sundaram H., Chang S.-F.: Determining computable scenes in films and their structures using audio-visual memory models, Proceedings of the 8th ACM Multimedia Conference, 2000Google Scholar
  78. [Tak00]
    Takao S., Ogata J., Ariki Y.: Topic segmentation of news speech using word similarity, Proceedings of the ACM Multimedia Conference, 2000Google Scholar
  79. [Tan89]
    Tannen D.: Talking voices: repetition, dialogue and imagery in conversational discourse, Studies in International Sociolinguistics 6, Cambridge University Press, 1989Google Scholar
  80. [Tru02]
    Truong B.T., Venkatesh S., Dorai C.: Neighborhood coherence and edge-based approach for scene extraction in films, proceedings of IEEE International Conference on Pattern Recognition (ICPR), 2002Google Scholar
  81. [Uti01]
    Utiyama M., Isihara H.: A statistical model for domain-independent text segmentation, ACL’ 01, pp. 491498, 2001Google Scholar
  82. [Ven01]
    Vendrig J., Worring M. Smeulders A.W.M.: Model based interactive story unit segmentation, IEEE International Conference on Multimedia and Expo (ICME), pages 1084–1087, August 22–25, 2001Google Scholar
  83. [Ven02]
    Vendrig, J., Worring, M.: Systematic evaluation of logical story unit segmentation IEEE Transactions on Multimedia, Vol. 4, No. 4, pp. 492–499, Dec. 2002CrossRefGoogle Scholar
  84. [Ven03]
    Vendrig, J.; Worring, M.: Interactive adaptive movie annotation, IEEE Multimedia, Vol. 10, No. 3, pp. 30–37, July–Sept. 2003CrossRefGoogle Scholar
  85. [Ven00]
    Veneau E., Ronfard R., Bouthemy P.: From video shot clustering to sequence segmentation, Proceedings of International Conference on Pattern Recognition (ICPR), Vol. 4, pp. 254–257, 2000Google Scholar
  86. [Wal91]
    Walker M.: Redundancy in collaborative dialogue, In J. Hirschberg, D. Litman, K. McCoy, C. Sidner (Eds.): AAAI Fall Symposium on Discourse Structure in Natural Language Understanding and Generation, Pacific Grove, CA, 1991Google Scholar
  87. [Wan01]
    Wang J., Chua T.-S., Chen L.: Cinematic-based model for scene boundary detection. Proceedings of MMM’ 2001 (Multimedia Modeling Conference), Amsterdam, Netherlands, Nov 2001. pp 3–18Google Scholar
  88. [Yam98]
    Yamron J., Carp I., Gillick L, Lowe S., van Mulbregt P.: A hidden Markov model approach to text segmentation and event tracking, Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1998Google Scholar
  89. [Yeu95a]
    Yeung M., Liu B.: Efficient matching and clustering of video shots, Proceedings of the International Conference on Image Processing (ICIP), pp 23–26, 1995Google Scholar
  90. [Yeu95b]
    Yeung M., Yeo B.-L., Wolf W., Liu B.: Video browsing using clustering and scene transitions on compressed sequences, Proceedings of Multimedia Computing and Networking 1995, Vo. SPIE 2417, pp. 399–413, 1995Google Scholar
  91. [Yeu96a]
    Yeung M., Yeo B.-L.: Time-constrained clustering for segmentation of video into story units, Proceedings of the International Conference on Pattern Recognition (ICPR), pp. 375–380, 1996Google Scholar
  92. [Yeu96b]
    Yeung M., Yeo B.-L., Liu B.: Extracting story units from long programs for video browsing and navigation, Proceedings of the IEEE international Conference on Multimedia Computing and Systems, pp. 296–305, 1996Google Scholar
  93. [Yeu97]
    Yeung M., Yeo B.-L.: Video visualization for compact presentation and fast browsing of pictorial content, IEEE Transactions on Circuits and Systems for Video Technology, Vol.7, No.5, pp. 771–785, 1997CrossRefGoogle Scholar
  94. [Yeu98]
    Yeung M., Yeo B.-L., Liu B.: Segmentation of video by clustering and graph analysis, Computer Vision and Image Understanding, 71(1), pp.94–109, 1998CrossRefGoogle Scholar
  95. [Zha95]
    Zhang HJ., Tan S.Y., Smoliar S.W., Yihong G.: Automatic parsing and indexing of news video, Multimedia Systems, 2(6), pp. 256–266, 1995CrossRefGoogle Scholar

Copyright information

© Springer Science + Business Media, Inc. 2004

Personalised recommendations