Parsing a Video into Semantic Segments

doi:10.1007/1-4020-8115-4_3

204 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References and Further Reading

Adams B.: Where does computational media aesthetics fit?, IEEE Multimedia, pp. 18–27, April-June 2003
Google Scholar
Aigrain P., Joly P., Leplain P., Longueville V.: Medium knowledge-based macro-segmentation into sequences, Working notes of IJCAI Workshop on Intelligent Multimedia Information Retrieval, pp. 5–14, 1995
Google Scholar
Allan J.: Topic detection and tracking: Event-based information organization, Kluwer Academic Publishers, February 2002
Google Scholar
Aner A.: Video summaries and cross-referencing, Ph.D. thesis, Columbia University, New York, 2002
Google Scholar
Arijon D.: Grammar of the film language, Silman-James Press, 1976
Google Scholar
Ariki Y., Saito Y.: Extraction of TV news articles based on scene cut detection using DCT Clustering, ICIP’ 96, Vol. 3, pp. 847–850, Lausanne CH, 1996
Google Scholar
Beaver F.: Dictionary of film terms, Twayne Publishing, New York, 1994
Google Scholar
Beeferman D., Berger A., Lafferty J.: Statistical models for text segmentation, Machine Learning, 34/1, pp. 177–210, 1999
Google Scholar
Del Bimbo A.: Visual information retrieval, Morgan Kaufmann Publishers, Inc., 1999
Google Scholar
Boccignone G., De Santo M., Percannella G.: Joint audio-video processing of MPEG encoded sequences, proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS), 1999
Google Scholar
Boccignone G., De Santo M., Percannella G.: A system for parsing MPEG videos, IS&T/SPIE Internet Imaging, 2000
Google Scholar
Boggs J.M., Petrie D.W.: The art of watching films, 5^th ed., Mountain View, CA: Mayfield 2000
Google Scholar
Bordwell D., Thompson K.: Film Art: An Introduction, McGraw-Hill, New York, 1997
Google Scholar
Chang S.-F., Smith J.R.: Extracting multidimensional signal features for content-based visual query, SPIE Symposium on Visual Communications and Signal Processing, pp. 995–1006, 1995
Google Scholar
Chiu P., Girgensohn A., Polak W., Rieffel E., Wilcox L.: A genetic algorithm for video segmentation and summarization, Proceedings of IEEE International Conference on Multiemdia and EXPO (ICME), Vol. 3, pp. 1329–1332, 2000
Google Scholar
Choi F.: Advances in domain independent linear text segmentation, NAACL’ 00, pp.26–33, 2000
Google Scholar
Choi F., Wiemer-Hastings P., Moore J: Latent semantic analysis for text segmentation, proceedings of the NAACL’ 01, pp. 109–117, 2001
Google Scholar
Corridoni J.M., Del Bimbo A.: Stuctured representation andautomatic indexing of movie information content, Pattern Recognition, 31(12), pp. 2027–2045, 1998
Article Google Scholar
Davies D.L., Bouldin D.W.: A cluster separation measure, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-1, pp. 224–227, April 1979
Google Scholar
Dawson J.L.: Suffix removal and word connation, ALLC Bulletin, 2(3), pp. 33–46, 1974
Google Scholar
Ferret O.: How to thematically segment texts by using lexical cohesion?, ACL-COLING’ 98, pp. 1481–1483,1998
Google Scholar
Ferret O.: Using collocations for topic segmentation and link detection, COLING’ 02, 2002
Google Scholar
Fiscus J., Doddington G., Garofolo J., Martin A.: MSTS’s 1998 topic detection and tracking evaluation, DARPA Broadcast News Workshop, 1999
Google Scholar
Foltz P.W., Kintsch W., Landauer T.K.: The measurement of textual coherence with Latent Semantic Analysis, Discourse Processes, 25, 2&3, pp. 285–307, 1998
Google Scholar
Furht B., Smoliar S.W., Zhang H.: Video and Image Processing in Multimedia Systems, Kluwer Academic Publishers, 1995
Google Scholar
Girgensohn A., Boreczky J., Wilcox L.: Keyframe-based user interfaces for digital video, IEEE Computer, September 2001
Google Scholar
Gonzales R.C., Woods R.E.: Digital image processing, Addison Wesley, 1993
Google Scholar
Gunsel B., Fu Y., Tekalp A.M.: Hierarchical temporal video segmentation and content characterization, in Multimedia Storage and Archiving Systems II, proceedings of SPIE, Vol. 3229, pp. 46–56, 1997
Google Scholar
Halliday M.A.K., Hasan R.: Cohesion in English, Lohman, London 1976
Google Scholar
Hanjalic A., Lagendijk R.L., Biemond J.: Template-based detection of anchorperson shots in news programs, Proceedings of IEEE International Conference on Image Processing (ICIP), 1998
Google Scholar
Hanjalic A., Lagendijk R.L., Biemond J.: Automated high-level movie segmentation for advanced video-retrieval systems, IEEE Transactions on Circuits and Systems for Video Technology, Vol.9, No.4, pp. 580–588, June 1999
Article Google Scholar
Hanjalic A., Zhang H.-J.: An integrated schemefor automated video abstraction based on unsupervised cluster-validity analysis, IEEE Transactions on Circuits and Systems for Video Technology, Vol.9, No.8, December 1999
Google Scholar
Hanjalic A., Langelaar G.C., van Roosmalen P.M.B., Biemond J., Lagendijk R.L.: Image and video databases: restoration, watermarking and retrieval, Elsevier Science, Amsterdam 2000
Google Scholar
Hanjalic A., Kakes G., Lagendijk R.L., Biemond J.: Indexing and retrieval of TV broadcast news using DANCERS, Journal of Electronic Imaging, 10(4), pp. 871–882, October 2001
Article Google Scholar
Hartley R., Zisserman A.: Multiple view geometry in computer vision, Cambridge University Press, 2001
Google Scholar
Hearst M.: TextTiling: Segmenting text into multi-paragraph subtopic passages, Computational Linguistics, 23/1, pp. 33–64, 1997
Google Scholar
Irani M., Anandan P., Bergenand J., Kumar R., Hsu S.: Efficient representation of video sequences and their applications, Signal Processing: Image Communication, Volume 8, 1996
Google Scholar
Jain A.K., Dubes R.C.: Algorithms for clustering data, Engelwood Cliffs, NJ, Prentice Hall, 1988
Google Scholar
Jiang H., Lin T., Zhang H.-J.: Video Segmentation with the assistance of audio content analysis, IEEE International Conference on Multimedia and Expo (ICME2000), 2000
Google Scholar
Kaufmann S.: Cohesion and collocation: Using context vectors in text segmentation, ACL’ 99, pp. 591–595, 1999
Google Scholar
Kender J.R., Yeo B.-L.: Video scene segmentation via continuous video coherence, Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 1998
Google Scholar
Kozima H.: Text segmentation based on similarity between words, Proceedings of the 31^st Annual Meeting of the Association for Compuational Linguistics, 1993
Google Scholar
Krovetz R.: Viewing morphology as an inference process, Proceedings of the 16^th ACM SIGIR Conference, pp. 191–202, 1993
Google Scholar
Kwon Y.-M., Song C.-I, Kim I.-J.: A new approach for high level video structuring, Proceedings of the IEEE International Conference on Multimedia and EXPO (ICME), Vol. 2, pp/ 773–776, 2000
Google Scholar
Landauer, T. K., Foltz, P. W., Laham, D.: Introduction to Latent Semantic Analysis. Discourse Processes, Vol. 25, pp. 259–284,1998
Google Scholar
Lee S.-Y., Lee S.-T,, Chen D.-Y.: Automatic video summary and description, Lecture Notes in Computer Science, Vol. 1929, pp. 37–48, Springer Verlag, Berlin 2000
Google Scholar
Li D., Dimitrova N., Li M., Sethi I.K.: Multimedia content processing through cross-modal association, Proceedings of ACM Multimedia’ 03, Berkeley 2003
Google Scholar
Lienhart R., Pfeiffer S., Effelsberg W.: Scene determination based on video and audio features, Proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS), 1999
Google Scholar
Lin T., Zhang H.J.: Automatic video scene extraction by shot grouping, Proceedings of International Conference on Pattern Recognition (ICPR), 2000
Google Scholar
Lovins J.B.: Development of a stemming algorithm, Mechanical Translation and Computational Linguistics, 11, pp. 22–31, 1968
Google Scholar
Lu X., Y.-F. Ma, H. Zhang, L. Wu, An integrated correlation measure for semantic video segmentation, Proceedings of IEEE International Conference on Multimedia and Expo, ICME, Lausanne, Switzerland, August, 2002.
Google Scholar
Lu L., Zhang H.-J., Jiang H.: Content analysis for audio classification and segmentation, IEEE Transactions on Speech and Audio Processing, Vol. 10, No.7, October 2002
Google Scholar
Mohan R.: Text-Based Search of TV News Stories, Proceedings of SPIE Conference on Multimedia Storage and Archiving Systems SPIE, Boston, MA, November 1996 pp. 2–13.
Google Scholar
Morris J., Hirst G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text, Computational Linguistics, 17(1), pp. 21–48, 1991
Google Scholar
O’ Connor N., Czirjek C. and al.: News Story Segmentation in The Físchlár Video Indexing System, Proceedings of the IEEE International Conference on Image Processing (ICIP), pp. 7–10, 2001
Google Scholar
Paice C.P.: Another stemmer, Department of Computing, Lancaster University, UK, 1990
Google Scholar
Passoneau R.J., Litman D.J.: Discourse segmentation by human and automated means, Computational Linguistics, 23(1): 103–139, 1997
Google Scholar
Patel N.V., Sethi I.K.: Audio characterization for video indexing, IS&T/SPIE Electronic Imaging: Storage and Retrieval for Image and Video Databases IV, Vol. 2670, pp. 373–384, 1996
Google Scholar
Patterson R.D., Robinson K., Holdsworth J., McKeown D., Zhang C, Allerhand M.H.: Complex sounds and auditory images, in Auditory Psychology and Perception, (Eds.) Y. Cazals, L. Demany, K. Horner, Pergamon, Oxford, 1992.
Google Scholar
Pfeiffer S.: The importance of perceptive adaptation of sound features for audio content processing, IS&T/SPIE Electronic Imaging: Storage and Retrieval for Image and Video Databases VII, 1999
Google Scholar
Ponte J., Croft W.B.: Text segmentation by topic, In proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries, pp. 120–129, 1997
Google Scholar
Porter M.F.: An algorithm for suffix stripping, Program, 14, No.3, pp. 130–137, July 1980
Google Scholar
Rabiner L.R., Huang B.H.: Fundamentals of speech recognition. Prentice Hall, 1993
Google Scholar
Rabiner L.R., Shafer R.W.: Digital processing of speech signals, Prentice Hall, 1978
Google Scholar
Raskin V., Weiser I.: Language and writing: applications of linguistics to thetoric and composition, ABLEX Publishing Corporation, Norwood, NJ, 1987
Google Scholar
Reynar J.C.: An automatic method of finding topic boundaries, Proceedings of the 32^nd Annual Meeting of the Association for Computational Linguistics, 1994
Google Scholar
Robertson S.E., Sparck Jones K.: Simple proven approaches to text retrieval, Technical Report TR356, Cambridge University, Computer laboratory, 1997
Google Scholar
Rui Y., Huang T.S., Mehrotra S.: Exploring video structure beyond the shots, Proceedings of IEEE International Conference on Multimedia Computing and Systems (ICMCS), pp. 237–240, 1998
Google Scholar
Rui Y., Huang T.S., Mehrotra S.: Constructing table-of-content for videos, Multimedia Systems, Special Section on Video Libraries, 7(5), pp. 359–368, 1999
Google Scholar
Sahouria E., Zakhor A.: Content analysis of video using principal components, IEEE Transactions on Circuits and Systems for Video Technology, 9(8), pp. 1290–1298, 1999
Article Google Scholar
Saraceno C, Leonardi R.: Audio as a support to scene change detection and characterization of video sequences, proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1997
Google Scholar
Scheirer E., Slaney M.: Construction and evaluation of a robust multifeature speech/music discriminator, Proceedings of (ICASSP), 1997
Google Scholar
Shan M.-K., Lee S.-Y.: Content-based video retrieval based on similarity of frame sequence, Proceedings of the International Workshop on Multi-Media Database Management Systems, pp 90–97, August 1998
Google Scholar
Skorochod’ko E.: Adaptive method of automatic abstracting and indexing, In C. Freiman (Eds.): Information Processing 71: Proceedings of the IFIP Congress 71, pp. 1179–1182, North-Holland Publishing Company, 1972
Google Scholar
Smeulders A.W.M., Worring M., Santini S., Gupta A., Jain R., Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12): 1349–1380, 2000
Article Google Scholar
Srinivasan S., Petkovic D., Ponceleon D.: Towards robust features for classifying audio in the CueVideo system, Proceedings of the seventh ACM international conference on Multimedia, October 1999
Google Scholar
Sundaram H., Chang S.-F.: Determining computable scenes in films and their structures using audio-visual memory models, Proceedings of the 8^th ACM Multimedia Conference, 2000
Google Scholar
Takao S., Ogata J., Ariki Y.: Topic segmentation of news speech using word similarity, Proceedings of the ACM Multimedia Conference, 2000
Google Scholar
Tannen D.: Talking voices: repetition, dialogue and imagery in conversational discourse, Studies in International Sociolinguistics 6, Cambridge University Press, 1989
Google Scholar
Truong B.T., Venkatesh S., Dorai C.: Neighborhood coherence and edge-based approach for scene extraction in films, proceedings of IEEE International Conference on Pattern Recognition (ICPR), 2002
Google Scholar
Utiyama M., Isihara H.: A statistical model for domain-independent text segmentation, ACL’ 01, pp. 491498, 2001
Google Scholar
Vendrig J., Worring M. Smeulders A.W.M.: Model based interactive story unit segmentation, IEEE International Conference on Multimedia and Expo (ICME), pages 1084–1087, August 22–25, 2001
Google Scholar
Vendrig, J., Worring, M.: Systematic evaluation of logical story unit segmentation IEEE Transactions on Multimedia, Vol. 4, No. 4, pp. 492–499, Dec. 2002
Article Google Scholar
Vendrig, J.; Worring, M.: Interactive adaptive movie annotation, IEEE Multimedia, Vol. 10, No. 3, pp. 30–37, July–Sept. 2003
Article Google Scholar
Veneau E., Ronfard R., Bouthemy P.: From video shot clustering to sequence segmentation, Proceedings of International Conference on Pattern Recognition (ICPR), Vol. 4, pp. 254–257, 2000
Google Scholar
Walker M.: Redundancy in collaborative dialogue, In J. Hirschberg, D. Litman, K. McCoy, C. Sidner (Eds.): AAAI Fall Symposium on Discourse Structure in Natural Language Understanding and Generation, Pacific Grove, CA, 1991
Google Scholar
Wang J., Chua T.-S., Chen L.: Cinematic-based model for scene boundary detection. Proceedings of MMM’ 2001 (Multimedia Modeling Conference), Amsterdam, Netherlands, Nov 2001. pp 3–18
Google Scholar
Yamron J., Carp I., Gillick L, Lowe S., van Mulbregt P.: A hidden Markov model approach to text segmentation and event tracking, Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1998
Google Scholar
Yeung M., Liu B.: Efficient matching and clustering of video shots, Proceedings of the International Conference on Image Processing (ICIP), pp 23–26, 1995
Google Scholar
Yeung M., Yeo B.-L., Wolf W., Liu B.: Video browsing using clustering and scene transitions on compressed sequences, Proceedings of Multimedia Computing and Networking 1995, Vo. SPIE 2417, pp. 399–413, 1995
Google Scholar
Yeung M., Yeo B.-L.: Time-constrained clustering for segmentation of video into story units, Proceedings of the International Conference on Pattern Recognition (ICPR), pp. 375–380, 1996
Google Scholar
Yeung M., Yeo B.-L., Liu B.: Extracting story units from long programs for video browsing and navigation, Proceedings of the IEEE international Conference on Multimedia Computing and Systems, pp. 296–305, 1996
Google Scholar
Yeung M., Yeo B.-L.: Video visualization for compact presentation and fast browsing of pictorial content, IEEE Transactions on Circuits and Systems for Video Technology, Vol.7, No.5, pp. 771–785, 1997
Article Google Scholar
Yeung M., Yeo B.-L., Liu B.: Segmentation of video by clustering and graph analysis, Computer Vision and Image Understanding, 71(1), pp.94–109, 1998
Article Google Scholar
Zhang HJ., Tan S.Y., Smoliar S.W., Yihong G.: Automatic parsing and indexing of news video, Multimedia Systems, 2(6), pp. 256–266, 1995
Article Google Scholar

Download references

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

(2004). Parsing a Video into Semantic Segments. In: Content-Based Analysis of Digital Video. Springer, Boston, MA. https://doi.org/10.1007/1-4020-8115-4_3

Download citation

DOI: https://doi.org/10.1007/1-4020-8115-4_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4020-8114-9
Online ISBN: 978-1-4020-8115-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics