Content-Based Analysis of Digital Video pp 57-106 | Cite as
Parsing a Video into Semantic Segments
Chapter
Keywords
Video Clip Time Stamp Latent Semantic Analysis Visual Content Shot Boundary
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Preview
Unable to display preview. Download preview PDF.
References and Further Reading
- [Ada03]Adams B.: Where does computational media aesthetics fit?, IEEE Multimedia, pp. 18–27, April-June 2003Google Scholar
- [Aig95]Aigrain P., Joly P., Leplain P., Longueville V.: Medium knowledge-based macro-segmentation into sequences, Working notes of IJCAI Workshop on Intelligent Multimedia Information Retrieval, pp. 5–14, 1995Google Scholar
- [All02]Allan J.: Topic detection and tracking: Event-based information organization, Kluwer Academic Publishers, February 2002Google Scholar
- [Ane02]Aner A.: Video summaries and cross-referencing, Ph.D. thesis, Columbia University, New York, 2002Google Scholar
- [Ari76]Arijon D.: Grammar of the film language, Silman-James Press, 1976Google Scholar
- [Ari96]Ariki Y., Saito Y.: Extraction of TV news articles based on scene cut detection using DCT Clustering, ICIP’ 96, Vol. 3, pp. 847–850, Lausanne CH, 1996Google Scholar
- [Bea94]Beaver F.: Dictionary of film terms, Twayne Publishing, New York, 1994Google Scholar
- [Bee99]Beeferman D., Berger A., Lafferty J.: Statistical models for text segmentation, Machine Learning, 34/1, pp. 177–210, 1999Google Scholar
- [Bim99]Del Bimbo A.: Visual information retrieval, Morgan Kaufmann Publishers, Inc., 1999Google Scholar
- [Boc99]Boccignone G., De Santo M., Percannella G.: Joint audio-video processing of MPEG encoded sequences, proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS), 1999Google Scholar
- [Boc00]Boccignone G., De Santo M., Percannella G.: A system for parsing MPEG videos, IS&T/SPIE Internet Imaging, 2000Google Scholar
- [Bog00]Boggs J.M., Petrie D.W.: The art of watching films, 5th ed., Mountain View, CA: Mayfield 2000Google Scholar
- [Bor97]Bordwell D., Thompson K.: Film Art: An Introduction, McGraw-Hill, New York, 1997Google Scholar
- [Cha95]Chang S.-F., Smith J.R.: Extracting multidimensional signal features for content-based visual query, SPIE Symposium on Visual Communications and Signal Processing, pp. 995–1006, 1995Google Scholar
- [Chi00]Chiu P., Girgensohn A., Polak W., Rieffel E., Wilcox L.: A genetic algorithm for video segmentation and summarization, Proceedings of IEEE International Conference on Multiemdia and EXPO (ICME), Vol. 3, pp. 1329–1332, 2000Google Scholar
- [Cho00]Choi F.: Advances in domain independent linear text segmentation, NAACL’ 00, pp.26–33, 2000Google Scholar
- [Cho01]Choi F., Wiemer-Hastings P., Moore J: Latent semantic analysis for text segmentation, proceedings of the NAACL’ 01, pp. 109–117, 2001Google Scholar
- [Cor98]Corridoni J.M., Del Bimbo A.: Stuctured representation andautomatic indexing of movie information content, Pattern Recognition, 31(12), pp. 2027–2045, 1998CrossRefGoogle Scholar
- [Dav79]Davies D.L., Bouldin D.W.: A cluster separation measure, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-1, pp. 224–227, April 1979Google Scholar
- [Daw74]Dawson J.L.: Suffix removal and word connation, ALLC Bulletin, 2(3), pp. 33–46, 1974Google Scholar
- [Fer98]Ferret O.: How to thematically segment texts by using lexical cohesion?, ACL-COLING’ 98, pp. 1481–1483,1998Google Scholar
- [Fer02]Ferret O.: Using collocations for topic segmentation and link detection, COLING’ 02, 2002Google Scholar
- [Fis99]Fiscus J., Doddington G., Garofolo J., Martin A.: MSTS’s 1998 topic detection and tracking evaluation, DARPA Broadcast News Workshop, 1999Google Scholar
- [Fol98]Foltz P.W., Kintsch W., Landauer T.K.: The measurement of textual coherence with Latent Semantic Analysis, Discourse Processes, 25, 2&3, pp. 285–307, 1998Google Scholar
- [Fur95]Furht B., Smoliar S.W., Zhang H.: Video and Image Processing in Multimedia Systems, Kluwer Academic Publishers, 1995Google Scholar
- [Gir01]Girgensohn A., Boreczky J., Wilcox L.: Keyframe-based user interfaces for digital video, IEEE Computer, September 2001Google Scholar
- [Gon93]Gonzales R.C., Woods R.E.: Digital image processing, Addison Wesley, 1993Google Scholar
- [Gun97]Gunsel B., Fu Y., Tekalp A.M.: Hierarchical temporal video segmentation and content characterization, in Multimedia Storage and Archiving Systems II, proceedings of SPIE, Vol. 3229, pp. 46–56, 1997Google Scholar
- [Hal76]Halliday M.A.K., Hasan R.: Cohesion in English, Lohman, London 1976Google Scholar
- [Han98]Hanjalic A., Lagendijk R.L., Biemond J.: Template-based detection of anchorperson shots in news programs, Proceedings of IEEE International Conference on Image Processing (ICIP), 1998Google Scholar
- [Han99a]Hanjalic A., Lagendijk R.L., Biemond J.: Automated high-level movie segmentation for advanced video-retrieval systems, IEEE Transactions on Circuits and Systems for Video Technology, Vol.9, No.4, pp. 580–588, June 1999CrossRefGoogle Scholar
- [Han99b]Hanjalic A., Zhang H.-J.: An integrated schemefor automated video abstraction based on unsupervised cluster-validity analysis, IEEE Transactions on Circuits and Systems for Video Technology, Vol.9, No.8, December 1999Google Scholar
- [Han00]Hanjalic A., Langelaar G.C., van Roosmalen P.M.B., Biemond J., Lagendijk R.L.: Image and video databases: restoration, watermarking and retrieval, Elsevier Science, Amsterdam 2000Google Scholar
- [Han01]Hanjalic A., Kakes G., Lagendijk R.L., Biemond J.: Indexing and retrieval of TV broadcast news using DANCERS, Journal of Electronic Imaging, 10(4), pp. 871–882, October 2001CrossRefGoogle Scholar
- [Har01]Hartley R., Zisserman A.: Multiple view geometry in computer vision, Cambridge University Press, 2001Google Scholar
- [Hea97]Hearst M.: TextTiling: Segmenting text into multi-paragraph subtopic passages, Computational Linguistics, 23/1, pp. 33–64, 1997Google Scholar
- [Ira96]Irani M., Anandan P., Bergenand J., Kumar R., Hsu S.: Efficient representation of video sequences and their applications, Signal Processing: Image Communication, Volume 8, 1996Google Scholar
- [Jai88]Jain A.K., Dubes R.C.: Algorithms for clustering data, Engelwood Cliffs, NJ, Prentice Hall, 1988Google Scholar
- [Jia00]Jiang H., Lin T., Zhang H.-J.: Video Segmentation with the assistance of audio content analysis, IEEE International Conference on Multimedia and Expo (ICME2000), 2000Google Scholar
- [Kau99]Kaufmann S.: Cohesion and collocation: Using context vectors in text segmentation, ACL’ 99, pp. 591–595, 1999Google Scholar
- [Ken98]Kender J.R., Yeo B.-L.: Video scene segmentation via continuous video coherence, Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 1998Google Scholar
- [Koz93]Kozima H.: Text segmentation based on similarity between words, Proceedings of the 31st Annual Meeting of the Association for Compuational Linguistics, 1993Google Scholar
- [Kro93]Krovetz R.: Viewing morphology as an inference process, Proceedings of the 16th ACM SIGIR Conference, pp. 191–202, 1993Google Scholar
- [Kwo00]Kwon Y.-M., Song C.-I, Kim I.-J.: A new approach for high level video structuring, Proceedings of the IEEE International Conference on Multimedia and EXPO (ICME), Vol. 2, pp/ 773–776, 2000Google Scholar
- [Lan98]Landauer, T. K., Foltz, P. W., Laham, D.: Introduction to Latent Semantic Analysis. Discourse Processes, Vol. 25, pp. 259–284,1998Google Scholar
- [Lee00]Lee S.-Y., Lee S.-T,, Chen D.-Y.: Automatic video summary and description, Lecture Notes in Computer Science, Vol. 1929, pp. 37–48, Springer Verlag, Berlin 2000Google Scholar
- [Li03]Li D., Dimitrova N., Li M., Sethi I.K.: Multimedia content processing through cross-modal association, Proceedings of ACM Multimedia’ 03, Berkeley 2003Google Scholar
- [Lie99]Lienhart R., Pfeiffer S., Effelsberg W.: Scene determination based on video and audio features, Proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS), 1999Google Scholar
- [Lin00]Lin T., Zhang H.J.: Automatic video scene extraction by shot grouping, Proceedings of International Conference on Pattern Recognition (ICPR), 2000Google Scholar
- [Lov68]Lovins J.B.: Development of a stemming algorithm, Mechanical Translation and Computational Linguistics, 11, pp. 22–31, 1968Google Scholar
- [Lu02a]Lu X., Y.-F. Ma, H. Zhang, L. Wu, An integrated correlation measure for semantic video segmentation, Proceedings of IEEE International Conference on Multimedia and Expo, ICME, Lausanne, Switzerland, August, 2002.Google Scholar
- [Lu02b]Lu L., Zhang H.-J., Jiang H.: Content analysis for audio classification and segmentation, IEEE Transactions on Speech and Audio Processing, Vol. 10, No.7, October 2002Google Scholar
- [Moh96]Mohan R.: Text-Based Search of TV News Stories, Proceedings of SPIE Conference on Multimedia Storage and Archiving Systems SPIE, Boston, MA, November 1996 pp. 2–13.Google Scholar
- [Mor91]Morris J., Hirst G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text, Computational Linguistics, 17(1), pp. 21–48, 1991Google Scholar
- [OCo01]O’ Connor N., Czirjek C. and al.: News Story Segmentation in The Físchlár Video Indexing System, Proceedings of the IEEE International Conference on Image Processing (ICIP), pp. 7–10, 2001Google Scholar
- [Pai90]Paice C.P.: Another stemmer, Department of Computing, Lancaster University, UK, 1990Google Scholar
- [Pas97]Passoneau R.J., Litman D.J.: Discourse segmentation by human and automated means, Computational Linguistics, 23(1): 103–139, 1997Google Scholar
- [Pat96]Patel N.V., Sethi I.K.: Audio characterization for video indexing, IS&T/SPIE Electronic Imaging: Storage and Retrieval for Image and Video Databases IV, Vol. 2670, pp. 373–384, 1996Google Scholar
- [Pat92]Patterson R.D., Robinson K., Holdsworth J., McKeown D., Zhang C, Allerhand M.H.: Complex sounds and auditory images, in Auditory Psychology and Perception, (Eds.) Y. Cazals, L. Demany, K. Horner, Pergamon, Oxford, 1992.Google Scholar
- [Pfe99]Pfeiffer S.: The importance of perceptive adaptation of sound features for audio content processing, IS&T/SPIE Electronic Imaging: Storage and Retrieval for Image and Video Databases VII, 1999Google Scholar
- [Pon97]Ponte J., Croft W.B.: Text segmentation by topic, In proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries, pp. 120–129, 1997Google Scholar
- [Por80]Porter M.F.: An algorithm for suffix stripping, Program, 14, No.3, pp. 130–137, July 1980Google Scholar
- [Rab93]Rabiner L.R., Huang B.H.: Fundamentals of speech recognition. Prentice Hall, 1993Google Scholar
- [Rab78]Rabiner L.R., Shafer R.W.: Digital processing of speech signals, Prentice Hall, 1978Google Scholar
- [Ras87]Raskin V., Weiser I.: Language and writing: applications of linguistics to thetoric and composition, ABLEX Publishing Corporation, Norwood, NJ, 1987Google Scholar
- [Rey94]Reynar J.C.: An automatic method of finding topic boundaries, Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, 1994Google Scholar
- [Rob97]Robertson S.E., Sparck Jones K.: Simple proven approaches to text retrieval, Technical Report TR356, Cambridge University, Computer laboratory, 1997Google Scholar
- [Rui98]Rui Y., Huang T.S., Mehrotra S.: Exploring video structure beyond the shots, Proceedings of IEEE International Conference on Multimedia Computing and Systems (ICMCS), pp. 237–240, 1998Google Scholar
- [Rui99]Rui Y., Huang T.S., Mehrotra S.: Constructing table-of-content for videos, Multimedia Systems, Special Section on Video Libraries, 7(5), pp. 359–368, 1999Google Scholar
- [Sah99]Sahouria E., Zakhor A.: Content analysis of video using principal components, IEEE Transactions on Circuits and Systems for Video Technology, 9(8), pp. 1290–1298, 1999CrossRefGoogle Scholar
- [Sar97]Saraceno C, Leonardi R.: Audio as a support to scene change detection and characterization of video sequences, proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1997Google Scholar
- [Sch97]Scheirer E., Slaney M.: Construction and evaluation of a robust multifeature speech/music discriminator, Proceedings of (ICASSP), 1997Google Scholar
- [Sha98]Shan M.-K., Lee S.-Y.: Content-based video retrieval based on similarity of frame sequence, Proceedings of the International Workshop on Multi-Media Database Management Systems, pp 90–97, August 1998Google Scholar
- [Sko72]Skorochod’ko E.: Adaptive method of automatic abstracting and indexing, In C. Freiman (Eds.): Information Processing 71: Proceedings of the IFIP Congress 71, pp. 1179–1182, North-Holland Publishing Company, 1972Google Scholar
- [Sme00]Smeulders A.W.M., Worring M., Santini S., Gupta A., Jain R., Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12): 1349–1380, 2000CrossRefGoogle Scholar
- [Sri99]Srinivasan S., Petkovic D., Ponceleon D.: Towards robust features for classifying audio in the CueVideo system, Proceedings of the seventh ACM international conference on Multimedia, October 1999Google Scholar
- [Sun00]Sundaram H., Chang S.-F.: Determining computable scenes in films and their structures using audio-visual memory models, Proceedings of the 8th ACM Multimedia Conference, 2000Google Scholar
- [Tak00]Takao S., Ogata J., Ariki Y.: Topic segmentation of news speech using word similarity, Proceedings of the ACM Multimedia Conference, 2000Google Scholar
- [Tan89]Tannen D.: Talking voices: repetition, dialogue and imagery in conversational discourse, Studies in International Sociolinguistics 6, Cambridge University Press, 1989Google Scholar
- [Tru02]Truong B.T., Venkatesh S., Dorai C.: Neighborhood coherence and edge-based approach for scene extraction in films, proceedings of IEEE International Conference on Pattern Recognition (ICPR), 2002Google Scholar
- [Uti01]Utiyama M., Isihara H.: A statistical model for domain-independent text segmentation, ACL’ 01, pp. 491498, 2001Google Scholar
- [Ven01]Vendrig J., Worring M. Smeulders A.W.M.: Model based interactive story unit segmentation, IEEE International Conference on Multimedia and Expo (ICME), pages 1084–1087, August 22–25, 2001Google Scholar
- [Ven02]Vendrig, J., Worring, M.: Systematic evaluation of logical story unit segmentation IEEE Transactions on Multimedia, Vol. 4, No. 4, pp. 492–499, Dec. 2002CrossRefGoogle Scholar
- [Ven03]Vendrig, J.; Worring, M.: Interactive adaptive movie annotation, IEEE Multimedia, Vol. 10, No. 3, pp. 30–37, July–Sept. 2003CrossRefGoogle Scholar
- [Ven00]Veneau E., Ronfard R., Bouthemy P.: From video shot clustering to sequence segmentation, Proceedings of International Conference on Pattern Recognition (ICPR), Vol. 4, pp. 254–257, 2000Google Scholar
- [Wal91]Walker M.: Redundancy in collaborative dialogue, In J. Hirschberg, D. Litman, K. McCoy, C. Sidner (Eds.): AAAI Fall Symposium on Discourse Structure in Natural Language Understanding and Generation, Pacific Grove, CA, 1991Google Scholar
- [Wan01]Wang J., Chua T.-S., Chen L.: Cinematic-based model for scene boundary detection. Proceedings of MMM’ 2001 (Multimedia Modeling Conference), Amsterdam, Netherlands, Nov 2001. pp 3–18Google Scholar
- [Yam98]Yamron J., Carp I., Gillick L, Lowe S., van Mulbregt P.: A hidden Markov model approach to text segmentation and event tracking, Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1998Google Scholar
- [Yeu95a]Yeung M., Liu B.: Efficient matching and clustering of video shots, Proceedings of the International Conference on Image Processing (ICIP), pp 23–26, 1995Google Scholar
- [Yeu95b]Yeung M., Yeo B.-L., Wolf W., Liu B.: Video browsing using clustering and scene transitions on compressed sequences, Proceedings of Multimedia Computing and Networking 1995, Vo. SPIE 2417, pp. 399–413, 1995Google Scholar
- [Yeu96a]Yeung M., Yeo B.-L.: Time-constrained clustering for segmentation of video into story units, Proceedings of the International Conference on Pattern Recognition (ICPR), pp. 375–380, 1996Google Scholar
- [Yeu96b]Yeung M., Yeo B.-L., Liu B.: Extracting story units from long programs for video browsing and navigation, Proceedings of the IEEE international Conference on Multimedia Computing and Systems, pp. 296–305, 1996Google Scholar
- [Yeu97]Yeung M., Yeo B.-L.: Video visualization for compact presentation and fast browsing of pictorial content, IEEE Transactions on Circuits and Systems for Video Technology, Vol.7, No.5, pp. 771–785, 1997CrossRefGoogle Scholar
- [Yeu98]Yeung M., Yeo B.-L., Liu B.: Segmentation of video by clustering and graph analysis, Computer Vision and Image Understanding, 71(1), pp.94–109, 1998CrossRefGoogle Scholar
- [Zha95]Zhang HJ., Tan S.Y., Smoliar S.W., Yihong G.: Automatic parsing and indexing of news video, Multimedia Systems, 2(6), pp. 256–266, 1995CrossRefGoogle Scholar
Copyright information
© Springer Science + Business Media, Inc. 2004