A Semantic Image Category for Structuring TV Broadcast Video Streams

  • Jinqiao Wang
  • Lingyu Duan
  • Hanqing Lu
  • Jesse S. Jin
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4261)


TV broadcast video stream consists of various kinds of programs such as sitcoms, news, sports, commercials, weather, etc. In this paper, we propose a semantic image category, named as Program Oriented Informative Images (POIM), to facilitate the segmentation, indexing and retrieval of different programs. The assumption is that most stations tend to insert lead-in/-out video shots for explicitly introducing the current program and indicating the transitions between consecutive programs within TV streams. Such shots often utilize the overlapping of text, graphics, and storytelling images to create an image sequence of POIM as a visual representation for the current program. With the advance of post-editing effects, POIM is becoming an effective indicator to structure TV streams, and also is a fairly common “prop” in program content production. We have attempted to develop a POIM recognizer involving a set of global/local visual features and supervised/unsupervised learning. Comparison experiments have been carried out. A promising result, F1 = 90.2%, has been achieved on a part of TRECVID 2005 video corpus. The recognition of POIM, together with other audiovisual features, can be used to further determine program boundaries.


Spectral Cluster Video Shot Dominant Color Radical Basis Function Canny Edge Detection Algorithm 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: analysis and an algorithm. Advance in Neural Information Processing Systems (2001)Google Scholar
  2. 2.
    Vapnik, V.: The nature of statistical learning theory. Springer, Heidelberg (1995)zbMATHGoogle Scholar
  3. 3.
    Wang, J., Duan, L., Lu, H., Jin, J.S., Xu, C.: A mid-level scene change representation via audiovisual alignment. In: Proc. ICASSP 2006 (2006)Google Scholar
  4. 4.
    Canny, J.: A computational approach to edge detection. IEEE Trans. PAMI 8(6), 679–698 (1986)Google Scholar
  5. 5.
    Manjunath, B., Ma, W.: Texture features for browsing and retrieval of image data. IEEE Trans. PAMI 18(8), 837–842 (1996)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Jinqiao Wang
    • 1
  • Lingyu Duan
    • 2
  • Hanqing Lu
    • 1
  • Jesse S. Jin
    • 3
  1. 1.National Lab of Pattern Recognition, Institute of AutomationChinese Academy of SciencesBeijingChina
  2. 2.Institute for Infocomm ResearchSingapore
  3. 3.The School of Design, Communication and Information TechnologyUniversity of NewcastleAustralia

Personalised recommendations