Multimedia Tools and Applications

, Volume 75, Issue 10, pp 5645–5672 | Cite as

Automatic topics segmentation for TV news video using prior knowledge



TV streams represent a principal source of multimedia information. The goal of the proposed approach is to enable a better exploitation of this source of video by multimedia services (i.e., TV-On-Demand, catch-up TV), social community, and video-sharing platforms (Vimeo, Youtube, Facebook …). In this work, we present an automatic structuring approach of TV news. The originality of the approach is the use of the contextual and operational characteristics as prior knowledge. This knowledge is modeled as video grammar which governs the structuring of TV stream content. This structuring is carried out on two levels. The first level identifies news programs in TV stream. The second level aims to identify the internal structure of the identified news programs. At this level, we opt to treat the case of TV news programs due to the large audience because of pertinent information within. Comparison experiments to similar works have been carried out on the TRECVID 2003 database. We show significant improvements to TV news structuring exceed 90 %.


TV stream structuring Video grammar TV news segmentation Content-based video indexing Automatic topics detection 


  1. 1.
    Abduraman AE, Berrani SA, Mérialdo B (2011) TV program structuring techniques: a review. Book chapter in TV content analysis. Tech ApplGoogle Scholar
  2. 2.
    Arvis V, Debain C, Berducat M, Benassi A (2004) Generalization of the co-occurence matrix for color images: application to color textures classification. Image Anal Stereol 63–72Google Scholar
  3. 3.
    Athanasakos KC, Doulamis AD, Karanikolas NN (2007) A signature tree content-based image retrieval system. 10th International Conference on Computer Graphics and Artificial Intelligence 181–191Google Scholar
  4. 4.
    Colace F, Foggia P, Percannella G (2005) A probabilistic framework for TV-news stories detection and classification. IEEE Int Conf Mult Expo 1350–1353Google Scholar
  5. 5.
    David C (1991) Practice of the video. Edition LAROUSSE, ISBN: 2035102309Google Scholar
  6. 6.
    Duan L, Xu M, Tian Q, Xu C, Jin JS (2005) A unified framework for semantic shot classification in sports video. IEEE Trans Multimedia 7(6):1066–1083CrossRefGoogle Scholar
  7. 7.
    Dumont E, Quénot G (2012) Automatic story segmentation for TV news video using multiple modalities. Int J Digital Multimedia Broadcast 2012:1–11CrossRefGoogle Scholar
  8. 8.
    Dunker P, Gruhne M, Sturtz S (2008) Personal television: a cross modal analysis approach. IEEE Int Symp Consumer Electron 1–4Google Scholar
  9. 9.
    Goyal A, Punitha P, Hopfgartner F, Jose JM (2009) Split and merge based story segmentation in news videos. 31th European Conference on IR Research on Advances in Information Retrieval 766–Google Scholar
  10. 10.
    Hu W, Xie N, Li L, Zeng X (2011) A survey on visual content-based video indexing and retrieval. IEEE Trans Syst Man Cybern Part C Appl Rev 41(6):797–819CrossRefGoogle Scholar
  11. 11.
    Hua S, Chen G, Wei H, Jiang Q (2012) Similarity measure for image resizing using SIFT feature. EURASIP J Image Video ProcGoogle Scholar
  12. 12.
    Hua XS, Chen X, Zhang HJ (2004) Robust video signature based on ordinal measure. Int Conf Inf Proc ICIP04 685–688Google Scholar
  13. 13.
    Jacobs A, Miene A, Ioannidis GT, Herzog O (2004) Automatic shot boundary detection combining color, edge, and motion features of adjacent frames. TRECVID Workshop Notebook Papers 197–206Google Scholar
  14. 14.
    Lakshmi D, Damodaram A, Sreenivasa M, Lal J (2008) Content based image retrieval using signature based similarity search. Indian J Sci Technol 1(5):80–92Google Scholar
  15. 15.
    Lienhart R, Maydt J (2002) An extended set of haar-like features for rapid object detection. Int IEEE Conf Proc Image 900–903Google Scholar
  16. 16.
    LOWE D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110CrossRefGoogle Scholar
  17. 17.
    Marquez GRC, Escalante HJ, Sucar LE (2011) Simplified quadtree image segmentation for image annotation. Automatic Image Annotation and Retrieval Workshop, CEUR-Workshop, 719:24–34Google Scholar
  18. 18.
    Maurizio M, Alberto M (2009) Parallel neural networks for multimodal video genre classification. Multimedia Tools Appl 41(1):125–159CrossRefGoogle Scholar
  19. 19.
    McIvor A (2000) Background subtraction techniques. Image and Vision Computing, New ZealandGoogle Scholar
  20. 20.
    Misra H, Hopfgartner F, Goyal A, Punitha P, Jose J (2010) TV news story based segmentation one semantic coherence and content similarity. 16th Int Conf Multimedia Model 347–357Google Scholar
  21. 21.
    Naturel X, Gros P (2008) Detecting repeats for video structuring. Multimedia Tools Appl 38(2):233–252CrossRefGoogle Scholar
  22. 22.
    Nock R, Nielsen F (2004) Statistical region merging. IEEE Trans. Pattern Anal Mach Intell1 452–1458Google Scholar
  23. 23.
    O’Hare N, Smeaton AF, Czirjek C, O’Connor N (2004) Murphy. A generic news story segmentation system and its evaluation. IEEE Int Conf Acoust Speech Signal Process 1028–1031Google Scholar
  24. 24.
    Poli JP (2008) An automatic television stream structuring system for television archives holders. Multimedia Syst 14(5):255–275CrossRefGoogle Scholar
  25. 25.
    Poulisse GJ, Moens MF, Dekens T, Deschacht K (2010) News story segmentation in multiple modalities. Multimedia Tools Appl 48(1):3–22CrossRefGoogle Scholar
  26. 26.
    Ravani R, Mirali RM, Baniasadi M (2010) Parallel CBIR system based on color coherence vector. Int Conf Syst Signals Image 518–521Google Scholar
  27. 27.
    Smeaton AF, Kraaij W, Over P (2003) TRECVID 2003 - an overview. TRECVID 2003 - Text REtrieval Conference TRECVID WorkshopGoogle Scholar
  28. 28.
    Turk M, Pentland A (1991) Eigenfaces for recognition. News Cogn Neurosci 71–86Google Scholar
  29. 29.
    Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. Conf Comp Vis Pattern Recognit 511–518Google Scholar
  30. 30.
    Wang J, Duan L, Lu H, Jin S (2006) A semantic image category for structuring TV broadcast video streams. IEEE Conf Pac Rim Multimedia 279–286Google Scholar
  31. 31.
    Wu X, Ide I, Satoh S (2010) PageRank with text similarity and video near-duplicate constraints for news story re-ranking. 16th International Conference on MultiMedia Modeling, Lecture Notes in Computer Science, Springer-Verlag 5916:533–544Google Scholar
  32. 32.
    Xie L, Yang YL, Liu ZQ (2011) On the effectiveness of subwords for lexical cohesion based story segmentation of Chinese broadcast news. Int J Inf Sci 181(13):2873–2891Google Scholar
  33. 33.
    Zlitni T, Mahdi W, Ben-Abdallah H (2009) A new approach for TV
programs identification based on video grammar. 7th Int Conf Adv Mob Comput Multimedia (MoMM2009) 316–320Google Scholar
  34. 34.
    Zlitni T, Mahdi W, Ben-Abdallah H (2010) Towards a modeling of video grammar based on a priori knowledge for the optimization of the audiovisual documents structuring. 2nd Int Conf Comput Technol Dev 517–521Google Scholar

Copyright information

© Springer Science+Business Media New York 2015

Authors and Affiliations

  1. 1.Higher Institute of Information Technology and MultimediaUniversity of SfaxSfaxTunisia
  2. 2.College of Computers and Information TechnologyTaif University, KSATaifSaudi Arabia
  3. 3.MIRACL LaboratorySfaxTunisia

Personalised recommendations