A Multi-stage Approach for Anchor Shot Detection

  • L. D’Anna
  • G. Marrazzo
  • G. Percannella
  • C. Sansone
  • M. Vento
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4109)


In this paper we present a novel algorithm for anchor shot detection (ASD). ASD is a fundamental step for segmenting news video into stories that is among key issues for achieving efficient treatment of news-based digital libraries.

The proposed algorithm creates a set of audio/video templates of anchorperson shots in an unsupervised way, then classifies shots by comparing them to the templates. Audio similarity is evaluated by means of a new index and helps to achieve better performance than a pure video approach. The method has been tested on a wide database and compared with other state-of-the-art algorithms, demonstrating its effectiveness with respect to them.


Minimum Span Tree News Video Video Shot Video Database Video Segmentation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    De Santo, M., Percannella, G., Sansone, C., Vento, M.: An Unsupervised Shot Classification System for News Video Story Detection. In: Abate, A.F., Nappi, M., Sebillo, M. (eds.) Multimedia Database and Image Communication, pp. 93–104. World Scientific Publ., Singapore (2005)Google Scholar
  2. 2.
    Gao, X., Tang, X.: Unsupervised Video-Shot Segmentation and Model-Free Anchorperson Detection for News Video Story Parsing. IEEE Transactions on Circuits and Systems for Video Technology 12(9), 765–776 (2002)CrossRefGoogle Scholar
  3. 3.
    Gunsel, B., Ferman, A.M., Tekalp, A.M.: Video Indexing Through Integration of Syntactic and Semantic Features. In: Proc. of Workshop Applications of Computer Vision, Sarasota, FL, pp. 90–95 (1996)Google Scholar
  4. 4.
    Swanberg, D., Shu, C.F., Jain, R.: Knowledge Guided Parsing in Video Databases. In: Proc. of SPIE Symposium on Electronic Imaging: Science and Technology, San Jose, CA, pp. 13–24 (1993)Google Scholar
  5. 5.
    Smoliar, S.W., Zhang, H.J., Tao, S.Y., Gong, Y.: Automatic Parsing and Indexing of News Video. Multimedia Systems 2(6), 256–265 (1995)CrossRefGoogle Scholar
  6. 6.
    Hanjalic, A., Lagendijk, R.L., Biemond, J.: Semi-Automatic News Analysis, Indexing, and Classification System Based on Topics Preselection. In: Proc. of SPIE, Electronic Imaging: Storage and Retrieval of Image and Video Databases, San Jose (CA) (1999)Google Scholar
  7. 7.
    Bertini, M., Del Bimbo, A., Pala, P.: Content-Based Indexing and Retrieval of TV News. Pattern Recognition Letters 22, 503–516 (2001)MATHCrossRefGoogle Scholar
  8. 8.
    Snoek, C.G.M., Worring, M.: Multimodal Video Indexing: A Review of the State-of-the-art. Multimedia Tools and Applications 25, 5–35 (2005)CrossRefGoogle Scholar
  9. 9.
    Eickeler, S., Muller, S.: Content-based video indexing of TV broadcast news using Hidden Markov Models. In: ICASSP 1999, pp. 2997–3000 (1999)Google Scholar
  10. 10.
    Qi, W., Gu, L., Jiang, H., Chen, X.R., Zhang, H.J.: Integrating Visual, Audio and Text Analysis for News Video. In: 7th IEEE International Conference on Image Processing, Vancouver, British Columbia, Canada (2000)Google Scholar
  11. 11.
    Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)MATHGoogle Scholar
  12. 12.
    Viola, P., Jones, M.: Rapid Object Detection Using a Boosted Cascade of Simple Features. In: Proc. of the IEEE CVPR Conference, vol. 1, pp. 511–518 (2001)Google Scholar
  13. 13.
    Lee, H.Y., Lee, H.K., Ha, Y.H.: Spatial Color Descriptor for Image Retrieval and Video Segmentation. IEEE Transactions on Multimedia 5(3), 358–367 (2003)CrossRefMathSciNetGoogle Scholar
  14. 14.
    Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A Real-Time Text-Independent Speaker Identification System. In: 12th International Conference on Image Analysis and Processing, September 17-19, pp. 632–637. IEEE Computer Society Press, Mantova, Italy (2003)CrossRefGoogle Scholar
  15. 15.
    Wang, D., Lu, L., Zhang, H.-J.: Speech Segmentation Without Speech Recognition. In: ICASSP 2003, vol. I, pp. 468–471 (2003)Google Scholar
  16. 16.
    Gargi, U., Kasturi, R., Strayer, S.H.: Performance Characterization of Video-Shot-Change Detection Methods. IEEE Trans. on Circuits and Systems for Video Technology 10(1), 1–13 (2000)CrossRefGoogle Scholar
  17. 17.
    De Santo, M., Percannella, G., Sansone, C., Vento, M.: A Comparison of Unsupervised Shot Classification Algorithms for News Video Segmentation. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds.) SSPR&SPR 2004. LNCS, vol. 3138, pp. 233–241. Springer, Heidelberg (2004)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • L. D’Anna
    • 1
  • G. Marrazzo
    • 1
  • G. Percannella
    • 1
  • C. Sansone
    • 2
  • M. Vento
    • 1
  1. 1.Dip. di Ingegneria dell’Informazione ed Ingegneria ElettricaUniversità degli Studi di SalernoFisciano (SA)Italy
  2. 2.Dipartimento di Informatica e SistemisticaUniversità degli Studi di Napoli “Federico II”NapoliItaly

Personalised recommendations