Pattern Recognition and Image Analysis

, Volume 24, Issue 2, pp 243–255 | Cite as

Temporal video segmentation by event detection: A novelty detection approach

  • Mahesh Venkata KrishnaEmail author
  • P. Bodesheim
  • M. Körner
  • J. Denzler
Representation, Processing, Analysis and Understanding of Images


Temporal segmentation of videos into meaningful image sequences containing some particular activities is an interesting problem in computer vision. We present a novel algorithm to achieve this semantic video segmentation. The segmentation task is accomplished through event detection in a frame-by-frame processing setup. We propose using one-class classification (OCC) techniques to detect events that indicate a new segment, since they have been proved to be successful in object classification and they allow for unsupervised event detection in a natural way. Various OCC schemes have been tested and compared, and additionally, an approach based on the temporal self-similarity maps (TSSMs) is also presented. The testing was done on a challenging publicly available thermal video dataset. The results are promising and show the suitability of our approaches for the task of temporal video segmentation.


temporal video segmentation one-class classification novelty detection temporal self-similarity maps unsupervised video analysis 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    P. Bodesheim, A. Freytag, E. Rodner, M. Kemmler, and J. Denzler, “Kernel null space methods for novelty detection,” in Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR’13) (Portland, 2013).Google Scholar
  2. 2.
    J. S. Boreczky and L. D. Wilcox, “A hidden Markov model framework for video segmentation using audio and image features,” in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (Seattle, 1998), Vol. 6, pp. 3741–3744.Google Scholar
  3. 3.
    A. Bosch, A. Zisserman, and X. Munoz, “Representing shape with a spatial pyramid kernel,” in Proc. 6th ACM Int. Conf. on Image and Video Retrieval (CIVR’07) (Minneapolis, 2007), pp. 401–408.CrossRefGoogle Scholar
  4. 4.
    C.-C. Chang and C.-J. Lin, “Libsvm: A library for support vector machines,” ACM Trans. Intellig. Syst. Technol. 2(3) (2011).Google Scholar
  5. 5.
    M. Cooper, T. Liu, and E. Rieffel, “Video segmentation via temporal pattern classification,” IEEE Trans. Multimedia 9(3), 610–618 (2007).CrossRefGoogle Scholar
  6. 6.
    R. Cutler and L. S. Davis, “Robust real-time periodic motion detection, analysis, and applications,” IEEE Trans. Pattern Anal. Mach. Intellig. (TPAMI) 22(8), 781–796 (2000).CrossRefGoogle Scholar
  7. 7.
    J. N. Goyette, P.-M. Porikli, J. F. Konrad, and P. Ishwar, “ A new change detection benchmark dataset,” in Proc. IEEE Workshop on Change Detection (CDW’12) at CVPR’12 (Providence, RI, 2012).Google Scholar
  8. 8.
    J. S. Iwanski and E. Bradley, “Recurrence plots of experimental data: To embed or not to embed?,” Chaos 8(4), 861–871 (1998).CrossRefGoogle Scholar
  9. 9.
    I. N. Junejo, E. Dexter, I. Laptev, and P. Pórez, “Viewindependent action recognition from temporal self-similarities,” IEEE Trans. Pattern Anal. Mach. Intellig. (TPAMI) 33(1), 172–185 (2011).CrossRefGoogle Scholar
  10. 10.
    M. Kemmler, E. Rodner, and J. Denzler, “One-class classification with Gaussian processes,” in Proc. Asian Conf. on Computer Vision (ACCV’10) (Queenstown, 2010), pp. 489–500.Google Scholar
  11. 11.
    I. Koprinska and S. Carrato, “Temporal video segmentation: a survey,” Signal Processing: Image Commun. 16(5), 477–500 (2001).Google Scholar
  12. 12.
    M. Körner and J. Denzler, “Temporal self-similarity for appearance-based action recognition in multi-view setups,” in Proc. 15th Int. Conf. on Computer Analysis of Images and Patterns (CAIP) (York, 2013).Google Scholar
  13. 13.
    Tianming Liu, Hong-Jiang Zhang, and Feihu Qi, “A novel video key-frame-extraction algorithm based on perceived motion energy model,” IEEE Trans. Circuits Syst. Video Technol. 13(10), 1006–1013 (2003).CrossRefGoogle Scholar
  14. 14.
    S. Maji, A.C. Berg, and J. Malik, “Classification using intersection kernel support vector machines is efficient,” in Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR’08) (Anchorage, 2008), pp. 1–8.CrossRefGoogle Scholar
  15. 15.
    G. McGuire, N. B. Azar, and M. Shelhamer, “Recurrence matrices and the preservation of dynamical properties,” Phys. Lett. A 237(1–2), 43–47 (1997).CrossRefzbMATHMathSciNetGoogle Scholar
  16. 16.
    F. Odone, A. Barla, and A. Verri, “Building kernels from binary strings for image matching,” IEEE Trans. Image Processing 14(2), 169–180 (2005).MathSciNetGoogle Scholar
  17. 17.
    C. E. Rasmussen and C. K. I. Williams, Gaussian Processes for Machine Learning (MIT Press, 2006).zbMATHGoogle Scholar
  18. 18.
    B. Schölkopf, J. C. Platt, J. Shawe-Taylor, A. J. Smola, and R. C. Williamson, “Estimating the support of a high-dimensional distribution,” Neural Comput. 13(7), 1443–1471 (2001).CrossRefzbMATHGoogle Scholar
  19. 19.
    P. Sidiropoulos, V. Mezaris, I. Kompatsiaris, H. Meinedo, M. Bugalho, and I. Trancoso, “Temporal video segmentation to scenes using high-level audiovisual features,” IEEE Trans. Circuits Syst. Video Tech. 21 (8), 1163–1177 (2011).Google Scholar
  20. 20.
    D. Swanberg, Chiao-Fe Shu, and R. C. Jain, “Knowledge-guided parsing in video databases,” SPIE 36, 13–24 (1993).CrossRefGoogle Scholar
  21. 21.
    D. M. J. Tax and R. P. W. Duin, “Support vector data description,” Mach. Learn. 54(1), 45–66 (2004).CrossRefzbMATHGoogle Scholar
  22. 22.
    R. Zabih, J. Miller, and K. Mai, “A feature-based algorithm for detecting and classifying production effects,” Multimedia Syst. 7(2), 119–128 (1999).CrossRefGoogle Scholar
  23. 23.
    Hong Jiang Zhang, A. Kankanhalli, and S. W. Smoliar, “Automatic partitioning of full-motion video,” Multimedia Syst. 1(1), 10–28 (1993).CrossRefGoogle Scholar

Copyright information

© Pleiades Publishing, Ltd. 2014

Authors and Affiliations

  • Mahesh Venkata Krishna
    • 1
    Email author
  • P. Bodesheim
    • 1
  • M. Körner
    • 1
  • J. Denzler
    • 1
  1. 1.Computer Vision GroupFriedrich Schiller University JenaJenaGermany

Personalised recommendations