Pattern Recognition and Image Analysis

, Volume 28, Issue 1, pp 53–58 | Cite as

Relative Flow Estimates for Shot Boundary Detection

  • Muwei Jian
  • Yilong Yin
  • Junyu Dong
Representation, Processing, Analysis, and Understanding of Images


This paper proposes a simple approach based on Relative Flow Estimates (RFE) for shot cut detection. The property of Relative flow estimates can be used for abrupt cut detection and a correction mechanism for gradual camera-shot transition detection (e.g., fade-in and fade-out, dissolves, wipes). The exacted feature vector in each frame can be mapped into a 3-D space along the continuous time axis, and these feature data can be treated as a virtually constructed pipe with fluid flowing in the 3-D axis. Compared with existing approaches, the new RFE-based algorithm can directly detect shot cut. A wide range of test videos are used to evaluate the performance of the proposed method. The experimental results show that the new scheme can produce promising results.


shot boundary detection relative flow estimates correction nechanism 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    M. Cooper, T. Liu, and E. Rieffel, “Video segmentation via temporal pattern classification,” IEEE Trans. Multimedia 9 (3), 610–618 (2007).CrossRefGoogle Scholar
  2. 2.
    S. Lefevre, J. Holler, and N. Vincent, “A review of realtime segmentation of uncompressed video sequences for content-based search and retrieval,” Real-Time Imag. 9, 73–98 (2003).CrossRefGoogle Scholar
  3. 3.
    Oguzhan Urhan, M. Kemal Güllü, and Sarp Ertürk, “Modified phase-correlation based robust hard-cut detection with application to archive film,” IEEE Trans. Circuits Syst. Video Technol. 16 (6) (2006).Google Scholar
  4. 4.
    A. Nagasaka and Y. Tanaka, “Automatic video indexing and full-video search for object appearances,” in Proc. IFIP Working Conf. Visual Database Systems (Budapest, Oct. 1991), pp. 113–127.Google Scholar
  5. 5.
    I. K. Sethi and N. Patel, “A statistical approach to scene change detection,” in SPIE Conf. Proc. Storage and Retrieval for Image and Video Databases III (La Jolla, CA, Feb. 1995), Vol. 2420, pp. 329–339.CrossRefGoogle Scholar
  6. 6.
    M. S. Lee, Y. M. Yang, and S. W. Lee, “Automatic video parsing using shot boundary detection and camera operation analysis,” Pattern Recogn. 34, 711–719 (2001).CrossRefMATHGoogle Scholar
  7. 7.
    S. Lefevre, J. Holler, and N. Vincent, “Real time temporal segmentation of compressed and uncompressed dynamic colour image sequences,” in Proc. Int. Workshop on Real Time Image Sequence Analysis (Oulu, Aug. 2000), pp. 56–62.Google Scholar
  8. 8.
    T. Truong, S. Venkatesh, and C. Dorai, “Scene extraction in motion pictures,” IEEE Trans. Circuits Syst. Video Technol. 13 (1), 5–15 (2003).CrossRefGoogle Scholar
  9. 9.
    H. J. Heng and K. N. Ngan, “Integrated shot boundary detection using object-based technique,” in IEEE Int. Conf. Image Processing (Kobe, Oct. 1999), Vol. 3, pp. 289–293.Google Scholar
  10. 10.
    P. Bouthemy, M. Gelgon, and F. Ganansia, “A unified approach to shot change detection and camera motion characterization,” IEEE Trans. Circuits Syst. Video Technol. 9 (10), 1030–1044 (1999).CrossRefGoogle Scholar
  11. 11.
    T. Liu, X. Zhang, D. Wang, J. Feng, and K. T. Lo, “Inertia-based cut detection technique: a step to the integration of video coding and content-based retrieval,” in Proc. IEEE Int. Conf. Signal Processing (Beijing, Aug. 2000), pp. 1018–1025.Google Scholar
  12. 12.
    W. K. Li and S. H. Lai, “Integrated video shot segmentation algorithm,” in Proc. SPIE Conf. Storage and Retrieval for Media Databases (Santa Clara, CA, 2003), pp. 264–271.Google Scholar
  13. 13.
    M. Flickner, “Query by image content: the QBIC system,” IEEE Comput. Mag. 28 (9) (1995).Google Scholar
  14. 14.
    J. Huang, S. Kumar, M. Mitra, W. Zhu, and R. Zabih, “Image indexing using color correlograms,” in Proc. IEEE Conf. on Computer Vision and Pattern Recognition (San Juan, 1997), pp. 762–768.CrossRefGoogle Scholar
  15. 15.
    G. Pass and R. Zabith, “Histogram refinement for content- based image retrieval,” in Proc. IEEE Workshop on Applications of Computer Vision (Sarasota, 1996), pp. 96–102.CrossRefGoogle Scholar
  16. 16.
    S. Mallat, “A theory for multiresolution signal decomposition: the wavelet representation,” IEEE Trans. PAMI 11 (7), 674–693 (1989).CrossRefMATHGoogle Scholar
  17. 17.
    B. S. Manjunath and W. Y. Ma, “Texture features for browsing and retrieval of image data,” IEEE Trans. Pattern Anal. Mach. Intellig. 18 (8), 837–842 (1996).CrossRefGoogle Scholar
  18. 18.
    B. Manjunath, P. Wu, S. Newsam, and H. Shin, “A texture descriptor for browsing and similarity retrieval,” J. Signal Processing: Image Commun. 16 (1), 3343 (2000).Google Scholar
  19. 19.
    J. Smith and S.-F. Chang, “Transform features for texture classification and discrimination in large image database,” in Proc. IEEE Int. Conf. on Image Processing (Austin, TX, 1994).Google Scholar
  20. 20.
    Muwei, Jian and Junyu Dong, “New perceptual texture features based on wavelet transform,” Int. J. Comput. Inf. Sci. 9 (1), 11–18 (2008).Google Scholar
  21. 21.
    R. L. Devalois, D.G. Albrecht, and L.G. Thorell, “Spatial -frequency selectivity of cells in acaque visual cortex,” Vision Res. 22, 545–559 (1982).CrossRefGoogle Scholar
  22. 22.
    T. Caelli, Visual Perception (Pergamon Press, 1981).Google Scholar
  23. 23.
    J. Yuan, H. Wang, L. Xiao, W. Zheng, J. Li, F. Lin, and B. Zhang, “A formal study of shot boundary detection,” IEEE Trans. Circuits Syst. Video Technology 17 (2), 168–186 (2007).CrossRefGoogle Scholar
  24. 24.
    G. G. Lakshmi Priya and S. Domnic, “Walsh-Hadamard transform kernel-based feature vector for shot boundary detection,” IEEE Trans. Image Processing 23 (12), 5187–5197 (2014).MathSciNetCrossRefMATHGoogle Scholar
  25. 25.
    Sawitchaya Tippaya, Tele Tan, Masood Khan, and Kosin Chamnongthai, “A study of discriminant visual descriptors for sport video shot boundary detection,” in Proc. 10th Asian Control Conf. (ASCC) (Kota Kinabalu, 2015), pp. 1–4.Google Scholar
  26. 26.
    B. H. Shekar, K. P. Uma, and K Raghurama Holla, “Shot boundary detection using correlation based spectral residual saliency map,” in Proc. Int. Conf. on Advances in Computing Communications and Informatics (ICACCI) (Jaipur, 2016), pp. 2242–2247.Google Scholar
  27. 27.
    Cai Pingping, Yue Guan, Xu Ding, and Zang Yu, “Shot boundary detection with sparse presentation,” in Proc. 13th IEEE Int. Conf. on Signal Processing (ICSP) (Buzios, 2016), pp. 900–904.Google Scholar
  28. 28.
    Dong-ju Jeong, Hyoung Jin Yoo, and Nam Ik Cho, “A static video summarization method based on the sparse coding of features and representativeness of frames,” EURASIP J. Image Video Processing (2017).Google Scholar
  29. 29.
    M. Jian and J. J. Ma, “Image retrieval using wavelet-based salient regions,” Imag. Sci. J. 59 (4), 219–231 (2011).CrossRefGoogle Scholar
  30. 30. Scholar

Copyright information

© Pleiades Publishing, Ltd. 2018

Authors and Affiliations

  1. 1.School of Computer Science and TechnologyShandong University of Finance and EconomicsJinanChina
  2. 2.Department of Computer Science and TechnologyOcean University of ChinaQingdaoChina

Personalised recommendations