DC-Image for Real Time Compressed Video Matching

  • Saddam Bekhet
  • Amr Ahmed
  • Andrew Hunter
Conference paper


This chapter presents a suggested framework for video matching based on local features extracted directly from the DC-image of MPEG compressed videos, without full decompression. In addition, the relevant arguments and supporting evidences are discussed. Several local feature detectors will be examined to select the best for matching using the DC-image. Two experiments are carried to support the above. The first is comparing between the DC-image and I-frame, in terms of matching performance and computation complexity. The second experiment compares between using local features and global features regarding compressed video matching with respect to the DC-image. The results confirmed that the use of DC-image, despite its highly reduced size, is promising as it produces higher matching precision, compared to the full I-frame. Also, SIFT, as a local feature, outperforms most of the standard global features. On the other hand, its computation complexity is relatively higher, but it is still within the real-time margin which leaves a space for further optimizations that could be done to improve this computation complexity.


Compressed domain DC-image Global features Local features MPEG SIFT Video matching 



This work is funded by SouthValley University, Egypt.


  1. 1.
    YouTube Statistics [Online]. Available (2013)
  2. 2.
    S. Bekhet, A. Ahmed, A. Hunter, Video matching using DC-image and local features, in Proceedings of the World Congress on Engineering, UK (2013), pp. 2209–2214Google Scholar
  3. 3.
    A.B. Watson, Image compression using the discrete cosine transform. Math. J. 4, 81 (1994)Google Scholar
  4. 4.
    A. Torralba, R. Fergus, W.T. Freeman, 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 30, 1958–1970 (2008)CrossRefGoogle Scholar
  5. 5.
    A. Altadmri, A. Ahmed, A framework for automatic semantic video annotation. Multimedia Appl. Tools 64(2), 1–25 (2013)Google Scholar
  6. 6.
    G. Miller, C. Fellbaum, Wordnet: An Electronic Lexical Database (1998)Google Scholar
  7. 7.
    A. Karpenko, P. Aarabi, Tiny videos: A large data set for nonparametric video retrieval and frame classification. IEEE Trans. Pattern Anal. Mach. Intell. 33, 618–630 (2011)CrossRefGoogle Scholar
  8. 8.
    A. Karpenko, P. Aarabi, Tiny videos: A large dataset for image and video frame categorization, in 11th IEEE International Symposium on Multimedia (ISM ‘09) (2009), pp. 281–289Google Scholar
  9. 9.
    A. Karpenko, P. Aarabi, Tiny videos: non-parametric content-based video retrieval and recognition, in Tenth IEEE International Symposium on Multimedia (ISM 2008) (2008), pp. 619–624Google Scholar
  10. 10.
    B.-L. Yeo, B. Liu, A unified approach to temporal segmentation of motion JPEG and MPEG compressed video, in Proceedings of the International Conference on Multimedia Computing and Systems (1995), pp. 81–88Google Scholar
  11. 11.
    J. Meng, Y. Juan, S.F. Chang, Scene change detection in a MPEG compressed video sequence, in IS&T/SPIE Symposium Proceedings (1995)Google Scholar
  12. 12.
    J.-L. Zheng, M.-J. Li, M.-X. Zhang, J. Zhou, Z.-D. Liu, An effective framework of shot segmentation based on I-frame in compressed-domain videos, in International Conference on Wavelet Analysis and Pattern Recognition (2012), pp. 84–90Google Scholar
  13. 13.
    P. Xu, L. Xie, S.F. Chang, A. Divakaran, A. Vetro, H. Sun, Algorithms and system for segmentation and structure analysis in soccer video, in Proceedings of ICME (2001), pp. 928–931Google Scholar
  14. 14.
    A. Divakaran, H. Ito, H. Sun, T. Poon, Scene change detection and feature extraction for MPEG-4 sequences, in Electronic Imaging’99 (1998), pp. 545–551Google Scholar
  15. 15.
    G. Liu, J. Zhao, Key frame extraction from MPEG video stream, in Third International Symposium on Information Processing (2010), pp. 423–427Google Scholar
  16. 16.
    E.K. Kang, S.J. Kim, J.S. Choi, Video retrieval based on scene change detection in compressed streams. IEEE Trans. Consum. Electron. 45, 932–936 (1999)CrossRefGoogle Scholar
  17. 17.
    O.N. Gerek, Y. Altunbasak, Key frame selection from MPEG video data, in Electronic Imaging (1997), pp. 920–925Google Scholar
  18. 18.
    J.C.S. Yu, M.S. Kankanhalli, P. Mulhen, Semantic video summarization in compressed domain MPEG video, in Proceedings International Conference on Multimedia and Expo, vol. 3 (2003), pp. 329–332Google Scholar
  19. 19.
    J. Almeida, N.J. Leite, R.S. Torres, Online video summarization on compressed domain. J. Vis. Commun. Image Represent. (2011)Google Scholar
  20. 20.
    H.-P. Gao, Z.-Q. Yang, Content based video retrieval using spatiotemporal salient objects, in 2010 International Symposium on Intelligence Information Processing and Trusted Computing (IPTC) (2010), pp. 689–692Google Scholar
  21. 21.
    D.G. Lowe, Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)CrossRefGoogle Scholar
  22. 22.
    M.M. Yeung, B. Liu, Efficient matching and clustering of video shots, in Proceedings of the International Conference on Image Processing. vol. 1 (1995), pp. 338–341Google Scholar
  23. 23.
    M.R. Naphade, M.M. Yeung, B.L. Yeo, A novel scheme for fast and efficient video sequence matching using compact signatures, in Proceedings of SPIE, Storage and Retrieval for Media Databases (2000), pp. 564–572Google Scholar
  24. 24.
    R. Mohan, Video sequence matching, in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 6 (1998), pp. 3697–3700Google Scholar
  25. 25.
    N. Dimitrova, M.S. Abdel-Mottaleb, Video retrieval of MPEG compressed sequences using dc and motion signatures, in Video Retrieval of MPEG Compressed Sequences using DC and Motion Signatures, 1999Google Scholar
  26. 26.
    R.C. Gonzalez, E.W. Richard, Digital Image Processing, 3rd edn edn. (Prentice Hall, Englewood Cliffs, 2008), pp. 142–143Google Scholar
  27. 27.
    D.N. Bhat, S.K. Nayar, Ordinal measures for visual correspondence, in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, (CVPR ‘96) (1996), pp. 351–357Google Scholar
  28. 28.
    J. Almeida, N.J. Leite, R. S. Torres, Comparison of video sequences with histograms of motion patterns, in 18th IEEE International Conference on Image Processing (ICIP) (2011), pp. 3673–3676Google Scholar
  29. 29.
    H. Bay, A. Ess, T. Tuytelaars, L. Van Gool, Speeded-up robust features (SURF). Comput. Vis. Image Underst. 6(110), 346–359 (2008)CrossRefGoogle Scholar
  30. 30.
    D. Fuentes, R. Bardeli, J.A. Ortega, L. Gonzalez-Abril, A similarity measure between videos using alignment, graphical and speech features. Expert Syst. Appl. 39, 10278–10282 (2012). 9/1CrossRefGoogle Scholar
  31. 31.
    A. Basharat, Y. Zhai, M. Shah, Content based video matching using spatiotemporal volumes. Comput. Vis. Image Underst. 110, 360–377 (2008)CrossRefGoogle Scholar
  32. 32.
    K. Mikolajczyk, C. Schmid, A performance evaluation of local descriptors, in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2 (2003), pp. II-257–II-263Google Scholar
  33. 33.
    L. Juan, O. Gwun, A comparison of sift, pca-sift and surf. Int. J. Image Process. (IJIP) 3, 143–152 (2009)Google Scholar
  34. 34.
    K.E.A. van de Sande, T. Gevers, C.G.M. Snoek, Evaluating color descriptors for object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1582–1596 (2010)CrossRefGoogle Scholar
  35. 35.
    D.M. Powers, Evaluation: From precision, recall and f-factor to roc, informedness, markedness and correlation, in School of Informatics and Engineering, Flinders University, Adelaide, Australia, Tech.Rep.SIE-07-001, 2007Google Scholar
  36. 36.
    T. Tuytelaars, Dense interest points, in 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2010), pp. 2281–2288Google Scholar
  37. 37.
    S. Bekhet, A. Ahmed, A. Hunter, Video matching using DC-image and local features, in Lecture Notes in Engineering and Computer Science: Proceedings of the World Congress on Engineering, WCE 2013, London, 3–5 July 2013, pp. 2209–2214Google Scholar
  38. 38.
    N. Dimitrova, F. Golshani, Motion recovery for video content classification. ACM Trans. Inf. Syst. (TOIS) 13, 408–439 (1995)CrossRefGoogle Scholar
  39. 39.
    A.S. Abbass, A.A.A. Youssif, A.Z. Ghalwash, Compressed Domain Video Fingerprinting Technique Using the Singular Value Decomposition (2012)Google Scholar
  40. 40.
    P. Panchal, S. Merchant, Performance evaluation of fade and dissolve transition shot boundary detection in presence of motion in video, in 1st International Conference on Emerging Technology Trends in Electronics, Communication and Networking (ET2ECN) (2012), pp. 1–6Google Scholar

Copyright information

© Springer Science+Business Media Dordrecht 2014

Authors and Affiliations

  1. 1.School of Computer ScienceUniversity of LincolnLincolnUK

Personalised recommendations