Multimedia Tools and Applications

, Volume 76, Issue 1, pp 1331–1353 | Cite as

Efficient copy detection for compressed digital videos by spatial and temporal feature extraction

  • Po-Chyi Su
  • Chin-Song Wu


This research aims at developing a practical video copy detection mechanism to determine whether an investigated video is a duplicated copy that may infringe the intellectual property rights. The significant features of original videos are extracted and stored in the server. Given an uploaded video, the same feature is extracted and compared with the stored ones to seek a possible match. Both the spatial and temporal features of compressed videos are employed in the proposed scheme. The scene-change detection is applied to select the key frames, from which the robust spatial features are extracted to help search visually similar frames. The shot lengths are used as the temporal features to further ensure the matching accuracy. To ensure that the proposed method is practical in considered applications, the size of stored features in the server, efficiency and accuracy of matching features are the major design principles. The experimental results by testing a large number of compressed videos demonstrate the feasibility of the proposed scheme.


Copy detection Video coding Feature extraction Content management Multimedia databases Copyright protection 



This research is supported by the Ministry of Science and Technology in Taiwan, ROC, under Grants MOST 103-2221-E-008-080 and MOST 104-2221-E-008-075.


  1. 1.
    Awad G, Over P, Kraaij W (2014) Content-based video copy detection benchmarking at TRECVID. ACM Trans Inf Syst:32Google Scholar
  2. 2.
    Benchmark videos from Youtube [Online]. Available:
  3. 3.
    Coskun B, Sankur B, Memon N (2006) Spatio-temporal transform based video hashing. IEEE Trans Multimedia 8:1190–1208CrossRefGoogle Scholar
  4. 4.
    Cotsaces C, Nikolaidis N, Pitas I (2006) Shot detection and condensed representation—a review. IEEE Signal Process Mag 23:28–37CrossRefGoogle Scholar
  5. 5.
    De Roover C, De Vleeschouwer C, Lefebvre F, Macq B (2005) Robust video hashing based on radial projections of key frames. IEEE Trans Signal Process 53(10):4020–4037MathSciNetCrossRefGoogle Scholar
  6. 6.
    Esmaeili MM, Fatourechi M, Ward RK (2011) A robust and fast video copy detection system using content-based fingerprinting. IEEE Trans Inf Forensics Secur 6:213–226CrossRefGoogle Scholar
  7. 7.
    Ferman A, Tekalp M, Mehrotra R (2002) Robust color histogram descriptors for video segment retrieval and identification. IEEE Trans Multimedia 11:497–508Google Scholar
  8. 8.
    Gersho A, Gray RM (1992) Vector quantization and signal compression. Kluwer Academic PublishersGoogle Scholar
  9. 9.
    Hsu W, Chua TS, Pung HK (1995) An integrated color-spatial approach to content-based image retrieval. In: Proceeding of ACM MultimediaGoogle Scholar
  10. 10.
    Kang L-W, Hsu C-Y, Chen H-W, Lu C-S (2010) Secure SIFT-based sparse representation for image copy detection and recognition. In: IEEE International Conference on Multimedia Exposition, pp 1248–1253Google Scholar
  11. 11.
    Kashino K, Kurozumi T, Murase H (2003) A quick search method for audio and video signals based on histogram pruning. IEEE Trans Multimedia 5(3):348–357CrossRefGoogle Scholar
  12. 12.
    Katsavounidis I, Kuo C-C, Zhang Z (1994) A new initialization technique for generalized Lloyd iteration. IEEE Signal Processing Letters 1(10):144–146CrossRefGoogle Scholar
  13. 13.
    Law-To J, Joly A, Boujemaa N (2007) Muscle-VCD-2007: a live benchmark for video copy detection.
  14. 14.
    Ling H, Cheng H, Ma Q, Zou F, Yan W (2012) Efficient image copy detection using multiscale fingerprints. IEEE MultiMedia 19:60–69CrossRefGoogle Scholar
  15. 15.
    Liu H, Lu H, Xue X (2013) A segmentation and graph-based video sequence matching method for video copy detection. IEEE Trans Knowl Data Eng 25:1706–1718CrossRefGoogle Scholar
  16. 16.
    Liu J, Huang Z, Cai H, Ngo HTSCW, Wang W (2013) Near-duplicate video retrieval: current research and future trends. ACM Comput Surv:45Google Scholar
  17. 17.
    Liu T, Zhang H-J, Qi F (2003) A novel video key-frame-extraction algorithm based on perceived motion energy mode. IEEE Trans Circuits Syst Video Technol 13:1006–1013CrossRefGoogle Scholar
  18. 18.
    Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60:91–110CrossRefGoogle Scholar
  19. 19.
    Lu S, Wang Z, Mei T, Guan G, Feng D (2014) A bag-of-importance model with locality-constrained coding based feature learning for video summarization. IEEE Trans Multimedia 16:1497–1509CrossRefGoogle Scholar
  20. 20.
    ReefVid A Resource of Free Coral Reef Video Clips for Educational Use [Online]. Available:
  21. 21.
    Song J, Yang Y, Huang Z, Shen HT, Luo J (2013) Effective multiple feature hashing for large-scale near-duplicate video retrieval. IEEE Trans Multimedia 15:1997–2008CrossRefGoogle Scholar
  22. 22.
    Su PC, Chen C-C, Chang H-M (2009) Towards effective content authentication for digital videos by employing feature extraction and quantization. In: IEEE Transactions on Circuits and Systems for Video Technology, vol 19, pp 668–677Google Scholar
  23. 23.
    Swain M, Ballard D (1991) Color indexing. Int J Comput Vis:7Google Scholar
  24. 24.
    Tan YP, Saur DD, Kulkarni SR, Ramadge PJ (2000) Rapid estimation of camera motion from compressed video with application to video annotation. IEEE Trans Circuits Syst Video Technol:133–145Google Scholar
  25. 25.
    van Rijsbergen CJ (1979) Information retrieval. Butterworth-Heinemann, LondonzbMATHGoogle Scholar
  26. 26.
    Wang T, Yu W, Chen L (2007) An approach to video key-frame extraction based on rough set. In: International Conference on Multimedia and Ubiquitous Engineering, 2007. MUE ’07, pp 590–596Google Scholar
  27. 27.
    Wolf (1996) Key frame selection by motion analysis. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp 1228–1231Google Scholar
  28. 28.
    Wu P-H, Thaipanich T, Jay Kuo C-C (2009) A suffix array approach to video copy detection in video sharing social networks. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, Taipei, TaiwanGoogle Scholar
  29. 29.
    Zargari F, Mehrabi M, Ghanbari M (2010) Compressed domain texture based visual information retrieval method for I-frame coded pictures. IEEE Trans Consum Electron 56:728–736CrossRefGoogle Scholar
  30. 30.
    Zhou X, Zhou X, Chen L, Bouguettaya A, Xiao N, Taylor JA (2009) An efficient near-duplicate video shot detection method using shot-based interest points. IEEE Trans Multimedia 11:879–891CrossRefGoogle Scholar
  31. 31.
    Zhuang Y, Rui Y, Huang TS, Mehrotra S (1998) Adaptive key frame extracting using unsupervised clustering. In: Proceedings of IEEE International Conference on Image Processing, pp 866–870Google Scholar

Copyright information

© Springer Science+Business Media New York 2015

Authors and Affiliations

  1. 1.Department of Computer Science and Information EngineeringNational Central UniversityJhongliRepublic of China

Personalised recommendations