Skip to main content
Log in

Compressed domain video zoom motion analysis utilizing CURL

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This paper explores the application of the concept of CURL borrowed from vector calculus to the zoom motion detection and classification problems. The interframe block motion vectors extracted from the compressed bitstream form the input to the proposed method. These block motion vectors are analyzed by partitioning the motion vector field into 4 representative quadrants followed by quantizing the block motion vectors into 3 levels and converting the block motion vectors into complex motion vector space. The resultant vector for each of the 4 quadrants is estimated followed by estimating the velocity vector between the quadrants. The CURL of the velocity field is then estimated whose magnitude essentially provides the area enclosed between the resultant quadrant motion vectors which are utilized for separating the zooming and non-zooming camera types. The zooming camera frames are further classified into zoom-in and zoom-out types utilizing the direction information (anti-clockwise/clockwise) extracted from CURL of the velocity field. The novelty here stems from the fact that a concept borrowed from vector calculus is being applied to the zoom motion analysis problem. Although handcrafted features from CURL are utilized we demonstrate its superiority over existing methods including a deep learning architecture where we show the robustness of the proposed features extracted from CURL in the presence of noise. Experimental validation carried out utilizing block motion vectors extracted using Exhaustive Search Motion Estimation algorithm as well as H.264 decoded block motion vectors demonstrate superior performance for the proposed method both in terms of detection accuracy as well as computational complexity in comparison to existing techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Abdollahian G, Pizlo Z, Delp EJ (2008) A study on the effect of camera motion on human visual attention. In: 15th IEEE international conference on image processing, pp 693–696

  2. Chang C-C, Lin C-J (2011) LIBSVM: A library for support vector machines. ACM Trans Intell Syst Technol, 2:27:1–27:27. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm

  3. Chen YM, Bajic IV (2010) Motion vector outlier rejection cascade for global motion estimation. IEEE Signal Process Lett 17(2):197–200

    Article  Google Scholar 

  4. Deng Y, Manjunath BS (1997) Content-based search of video using color, texture, and motion. In: Proceedings of international conference on image processing, vol 2. pp 534–537, vol.2

  5. Duan L-Y, Jin JS, Tian Q, Xu C-S (2006) Nonparametric motion characterization for robust classification of camera motion patterns. IEEE Trans Multimed 8(2):323–340

    Article  Google Scholar 

  6. Fang Y, Lin W, Chen Z, Tsai C, Lin C (2014) A video saliency detection model in compressed domain. IEEE Trans Circuits Syst Video Technol 24 (1):27–38

    Article  Google Scholar 

  7. Ghosh S, Biswas J (2017) Joint perception and planning for efficient obstacle avoidance using stereo vision. In: IEEE/RSJ International conference on intelligent robots and systems (IROS), pp 1026–1031

  8. Guironnet M, Pellerin D, Rombaut M (2006) Camera motion classification based on transferable belief model. In: 14Th european signal processing conference, pp 1–5

  9. Hasan MA, Xu M, He X, Xu C (2014) Camhid: Camera motion histogram descriptor and its application to cinematographic shot classification. IEEE Trans Circuits Syst Video Technol 24(10):1682–1695

    Article  Google Scholar 

  10. Jin R, Qi Y, Hauptmann A (2002) A probabilistic model for camera zoom detection. In: 16Th IEEE international conference on pattern recognition, vol 3. IEEE, pp 859–862

  11. Kesana V, Okade M (2019) Compressed domain zoom motion classification using local tetra patterns. Signal Image and Video Processing 13(5):879–885

    Article  Google Scholar 

  12. Kilicarslan M, Zheng JY (2019) Predict vehicle collision by ttc from motion using a single video camera. IEEE Trans Intell Transp Syst 20(2):522–533

    Article  Google Scholar 

  13. Kreyszig E (2000) Advanced Engineering Mathematics, 8th edn. Wiley, New York

    MATH  Google Scholar 

  14. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems 25, pp 1097–1105

  15. Luo J, Papin C, Costello K (2009) Towards extracting semantically meaningful key frames from personal video clips: From humans to computers. IEEE Trans Circuits Syst Video Technol 19(2):289–301

    Article  Google Scholar 

  16. Murala S, Maheshwari RP, Balasubramanian R (2012) Local tetra patterns: a new feature descriptor for content-based image retrieval. IEEE Trans Image Process 21(5):2874–2886

    Article  MathSciNet  Google Scholar 

  17. Nakaya Y, Harashima H (1994) Motion compensation based on spatial transformations. IEEE Transactions on Circuits and Systems for Video Technology 4(3):339–356, 366–7

    Article  Google Scholar 

  18. Okade M, Patel G, Kumar Biswas P (2016) Robust learning-based camera motion characterization scheme with applications to video stabilization. IEEE Trans Circuits Syst Video Technol 26(3):453–466

    Article  Google Scholar 

  19. Po L-M, Wong K-M, Cheung K-W, Ng K-H (2010) Subsampled block-matching for zoom motion compensated prediction. IEEE Trans Circuits Syst Video Technol 20(11):1625–1637

    Article  Google Scholar 

  20. Sandula P, Okade M (2019) Camera zoom detection and classification based on application of histogram intersection and kullback leibler divergence. In: 2019 National conference on communications (NCC), pp 1–6

  21. Sandula P, Okade M (2019) Camera zoom detection in the compressed domain. In: International conference on range technology, (ICORT), pp 1–4

  22. Schoeffmann K, Taschwer M, Boeszoermenyi L (2009) Video browsing using motion visualization. In: IEEE International conference on multimedia and expo, pp 1835–1836

  23. The H.264 AVC JM Reference Software

  24. Wiegand T, Sullivan GJ, Bjontegaard G, Luthra A (2003) Overview of the h.264/avc video coding standard. IEEE Trans Circuits Syst Video Technol 13(7):560–576

    Article  Google Scholar 

  25. Yuan H, Chang Y, Lu Z, Ma Y (2010) Model based motion vector predictor for zoom motion. IEEE Signal Process Lett 17(9):787–790

    Article  Google Scholar 

  26. Zhang Z, Jing T, Ding B, Gao M, Li X (2019) A model-based approach of foreground region of interest detection for video codecs. Appl Sci 9:2670, 06

    Google Scholar 

Download references

Funding

This research work is supported by SERB, Government of India under grant No ECR/2016/ 000112.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pavan Sandula.

Ethics declarations

Conflict of Interests

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sandula, P., Okade, M. Compressed domain video zoom motion analysis utilizing CURL. Multimed Tools Appl 81, 12759–12776 (2022). https://doi.org/10.1007/s11042-022-12363-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-12363-8

Keywords

Navigation