Compressed domain video zoom motion analysis utilizing CURL

Sandula, Pavan; Okade, Manish

doi:10.1007/s11042-022-12363-8

Compressed domain video zoom motion analysis utilizing CURL

Published: 21 February 2022

Volume 81, pages 12759–12776, (2022)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

134 Accesses
4 Citations
1 Altmetric
Explore all metrics

Abstract

This paper explores the application of the concept of CURL borrowed from vector calculus to the zoom motion detection and classification problems. The interframe block motion vectors extracted from the compressed bitstream form the input to the proposed method. These block motion vectors are analyzed by partitioning the motion vector field into 4 representative quadrants followed by quantizing the block motion vectors into 3 levels and converting the block motion vectors into complex motion vector space. The resultant vector for each of the 4 quadrants is estimated followed by estimating the velocity vector between the quadrants. The CURL of the velocity field is then estimated whose magnitude essentially provides the area enclosed between the resultant quadrant motion vectors which are utilized for separating the zooming and non-zooming camera types. The zooming camera frames are further classified into zoom-in and zoom-out types utilizing the direction information (anti-clockwise/clockwise) extracted from CURL of the velocity field. The novelty here stems from the fact that a concept borrowed from vector calculus is being applied to the zoom motion analysis problem. Although handcrafted features from CURL are utilized we demonstrate its superiority over existing methods including a deep learning architecture where we show the robustness of the proposed features extracted from CURL in the presence of noise. Experimental validation carried out utilizing block motion vectors extracted using Exhaustive Search Motion Estimation algorithm as well as H.264 decoded block motion vectors demonstrate superior performance for the proposed method both in terms of detection accuracy as well as computational complexity in comparison to existing techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Compressed domain zoom motion classification using local tetra patterns

Article 31 January 2019

Zoom motion estimation for color and depth videos using depth information

Article Open access 16 March 2020

Performance Analysis of Matching Criteria in Block-Based Motion Estimation for Video Encoding

References

Abdollahian G, Pizlo Z, Delp EJ (2008) A study on the effect of camera motion on human visual attention. In: 15th IEEE international conference on image processing, pp 693–696
Chang C-C, Lin C-J (2011) LIBSVM: A library for support vector machines. ACM Trans Intell Syst Technol, 2:27:1–27:27. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm
Chen YM, Bajic IV (2010) Motion vector outlier rejection cascade for global motion estimation. IEEE Signal Process Lett 17(2):197–200
Article Google Scholar
Deng Y, Manjunath BS (1997) Content-based search of video using color, texture, and motion. In: Proceedings of international conference on image processing, vol 2. pp 534–537, vol.2
Duan L-Y, Jin JS, Tian Q, Xu C-S (2006) Nonparametric motion characterization for robust classification of camera motion patterns. IEEE Trans Multimed 8(2):323–340
Article Google Scholar
Fang Y, Lin W, Chen Z, Tsai C, Lin C (2014) A video saliency detection model in compressed domain. IEEE Trans Circuits Syst Video Technol 24 (1):27–38
Article Google Scholar
Ghosh S, Biswas J (2017) Joint perception and planning for efficient obstacle avoidance using stereo vision. In: IEEE/RSJ International conference on intelligent robots and systems (IROS), pp 1026–1031
Guironnet M, Pellerin D, Rombaut M (2006) Camera motion classification based on transferable belief model. In: 14Th european signal processing conference, pp 1–5
Hasan MA, Xu M, He X, Xu C (2014) Camhid: Camera motion histogram descriptor and its application to cinematographic shot classification. IEEE Trans Circuits Syst Video Technol 24(10):1682–1695
Article Google Scholar
Jin R, Qi Y, Hauptmann A (2002) A probabilistic model for camera zoom detection. In: 16Th IEEE international conference on pattern recognition, vol 3. IEEE, pp 859–862
Kesana V, Okade M (2019) Compressed domain zoom motion classification using local tetra patterns. Signal Image and Video Processing 13(5):879–885
Article Google Scholar
Kilicarslan M, Zheng JY (2019) Predict vehicle collision by ttc from motion using a single video camera. IEEE Trans Intell Transp Syst 20(2):522–533
Article Google Scholar
Kreyszig E (2000) Advanced Engineering Mathematics, 8th edn. Wiley, New York
MATH Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems 25, pp 1097–1105
Luo J, Papin C, Costello K (2009) Towards extracting semantically meaningful key frames from personal video clips: From humans to computers. IEEE Trans Circuits Syst Video Technol 19(2):289–301
Article Google Scholar
Murala S, Maheshwari RP, Balasubramanian R (2012) Local tetra patterns: a new feature descriptor for content-based image retrieval. IEEE Trans Image Process 21(5):2874–2886
Article MathSciNet Google Scholar
Nakaya Y, Harashima H (1994) Motion compensation based on spatial transformations. IEEE Transactions on Circuits and Systems for Video Technology 4(3):339–356, 366–7
Article Google Scholar
Okade M, Patel G, Kumar Biswas P (2016) Robust learning-based camera motion characterization scheme with applications to video stabilization. IEEE Trans Circuits Syst Video Technol 26(3):453–466
Article Google Scholar
Po L-M, Wong K-M, Cheung K-W, Ng K-H (2010) Subsampled block-matching for zoom motion compensated prediction. IEEE Trans Circuits Syst Video Technol 20(11):1625–1637
Article Google Scholar
Sandula P, Okade M (2019) Camera zoom detection and classification based on application of histogram intersection and kullback leibler divergence. In: 2019 National conference on communications (NCC), pp 1–6
Sandula P, Okade M (2019) Camera zoom detection in the compressed domain. In: International conference on range technology, (ICORT), pp 1–4
Schoeffmann K, Taschwer M, Boeszoermenyi L (2009) Video browsing using motion visualization. In: IEEE International conference on multimedia and expo, pp 1835–1836
The H.264 AVC JM Reference Software
Wiegand T, Sullivan GJ, Bjontegaard G, Luthra A (2003) Overview of the h.264/avc video coding standard. IEEE Trans Circuits Syst Video Technol 13(7):560–576
Article Google Scholar
Yuan H, Chang Y, Lu Z, Ma Y (2010) Model based motion vector predictor for zoom motion. IEEE Signal Process Lett 17(9):787–790
Article Google Scholar
Zhang Z, Jing T, Ding B, Gao M, Li X (2019) A model-based approach of foreground region of interest detection for video codecs. Appl Sci 9:2670, 06
Google Scholar

Download references

Funding

This research work is supported by SERB, Government of India under grant No ECR/2016/ 000112.

Author information

Authors and Affiliations

National Institute of Technology Rourkela, Rourkela, Odisha, India
Pavan Sandula & Manish Okade

Authors

Pavan Sandula
View author publications
You can also search for this author in PubMed Google Scholar
Manish Okade
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pavan Sandula.

Ethics declarations

Conflict of Interests

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sandula, P., Okade, M. Compressed domain video zoom motion analysis utilizing CURL. Multimed Tools Appl 81, 12759–12776 (2022). https://doi.org/10.1007/s11042-022-12363-8

Download citation

Received: 26 November 2020
Revised: 10 January 2022
Accepted: 18 January 2022
Published: 21 February 2022
Issue Date: April 2022
DOI: https://doi.org/10.1007/s11042-022-12363-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Compressed domain video zoom motion analysis utilizing CURL

Abstract

Access this article

Similar content being viewed by others

Compressed domain zoom motion classification using local tetra patterns

Zoom motion estimation for color and depth videos using depth information

Performance Analysis of Matching Criteria in Block-Based Motion Estimation for Video Encoding

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Compressed domain video zoom motion analysis utilizing CURL

Abstract

Access this article

Similar content being viewed by others

Compressed domain zoom motion classification using local tetra patterns

Zoom motion estimation for color and depth videos using depth information

Performance Analysis of Matching Criteria in Block-Based Motion Estimation for Video Encoding

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation