Abstract
This paper derives a mathematical relationship between the shape information of an object and their discrete cosine transform (DCT) coefficients. Here, each column of the object is put into a matrix. As the lengths of different columns in the object are different, the lengths of the columns of the matrix are reset to the maximum value among these lengths of the columns of the object and the rest elements in the columns of the matrix are set to zero. The mathematical relationship between the shape information of the object and the DCT coefficients is derived. By substituting the DCT coefficients in the derived model, the shape of the object can be obtained directly in the DCT domain. Since most of the images are coded in the DCT domain, the derived result can significantly improve the efficiency for identifying the objects.
Similar content being viewed by others
References
Foi A, Katkovnik V, Egiazarian K (2007) Pointwise shape-adaptive DCT for high-quality denoising and deblocking of grayscale and color images. IEEE Trans Image Process 16(5):1395–1411
Gonzalez RC, Woods RE (2000) Digital image processing, 3rd edn. Hall, Prentice
Guo H, Wu X, Cai S, Li N, Cheng J, Chen Y-L (2016) Quaternion discrete cosine transformation signature analysis in crowd scenes for abnormal event detection. Neurocomputing 204:106–115
Ng WK, Lin Z (2000) A new shape-adaptive DCT for coding of arbitrarily shaped image segments. IEEE International Conference on Acoustics, Speech, and Signal Processing 4:2115–2118
Pang C-Y, Zhou R-G, Hu B-Q, Hu W-W, El-Rafei A (2019) Signal and image compression using quantum discrete cosine transform. Inf Sci 473:121–141
Persoon E, King-Sun F (1986) Shape discrimination using fourier descriptors. IEEE Trans Pattern Anal Mach Intell 8(3):388–397
Shen G, Zeng B, Liou ML (1999) A new padding technique for coding of arbitrarily-shaped image/video segments. IEEE International Conference on Image Processing 2:838–842
Shen G, Zeng B, Liou ML (2001) Arbitrarily shaped transform coding based on a new padding technique. IEEE Transactions on Circuits and Systems for Video Technology 11(1):67–79
Tahir N Md, Hussain A, Mustafa MM, Samad SA and Husin H (2007) Fourier descriptor for pedestrian shape recognition using support vector machine. IEEE International Symposium on Signal Processing and Information Technology, pp. 636–641
Zahn CT, Roskies RZ (1972) Fourier descriptors for plane closed curves. IEEE Trans Comput 21(3):269–281
Zhang D, Lu G (2002) Shape-based image retrieval using generic fourier descriptor. Signal Process Image Commun 17(10):825–848
Zhang P, Wang X, Wang X, Fei C, Guo Z (2019) Infrared small target detection based on spatial-temporal enhancement using quaternion discrete cosine transform. IEEE Access 7:54712
Zhu S, Zeng B (2008) Coding of arbitrarily-shaped image blocks based on a constrained quantization. IEEE International Conference on Multimedia and Expo, pp. 193–196
Acknowledgments
This paper was supported partly by the National Nature Science Foundation of China (no. U1701266, no. 61372173 and no. 61671163), the Team Project of the Education Ministry of the Guangdong Province (no. 2017KCXTD011), the Guangdong Higher Education Engineering Technology Research Center for Big Data on Manufacturing Knowledge Patent (no. 501130144), the Guangdong Province Intellectual Property Key Laboratory Project (no. 2018B030322016) and Hong Kong Innovation and Technology Commission, Enterprise Support Scheme (no. S/E/070/17).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Huang, Z., Ling, B.WK. Mathematical model for shape description in DCT domain. Multimed Tools Appl 80, 10101–10112 (2021). https://doi.org/10.1007/s11042-020-09463-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-09463-8