Skip to main content
Log in

Mathematical model for shape description in DCT domain

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This paper derives a mathematical relationship between the shape information of an object and their discrete cosine transform (DCT) coefficients. Here, each column of the object is put into a matrix. As the lengths of different columns in the object are different, the lengths of the columns of the matrix are reset to the maximum value among these lengths of the columns of the object and the rest elements in the columns of the matrix are set to zero. The mathematical relationship between the shape information of the object and the DCT coefficients is derived. By substituting the DCT coefficients in the derived model, the shape of the object can be obtained directly in the DCT domain. Since most of the images are coded in the DCT domain, the derived result can significantly improve the efficiency for identifying the objects.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  1. Foi A, Katkovnik V, Egiazarian K (2007) Pointwise shape-adaptive DCT for high-quality denoising and deblocking of grayscale and color images. IEEE Trans Image Process 16(5):1395–1411

    Article  MathSciNet  Google Scholar 

  2. Gonzalez RC, Woods RE (2000) Digital image processing, 3rd edn. Hall, Prentice

    Google Scholar 

  3. Guo H, Wu X, Cai S, Li N, Cheng J, Chen Y-L (2016) Quaternion discrete cosine transformation signature analysis in crowd scenes for abnormal event detection. Neurocomputing 204:106–115

    Article  Google Scholar 

  4. Ng WK, Lin Z (2000) A new shape-adaptive DCT for coding of arbitrarily shaped image segments. IEEE International Conference on Acoustics, Speech, and Signal Processing 4:2115–2118

    Google Scholar 

  5. Pang C-Y, Zhou R-G, Hu B-Q, Hu W-W, El-Rafei A (2019) Signal and image compression using quantum discrete cosine transform. Inf Sci 473:121–141

    Article  MathSciNet  Google Scholar 

  6. Persoon E, King-Sun F (1986) Shape discrimination using fourier descriptors. IEEE Trans Pattern Anal Mach Intell 8(3):388–397

    Article  Google Scholar 

  7. Shen G, Zeng B, Liou ML (1999) A new padding technique for coding of arbitrarily-shaped image/video segments. IEEE International Conference on Image Processing 2:838–842

    Google Scholar 

  8. Shen G, Zeng B, Liou ML (2001) Arbitrarily shaped transform coding based on a new padding technique. IEEE Transactions on Circuits and Systems for Video Technology 11(1):67–79

    Article  Google Scholar 

  9. Tahir N Md, Hussain A, Mustafa MM, Samad SA and Husin H (2007) Fourier descriptor for pedestrian shape recognition using support vector machine. IEEE International Symposium on Signal Processing and Information Technology, pp. 636–641

  10. Zahn CT, Roskies RZ (1972) Fourier descriptors for plane closed curves. IEEE Trans Comput 21(3):269–281

    Article  MathSciNet  Google Scholar 

  11. Zhang D, Lu G (2002) Shape-based image retrieval using generic fourier descriptor. Signal Process Image Commun 17(10):825–848

    Article  Google Scholar 

  12. Zhang P, Wang X, Wang X, Fei C, Guo Z (2019) Infrared small target detection based on spatial-temporal enhancement using quaternion discrete cosine transform. IEEE Access 7:54712

    Article  Google Scholar 

  13. Zhu S, Zeng B (2008) Coding of arbitrarily-shaped image blocks based on a constrained quantization. IEEE International Conference on Multimedia and Expo, pp. 193–196

Download references

Acknowledgments

This paper was supported partly by the National Nature Science Foundation of China (no. U1701266, no. 61372173 and no. 61671163), the Team Project of the Education Ministry of the Guangdong Province (no. 2017KCXTD011), the Guangdong Higher Education Engineering Technology Research Center for Big Data on Manufacturing Knowledge Patent (no. 501130144), the Guangdong Province Intellectual Property Key Laboratory Project (no. 2018B030322016) and Hong Kong Innovation and Technology Commission, Enterprise Support Scheme (no. S/E/070/17).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bingo Wing-Kuen Ling.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huang, Z., Ling, B.WK. Mathematical model for shape description in DCT domain. Multimed Tools Appl 80, 10101–10112 (2021). https://doi.org/10.1007/s11042-020-09463-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-020-09463-8

Keywords

Navigation