Mish-DCTGAN based combined image super-resolution and deblurring approach for blurry license plates

Pattanaik, Anmol; Balabantaray, Rakesh Chandra

doi:10.1007/s41870-023-01322-7

Mish-DCTGAN based combined image super-resolution and deblurring approach for blurry license plates

Original Research
Published: 12 June 2023

Volume 15, pages 2767–2775, (2023)
Cite this article

International Journal of Information Technology Aims and scope Submit manuscript

Anmol Pattanaik¹ &
Rakesh Chandra Balabantaray¹

116 Accesses
2 Citations
Explore all metrics

Abstract

Nowadays, there is a growing desire for high definition images with fine textures, yet images taken in natural settings frequently suffer from sophisticated fuzzy artifacts. Due to the fact that these obtrusive abnormalities significantly reduce the visual quality of images, deblurring methods have been developed from a variety of perspectives. Blind motion Deblurring is a fundamental and difficult challenge in image processing and computer vision. It attempts to restore a clear image from a blurred version, despite the fact that it has no knowledge of the blurring process. Numerous existing methods are employed to address these types of challenges, but they are incapable of handling the high frequency characteristics present in natural images, as real-world images are frequently low resolution and blurred in various ways. This article presents a technique for recognising vehicle licence plates captured by surveillance cameras under natural circumstances, which is important in the domain of intelligent transportation systems. These observed plate images are frequently of low resolution and suffer from considerable edge loss, posing a significant barrier to existing blind deblurring algorithms. We present a discrete cosine transform (DCT) generative adversarial network (DCTGAN) based approach with a Mish activation function called Mish-DCTGAN to jointly process image super-resolution and non-uniform deblurring. We evaluated our proposed approach to licence plate (LP) datasets and compared the results with other existing methodologies. Mish-DCTGAN achieves the best performance in terms of PSNR and SSIM, as demonstrated by our testing results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enhancement of license plate recognition performance using Xception with Mish activation function

Article 14 October 2022

Automatic License Plate Recognition for Distorted Images Using SRGAN

ProDeblurGAN: Progressive Growing of GANs for Blind Motion Deblurring in Face Recognition

Data availability

The data will be provided on request.

References

Chakrabarti Ayan (2016) A neural approach to blind motion deblurring. In European conference on computer vision. pp 221–235. Springer, Cham
Su S, Delbracio M, Wang J, Sapiro G, Heidrich W, Wang O (2017) Deep video deblurring for hand-held cameras. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 1279–1288. IEEE
Chang X, Huang P. Y, Shen Y. D, Liang X, Yang Y, Hauptmann AG (2018) Rcaa: relational context-aware agents for person search. In Proceedings of the European Conference on Computer Vision (ECCV). pp 84–100
Liu AA, Nie WZ, Gao Y, Su YT (2016) Multi-modal clique-graph matching for view-based 3d model retrieval. IEEE Trans Image Process 25(5):2103–2116
Article MathSciNet MATH Google Scholar
Mao X, Shen C, Yang YB (2016) Image restoration using very deep convolutional encoder-decoder networks with symmetric skip connections. Adv Neural Inf Process Syst 29:401–409
Google Scholar
Liu J, Zhai G, Liu A, Yang X, Zhao X, Chen CW (2018) IPAD: intensity potential for adaptive de-quantization. IEEE Trans Image Process 27(10):4860–4872
Article MathSciNet Google Scholar
Liu AA, Su YT, Nie WZ, Kankanhalli M (2016) Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans Pattern Anal Mach Intell 39(1):102–114
Article Google Scholar
Chang X, Ma Z, Yang Y, Zeng Z, Hauptmann AG (2016) Bi-level semantic representation analysis for multimedia event detection. IEEE Trans Cybern 47(5):1180–1197
Article Google Scholar
Chang X, Yang Y, Xing E, Yu Y (2015) Complex event detection using semantic saliency and nearly-isotonic SVM. In International Conference on Machine Learning. pp 1348–1357, PMLR
Ma S, Liu J, Wen Chen C (2017) A-lamp: adaptive layout-aware multi-patch deep convolutional neural network for photo aesthetic assessment. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 4535–4544
Min X, Gu K, Zhai G, Liu J, Yang X, Chen CW (2017) Blind quality assessment based on pseudo-reference image. IEEE Trans Multimed 20(8):2049–2062
Article Google Scholar
Hradiš M, Kotera J, Zemcık P, Šroubek F (2015) Convolutional neural networks for direct text deblurring. In Proceedings of BMVC. pp 61–73
Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition pp 4681–4690
Zhu JY, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE international conference on computer vision pp 2223–2232
Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 4681–4690
Kupyn O, Budzan V, Mykhailych M, Mishkin D, Matas J (2018) Deblurgan: blind motion deblurring using conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 8183–8192
Schmidt U, Schelten K, Roth S (2011) Bayesian deblurring with integrated noise estimation. In CVPR 2011. pp 2625–2632 IEEE
Schmidt U, Rother C, Nowozin S, Jancsary J, Roth S (2013) Discriminative non-blind deblurring. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 604–611
Xu L, Zheng S, Jia J (2013) Unnatural l0 sparse representation for natural image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1107–1114
Babacan SD, Molina R, Do MN, Katsaggelos AK (2012) Bayesian blind deconvolution with general sparse image priors. In European conference on computer vision. pp 341–355, Springer, Berlin, Heidelberg
Perrone D, Favaro P (2014) Total variation blind deconvolution: the devil is in the details. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 2909–2916
Xu L, Jia J (2010) Two-phase kernel estimation for robust motion deblurring. In European conference on computer vision. Springer, Berlin
Book Google Scholar
Xu L, Zheng S, Jia J (2013) Unnatural l0 sparse representation for natural image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1107–1114
Fergus R, Singh B, Hertzmann A, Roweis ST, Freeman WT (2006) Removing camera shake from a single photograph. In ACM SIGGRAPH 2006 Papers. pp 787–794
Boracchi G, Foi A (2012) Modeling the performance of image restoration from motion blur. IEEE Trans Image Process 21(8):3502–3517
Article MathSciNet MATH Google Scholar
Xu L, Zheng S, Jia J (2013) Unnatural l0 sparse representation for natural image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1107–1114
Dong C, Loy CC, He K, Tang X (2015) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307
Article Google Scholar
Kim J, Lee JK, Lee KM (2016) Accurate image super-resolution using very deep convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1646–1654
Lai WS, Huang JB, Ahuja N, Yang MH (2017) Deep Laplacian pyramid networks for fast and accurate super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 624–632
Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 4681–4690
Tao X, Gao H, Shen X, Wang J, Jia J (2018) Scale-recurrent network for deep image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 8174–8182
Lim B, Son S, Kim H, Nah S, Mu Lee K (2017) Enhanced deep residual networks for single image super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops. pp 136–144
Wang X, Yu K, Wu S, Gu J, Liu Y, Dong C, Change Loy C (2018) Esrgan: enhanced super-resolution generative adversarial networks. In Proceedings of the European conference on computer vision (ECCV) workshops. pp 0–0
Xu X, Sun D, Pan J, Zhang Y, Pfister H, Yang MH (2017) Learning to super-resolve blurry face and text images. In Proceedings of the IEEE international conference on computer vision. pp 251–260
Zhang X, Wang F, Dong H, Guo Y (2018) A deep encoder-decoder networks for joint deblurring and super-resolution. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). pp 1448–1452, IEEE
Tomosada H, Kudo T, Fujisawa T, Ikehara M (2021) GAN-based image deblurring using DCT discriminator. In 2020 25th International Conference on Pattern Recognition (ICPR). pp 3675–3681, IEEE
Tomosada H, Kudo T, Fujisawa T, Ikehara M (2021) GAN-based image deblurring using DCT loss with customized datasets. IEEE Access. pp 135224–135233
Cheon M, Kim JH, Choi JH, Lee JS (2018) Generative adversarial network-based image super-resolution using perceptual content losses. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops
Dong L, Zhou Y, Jiang J (2022) Generative synthesis of logos across DCT domain. Neurocomputing. pp 163–172
Ghedia NS, Vithalani CH (2021) Outdoor object detection for surveillance based on modified GMM and adaptive thresholding. Int J Inf Technol 13(1):185–193
Google Scholar
Shekar BH, Raveeshwara S (2022) Contour feature learning for locating text in natural scene images. Int J Inf Technol 14(4):1719–1724
Google Scholar
Bhatt MS, Patalia TP (2019) Content-based high-resolution satellite image classification. Int J Inf Technol 11:127–140
Google Scholar
Hisham B, Hamouda A (2021) Arabic sign language recognition using Ada-Boosting based on a leap motion controller. Int J Inf Technol 13:1221–1234
Google Scholar

Download references

Acknowledgements

I acknowledge the support of CLIA lab for this research.

Author information

Authors and Affiliations

International Institute of Information Technology Bhubaneswar, Bhubaneswar, India
Anmol Pattanaik & Rakesh Chandra Balabantaray

Authors

Anmol Pattanaik
View author publications
You can also search for this author in PubMed Google Scholar
Rakesh Chandra Balabantaray
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anmol Pattanaik.

Ethics declarations

Conflict of interest

The authors declare that there are no conflicts of interest.

Additional information

Supported by organization IIIT BBSR.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Pattanaik, A., Balabantaray, R.C. Mish-DCTGAN based combined image super-resolution and deblurring approach for blurry license plates. Int. j. inf. tecnol. 15, 2767–2775 (2023). https://doi.org/10.1007/s41870-023-01322-7

Download citation

Received: 16 December 2022
Accepted: 25 May 2023
Published: 12 June 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s41870-023-01322-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Mish-DCTGAN based combined image super-resolution and deblurring approach for blurry license plates

Abstract

Access this article

Similar content being viewed by others

Enhancement of license plate recognition performance using Xception with Mish activation function

Automatic License Plate Recognition for Distorted Images Using SRGAN

ProDeblurGAN: Progressive Growing of GANs for Blind Motion Deblurring in Face Recognition

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Mish-DCTGAN based combined image super-resolution and deblurring approach for blurry license plates

Abstract

Access this article

Similar content being viewed by others

Enhancement of license plate recognition performance using Xception with Mish activation function

Automatic License Plate Recognition for Distorted Images Using SRGAN

ProDeblurGAN: Progressive Growing of GANs for Blind Motion Deblurring in Face Recognition

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation