Skip to main content
Log in

Mish-DCTGAN based combined image super-resolution and deblurring approach for blurry license plates

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

Nowadays, there is a growing desire for high definition images with fine textures, yet images taken in natural settings frequently suffer from sophisticated fuzzy artifacts. Due to the fact that these obtrusive abnormalities significantly reduce the visual quality of images, deblurring methods have been developed from a variety of perspectives. Blind motion Deblurring is a fundamental and difficult challenge in image processing and computer vision. It attempts to restore a clear image from a blurred version, despite the fact that it has no knowledge of the blurring process. Numerous existing methods are employed to address these types of challenges, but they are incapable of handling the high frequency characteristics present in natural images, as real-world images are frequently low resolution and blurred in various ways. This article presents a technique for recognising vehicle licence plates captured by surveillance cameras under natural circumstances, which is important in the domain of intelligent transportation systems. These observed plate images are frequently of low resolution and suffer from considerable edge loss, posing a significant barrier to existing blind deblurring algorithms. We present a discrete cosine transform (DCT) generative adversarial network (DCTGAN) based approach with a Mish activation function called Mish-DCTGAN to jointly process image super-resolution and non-uniform deblurring. We evaluated our proposed approach to licence plate (LP) datasets and compared the results with other existing methodologies. Mish-DCTGAN achieves the best performance in terms of PSNR and SSIM, as demonstrated by our testing results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data availability

The data will be provided on request.

References

  1. Chakrabarti Ayan (2016) A neural approach to blind motion deblurring. In European conference on computer vision. pp 221–235. Springer, Cham

  2. Su S, Delbracio M, Wang J, Sapiro G, Heidrich W, Wang O (2017) Deep video deblurring for hand-held cameras. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 1279–1288. IEEE

  3. Chang X, Huang P. Y, Shen Y. D, Liang X, Yang Y, Hauptmann AG (2018) Rcaa: relational context-aware agents for person search. In Proceedings of the European Conference on Computer Vision (ECCV). pp 84–100

  4. Liu AA, Nie WZ, Gao Y, Su YT (2016) Multi-modal clique-graph matching for view-based 3d model retrieval. IEEE Trans Image Process 25(5):2103–2116

    Article  MathSciNet  MATH  Google Scholar 

  5. Mao X, Shen C, Yang YB (2016) Image restoration using very deep convolutional encoder-decoder networks with symmetric skip connections. Adv Neural Inf Process Syst 29:401–409

    Google Scholar 

  6. Liu J, Zhai G, Liu A, Yang X, Zhao X, Chen CW (2018) IPAD: intensity potential for adaptive de-quantization. IEEE Trans Image Process 27(10):4860–4872

    Article  MathSciNet  Google Scholar 

  7. Liu AA, Su YT, Nie WZ, Kankanhalli M (2016) Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans Pattern Anal Mach Intell 39(1):102–114

    Article  Google Scholar 

  8. Chang X, Ma Z, Yang Y, Zeng Z, Hauptmann AG (2016) Bi-level semantic representation analysis for multimedia event detection. IEEE Trans Cybern 47(5):1180–1197

    Article  Google Scholar 

  9. Chang X, Yang Y, Xing E, Yu Y (2015) Complex event detection using semantic saliency and nearly-isotonic SVM. In International Conference on Machine Learning. pp 1348–1357, PMLR

  10. Ma S, Liu J, Wen Chen C (2017) A-lamp: adaptive layout-aware multi-patch deep convolutional neural network for photo aesthetic assessment. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 4535–4544

  11. Min X, Gu K, Zhai G, Liu J, Yang X, Chen CW (2017) Blind quality assessment based on pseudo-reference image. IEEE Trans Multimed 20(8):2049–2062

    Article  Google Scholar 

  12. Hradiš M, Kotera J, Zemcık P, Šroubek F (2015) Convolutional neural networks for direct text deblurring. In Proceedings of BMVC. pp 61–73

  13. Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition pp 4681–4690

  14. Zhu JY, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE international conference on computer vision pp 2223–2232

  15. Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 4681–4690

  16. Kupyn O, Budzan V, Mykhailych M, Mishkin D, Matas J (2018) Deblurgan: blind motion deblurring using conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 8183–8192

  17. Schmidt U, Schelten K, Roth S (2011) Bayesian deblurring with integrated noise estimation. In CVPR 2011. pp 2625–2632 IEEE

  18. Schmidt U, Rother C, Nowozin S, Jancsary J, Roth S (2013) Discriminative non-blind deblurring. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 604–611

  19. Xu L, Zheng S, Jia J (2013) Unnatural l0 sparse representation for natural image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1107–1114

  20. Babacan SD, Molina R, Do MN, Katsaggelos AK (2012) Bayesian blind deconvolution with general sparse image priors. In European conference on computer vision. pp 341–355, Springer, Berlin, Heidelberg

  21. Perrone D, Favaro P (2014) Total variation blind deconvolution: the devil is in the details. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 2909–2916

  22. Xu L, Jia J (2010) Two-phase kernel estimation for robust motion deblurring. In European conference on computer vision. Springer, Berlin

    Book  Google Scholar 

  23. Xu L, Zheng S, Jia J (2013) Unnatural l0 sparse representation for natural image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1107–1114

  24. Fergus R, Singh B, Hertzmann A, Roweis ST, Freeman WT (2006) Removing camera shake from a single photograph. In ACM SIGGRAPH 2006 Papers. pp 787–794

  25. Boracchi G, Foi A (2012) Modeling the performance of image restoration from motion blur. IEEE Trans Image Process 21(8):3502–3517

    Article  MathSciNet  MATH  Google Scholar 

  26. Xu L, Zheng S, Jia J (2013) Unnatural l0 sparse representation for natural image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1107–1114

  27. Dong C, Loy CC, He K, Tang X (2015) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307

    Article  Google Scholar 

  28. Kim J, Lee JK, Lee KM (2016) Accurate image super-resolution using very deep convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1646–1654

  29. Lai WS, Huang JB, Ahuja N, Yang MH (2017) Deep Laplacian pyramid networks for fast and accurate super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 624–632

  30. Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 4681–4690

  31. Tao X, Gao H, Shen X, Wang J, Jia J (2018) Scale-recurrent network for deep image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 8174–8182

  32. Lim B, Son S, Kim H, Nah S, Mu Lee K (2017) Enhanced deep residual networks for single image super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops. pp 136–144

  33. Wang X, Yu K, Wu S, Gu J, Liu Y, Dong C, Change Loy C (2018) Esrgan: enhanced super-resolution generative adversarial networks. In Proceedings of the European conference on computer vision (ECCV) workshops. pp 0–0

  34. Xu X, Sun D, Pan J, Zhang Y, Pfister H, Yang MH (2017) Learning to super-resolve blurry face and text images. In Proceedings of the IEEE international conference on computer vision. pp 251–260

  35. Zhang X, Wang F, Dong H, Guo Y (2018) A deep encoder-decoder networks for joint deblurring and super-resolution. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). pp 1448–1452, IEEE

  36. Tomosada H, Kudo T, Fujisawa T, Ikehara M (2021) GAN-based image deblurring using DCT discriminator. In 2020 25th International Conference on Pattern Recognition (ICPR). pp 3675–3681, IEEE

  37. Tomosada H, Kudo T, Fujisawa T, Ikehara M (2021) GAN-based image deblurring using DCT loss with customized datasets. IEEE Access. pp 135224–135233

  38. Cheon M, Kim JH, Choi JH, Lee JS (2018) Generative adversarial network-based image super-resolution using perceptual content losses. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops

  39. Dong L, Zhou Y, Jiang J (2022) Generative synthesis of logos across DCT domain. Neurocomputing. pp 163–172

  40. Ghedia NS, Vithalani CH (2021) Outdoor object detection for surveillance based on modified GMM and adaptive thresholding. Int J Inf Technol 13(1):185–193

    Google Scholar 

  41. Shekar BH, Raveeshwara S (2022) Contour feature learning for locating text in natural scene images. Int J Inf Technol 14(4):1719–1724

    Google Scholar 

  42. Bhatt MS, Patalia TP (2019) Content-based high-resolution satellite image classification. Int J Inf Technol 11:127–140

    Google Scholar 

  43. Hisham B, Hamouda A (2021) Arabic sign language recognition using Ada-Boosting based on a leap motion controller. Int J Inf Technol 13:1221–1234

    Google Scholar 

Download references

Acknowledgements

I acknowledge the support of CLIA lab for this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anmol Pattanaik.

Ethics declarations

Conflict of interest

The authors declare that there are no conflicts of interest.

Additional information

Supported by organization IIIT BBSR.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pattanaik, A., Balabantaray, R.C. Mish-DCTGAN based combined image super-resolution and deblurring approach for blurry license plates. Int. j. inf. tecnol. 15, 2767–2775 (2023). https://doi.org/10.1007/s41870-023-01322-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-023-01322-7

Keywords

Navigation