Abstract
Nowadays, there is a growing desire for high definition images with fine textures, yet images taken in natural settings frequently suffer from sophisticated fuzzy artifacts. Due to the fact that these obtrusive abnormalities significantly reduce the visual quality of images, deblurring methods have been developed from a variety of perspectives. Blind motion Deblurring is a fundamental and difficult challenge in image processing and computer vision. It attempts to restore a clear image from a blurred version, despite the fact that it has no knowledge of the blurring process. Numerous existing methods are employed to address these types of challenges, but they are incapable of handling the high frequency characteristics present in natural images, as real-world images are frequently low resolution and blurred in various ways. This article presents a technique for recognising vehicle licence plates captured by surveillance cameras under natural circumstances, which is important in the domain of intelligent transportation systems. These observed plate images are frequently of low resolution and suffer from considerable edge loss, posing a significant barrier to existing blind deblurring algorithms. We present a discrete cosine transform (DCT) generative adversarial network (DCTGAN) based approach with a Mish activation function called Mish-DCTGAN to jointly process image super-resolution and non-uniform deblurring. We evaluated our proposed approach to licence plate (LP) datasets and compared the results with other existing methodologies. Mish-DCTGAN achieves the best performance in terms of PSNR and SSIM, as demonstrated by our testing results.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01322-7/MediaObjects/41870_2023_1322_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01322-7/MediaObjects/41870_2023_1322_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01322-7/MediaObjects/41870_2023_1322_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01322-7/MediaObjects/41870_2023_1322_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01322-7/MediaObjects/41870_2023_1322_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01322-7/MediaObjects/41870_2023_1322_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs41870-023-01322-7/MediaObjects/41870_2023_1322_Fig7_HTML.png)
Similar content being viewed by others
Data availability
The data will be provided on request.
References
Chakrabarti Ayan (2016) A neural approach to blind motion deblurring. In European conference on computer vision. pp 221–235. Springer, Cham
Su S, Delbracio M, Wang J, Sapiro G, Heidrich W, Wang O (2017) Deep video deblurring for hand-held cameras. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 1279–1288. IEEE
Chang X, Huang P. Y, Shen Y. D, Liang X, Yang Y, Hauptmann AG (2018) Rcaa: relational context-aware agents for person search. In Proceedings of the European Conference on Computer Vision (ECCV). pp 84–100
Liu AA, Nie WZ, Gao Y, Su YT (2016) Multi-modal clique-graph matching for view-based 3d model retrieval. IEEE Trans Image Process 25(5):2103–2116
Mao X, Shen C, Yang YB (2016) Image restoration using very deep convolutional encoder-decoder networks with symmetric skip connections. Adv Neural Inf Process Syst 29:401–409
Liu J, Zhai G, Liu A, Yang X, Zhao X, Chen CW (2018) IPAD: intensity potential for adaptive de-quantization. IEEE Trans Image Process 27(10):4860–4872
Liu AA, Su YT, Nie WZ, Kankanhalli M (2016) Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans Pattern Anal Mach Intell 39(1):102–114
Chang X, Ma Z, Yang Y, Zeng Z, Hauptmann AG (2016) Bi-level semantic representation analysis for multimedia event detection. IEEE Trans Cybern 47(5):1180–1197
Chang X, Yang Y, Xing E, Yu Y (2015) Complex event detection using semantic saliency and nearly-isotonic SVM. In International Conference on Machine Learning. pp 1348–1357, PMLR
Ma S, Liu J, Wen Chen C (2017) A-lamp: adaptive layout-aware multi-patch deep convolutional neural network for photo aesthetic assessment. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 4535–4544
Min X, Gu K, Zhai G, Liu J, Yang X, Chen CW (2017) Blind quality assessment based on pseudo-reference image. IEEE Trans Multimed 20(8):2049–2062
Hradiš M, Kotera J, Zemcık P, Šroubek F (2015) Convolutional neural networks for direct text deblurring. In Proceedings of BMVC. pp 61–73
Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition pp 4681–4690
Zhu JY, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE international conference on computer vision pp 2223–2232
Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 4681–4690
Kupyn O, Budzan V, Mykhailych M, Mishkin D, Matas J (2018) Deblurgan: blind motion deblurring using conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 8183–8192
Schmidt U, Schelten K, Roth S (2011) Bayesian deblurring with integrated noise estimation. In CVPR 2011. pp 2625–2632 IEEE
Schmidt U, Rother C, Nowozin S, Jancsary J, Roth S (2013) Discriminative non-blind deblurring. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 604–611
Xu L, Zheng S, Jia J (2013) Unnatural l0 sparse representation for natural image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1107–1114
Babacan SD, Molina R, Do MN, Katsaggelos AK (2012) Bayesian blind deconvolution with general sparse image priors. In European conference on computer vision. pp 341–355, Springer, Berlin, Heidelberg
Perrone D, Favaro P (2014) Total variation blind deconvolution: the devil is in the details. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp 2909–2916
Xu L, Jia J (2010) Two-phase kernel estimation for robust motion deblurring. In European conference on computer vision. Springer, Berlin
Xu L, Zheng S, Jia J (2013) Unnatural l0 sparse representation for natural image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1107–1114
Fergus R, Singh B, Hertzmann A, Roweis ST, Freeman WT (2006) Removing camera shake from a single photograph. In ACM SIGGRAPH 2006 Papers. pp 787–794
Boracchi G, Foi A (2012) Modeling the performance of image restoration from motion blur. IEEE Trans Image Process 21(8):3502–3517
Xu L, Zheng S, Jia J (2013) Unnatural l0 sparse representation for natural image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1107–1114
Dong C, Loy CC, He K, Tang X (2015) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307
Kim J, Lee JK, Lee KM (2016) Accurate image super-resolution using very deep convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1646–1654
Lai WS, Huang JB, Ahuja N, Yang MH (2017) Deep Laplacian pyramid networks for fast and accurate super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 624–632
Ledig C, Theis L, Huszár F, Caballero J, Cunningham A, Acosta A, Shi W (2017) Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 4681–4690
Tao X, Gao H, Shen X, Wang J, Jia J (2018) Scale-recurrent network for deep image deblurring. In Proceedings of the IEEE conference on computer vision and pattern recognition. pp 8174–8182
Lim B, Son S, Kim H, Nah S, Mu Lee K (2017) Enhanced deep residual networks for single image super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops. pp 136–144
Wang X, Yu K, Wu S, Gu J, Liu Y, Dong C, Change Loy C (2018) Esrgan: enhanced super-resolution generative adversarial networks. In Proceedings of the European conference on computer vision (ECCV) workshops. pp 0–0
Xu X, Sun D, Pan J, Zhang Y, Pfister H, Yang MH (2017) Learning to super-resolve blurry face and text images. In Proceedings of the IEEE international conference on computer vision. pp 251–260
Zhang X, Wang F, Dong H, Guo Y (2018) A deep encoder-decoder networks for joint deblurring and super-resolution. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). pp 1448–1452, IEEE
Tomosada H, Kudo T, Fujisawa T, Ikehara M (2021) GAN-based image deblurring using DCT discriminator. In 2020 25th International Conference on Pattern Recognition (ICPR). pp 3675–3681, IEEE
Tomosada H, Kudo T, Fujisawa T, Ikehara M (2021) GAN-based image deblurring using DCT loss with customized datasets. IEEE Access. pp 135224–135233
Cheon M, Kim JH, Choi JH, Lee JS (2018) Generative adversarial network-based image super-resolution using perceptual content losses. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops
Dong L, Zhou Y, Jiang J (2022) Generative synthesis of logos across DCT domain. Neurocomputing. pp 163–172
Ghedia NS, Vithalani CH (2021) Outdoor object detection for surveillance based on modified GMM and adaptive thresholding. Int J Inf Technol 13(1):185–193
Shekar BH, Raveeshwara S (2022) Contour feature learning for locating text in natural scene images. Int J Inf Technol 14(4):1719–1724
Bhatt MS, Patalia TP (2019) Content-based high-resolution satellite image classification. Int J Inf Technol 11:127–140
Hisham B, Hamouda A (2021) Arabic sign language recognition using Ada-Boosting based on a leap motion controller. Int J Inf Technol 13:1221–1234
Acknowledgements
I acknowledge the support of CLIA lab for this research.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there are no conflicts of interest.
Additional information
Supported by organization IIIT BBSR.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Pattanaik, A., Balabantaray, R.C. Mish-DCTGAN based combined image super-resolution and deblurring approach for blurry license plates. Int. j. inf. tecnol. 15, 2767–2775 (2023). https://doi.org/10.1007/s41870-023-01322-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-023-01322-7