Abstract
Image denoising is a process of inverse reconstruction where the original image is reconstructed from its noisy observations. Several deep learning models have been developed for image denoising. Usually, the performance of image denoising is measured by metrics like structural similarity index (SSIM) and peak signal-to-noise ratio (PSNR), however in this paper, we take a more pragmatic approach. We design and conduct experiments to evaluate the performance of deep image denoising methods in terms of improving the performance of some popular computer vision (CV) algorithms after image denoising. In this paper, we have comparatively analyzed: fast and flexible denoising (FFDNet) convolution neural network (CNN), feed forward denoising CNN (DnCNN), and deep image prior (DIP)-based image denoising. CV algorithms experimented with are face detection, face recognition, and object detection. Standard and augmented datasets were used in our experiments. Various types and amounts of noise were added to raw images from standard datasets (BSDS500, LFW, FDDB, and WGSID). We may conclude from our findings that image denoising does not improve the performance of CV algorithms when applied to raw images of datasets. But image denoising is very effective in improving the performance of the CV methods when denoising is applied to noise corrupted images of the datasets. In our experiments, we found results where the improvements were up to 11.70% in terms of accuracy for the face detection experiment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Fan L, Zhang F, Fan H et al (2019) Brief review of image denoising techniques. Vis Comput Ind Biomed Art 2:7
Turk MA, Pentland A (1991) Face recognition using eigenfaces. In: Proceedings of 1991 IEEE computer society conference on computer vision and pattern recognition, pp 586–591
Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE computer society conference on computer vision and pattern recognition. CVPR 2001
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Kumar P, Dick A, Sheng TS (2009) Real time target tracking with pan tilt zoom camera. Digital Image Comput Techn Appl 2009:492–497. https://doi.org/10.1109/DICTA.2009.84
Portilla J, Strela V, Wainwright MJ, Simoncelli EP (2003) Image denoising using scale mixtures of gaussians in the wavelet domain. IEEE Trans Image Process 12(11):1338–1351
Jain V, Learned-Miller E. FDDB: a benchmark for face detection in unconstrained settings. University of Massachusetts, Amherst
Zhang K, Zuo W, Zhang L (2017) FFDNet: toward a fast and flexible solution for CNN based image denoising. IEEE Trans Image Process
Zhang K, Zuo W, Chen Y, Meng D, Zhang L (2017) Beyond a Gaussian Denoiser: residual learning of deep CNN for image denoising. IEEE Trans Image Process 26(7)
Lempitsky V, Vedaldi A, Ulyanov D (2018) Deep image prior. IEEE/CVF Conf Comput Vis Pattern Recogn 2018:9446–9454. https://doi.org/10.1109/CVPR.2018.00984
Al-Najjar Y, Chen SD (2012) Comparison of image quality assessment: PSNR, HVS, SSIM, UIQI. Int J Sci Eng Res 3:1–5
Mittal A, Moorthy AK, Bovik AC (2011) Blind/referenceless image spatial quality evaluator (BRISQUE). In: 2011 conference record of the forty fifth asilomar conference on signals, systems and computers (ASILOMAR)
Jocher G, Stoken A, Borovec J, NanoCode012, Christopher STAN, Liu C, Laughing, tkianai, yxNONG, Hogan A, lorenzomammana, AlexWang1900, Hajek J, Diaconu L, Marc KY, oleg, wanghaoyang0106, Defretin Y, Lohia A, ml5ah, Milanko B, Fineran B, Khromov D, Yiwei D, Doug D, Ingham F, Frederik, Guilhen, Colmagro A, Ye H; Jacobsolawetz, Poznanski J, Fang J, Kim J, Doan K, Yu L (2021, Jan 5) ultralytics/yolov5: v4.0 - nn.SiLU() activations, weights biases logging, PyTorch hub integration (version v4.0). Zenodo
Beitzel SM, Jensen EC, Frieder O (2009) Mean average precision. In: Liu L, Özsu MT (eds) Encyclopedia of database systems. Springer, Boston
Santos T, de Souza, dos Santos Andreza L, Avila S (2019). Embrapa wine grape instance segmentation dataset—embrapa WGISD (version 1.0.0) [data set]. Zenodo
Ide H, Kurita T (2017) Improvement of learning for CNN with ReLU activation by sparse regularization. In: 2017 international joint conference on neural networks (IJCNN)
Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. In: Proceddings of international conference on machine learning, pp 448–456
Dabov K, Foi A, Katkovnik V, Egiazarian K (2006) Image denoising with block-matching and 3D filtering. In: Proceedings of SPIE 6064, image processing: algorithms and systems, neural networks, and machine learning, vol 606414, 17 Feb 2006. https://doi.org/10.1117/12.643267
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR)
Shah M, Kumar P (2021) Improved handling of motion blur for grape detection after deblurring. In: 2021 8th international conference on signal processing and integrated networks (SPIN), pp 949–954. https://doi.org/10.1109/SPIN52536.2021.9566112
Gary BH, Jain V, Learned-Miller E (2007) Unsupervised joint alignment of complex images. ICCV
Diederik PK, Adam JB (2015) A method for stochastic optimization. In: Published as a conference paper at the 3rd International Conference for Learning Representations, San Diego, 2015
Ruder S (2016) An overview of gradient descent optimization algorithms. ArXiv:1609.04747
Sammut C, Webb GI (2011) Mean squared error. In: Encyclopedia of machine learning. Springer, Boston, MA
Jolliffe I (2011) Principal component analysis. In: Lovric M (ed) International encyclopedia of statistical science. Springer, Berlin, Heidelberg
Deng J, Dong W, Socher R, Li L, Kai L, Li F-F (2009) ImageNet: a large-scale hierarchical image database. IEEE Conf Comput Vis Pattern Recogn 2009:248–255. https://doi.org/10.1109/CVPR.2009.5206848
Martin D, Fowlkes C, Tal D, Malik J. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In: Proceedings of 8th international conference on computer vision
Cortes C, Vapnik V (1995) Support-vector networks. Machine Learn 20(3):273–297
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shah, M., Kumar, P. (2022). On the Efficacy of Deep Image Denoising for Computer Vision Applications. In: Singh, P.K., Wierzchoń, S.T., Chhabra, J.K., Tanwar, S. (eds) Futuristic Trends in Networks and Computing Technologies . Lecture Notes in Electrical Engineering, vol 936. Springer, Singapore. https://doi.org/10.1007/978-981-19-5037-7_28
Download citation
DOI: https://doi.org/10.1007/978-981-19-5037-7_28
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-5036-0
Online ISBN: 978-981-19-5037-7
eBook Packages: Computer ScienceComputer Science (R0)