Abstract
Noise reduction is one of the most important topics of digital image processing and despite the fact that it has been studied for a long time it remains the subject of active research. In the following work, we present an extension of the Mean Shift technique, which is efficiently reducing the Gaussian noise, so that it is able to cope with the impulsive disturbances. Furthermore, the elaborated technique can be applied to enhance the images corrupted by a mixture of strong Gaussian and impulsive noise, severely decreasing the quality of color digital images. By means of our approach, which is based on a novel similarity measure between a pixel and a patch located in the center of the processing block, even heavily disturbed images can be effectively restored, which enables the success of further stages of the image processing pipeline. We evaluate the efficiency of the proposed method using a publicly available database of test color images and compare the restored images applying a set of standard quality metrics with the results delivered by state-of-the-art denoising methods. Additionally, we compare our method with the Medoid and Quick Shift techniques, accelerating the original Mean Shift algorithm, in terms of objective quality criteria and computational complexity. The results of the performed experiments indicate that the proposed technique is superior to the widely used denoising techniques and can be used as a robust extension of the Mean Shift procedure. In the paper, a particular emphasis is placed on the ability of the presented algorithm to preserve and enhance image edges. The performed experiments evaluated with the use of the Pratt’s index, quantitatively confirm the superiority of the proposed design over the Mean Shift and standard denoising methods. The preservation of edges and even their sharpening is a very important feature of our algorithm whereas the final goal is segmentation, playing a crucial role in various computer vision tasks. The proposed algorithm is intended for the mixed noise reduction in color images, but it can be also applied in multispectral imaging and clustering of multidimensional data. To enable the comparison of our method with the standard denoising techniques and to help applying it in other image processing fields, we made its code freely available.
Similar content being viewed by others
Introduction
Noise reduction has been one of the paramount topics in digital image processing for decades. Over recent years, mainly due to the ubiquity of multimedia devices and popular applications for capturing, uploading and sharing color images through social networks, research focused on improving the quality of digital images has rapidly gained in importance.
The resolution of smartphones, webcams, video recorders and other devices that are used for the acquisition of visual information has notably increased and despite the fact that less and less light falls on the photosensitive sensors, images are expected to be of an excellent quality. Therefore, the development of effective and fast algorithms to reduce noise, improve contrast, color saturation and increase tonal dynamics is gaining more and more significance1,2,3. The demand for new, efficient noise reduction algorithms is also influenced by novel multi-channel imaging techniques, particularly in medicine and biology, astronomy, earth observation and industrial applications. In all imaging modalities, noise suppression plays a crucial role as in general, improving the quality of images by reducing distortion levels, implies the success of further processing and analysis4,5.
In the undermentioned paper we focus on the Mean Shift (MS) technique, which is an effective method of reducing the impact of noise-induced distortions that can smooth out the noise while preserving image edges. Owing to the ability of MS to flatten the color images, it is also used as a pre-processing stage in image segmentation tasks. This method works successfully when enhancing images corrupted by Gaussian noise, but it severely fails when impulsive noise is also present, as the outlying pixels are being preserved. Hence, to exploit the capabilities of the MS method, very often a suitable impulsive noise removal technique has to be applied firstly and afterwards, the MS is performed to reduce the remaining Gaussian noise component.
To alleviate the problems caused by the impulsive noise corrupting the images, we propose a robust modification of the standard MS technique. Our method may work directly on the noisy color images distorted by mixed impulsive and Gaussian noise without the need to remove the impulses at first and then, reduce the remaining Gaussian noise component. In addition, it offers very satisfying denoising results. As the proposed Robust Mean Shift (ROMS) can be directly applied to restore images corrupted by even strong mixed Gaussian and impulsive noise, it can be used in various practical imaging tasks. The described technique has been validated on a database available for download at http://denoising.net/ and also accessible as electronic supplementary material6, containing a variety of color test images contaminated with different noise intensity levels, and the obtained results show that it significantly outperforms the classical MS algorithm and its modifications. It is worth stating at this point that the comparison of the proposed robustified MS with state-of-the-art techniques also confirms its satisfying denoising performance. Using the elaborated technique, noise is efficiently suppressed even in heavily corrupted images, edges are substantially sharpened and details are retained.
In this work we also prove that the application of the pixel to patch similarity concept introduced in7,8 allows to successfully restore color images corrupted by mixed impulsive and Gaussian noise using the MS based procedure without the need of any preprocessing. We provide the results of simulations which can guide the users while setting the parameters of the new filter. We also show that the proposed denoising framework is able to enhance image edges that can be crucial in many practical applications.
The rest of the paper is structured as follows. First, in “Related work” we briefly describe previous techniques focused on the reduction of the mixture of Gaussian and impulsive noise. Afterwards, in “Mean shift and its fast modifications” the Mean Shift technique is presented and we provide a description of its modifications, which main aim is to decrease its computational complexity. In “Robust mean shift ” we put forward the structure of the proposed Robust Mean Shift (ROMS) algorithm and in “Efficiency evaluation and comparison with existing filtering designs” we evaluate the impact of the proposed filter parameters on its efficiency when restoring color images affected by mixed impulsive and Gaussian noise of varying intensity. We also compare the efficiency of the ROMS filter with various standard filtering methods using image quality measures as well as the Pratt’s Figure of Merit. Finally, we draw some summarizing conclusions and discuss future work directions.
Related work
Various kinds of noise degrade the quality of digital images, introducing distortions of its original content. Common noise sources include malfunctioning pixel elements in the camera sensors, thermal noise of the photosensitive matrix, shot noise caused by fluctuations in the photon flux, faulty memory locations or bit errors in hardware, timing errors in analog-to-digital converters, noise generated by electromagnetic interference and atmospheric turbulence as well as errors caused by imperfect optics and transmission. The resulting image degradations are frequently modelled as a mixture of additive white Gaussian noise, mainly responsible for dark and shot currents, and impulsive noise generating pixels with random channel intensity values3,9,10,11.
Standard Gaussian noise filtering schemes estimate the intensity of the processed pixel or its color channel values, considering its similarity to surrounding pixels belonging to a local processing block. The popular Bilateral Filter (BF)12,13 takes into account the radiometric and topographic closeness between pixels and can effectively suppress image distortions while preserving edges. Nonetheless, it fails in the presence of impulsive disturbances, as the corrupted pixels are being preserved. The ability of the bilateral filter to enhance edges was extended by incorporating a guidance image, which can be the processed image itself or an output of a robust filter capable of removing impulsive noise14,15, so that the mixed noise can be better suppressed.
The efficiency of BF was considerably increased by the use of additional information about the structure of a pixel neighborhood. The Non-Local Means filter (NLM)16 compares pixel patches instead of single pixels and by using a local measure of similarity between the patches, image details and edges are better preserved. Though, impulsive noise severely distorts the distance between the patches and, since as in the BF, pixels are also compared to themselves, it results in the inability of NLM to eliminate outlying pixels.
The impulsive noise is also affecting the efficiency of the Block-Matching and 3D Filtering, (BM3D)17,18 which is exploiting the image local sparse representation in the transform domain19,20,21,22,23 and dampen the Gaussian noise operating on a 3D stack of the local patches from the sliding filtering block, applying a collaborative filtering-based shrinkage strategy. The Anisotropic Diffusion filter24 also cannot handle the impulses, as the local gradients between an impulsive pixel and its neighbors are high, which remarkably slows down the diffusion process and preserves the outlying pixels25.
To alleviate the above-mentioned problems, numerous image restoration approaches were designed to first remove impulsive noise and then, to apply an efficient filter intended for the reduction of the Gaussian noise component26,27,28,29,30. The rationale behind such an approach is that the nature of the two noise degradation types is distinct and they are easier to manage when considered separately. Nevertheless, the Gaussian noise, through its masking effect, diminishes the accuracy of impulse detection methods, and in the following processing stage the remaining noise, which characteristics is generally far from the assumed Gaussian distribution, is not satisfactorily attenuated, with visible artifacts and some amount of impulses still unfiltered.
The Mean Shift technique, which extension is the subject of the following paper, can be used to detect impulsive noise. As the MS preserves the outlying pixels, they can be removed in the successive filtering step and the remaining image disturbances can be reduced with a suitable standard technique designed to cope with Gaussian noise31,32,33. In34 the authors proposed firstly detecting the impulses using the Adaptive Center-Weighted Median Filter (ACWMF) and replacing them with Adaptive Median Filter (AMF). Subsequently, the BM3D filter is applied to smooth out the Gaussian noise component and finally, the previously detected outliers are corrected using an inpainting method based on the median filter. The ACWMF and AMF impulsive noise filters were also used to detect the outliers when denoising the image using blind inpainting methods35,36,37,38,39 and were applied in35 to ensure the robustness of a method based on sparse image representation model.
Another technique40 combines the direction weighted median filter aimed at eliminating impulsive noise with the BM3D. The filtering result is further refined, detecting the remaining noise, which was not removed in the second processing stage. In41,42 the impulsive noise was removed with an adaptive averaging filter and the remaining distortions were suppressed with BM3D. In43 the Robust Outlyingness Ratio, which is a local statistic capable of efficiently detecting outliers, was used to remove impulsive noise disturbances and then the NLM filter was applied, adapting its parameters to the remaining noise characteristic. In44 the impulses were first detected by sorted quadrant median vector28 and the unfiltered noise was smoothed out by BF, which parameters were tuned on the basis of an estimation of the mixture noise composition and its intensity. The method of mixed noise reduction described in45 firstly detects the impulses and the subsequent stages use filters utilizing the PCA technique.
The approach proposed in30 consists of a fuzzy impulsive noise removal filter followed by additive noise reduction and a final postprocessing step. Once the impulsive noise is suppressed, a method based on sparse representation and 3D-processing performed with the use of DCT is applied. In the end, BF and an edge restoration technique is employed. An approach based on fuzzy logic was elaborated to cope with the problem of impulsive noise detection in46. The outlier detection scheme, which compares the central pixel with its neighbors, is designed to prevent the filtering of noise-free image pixels. Then, a weighted averaging scheme incorporating the measure of pixel impulsiveness is used to suppress the noisy pixel. A fuzzy-based switching technique for impulse detection was also devised in47. The proposed scheme builds fuzzy membership functions based on the local pixel similarity to its neighborhood and the results of median processing.
The efficiency of the two-stage approaches are generally not satisfactory as the methods require setting proper values of parameters dependent on the mixed noise characteristics and intensity. They are also computationally more demanding and are prone to produce visible artifacts. Therefore, many robust methods capable of suppressing in one step, both impulsive and Gaussian noise, have been developed. Primarily, these methods combine techniques of outlier detection with existing filtering designs so that the undesired influence of impulses can be eliminated or at least diminished.
A reliable statistic used to estimate pixel impulsiveness is the Rank Ordered Absolute Difference, (ROAD) which calculates the sum of distances between a pixel and its most similar neighbors from a filtering window48,49. The ROAD statistic was applied in the construction of the Trilateral Filter, which incorporates it into the BF framework50,51,52,53. The extension of the ROAD statistics was also applied for impulse detection in the method proposed in54, which restores the corrupted pixels with an interpolation scheme based on the radial basis function.
A robust modification of the NLM filter was proposed in55. As the patches are contaminated by impulsive noise, the noisy pixels are assigned a weight using the ROAD measure56,57. The weights are utilized when calculating the similarities between the patches, which are used to build the weighted average of pixels from a processing block. The proposed approach proved to be more effective when the spatial difference is considered, like in the BF58. The authors of59 propose applying in the first stage the traditional NLM technique and then, removing the remaining distortions again with NLM, which coefficients depend on the difference between the pre-denoised result and the noisy image, and also take into account the result of the first denoising step.
The patch-based method was also applied in60. First, the intensity of the mixed noise is estimated using the ROAD measure, then the central patch of the processing block is compared with all other patches and the most similar are chosen and finally, a given pixel is denoised by applying a maximum likelihood estimator. The extension of the ROAD statistics to color image processing was also developed in61,62. The ROAD statistics, obtained using various kinds of Minkowski distance in the RGB color space, are used both, as a measure of noise distortion and also, as its similarity to neighboring pixels and are exploited in the construction of a fuzzy filter which parameters are tuned to the noise characteristic. Another simple fuzzy filtering approach is based on the weighted vector median filter and the similarity between pixels expressed by ROAD63. The authors of64 proposed a fuzzy filter which can remove mixed noise in color images. The elaborated method relies on fuzzy rules based on the aggregated distances between the processed pixel and its neighbors.
A wavelet multi-scale analysis, combined with the local averaging filter was applied in65 and in66 an edge-preserving image denoising framework based on wavelet transform was also proposed. To successfully suppress noise distortions, a locally adaptive patch-based thresholding scheme was applied. The discrete wavelet transform coupled with the double window median filter was also used in67. The method utilizes row and column windows to perform median filtering and the wavelet coefficients are thresholded to suppress the noise-induced disturbances. Another technique developed in68 first smooths the image with a Gaussian or bilateral filter and then, examines the difference between the noisy and filtered images. To suppress the noise and retain the image details, the wavelet decomposition and thresholding were performed. The final restoration is achieved by using wavelet reconstruction, which efficiently estimates the image noise component.
An algorithm using clustering-based sparse representation exploiting both sparsity and non-local self-similarity was designed in69. The performed experiments confirm the competitive performance of the new technique which can be applied to video denoising. In70 the image denoising is formulated as an optimization problem that is solved iteratively by a weighted basis pursuit in the closed affine subspace. The reconstruction of the extracted noisy patches is performed by sparse representation using two dictionaries built with the DCT. The weighted encoding with the sparse non-local regularization technique (WESNR) was also applied in71 to cope with mixed noise. The noise-corrupted image patches were encoded over a set of pre-learned local PCA dictionaries and coding residuals were weighted adaptively to evaluate the pixel corruption measure. Additionally, both image sparsity and non-local self-similarity priors were combined into a single sparse regularization term. Another method depicted in72 first performs the initial denoising with the filter described in55. Then, the image pixels are classified as corrupted by impulsive or Gaussian noise and the final output is obtained using a variational approach. A method based on the total variation73,74,75 with \(\ell _0\)-norm fidelity was described in76. Although designed for impulsive noise, it can efficiently reduce various mixtures of noise models, too.
Impulsive noise can be also detected using the methods developed within the framework of mathematical morphology77,78,79,80 which enables the construction of two-stage filtering designs. In81 various impulsive noise detectors based on morphological operators were evaluated and the remaining noise was suppressed using morphological smoothers. A combination of the fuzzy approach to noise reduction with morphological operations was presented in82. The experiments revealed that the elaborated approach yields very promising results for highly contaminated images.
This brief review of the literature shows that mixed noise reduction methods can be divided into two classes. The algorithms belonging to the first class detect the impulses first and then reduce the remaining Gaussian noise. The second class of algorithms allows for simultaneous removal of both types of disturbances by introducing robust mechanisms of the suppression of outliers introduced by impulsive noise.
The analysis of the denoising algorithms shows that modifications of methods which are intended to reduce Gaussian noise, generally achieve good efficiency. Therefore, in this work we present a modified Mean Shift algorithm based on the robust pixel to patch similarity measure, which can be used to extend other Gaussian noise reduction methods such as BF, NLM or BM3D to diminish their susceptibility to impulsive disturbances.
Mean shift and its fast modifications
The Mean Shift is a powerful nonparametric iterative technique which is used for finding the local modes of a given density function. It was introduced in83 and has been further investigated by the authors of84, however its immense popularity in image processing is due to the seminal work presented in85. This mode-seeking algorithm is based on the widely used kernel smoothing technique86 and features some similarity with the popular k-means clustering algorithm and the image smoothing approaches based on bilateral filtering and nonlinear diffusion25,87,88. Therefore, it is frequently applied for edge-preserving noise reduction89, object tracking and color image, and video segmentation90,91,92.
The MS technique is fairly effective in enhancing color images disturbed by the low-intensity Gaussian noise. It may smooth out the undesired noise component while minimizing the loss of sharpness of the image edges. Alas, this technique fails completely when the image is also corrupted by heavy-tailed noise, as the introduced impulses are treated as local modes and can be removed only when depleting the image contrast, which leads to its unacceptable blurry appearance. The inability of MS to remove impulses can be used for their detection, which enables their further treatment by a selected interpolation method32,93,94. To circumvent the effect of outlying pixels preservation, the impulses can be first removed using a method adapted to the impulsive noise intensity and the remaining pixels are later restored using the MS iterative scheme95.
Let us recall the structure of the MS technique, assuming a sample \(\varvec{X}=\{\varvec{x}_1, \ldots , \varvec{x}_n\}\) which consists of n multivariate observations \(\varvec{x}_i\), (\(i=1,\ldots , n\)) in the l-dimensional space \(\mathbb {R}^l\). The multivariate density estimator \(f(\varvec{x})\) calculated at observation \(\varvec{x}\) is defined using a kernel \(\Phi (\varvec{x})\) which is a bounded and symmetric function defined as85,96
where \(\sigma \) is the bandwidth (smoothing parameter).
In practice, the radially symmetric kernel defined as \(\Phi (\varvec{x})=c_{\phi ,l}\cdot \phi (\Vert \varvec{x} \Vert ^2)\) is used and we obtain
where \(\Vert \cdot \Vert \) denotes the Euclidean norm, \(\phi \) is a function called profile and \(c_{\phi ,l}\) is the normalizing factor. The popular choice of the profile is \(\phi (x)=\exp \left( -x/2 \right) \), which provides the Gaussian kernel \( \Phi _{\text {G}}(\varvec{x})=(2\pi )^{-\frac {l}{2}} \exp \left( - \Vert \varvec{x} \Vert ^2/2 \right) . \)
Introducing the function \(\psi (x)=-\phi ^{'}(x)\) we can define a kernel \(\Psi (x)=c_{\psi ,l}\, \psi \left( \Vert \varvec{x} \Vert ^2 \right) \), with \(c_{\psi ,l}\) being a normalizing constant. The kernel \(\Psi \) is called a shadow of \(\Phi \) and thus the Epanechnikov profile is a shadow of a uniform (flat) profile \(\psi _{\text {U}}(x)= 1\) if \( \Vert x\Vert \le 1\) and 0 otherwise, while the Gaussian kernel and its shadow have the same form.
The gradient of the density estimator \(f(\varvec{x})\) defined in (2) is thus given by
where \(C=2c_{\phi ,l}/n\sigma ^{l+2} \) is a constant. As \(\phi ^{'}(x)=-\psi (x)\) we obtain
which can be rewritten as
The first term of (5) is proportional to the density estimation calculated using the kernel \(\Psi \) and the second one is called Mean Shift denoted as \(\varvec{m}(\varvec{x})\)
where \(\varvec{y}\) is a weighted mean of \(\{\varvec{x}_i\}\) and the weights are determined by the appropriate values of the \(\psi \) function. The mean shift vector \(\varvec{m}(\varvec{x})\), which is the difference between the normalized weighted average obtained using the kernel \(\psi \) and the vector \(\varvec{x}\), is oriented in the direction of the gradient of the density function, or in other words, points towards its maximum increase.
The mean shift procedure works in an iterative way. The observation \(\varvec{x}\) is being shifted to \(\varvec{y}\) as shown in (6) and thereby, we can formulate the iterative mode finding scheme as85,97
where \(t\ge 1\) denotes the iteration number and \(\varvec{x}^{(1)}\) is the starting point. This iterative technique converges to a local mode85 and the process is terminated when the magnitude of the mean shift vector satisfies \(\Vert \varvec{m}(\varvec{x})\Vert <\epsilon \), where \(\epsilon \) is a small scalar value.
The mean shift procedure defined by (7) can be formulated as locally weighted least squares estimator98,99,100,101
Calculating the gradient of the minimized expression and equating it to zero we obtain expression (7)100 and in the iteration \((t+1)\), a point \(\varvec{x}=\varvec{x}^{(t+1)}\) is being found, which minimizes the cost function \(\sum _{i=1}^{n} \Vert \varvec{x}-\varvec{x}_i\Vert ^2 \psi \left( \Vert \varvec{x}^{(t)} -\varvec{x}_i \Vert ^2 /{\sigma ^2 }\right) \). As can be readily noticed, this procedure is well known from the theory of M-estimators. The process of finding the successive points \(\varvec{x}^{(t)}\) is time-consuming as many iterations can be needed to obtain the \(\epsilon \)-convergence and in each iteration the distances between \(\varvec{x}^{(t)}\) and all points \(\{\varvec{x}_i\}\) need to be computed.
To speed up the process of finding the local modes, the Medoid Shift algorithm, which is based on the iterative procedure formulated in (8), has been proposed100,102. This modification assumes that only the points that belong to the set \(\varvec{X}\) can be taken as successive observations that minimize the cost function, and the trajectory is determined by
or by
where \(d^2(\varvec{x}^{(t)},\varvec{x}_i)\) is the squared dissimilarity measure between the corresponding vectors, equal to \(\Vert \varvec{x}^{(t)}-\varvec{x}_i\Vert ^2\) when the Euclidean norm is used and \(\varvec{x}^{(1)}\) is again the starting point.
As a result, the trajectory of \(\varvec{x}^{(t)}\) obtained in successive iterations is constrained to pass only through the points that belong to the initial set \(\varvec{X}\). Although much faster than the original MS technique, this algorithm is also time-consuming as all the distances from the current position \(\varvec{x}^{(t)}\) to the points \(\varvec{x}_1,\ldots ,\varvec{x}_n\) belonging to \(\varvec{X}\) have to be calculated to determine the next iteration output \(\varvec{x}^{(t+1)}\). Notwithstanding, the computation of the distances between the points belonging to \(\varvec{X}\) has to be performed only once and when stored they can be reused in the iteration process described by (10).
Another approach aiming to speed up the iterative mode finding procedure was proposed in99. The authors put forward a method named Quick Shift (QS), which is defined by
where \(f(\varvec{x}_i)\) and \(f(\varvec{x}^{(t)})\) are the values of the density function defined by (1) at \(\varvec{x}_i\) and \(\varvec{x}^{(t)}\) and \(d^2\) is again a squared dissimilarity measure.
Using the Quick Shift, each initial point is moved to its nearest neighbor, for which the density function is increased and this process is repeated. As might be observed, if a global mode is present, then it is the final point of every trajectory that originates from the initial starting point \(\varvec{x}^{(1)}\). Therefore, some constraints regarding the maximum value of dissimilarity between the successive points in a trajectory can be predefined, so that the paths created through the iterative scheme are terminated before the global mode is reached.
In regard to color images, the mean shift procedure operates in 5-dimensional space, (\(l=5\)). Two dimensions describe the pixel position, and another three are needed for the RGB color channels. The spatial and radiometric dimensions expressed by vectors \(\varvec{\xi }\) and \(\varvec{\eta }\) have to be treated in a different way, which is reflected in the definition of the radial kernel, which is composed of two parts
where \(\sigma _{\text {s}}\) and \(\sigma _{\text {r}}\) are kernel bandwidths (smoothing parameters) corresponding to the spatial (s) and radiometric (r) components.
Due to the properties of natural images, the correlation between pixels decreases with their topological distance on the image domain. This is incorporated into the spatial kernel, which is mostly a Gaussian function or is simplified by a flat kernel. Usually, the mean shift operates using a square processing block \(\mathcal {B}\) of size \((2r+1)\times (2r+1)\), where the radius r determines the ability of MS to determine the local modes, so that it can be viewed as a parameter influencing the extrema finding resolution. The processing block \(\mathcal {B}\) contains \(N=(2r+1)^2\) pixels which are indexed by \(j=1, \ldots , N\).
The second parameter \(\sigma _{\text {r}}\) influences the role of the pixel color components. If its value is small, only similar pixels are considered, whereas for large values, owing to oversmoothing, the local modes cannot be distinguished. The role of the size of the processing block is also clear in the case of the QS algorithm, for which the limited size of the processing block can prevent the digital paths to reach the global mode.
The MS algorithm iteratively calculates the new position \(\varvec{\xi }_i^{(t+1)}\) of the center of a processing block \(\mathcal {B}_i^{(t+1)}\) and its updated RGB values expressed by \(\varvec{\eta }_i^{(t+1)}\). The index i determines the position of a central pixel of the processing block. The procedure starts from the pixel position \(\varvec{\xi }_{i}^{(1)}\) at the center of the initial block \(\mathcal {B}_i^{(1)}\) which consists of pixels indexed by \(j=1,\ldots ,N\), where \(N=(2r+1)^2\) is the number of pixels in the sliding block. The RGB components of the initial pixel at location \(\varvec{\xi }_i^{(1)}\) are contained in the vector \(\varvec{\eta }_i^{(1)}\) and the same procedure is applied to each image pixel \(\varvec{x}_i\). Thus, the MS performs a series of iterations
where
and \(\sigma _{\text {r}}\), \(\sigma _{\text {s}}\) are smoothing parameters and each block \(\mathcal {B}_i^{(t)}\) contains pixels indexed from \(j=1\) to N.
The algorithm stops in iteration \((t+1)\) when \(\Vert \varvec{x}^{(t+1)} -\varvec{x}^{(t)} \Vert <\epsilon \), where \(\epsilon \) is a predefined small value, (set in our experiment at \(10^{-3}\)). When this condition is satisfied, the local mode is reached and the magnitude of the mean shift vector is very close to 0. Then the final value of \(\varvec{\eta }_i^{(t)}\) is assigned to the pixel at the initial position \(\varvec{\xi }_i\). The structure of the standard MS algorithm is presented using a pseudocode in Alg. 1.
Robust mean shift
The new approach to the problem of mixed Gaussian and impulsive noise suppression in color images is based on the previously discussed MS technique and the recently introduced Robust Local Similarity Filter (RLSF)7,8, which exhibits a very satisfying ability to reduce mixed Gaussian and impulsive noise in color images. The RLSF is based on the bilateral filter and a modified ROAD statistic.
In order to process an image pixel \(\varvec{x}_i\) and calculate the filter output \(\varvec{x}_i^{'}\), a weighted average of pixels from a block \(\mathcal {B}_i\) centered at \(\varvec{x}_i\) is computed. The weighting function utilizes the Robust Similarity Measure (RSM) defined as
where \(d_{j(k)}\), \(k=1,\ldots ,9\), is the k-th smallest Euclidean distance between vector \(\varvec{x}_j\) from the processing block \(\mathcal {B}_i\) and the pixels from a window \(\mathcal {W}_i\) of size \(3 \times 3\) at the center of this block and \(\alpha \in [1,9]\) is a parameter which denotes the number of closest neighbors taken for the average.
The RLSF filter output is defined as
with
where \(\varvec{x}_j\), \(j=1,\ldots , N\), are the pixels in the processing block \(\mathcal {B}_i\), \(d_{j(k)}\) is the Euclidean distance between \(\varvec{x}_j\) and the central pixel \(\varvec{x}_i\) of \(\mathcal {B}_i\) and \(\lfloor \cdot \rceil \) denotes rounding to the closest integer.
Thereby, only those pixels from the block \(\mathcal {B}_i\), which are most similar to the small central window \(\mathcal {W}_i\) are taken to the weight computation process. The applied similarity measure in not affected by impulsive noise as the outlying pixels are not considered when calculating the measure R using Eq. (15).
In the RLSF filter design we neglect the topological distance between pixels, as we observed that this has little impact on the final results and such a procedure proved beneficial also in the well known NLM algorithm, which assumes a self-similarity of the image features, not to mention the fact that omitting the spatial components allows to speed up the algorithm. We also used the squared Euclidean distances instead of distances as in the original ROAD definition when designing the R measure, as in this way it is more sensitive to the outliers and easier to compute as taking the square roots can be discarded.
The RLSF technique can be incorporated into the Mean Shift algorithm. It is worth noticing that the mean shift vector can be computed using the RSM, and spatial coordinates of the output are contained in the vector \(\varvec{\xi }_i^{'}\) calculated using the scheme defined in Eq. (16). Nonetheless, instead of directly comparing the pixels \(\varvec{x}_j\) from the block \(\mathcal {B}_i\) with its center pixel \(\varvec{x}_i\), we compare them with the central small window \(\mathcal {W}_i\) and calculate the weight as given by (17). In each iteration, like in the MS, the new values of the RGB channels \(\varvec{\eta }_i^{'}\) and new position \(\varvec{\xi }_i^{'}\) of the processed pixel \(\varvec{x}_i\) are calculated and the processing block \(\mathcal {B}_i\) is moved so that it matches the new output pixel position. Of course, for the shifting of the block the new coordinates have to be rounded to the nearest integer, using the \(\lfloor \cdot \rceil \) operation. Then, the new central window \(\mathcal {W}\) in the shifted block is determined and its center is replaced by the RGB values of the RLSF filter output defined in (16).
Accordingly, the main difference of the new filtering technique when compared to the Mean Shift is the adoption of a robust similarity measure and the replacement, in each iteration, of the center of the processing block by the RLSF filter output. Then, the iteration process is continued until convergence is achieved. In the final step, the RGB values of the original pixel \(\varvec{x}_i\) are replaced by \(\varvec{\eta }_i^{'}\) obtained using the RLSF, analogously to the classic MS technique. The structure of the new algorithm is shown using pseudocode in Alg. 2 and also explained in Fig. 2.
The replacement of the central pixel of the processing block (center of window \(\mathcal {W}\)) is advantageous as it smooths out the Gaussian noise component through weighted averaging and it is able to remove the outliers when they occupy the window center. Adopting this approach, the impulsive pixels do not influence the trajectory of the block center (as the outlying values are not considered by the robust similarity measure) until the stopping criterion is satisfied. This allows us to apply the MS procedure directly on the images contaminated with mixed Gaussian and impulsive noise and if needed to perform the final segmentation procedure as suggested in103.
In order to better explain the proposed approach, Fig. 1 illustrates two first iterations of the Robust Mean-Shift algorithm. For simplicity, a grayscale image is used. The block and window indexes represent the number of iteration steps. In the first, a sample pixel of intensity 6 (marked blue) is compared to the central window. The three closest pixels intensities are 8, 5 and 4. For illustration, we assume that the new calculated processing block will be \(\mathcal {B}_2\). This leads to a new central window \(\mathcal {W}_2\) with the central pixel \(\varvec{\eta }^{'}\) from the previous step (calculated using the RLSF) and unchanged pixels from the new processing block.
Efficiency evaluation and comparison with existing filtering designs
In an attempt to evaluate the efficiency of the proposed filter, two sets of color test images contaminated with different types of noise were prepared. Chosen test images are depicted in Fig. 3. The first set containing the mixed noise was firstly distorted by Gaussian noise with standard deviation in the range 10–50, (with step 10) and 10–50% of the pixels were subsequently replaced by random valued impulsive noise (with the same step), so that every RGB channel of a corrupted pixel was assigned a value drawn from a uniform distribution in the range [0, 255]. The second set of images was degraded only by random valued impulsive noise. To simplify the notation, noise level p denotes a Gaussian noise contamination with standard deviation p and with \(p\%\) impulsive pixels. Such a combination of Gaussian and impulsive noise produces images with realistic levels of distortions and was used in our previous papers7,8,104, which enables us to compare the obtained results with those achieved using our other methods.
The restoration efficiency has been assessed mainly using the commonly used PSNR and MAE quality measures105,106,107 defined as
where \(\varvec{x}_{i}\) are the original image pixels, \(\varvec{y}_{i}\) are the restored samples and the number of image pixels is n. In order to better express the ability of filtering out impulses, a relaxed Mean Squared Error measure (\(\text {MSE}_{\text {R}}\)) is used. The so-called Impulse Removal Index (IRI), is defined as108
where \(\varvec{x}_j\) are the original (clean) pixels contained in the filtering window \(\mathcal {W}_i\) at image position i and \(\varvec{y}_i\) is the filtering output.
Thus, IRI measures the deviation between the processed pixel and the set of original pixels located in the corresponding operational window of size \(3\!\times \!3\). If the pixel is corrupted by impulsive noise, which is not removed, then the IRI will grow markedly. However, IRI will not increase when the corrupted pixel is replaced by a pixel close to one of its noise-free neighbors. In consequence, the IRI is a reliable measure of the ability of a filter to detect and replace corrupted picture elements. Additionally, the IRI measure does not penalize shifts of pixels making the edges sharper, which does not occur when using PSNR.
Furthermore, the Structural similarity (SSIM) measure and the Multi-Scale Similarity (MSSIM) were used to better express the image restoration quality in consistency with subjective ratings109,110. We converted these quality metrics into a logarithmic form in order to better compare the enhancement results
Because, the SSIM measure works on gray scale images, the luminance of the images was calculated using the following conversion rule: \(L = 0.299\text {R} + 0.587\text {G} + 0.114\text {B}\).
The filter efficiency has been tested on 100 color test images of resolution 640\(\times \)480, depicted in Fig. 4, available at http://denoising.net/ and also included in6, to determine the best tuning parameters of the new method. Initially, the influence of the processing block \(\mathcal {B}\) size has been investigated. The plots in Fig. 5 show the dependence of the PSNR and the IRI measure on the radius r of the block \(\mathcal {B}\) containing \((2r+1)^2\) pixels to tuning parameter \(\sigma \) and also to the number of close neighbors \(\alpha \). We chose an exemplary, well-known test color image PEPPERS contaminated at levels \(p=10\) and \(p=30\).
For low noise level, a processing block of the size of \(3\!\times \!3\), (\(r=1\)), which coincides with the central window \(\mathcal {W}\), is large enough, although for higher noise levels, a block of \(5\!\times \!5\), (\(r=2),\) gives much better results. The choice of the second parameter—\(\alpha \) does not influence significantly the quality measures and the three nearest pixels (\(\alpha =3\)) are optimal both for low and high contamination levels. The third parameter \(\sigma \) is not dependent on both r and \(\alpha \), but depends on the noise level. It reduces the impact of Gaussian noise in the filtering process, thus the higher the Gaussian noise component is, the higher the \(\sigma \) parameter is needed. The recommended value of \(\sigma \) for low noise level is 30 and for higher intensity \(\sigma =50\).
In Tables 1, 2, 3, 4 the new filtering design has been compared with a set of filters used for the reduction of noise in color images using the PSNR, IRI, SSIM and MSSIM restoration quality measures. The following filters were chosen for comparisons:
-
Mean-Shift Filter, (MS)85,
-
Quick-Shift Filter, (QS)99,
-
Medoid-Shift Filter, (MEDS)100
-
Bilateral Filter, (BF)87,
-
Non-Local Means Filter, (NLM)16,
-
Vector Median Filter, (VMF)111,
-
Block-Matching and 3D filtering, (BM3D)17,
-
Patch-based Approach to Remove Impulse-Gaussian Noise, (PARIGI)112,
-
Weighted Encoding with Sparse Nonlocal Regularization, (WESNR)71,
-
TV-based restoration method with \(\ell _0\)TV-norm data fidelity, (\(\ell _0\)TV)76,
-
Guided Bilateral Filter, (GBF)113.
For the MS filter, we tested three variants which treat in a different way the weight assigned to the central pixel of the processing block. As for the central pixel of the block the similarity to itself is considered, the assigned weight always takes on the maximum value 1, independently of the pixel RGB channels and the block structure. When this pixel is corrupted, then, even though its role in the averaging process should be diminished, it is still considered as deserving the highest weight, which makes that the impulses are mostly retained. The only way to force the algorithm to decrease the influence of the central pixel of the block is to set a high value of the parameter \(\sigma _{\text {r}}\), which, however causes a considerable blurring. Therefore, to make a valid comparison of the new ROMS filter with the MS, its three versions have been evaluated114,115
-
\(\text {MS}_{\text {STD}}\)—standard version, as described in85, considering all pixels in the filtering block, (including the central pixel),
-
\(\text {MS}_{\text {NC}}\)—the central pixel of the block is omitted in the calculations of the weights in Eq. (17), (no central pixel considered),
-
\(\text {MS}_{\text {MAX}}\)—the maximum weight calculated for the pixels in the block (excluding the central pixel) is assigned114.
The analysis of the PSNR results summarized in Table 1 indicates that the ROMS filter has much better denoising capabilities than the Mean Shift and its 3 modifications regarding the treatment of the central pixel. The performance of the MS modifications does not differ significantly in terms of PSNR and IRI, both for low and high contamination. However, when analysing the results in terms of the SSIM and MSSIM measures, which better express the perceived quality of the restored images, the version \(\text {MS}_{\text {NC}}\) is superior to the standard MS implementation. In addition, the Quick Shift and Medoid Shift were not able to effectively suppress the mixed noise, although the latter proved to be generally more effective when analyzing the PSNR results. Similar conclusions can be drawn while analyzing the results in terms of the IRI (Table 2), SSIM and MSSIM measures (Tables 3, 4).
Analyzing the obtained results presented in the Tables, the proposed denoising scheme outperforms other filters taken for comparison when the test images were distorted by mixed noise of high intensity. For lower contamination levels, only the WESNR filter achieved for some images slightly better performance. The IRI results show that the ROMS filter efficiently removes the impulsive noise outperforming other methods. The satisfying properties of the ROMS are also confirmed when scrutinizing the results in terms of SSIM and MSSIM.
The very satisfying performance of the ROMS filters expressed using the objective quality measures can be also confirmed visually. Figures 7 and 8 show that the ROMS filter can cope, both with the pure impulsive and mixed noise, and it exhibits significantly better denoising efficiency than the competitive methods. The impulsive noise is much better attenuated, and the Gaussian component is efficiently reduced. The image details are well preserved, and the edges are even sharper than in the undistorted test images.
The restoration results obtained using the WESNR show that the filter is prone to generate color artifacts, especially for high contamination levels. The enhancement results achieved using the PARIGI filter reveal that this filter tend to retain impulsive pixels and is not able to suppress the Gaussian noise for high contamination levels.
It can be also observed that the standard Mean Shift is forced to increase the smoothing parameter to suppress the impulses, which makes the images blurry. The QS and MEDS techniques produce much sharper images, however the edges are jagged and exhibit a strong zipper effect. In contrast, the proposed robust modification does not experience any problems with the outliers injected by the noise process and is able to sharpen image edges, which is well expressed by the IRI measure. The visual comparison of the images presented in Figs. 7a) and 8a) shows that the proposed Robust Mean-Shift offers very satisfying image enhancement results when only impulsive noise is present, what makes this algorithm very versatile. The effectiveness of the proposed technique is also confirmed in Fig. 11, which presents the enhancement results of real noisy images depicting 2 works of art and also a cDNA microarray116. As can be observed the noise of unknown characteristic is well suppressed and edges are sharpened.
The ability of the proposed algorithm to sharpen edges of the image objects is very beneficial when they are to be segmented or edge detection is to be performed. Therefore, we decided to evaluate the efficiency of ROMS to enhance image edges. To that end, we prepared a test color image SQUARES consisting of square areas with slowly varying colors depicted in Fig. 9. The position of the edges was known, so we could objectively evaluate the ability of the ROMS and other filters to restore them.
The SQUARES image was corrupted with impulsive and mixed noise of different intensity and restored using ROMS, MS, QS, MEDS and other filters used for comparisons. For the objective assessment of the edge preservation ability of the tested algorithm, the well-known Pratt’s Figure of Merit (FOM)117 was used.
This measure is defined as follows
where I and \(I'\) are the numbers of detected edge pixels in the clean and filtered images, \(d_i\) is the topological distance between the found edge pixel and its closest pixel belonging to the set of ideal edges. Additionally, a design constant \(\gamma =9\) was used to penalize displaced edges. The FOM values are in the interval [0, 1] and higher values indicate better edge detector performance.
Figures 9a) and 10a) show the test image SQUARES distorted with impulsive and mixed noise together with the result of their enhancement. As can be noticed, the restoration quality excels the Mean Shift and its modifications (MEDS and QS) both in the case of impulsive and mixed noise. The ROMS filter removes efficiently impulsive noise, the edges are not distorted by the zipper effect and no blotches are produced.
The obtained results also exhibit the tendency of the WESNR to produce color blotches both for impulsive and mixed noise. The inability of the PARIGI filter to suppress the strong Gaussian noise was also confirmed.
The very good edge-enhancing properties of ROMS are supported by the maps of the Vector Range edge detector118,119 presented in Figs. 9b) and 10b). The edges are continuous and do not deviate much from those obtained using the undistorted image which is reflected by the FOM measures summarized in Table 5. The analysis of the FOM values show that the ROMS considerable excels all competitive techniques when the ability to suppress impulsive and mixed noise in flat areas and to retain their edges is considered.
Figure 6 shows the comparison of the execution time of the ROMS filter when compared with the standard MS. The proposed filter is slower than MS, but still enables its application in real-time processing tasks or for video denoising even for images in full HD resolution. The experiments have been performed on a CUDA compatible NVIDA RTX2080Ti graphics card. The plots showing the dependence of the execution times on block radius r were obtained averaging the results of 1000 repetitions.
Conclusions
In the paper a Robust Mean Shift technique, able to efficiently suppress mixed Gaussian and impulsive noise, has been presented. The new filtering framework makes use of a robust similarity measure which compares a pixel belonging to a processing block with a given number of closest samples in a filtering window in its center. Thereby, the introduced similarity measure is resilient to noise disturbances and allows to construct robust noise filtering designs.
The adoption of the Mean Shift concept proved to deliver very satisfying noise restoration results, even for images degraded by impulsive and mixed noise of high intensity. The obtained results were presented using objective quality measures and indicated that the new filter outperforms the Mean Shift and also Medoid Shift and Quick Shift methods, which are its modifications developed to make the algorithm faster.
The proposed ROMS excels some widely used denoising methods, like BF, NLM, BM3D or VMF, which are utilizing the Euclidean distance, in terms of standard restoration quality metrics. The proposed robust similarity measure between a pixel and a patch in the center of the processing block allows to substantially diminish the influence of the impulsive pixels. In this way, the susceptibility of the Euclidean distance to outliers can be considerable decreased, which enables to construct new methods based on the classical, well-researched algorithms.
The proposed method is especially effective for images contaminated with mixed noise of high intensity. As the pixels from the filtering block are compared with the patch of pixels in its center, some tiny details are treated as noise and removed. This can be seen as a drawback when the preservation of tiny details in images with low contamination is desired. Another feature of the proposed filter is its tendency to sharpen image edges. Generally, edge enhancement is beneficial in most applications, however the creation of too strong edges can lead to image oversegmentation. To decrease the impact of this effect, the filter parameter \(\sigma \) can be increased, which leads to softer edge profiles without diminishing the efficiency of the filter to suppress the mixed noise.
The visual analysis of the experiments performed on a set of standard color images proved that the new filter can efficiently remove the impulsive noise while smoothing out the Gaussian noise component. The beneficial feature of the new filtering design is its ability to preserve and even enhance the image edges. An analysis performed applying the Vector Range edge detector and the Pratt’s Figure of Merit (FOM) revealed that the new technique is able to precisely locate the edges and significantly outperforms the designs based on the Mean Shift and its modifications and also other competitive denoising techniques.
The described method can be used for the enhancement of both gray scale and color images and their temporal sequences. It can be easily extended to work on multispectral images corrupted by a mixture of Gaussian and heavy tailed noise. Possible applications include underwater imagery, virtual enhancement of works of art, restoration of old photographs, filtering of microscopy images and enhancement of multispectral satellite, astronomical and medical data acquired from various modalities. Additionally, the elaborated approach can be applied for image segmentation tasks and also for multi-dimensional data clustering120.
Future work will be focused on the incorporation of the robust pixel to patch similarity measure to the structure of the Medoid and Quick Shift, so that they will be able to cope better with impulsive and mixed noise corrupting color images. Preliminary experimental results show that the adoption of an additional weight assigned to the pixels from the processing block, expressing their degree of corruption, considerable improves the denoising effectiveness of the developed filtering framework. In this way, a new family of very fast filters utilizing the Medoid and Quick Shift concept will be elaborated.
Data availability
Algorithm implementation with working code of this work is available under https://github.com/dkusnik/RMS.
References
Chatterjee, P. & Milanfar, P. Is denoising dead?. IEEE Trans. Image Process 19, 895–911 (2010).
Lukac, R., Smolka, B., Martin, K., Plataniotis, K. & Venetsanopoulos, A. Vector filtering for color imaging. IEEE Signal Process. Mag. 22, 74–86 (2005).
Mafi, M. et al. A comprehensive survey on impulse and Gaussian denoising filters for digital images. Signal Process. 157, 236–260 (2019).
Huang, S.-C. et al. An advanced noise reduction and edge enhancement algorithm. Sensors 21, 25 (2021).
Fevralev, D. et al. Efficiency analysis of color image filtering. EURASIP J. Adv. Signal Process 41, 25 (2011).
Malinski, L. & Smolka, B. Fast adaptive switching technique of impulsive noise removal in color images. J. Real-Time Image Process 20, 1077–1098 (2019).
Kusnik, D. & Smolka, B. On the robust technique of mixed Gaussian and impulsive noise reduction in color digital images. In Information, Intelligence, Systems and Applications (IISA), 2015 6th International Conference on, 1–6 (2015).
Smolka, B. & Kusnik, D. Robust local similarity filter for the reduction of mixed Gaussian and impulsive noise in color digital images. Signal Image Video Process 9, 49–56 (2015).
Boncelet, C. Image noise models. In Handbook of Image and Video Processing, Communications, Networking and Multimedia (ed. Bovik, A.) 397–410 (Academic Press, 2005).
López-Rubio, E. Restoration of images corrupted by Gaussian and uniform impulsive noise. Pattern Recognit. 43, 1835–1846 (2010).
Zhang, J. et al. A survey on computational spectral reconstruction methods from rgb to hyperspectral imaging. Sci. Rep. 12, 25 (2022).
Tomasi, C. & Manduchi, R. Bilateral filtering for gray and color images. In Sixth International Conference on Computer Vision, 839–846 (1998).
Elad, M. On the origin of the bilateral filter and ways to improve it. IEEE Trans. Image Process 11, 1141–1151 (2002).
He, K., Sun, J. & Tang, X. Guided image filtering. IEEE Trans. Pattern Anal. Mach. Intell. 35, 1397–1409 (2013).
Zhang, Q., Shen, X., Xu, L. & Jia, J. Rolling guidance filter. In Computer Vision–ECCV 2014 (eds Fleet, D. et al.) 815–830 (Springer, 2014).
Buades, A., Coll, B. & Morel, J. M. A non-local algorithm for image denoising. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 2, 60–65 (2005).
Dabov, K., Foi, A., Katkovnik, V. & Egiazarian, K. Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Trans. Image Process 16, 2080–2095 (2007).
Yahya, A. A. et al. BM3D image denoising algorithm based on an adaptive filtering. Multimed. Tools Appl. 79, 20391–20427 (2020).
Kravchenko, V., Ponomaryov, V., Pustovoit, V. & Enriquez, A. P. 3D filtering of images corrupted by additive-multiplicative noise. Doklady Math. 494, 71–75 (2020).
Zhou, Y. et al. An image denoising algorithm for mixed noise combining nonlocal means filter and sparse representation technique. J. Vis. Commun. Image Represent. 41, 74–86 (2016).
Kravchenko, V., Ponomaryov, V. & Pustovoit, V. 3D filtering of images corrupted by mixed additive-impulsive noise. Doklady Phys. 63, 321–325 (2018).
Xu, Y., Zhao, Y. & Lu, P. Mixed noise reduction via sparse error constraint representation of high frequency image for wildlife image. Multimed. Tools Appl.https://doi.org/10.1007/s11042-022-13247-7 (2022).
Ponomaryov, V. & Palacios-Enriquez, A. Sparse approach in filtering of color images corrupted by mixture noises. WSEAS Trans. Signal Process 16, 81–88 (2020).
Perona, P. & Malik, J. Scale-space and edge detection using anisotropic diffusion. IEEE Trans. Pattern Anal. Mach. Intell. 12, 629–639 (1990).
Barash, D. Fundamental relationship between bilateral filtering, adaptive smoothing, and the nonlinear diffusion equation. IEEE Trans. Pattern Anal. Mach. Intell. 24, 844–847 (2002).
Abreu, E., Lightstone, M., Mitra, S. & Arakawa, K. A new efficient approach for the removal of impulse noise from highly corrupted images. IEEE Trans. Image Process 5, 1012–1025 (1996).
Li, R. & Zhang, Y.-J. A hybrid filter for the cancellation of mixed Gaussian noise and impulse noise. In International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint, vol. 1, 508–512 (2003).
Lin, C. H., Tsai, J. S. & Chiu, C. T. Switching bilateral filter with a texture/noise detector for universal noise removal. In IEEE International Conference on Acoustics, Speech and Signal Processing, 1434–1437 (2010).
Kim, D.-G. et al. Hybrid deep learning framework for reduction of mixed noise via low rank noise estimation. IEEE Access 10, 46738–46752 (2022).
Palacios-Enriquez, A., Ponomaryov, V., Reyes-Reyes, R. & Sadovnychiy, S. Sparse technique for images corrupted by mixed Gaussian-impulsive noise. Circ. Syst. Signal Process 37, 5389–5416 (2018).
Jin, L., Zhu, Z., Song, E. & Ma, G. Impulse noise reduction in color images based on mean shift and channel suppression. J. Electron. Imaging 27, 043011 (2018).
Kwon, Y.-M. & Lim, M.-J. Selective removal of impulse noise preserving edge information. In Computer Applications for Database, Education, and Ubiquitous Computing (eds Kim, T.-H. et al.) 227–232 (Springer, 2012).
Shao, Y., Jiang, J. & Hong, X. An improved nonlocal means-based correction strategy for mixed noise removal. IET Image Process. 20, 1–14 (2022).
Xing, M. & Gao, G. An efficient method to remove mixed Gaussian and random-valued impulse noise. PLoS One 17, 1–20 (2022).
Chen, L., Liu, L. & Philip, C. A robust bi-sparsity model with non-local regularization for mixed noise reduction. Inf. Sci. 354, 101–111 (2016).
Dong, B., Ji, H., Li, J., Shen, Z. & Xu, Y. Wavelet frame based blind image inpainting. Appl. Comput. Harmon. Anal. 32, 268–279 (2012).
Cai, J. F., Chan, R. H. & Nikolova, M. Two-phase approach for deblurring images corrupted by impulse plus Gaussian noise. Inverse Probl. Imaging 2, 187–204 (2008).
Nikolova, M. A variational approach to remove outliers and impulse noise. J. Math. Imaging Vis. 20, 99–120 (2004).
Yan, M. Restoration of images corrupted by impulse noise and mixed Gaussian impulse noise using blind inpainting. SIAM J. Imaging Sci. 6, 25 (2013).
Yamaguchi, T., Suzuki, A. & Ikehara, M. Detail preserving mixed noise removal by DWM filter and BM3D. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. E100.A, 2451–2457 (2017).
Stanković, I., Djurović, I. & Daković, M. Adaptive average BM3D filter for reconstruction of images with combined noise. In Mediterranean Conference on Embedded Computing (MECO), 1–4 (2018).
Djurović, I. Combination of the adaptive Kuwahara and BM3D filters for filtering mixed Gaussian and impulsive noise. Signal Image Video Process 11, 753–760 (2017).
Xiong, B. & Yin, Z. A universal denoising framework with a new impulse detector and nonlocal means. IEEE Trans. Image Process. 21, 1663–1675 (2012).
Langampol, K., Srisomboon, K., Patanavijit, V. & Lee, W. Smart switching bilateral filter with estimated noise characterization for mixed noise removal. Math. Probl. Eng 2019, 25 (2019).
Awad, A. Denoising images corrupted with impulse, Gaussian, or a mixture of impulse and Gaussian noise. Eng. Sci. Technol. Int. J. 22, 746–753 (2019).
Melange, T., Nachtegael, M., Schulte, S. & Kerre, E. E. A fuzzy filter for the removal of random impulse noise in image sequences. Image Vis. Comput 29, 407–419 (2011).
Mukhopadhyay, S. & Mandal, J. K. A fuzzy switching median filter of impulses in digital imagery (FSMF). Circ. Syst. Signal Process 33, 2193–2216 (2014).
Garnett, R., Huegerich, T., Chui, C. & He, W. A universal noise removal algorithm with an impulse detector. Image Process. IEEE Trans. 14, 1747–1754 (2005).
Rawat, A. K. & Singh, J. Improved median filter using road for removal of impulse noise. Int. J. Comput. Appl. 94, 29–33 (2014).
Liu, Y.-H., Gao, K., Ni, G.-Q. & Ge, S.-L. A universal denoising algorithm with trilateral filter and impulse detector. In Yoshizawa, T., Wei, P. & Zheng, J. (eds.) International Conference on Optical Instruments and Technology: Optoelectronic Imaging and Process Technology, vol. 7513, 646–655. International Society for Optics and Photonics (SPIE, 2009).
Xu, G., Tan, J. & Zhong, J. An improved trilateral filter for image denoising using an effective impulse detector. Int. Congress Image Signal Process. 1, 90–94 (2011).
Smolka, B. & Malik, K. Fast technique for mixed Gaussian and impulsive noise suppression in color images. AFRICON 2013, 1–5 (2013).
Liu, X. et al. Mixed-model noise removal in 3D MRI via rotation-and-scale invariant non-local means. In Processing and Analysis of Biomedical Information (eds Lepore, N. et al.) 33–41 (Springer, 2019).
Veerakumar, T., Prasad, R., Jagannath, K., Subudhi, B. N. & Esakkirajan, S. Impulse noise removal using adaptive radial basis function interpolation. Circ. Syst. Signal Process. 36, 1192–1223 (2017).
Li, B., Liu, Q., Xu, J. & Luo, X. A new method for removing mixed noises. Sci. China Inf. Sci. 54, 51–59 (2011).
Dong, Y., Chan, R. H. & Xu, S. A detection statistic for random-valued impulse noise. IEEE Trans. Image Process. 16, 1112–1120 (2007).
Saranya, G., Porkumaran, K. & Prabakar, S. Mixed noise removal of a color image using simple fuzzy filter. In International Conference on Green Computing Communication and Electrical Engineering (ICGCCEE), 1–6 (2014).
Hu, H., Li, B. & Liu, Q. Non-local filter for removing a mixture of Gaussian and impulse noises. In International Conference on Computer Vision Theory and Application, VISAPP, vol 2012, 145–150 (2012).
Zhong, H., Yang, C. & Zhang, X. A new weight for nonlocal means denoising using method noise. IEEE Signal Process. Lett. 19, 535–538 (2012).
Delon, J., Desolneux, A. & Guillemot, T. Parigi: A patch-based approach to remove impulse-Gaussian noise from images. Image Process. On Line 6, 130–154 (2016).
Camarena, J., Gregori, V., Morillas, S. & Sapena, A. A simple fuzzy method to remove mixed Gaussian-impulsive noise from color images. IEEE Trans. Fuzzy Syst. 21, 971–978 (2013).
Lukac, R., Smolka, B. & Plataniotis, K. Sharpening vector median filters. Signal Process. 87, 2085–2099 (2007).
Arivarasi, A. & Manickavasagam, S. A simple fuzzy method to remove mixed Gaussian-impulsive noise from colour images. In International Conference on Information Communication and Embedded Systems (ICICES2014), 1–5 (2014).
Taguchi, A. & Hamada, T. Removal of mixed noise on color image processing by using fuzzy rules. In Proc. SPIE—The Int. Soc. for Opt. Eng. (2000).
Chen, B., Qin, Q. & Zhang, X. G. Image de-noising in mixed noises based on wavelet transform. In Materials Engineering and Automatic Control, vol. 562 of Advanced Materials Research 1861–1865 (Trans Tech Publications Ltd, 2012).
Jain, P. & Tyagi, V. LAPB: Locally adaptive patch-based wavelet domain edge-preserving image denoising. Inf. Sci. 294, 164–181 (2015).
Prathik, A., Anuradha, J. & Uma, K. A novel filter for removing image noise and improving the quality of image. Int. J. Cloud Comput 11, 14–26 (2022).
Kumar, B. S. Image denoising based on Gaussian/bilateral filter and its method noise thresholding. Signal Image Video Process 7, 1159–1172 (2013).
Luo, H., Chu, H. & Xu, Y. Clustering-based image sparse denoising in wireless multimedia sensor networks. Circ. Syst. Signal Process 34, 1027–1040 (2015).
Kuang, Y., Zhang, L. & Yi, Z. Image denoising via sparse dictionaries constructed by subspace learning. Circ. Syst. Signal Process 33, 2151–2171 (2014).
Jiang, J., Zhang, L. & Yang, J. Mixed noise removal by weighted encoding with sparse nonlocal regularization. IEEE Trans. Image Process 23, 2651–2662 (2014).
Zhou, Y., Ye, Z. & Xiao, Y. A restoration algorithm for images contaminated by mixed Gaussian plus random-valued impulse noise. J. Vis. Commun. Image Represent. 24, 283–294 (2013).
Rudin, L. I., Osher, S. & Fatemi, E. Nonlinear total variation based noise removal algorithms. Phys. D Nonlinear Phenom. 60, 259–268 (1992).
Li, S., Zhang, B., Yang, X. & Zhu, W. Edge-guided second-order total generalized variation for gaussian noise removal from depth map. Sci. Rep. 10, 25 (2020).
Fang, Z., Liming, T., Liang, W. & Hanxin, L. A nonconvex \(\text{ TV}_q-l_1\) regularization model and the admm based algorithm. Sci. Rep. 12, 25 (2022).
Yuan, G. & Ghanem, B. \(\ell _0\)TV: A sparse optimization method for impulse noise image restoration. IEEE Trans. Pattern Anal. Mach. Intell. 41, 352–364 (2019).
Harikiran, J. Impulse noise reduction using mathematical morphology. In Computer Networks and Information Technologies (eds Das, V. V. et al.) 539–543 (Springer, 2011).
Ito, Y., Sato, T., Yamashita, N., Lu, J. & Takashi, Y. Impulse noise detector using mathematical morphology, pp. 4261– 4264 (2006).
Singh, A. et al. An efficient morphological salt-and-pepper noise detector. Int. J. Adv. Netw. Appl. 873, 2–5 (2011).
Lin, P.-H., Chen, B.-H., Cheng, F.-C. & Huang, S.-C. A morphological mean filter for impulse noise removal. J. Disp. Technol. 12, 344–350 (2016).
Mendiola-Santibáñez, J. & Terol-Villalobos, I. Filtering of mixed Gaussian and impulsive noise using morphological contrast detectors. IET Image Process 8, 131–141 (2014).
Youlian, Z., Cheng, H., Lifang, Z. & Lingjiao, P. Mixed noise reduction method based on fuzzy morphological filtering. In The 26th Chinese Control and Decision Conference (2014 CCDC), 2970–2973 (2014).
Fukunaga, K. & Hostetler, L. The estimation of the gradient of a density function, with applications in pattern recognition. Inf. Theory IEEE Trans. 21, 32–40 (1975).
Cheng, Y. Mean shift, mode seeking, and clustering. Pattern Anal. Mach. Intell. IEEE Trans. 17, 790–799 (1995).
Comaniciu, D. & Meer, P. Mean shift: A robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24, 603–619 (2002).
Wand, M. & Jones, M. Kernel Smoothing. Chapman & Hall/CRC Monographs on Statistics and Applied Probability (Taylor & Francis, 1994).
Tomasi, C. & Manduchi, R. Bilateral filtering for gray and color images. In Sixth International Conference on Computer Vision vol 1998, 839–846 (1998).
Paris, S., Kornprobst, P., Tumblin, J. & Durand, F. Bilateral filtering: Theory and applications. Found. Trends Comput. Graph. Vis 4, 1–73 (2009).
Shim, S.-O., Malik, A. S. & Choi, T.-S. Noise reduction using mean shift algorithm for estimating 3D shape. Imaging Sci. J. 59, 267–273 (2011).
Bradski, G. R. Computer vision face tracking for use in a perceptual user interface. Intel Technol. J. Q2, 2 (1998).
Comaniciu, D., Ramesh, V. & Meer, P. Real-time tracking of non-rigid objects using mean shift. In Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000, vol. 2, 142–149 (IEEE Comput. Soc, 2000).
Collins, R. Mean-shift blob tracking through scale space. In 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., vol. 2, II–234 (2003).
Devi, M. S. & Soranamageswari, M. Super pixel segmentation with neuro-fuzzy filtering based complex impulse noise removal for color images. Digit. Image Process 13, 25 (2021).
Jin, L. Complex impulse noise removal from color images based on super pixel segmentation. J. Vis. Commun. Image Represent. 48, 54–65 (2017).
Jin, L., Zhu, Z., Song, E. & Ma, G. Impulse noise reduction in color images based on mean shift and channel suppression. J. Electron. Imaging 27, 1–15 (2018).
Fashing, M. & Tomasi, C. Mean shift is a bound optimization. IEEE Trans. Pattern Analy. Mach. Intell. 27, 471–474 (2005).
Morales, R. R., Dominguez, D., Torres, E. & Sossa, J. H. Image segmentation through an iterative algorithm of the mean shift. In Advances in Image Segmentation Vol. 3 (ed. Ho, P.-G.P.) (IntechOpen, 2012).
Chu, C. K., Glad, I. K., Godtliebsen, F. & Marron, J. S. Edge-preserving smoothers for image processing. J. Am. Stat. Assoc. 93, 526–541 (1998).
Vedaldi, A. & Soatto, S. Quick shift and kernel methods for mode seeking. In Computer Vision—ECCV 2008 (eds Forsyth, D. et al.) 705–718 (Springer, Berlin, 2008).
Sheikh, Y., Khan, E. & Kanade, T. Mode-seeking by medoidshifts. In Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on, 1–8 (2007).
Gu, I. & Gui, V. Joint space-time-range mean shift-based image and video segmentation. In Advances in Image and Video Segmentation, 113–139 (2006).
Koontz, Narendra, & Fukunaga, X. A graph-theoretic approach to nonparametric cluster analysis. IEEE Trans. Comput. 25, 936–944 (1976).
Comaniciu, D. & Meer, P. Mean shift: A robust approach toward feature space analysis. Pattern Anal. Mach. Intell. IEEE Tran. 24, 603–619 (2002).
Smolka, B. & Kusnik, D. On the application of the reachability distance in the suppression of mixed Gaussian and impulsive noise in color images. Multimed. Tools Appl. 79, 32857–32879 (2020).
Smolka, B., Plataniotis, K. & Venetsanopoulos, A. Nonlinear Signal and Image Processing: Theory. In Methods, and Applications, chap. Nonlinear Techniques for Color Image Processing 445–505 (CRC Press, 2004).
Smolka, B. & Venetsanopoulos, A. Color Image Processing: Methods and Applications, Chap. Noise Reduction and Edge Detection in Color Images 75–100 (CRC Press, 2006).
Wang, Z., Bovik, A. C., Sheikh, H. R. & Simoncelli, E. P. Image quality assessment: From error visibility to structural similarity. IEEE Tran. Image Process 13, 600–612 (2004).
Smolka, B. Robust sharpening vector median filter. In 2018 International Automatic Control Conference (CACS), 1–6 (2018).
Wang, Z., Bovik, A. C., Sheikh, H. R. & Simoncelli, E. P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process 13, 600–612 (2004).
Wang, Z., Simoncelli, E. & Bovik, A. Multiscale structural similarity for image quality assessment. In The Thrity-Seventh Asilomar Conference on Signals, Systems Computers, 2003, vol. 2, 1398–1402 (2003).
Astola, J., Haavisto, P. & Neuvo, Y. Vector median filters. Proc. IEEE 78, 678–689 (1990).
Delon, J. & Desolneux, A. A patch-based approach for removing impulse or mixed gaussian-impulse noise. SIAM J. Imaging Sci. 6, 1140–1174 (2013).
He, K., Sun, J. & Tang, X. Guided image filtering. IEEE Trans. Pattern Anal. Mach. Intell. 35, 1397–1409 (2012).
Wu, Y., Tracey, B., Natarajan, P. & Noonan, J. P. James-stein type center pixel weights for non-local means image denoising. IEEE Signal Process. Lett. 20, 411–414 (2013).
Salmon, J. On two parameters for denoising with non-local means. IEEE Signal Process. Lett. 17, 269–272 (2010).
Lukac, R., Plataniotis, K., Smolka, B. & Venetsanopoulos, A. A multichannel order-statistic technique for cdna microarray image processing. IEEE Trans. Nanobiosci. 3, 272–285 (2004).
Pratt, W. Digital Image Processing (Wiley, 1978).
Trahanias, P. & Venetsanopoulos, A. Color edge detection using vector order statistics. IEEE Trans. Image Process 2, 259–264 (1993).
Koschan, A. & Abidi, M. Detection and classification of edges in color images. IEEE Signal Process. Mag. 22, 64–73 (2005).
Cariou, C., Le Moan, S. & Chehdi, K. A novel mean-shift algorithm for data clustering. IEEE Access 10, 14575–14585 (2022).
Acknowledgements
This work was supported by Silesian University of Technology, Poland research grant BK-2021 and was also funded by the National Science Centre, Poland (grant 2017/25/B/ST6/02219).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kusnik, D., Smolka, B. Robust mean shift filter for mixed Gaussian and impulsive noise reduction in color digital images. Sci Rep 12, 14951 (2022). https://doi.org/10.1038/s41598-022-19161-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-19161-0
- Springer Nature Limited