Abstract
We propose a new variational model in Sobolev-Orlicz spaces with non-standard growth conditions of the objective functional and discuss its applications to the simultaneous contrast enhancement and denoising of color images. The characteristic feature of the proposed model is that we deal with a constrained non-convex minimization problem that lives in variable Sobolev-Orlicz spaces where the variable exponent is unknown a priori and it depends on a particular function that belongs to the domain of the objective functional. In contrast to the standard approach, we do not apply any spatial regularization to the image gradient. We discuss the consistency of the variational model, give the scheme for its regularization, derive the corresponding optimality system, and propose an iterative algorithm for practical implementations.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
A very promising approach to the image quality enhancement is to reduce the influence of the noise and improve the perceptibility of objects in the scene by increasing the brightness difference between objects and their background. In recent years, many contrast enhancement techniques have been proposed for digital images. Some approaches allow to improve the image contrast just in low light conditions [39, 44]. Other methods, called sharpening, focus on enforcing strong contours to remove the obtained blur, e.g., by the Gaussian convolution [38]. However, this kind of enhancement concerns only strong image contours while the contrast enhancement attempts to modify the gray level of objects not only in the contours neighborhood. In recent years, many different techniques have been proposed for the reconstruction of noise-affected digital images and their contrast enhancement. We refer to Ref. [40], where the authors focused on the problem of contrast enhancement of natural images captured with a digital camera, and gave a sufficiently complete overview of the existing methods with a detailed analysis of all pros and cons.
In this paper, we mainly focus on the development of a variational approach for simultaneous contrast enhancement of color images and their denoising. With that in mind, we propose a new variational model in Sobolev-Orlicz spaces with non-standard growth conditions of the objective functional and discuss its applications to the simultaneous fusion and denoising of each spectral channel for input color images. In contrast to Ref. [24], we do not provide the color image restoration using the saturation-value Total Variation (TV), but instead, we are working just with the RGB representation of color images. However, as follows from the results of numerical simulations, the proposed approach does not strongly modify the histogram of the original image. This enables the model to preserve the global lighting sensation and to show that the hue of the main objects does not drastically change with the illumination. One of the most important advantages of this approach is the fact that the proposed model allows to synthesize at a high level of accuracy noise- and blur-free color images, that were captured in extremely low light conditions. This situation is typical for most remote sensing problems. Indeed, real-life satellite images frequently suffer from different types of noise, blur, and other atmosphere artifacts that can affect the radiation recovered by the sensors. As a result, such images lose their efficiency for the crop field monitoring problems and their utilization can lead to erroneous results and inferences.
The characteristic feature of the proposed model is that we deal with a constrained minimization problem with a special objective functional that lives in variable Sobolev-Orlicz spaces. This functional contains a spatially variable exponent characterizing the growth conditions and it can be seen as a replacement for the 1-norm in the TV regularization. Moreover, the variable exponent, which is associated with the non-standard growth, is unknown a priori and it depends on a particular function that belongs to the domain of the objective functional.
The idea of using a spatially varying exponent in a TV-like regularization method for image denoising dates back as early as 1997 [7] and it was put into practice in 2006 [11]. Both papers as well as some subsequent articles try to tackle variants of the problem
where the exponent depends directly on the image u, e.g.,
Here, \(\left( G_\sigma *v\right) (x)\) determines the convolution of the function v with the two-dimensional Gaussian filter kernel \(G_\sigma .\)
It has been demonstrated that this model possesses some favorable properties, particularly, when the edge preservation and the effective noise suppression are primary goals in image reconstruction. Furthermore, this model has been introduced specifically to address the issue of staircasing [42], which refers to the regularizer’s inclination towards piecewise constant functions. The appearance of the staircase effect is a notable drawback of the classical TV model. However, the non-convex model (1) did not gain significant attention for a long period due to its high numerical complexity and the absence of a rigorous mathematical substantiation of its consistency. Only particular solutions to this problem have been derived for a smoothed version of the integrand, using a weak notion of solution (see, for instance, Ref. [45]).
A recently developed alternative variant is the TV-like method [33] (see also Refs. [1, 12]), which computes the variable exponent p in an offline step and keeps it as a fixed parameter in the final optimization problem. This approach allows the exponent to vary based on the spatial location, enabling users to locally select whether to preserve edges or smooth intensity variations. However, there are only two natural types of imaging problems where this approach can be applied:
-
single-channel imaging where first the exponent is computed from the given data and then is applied as prior in the subsequent minimization problem;
-
dual-channel imaging where the secondary channel provides the exponent map that is used for regularization of the primary channel.
Thus, this circumstance imposes significant limitations from a practical point of view, especially in the case of multispectral satellite noisy images, where different channels can differ drastically.
The main purpose of this paper is to describe a robust approach for the simultaneous contrast enhancement and denoising of non-smooth multispectral images using an energy functional with non-standard growth, in particular a special form of anisotropic diffusion tensor for the regularization term and a term which is inspired by the variational model of Bertalmío et al. [5]. Following this approach, we aim to increase the perceptibility of objects in the scene and the noise robustness of the proposed model albeit it makes such variational problems completely non-smooth, non-convex, and, hence, significantly more difficult from a minimization point of view.
We consider a variational problem for the energy functional with non-standard growth p(x), where the principle edge information for the contrast enhancement is mainly accumulated. Namely, for the simultaneous denoising and contrast enhancement of color images, we propose to solve the following constrained minimization problems:
for each spectral channel of an input image separately, where the objective functional is non-convex and takes the form
Here, \(Q_i(v)\) stands for the fidelity term and its specific form is described in detail in Sect. 3 together with the operator \(R_{\eta }.\) The principle point that should be emphasized is the fact that we do not predefine the exponent p(x) a priori using for that the original image, but instead, we associate this characteristic with the current state of their spectral channels. We take it as follows:
Therefore, in contrast to the well-known approach, coming from the pioneering papers [2, 10], the principle difference of the models (5) and (2) is that we do not apply in (5) any spatial regularization of gradient \(\nabla u.\) Because of this, the model (3)–(4) becomes an ill-posed problem from the mathematical point of view and can produce many unexpected phenomena. In particular, to our best knowledge, we have no results of the existence and consistency of the optimization problem (3)–(4). To overcome this problem, we could apply some regularization of the variable exponent p(x) in the form like (2).
However, it is well known that optimization problem (3)–(4) with the spatially regularized gradient has several serious practical and theoretical difficulties. The first one is that the spatial regularization of the gradient in the form (2) leads to the loss of accuracy in the case when the signal is noisy, with white noise (see, for instance, Ref. [10]). Then, the noise introduces very large, in theory unbounded, oscillations of the gradient \(\nabla u.\) As a result, the conditional smoothing introduced by the model will not help, since all these noise edges will be kept.
The second drawback of the model with the regularized gradient is the fact that the space-invariant Gaussian smoothing inside the divergent term tends to push the edges in u away from their original locations. We refer to Ref. [41] where this issue was studied in detail. This effect, known as the edge dislocation, can be detrimental, especially in the context of the boundary detection problem and its application to remote the sensing and monitoring.
In view of this, our prime interest in this paper is to study the optimization problem (3)–(4) without the space-invariant Gaussian smoothing of the variable exponent p(x). In summary, the main contributions of our paper can be enumerated as follows.
-
The variational statement for the simultaneous contrast enhancement and denoising of multispectral images in the form of the minimization problem in Sobolev-Orlicz spaces with non-standard growth conditions of the objective functional.
-
Rigorous substantiation of the well-posedness of the variational problem with the non-standard growth functional.
-
The proof of existence results to the approximation variational problems.
-
The iterative algorithm for numerical implementations.
-
Derivation of the first order necessary conditions for the original problem and their substantiation.
-
Numerical experiments to study the performance of the new approach.
The remainder of the paper is organized as follows. In Sect. 2, we give preliminaries and some auxiliaries results. In Sect. 3, we present a novel variational problem for the denoising and contrast enhancement of non-smooth RGB images which can be viewed as an improved version of the variational model that has been recently proposed in Ref. [17]. Section 4 is devoted to the derivation of optimality conditions to the original problem and their substantiation. In Sect. 5, we discuss the possible ways for the relaxation of the minimization problem and its approximation. Specifically, we introduce a family of special minimization problems and show that each of these problems is solvable and their solutions are compact in an appropriate topology. We also discuss their approximating properties and give a convergence criterion of such sequence to an optimal solution of the original problem. Finally, for illustration, we give in Sect. 6 some results of numerical experiments.
2 Preliminaries
Let us recall some useful notations. For vectors \(\xi \in {\mathbb {R}}^2\) and \(\eta \in {\mathbb {R}}^2,\) \(\left( \xi ,\eta \right) =\xi ^\textrm{t}\eta \) denotes the standard vector inner product in \({\mathbb {R}}^2,\) where \(^\textrm{t}\) stands for the transpose operator. The norm \(|\xi |\) is the Euclidean norm given by \(|\xi |=\sqrt{(\xi ,\xi )}.\) Let \(\Omega \subset {\mathbb {R}}^2\) be a bounded open set with a Lipschitz boundary \(\partial \Omega \) and nonzero Lebesgue measure. For any subset \(E\subset \Omega \) we denote by |E| its two-dimensional Lebesgue measure \({\mathcal {L}}^2(E).\) Let \({\overline{E}}\) denote the closure of E, and \(\partial E\) stands for its boundary. Let \(\Omega \subset {\mathbb {R}}^2\) be a bounded connected open set with a sufficiently smooth boundary \(\partial \Omega .\)
2.1 Functional Spaces
For the convenience of the reader, we collect here the basic facts on functional spaces that will be used in the sequel. Let X denote a real Banach space with norm \(\Vert \cdot \Vert _X,\) and let \(X^\prime \) be its dual. Let \(\left<\cdot ,\cdot \right>_{X^\prime ;X}\) be the duality form on \(X^\prime \times X.\) By \(\rightharpoonup \) and \({\mathop {\rightharpoonup }\limits ^{*}}\) we denote the weak and weak\(^*\) convergence in normed spaces, respectively. For given \(1 \leqslant p \leqslant +\infty ,\) the space \(L^p(\Omega ;{\mathbb {R}}^2)\) is defined by
where \(\Vert f\Vert _{L^p(\Omega ;{\mathbb {R}}^2)}=\left( \int _{\Omega } |f(x)|^p\,{\textrm{d}}x\right) ^{1/p}\) for \(1 \leqslant p<+\infty .\) The inner product of two functions f and g in \(L^p(\Omega ;{\mathbb {R}}^2)\) with \(p\in [1,\infty )\) is given by
We denote by \(C_c^\infty ({\mathbb {R}}^2)\) a locally convex space of all infinitely differentiable functions with compact support in \({\mathbb {R}}^2.\) We recall here some functional spaces that will be used throughout this paper. We define the Banach space \(H^{1}(\Omega )\) as the closure of \(C^\infty _c({\mathbb {R}}^2)\) with respect to the norm
We denote by \(\left( H^{1}(\Omega )\right) ^\prime \) the dual space of \(H^{1}(\Omega ).\) Hereinafter, \(W^{1,1}(\Omega )\) stands for the Banach space of all functions \(u\in L^1(\Omega )\) with respect to the norm
Given a real Banach space X, we will denote by C([0, T]; X) the space of all continuous functions from [0, T] into X. We recall that a function \(u\!:[0,T]\rightarrow X\) is said to be Lebesgue measurable if there exists a sequence \(\left\{ u_k\right\} _{k\in {\mathbb {N}}}\) of step functions (i.e., \(u_k=\sum _{j=1}^{n_k} a_j^k \chi _{A_j^k}\) for a finite number \(n_k\) of Borel subsets \(A_j^k\subset [0,T]\) and with \(a_j^k\in X\)) converging to u almost everywhere with respect to the Lebesgue measure in [0, T].
Then, for \(1\leqslant p<\infty ,\) \(L^p(0,T;X)\) is the space of all measurable functions \(u\!:[0,T]\rightarrow X\) such that
while \(L^\infty (0,T;X)\) is the space of measurable functions such that
A full presentation of this topic can be found in Ref. [21].
Let us recall that, for \(1\leqslant p\leqslant \infty ,\) \(L^p(0,T;X)\) is a Banach space. Moreover, if X is separable and \(1\leqslant p<\infty ,\) then the dual space of \(L^p(0,T;X)\) can be identified with \(L^{p^\prime }(0,T;X^\prime ).\)
2.2 Basic Facts on the Lebesgue and Sobolev Spaces with Variable Exponents
Let \(p{:}~\Omega \rightarrow [p^{-},p^{+}]\subset (1,+\infty ),\) with \(p^\pm =\text {const},\) be a given measurable function. Denote by \(L^{p(\cdot )}(\Omega )\) the set of all measurable functions f(x) on \(\Omega \) such that \(\int _\Omega |f(x)|^{p(x)}\,{\textrm{d}}x<\infty .\) Then, \(L^{p(\cdot )}(\Omega )\) is a reflexive separable Banach space with respect to the Luxemburg norm
Moreover, in this case, the set \(C^\infty _0(\Omega )\) is dense in \(L^{p(\cdot )}(\Omega ).\) The relation between the modular \(\int _\Omega |f(x)|^{p(x)}\,{\textrm{d}}x\) and the norm follows from the definition
If \(p(\cdot )=\text {const}>1\), then these inequalities transform into equalities. The following estimates are also well known (see, for instance, Refs. [22, 46]): if \(f\in L^{p(\cdot )}(\Omega )\), then
The next result can be viewed as an analogous of the Hölder inequality in Lebesgue spaces with variable exponents. If \(f\in L^{p(\cdot )}(\Omega )\) and \(g\in L^{p^\prime (\cdot )}(\Omega )\) with
then \(\left( f,g\right) \in L^1(\Omega )\) and
Let \(p_1(\cdot )\) and \(p_2(\cdot )\) be measurable functions on \(\Omega \) such that \(p_i(x)\in [p_i^{-},p_i^{+}]\subset (1,+\infty )\) a.e. in \(\Omega .\) In case \(p_1(x)\geqslant p_2(x)\) a.e. in \(\Omega ,\) the inclusion \(L^{p_1(\cdot )}(\Omega )\subset L^{p_2(\cdot )}(\Omega )\) is continuous and
with a constant \(C=C(|\Omega |,p_1^{\pm },p_2^\pm ).\)
The variable Sobolev space \(W^{1,p(\cdot )}(\Omega )\) is defined as the set of functions
equipped with the norm
Unlike classical Sobolev spaces, smooth functions are not necessarily dense in \(W^{1,p(\cdot )}(\Omega ).\) Therefore, we define \(H^{1,p(\cdot )}(\Omega )\) as the closure of the set \(C^\infty ({\overline{\Omega }})\) in \(W^{1,p(\cdot )}(\Omega )\)-norm.
Let \(C^{\textrm{log}}({\overline{\Omega }})\) be the set of functions continuous on \({\overline{\Omega }}\) with the logarithmic modulus of continuity, i.e.,
where \(\omega \geqslant 0\) satisfies the condition: \(\limsup _{\tau \rightarrow 0^{+}} \omega (\tau )\log \frac{1}{\tau }=C<+\infty ,\) \(C=\text {const}.\) It is well known that for \(p\in C^{\textrm{log}}({\overline{\Omega }})\) the set \(C^\infty ({\overline{\Omega }})\) is dense in \(W^{1,p(\cdot )}(\Omega )\) and the space \(W^{1,p(\cdot )}(\Omega )\) coincides with the closure of \(C^\infty ({\overline{\Omega }})\) with respect to the norm (12), i.e., in this case \(W^{1,p(\cdot )}(\Omega )=H^{1,p(\cdot )}(\Omega ).\) In particular, if there exists \(\delta \in (0,1]\) such that \(p\in C^{0,\delta }({\overline{\Omega }}),\) then the set \(C^\infty ({\overline{\Omega }})\) is dense in \(W^{1,p(\cdot )}(\Omega ).\) Indeed, since
it follows from the Hölder continuity of \(p(\cdot )\) that
with \(\omega (t)=C/\log (|t|^{-1}).\)
Let \(p(\cdot ), q(\cdot )\in C({\overline{\Omega }})\) be such that \(p(x)\in [p^{-},p^{+}]\subset (1,2]\) and \(q(x)<\frac{2p(x)}{2-p(x)}\) in \({\overline{\Omega }}.\) Then, the embedding \(W^{1,p(\cdot )}(\Omega )\subset L^{q(\cdot )}(\Omega )\) is continuous and compact. Moreover, according to (11), we have a continuous embedding \(W^{1,p(\cdot )}(\Omega )\subset W^{1,p^{-}}(\Omega ).\)
For a more detailed presentation of the theory of these spaces, we refer to the monograph [22].
2.3 On the Dual Sobolev Space \(H^{-1}(\Omega )\)
Let \(H^1_0(\Omega )\) be the standard Sobolev space, i.e., \(H^1_0(\Omega )\) is the closure of \(C^1_0(\Omega )\) with respect to the norm
It is well known that for any \(u^*\in H^{-1}(\Omega )\) there can be found a vector-function \(g=\left[ g_1,g_2\right] \) in \(L^2(\Omega ;{\mathbb {R}}^2)\) such that
Therefore, it is clear now that
On the other hand, due to the Lax-Milgram Theorem, the Dirichlet boundary value problem
has a unique solution \(y=(-\Delta )^{-1}u^*\in H^1_0(\Omega )\) for each \(u^*\in H^{-1}(\Omega ).\) Moreover, in view of the energy equality
which holds for the weak solution of the Dirichlet problem (15), we can deduce the following a priori estimate for the weak solution of the Dirichlet problem (15):
Combining this result with (14), we get
Hence, the norm in \(H^{-1}(\Omega )\) can be defined as follows:
2.4 Level Sets, Directional Gradients, and Texture Indexes
Let \(u{:}~\Omega \rightarrow \overline{{\mathbb {R}}}\) be a given function. Then, for each \(\lambda \in {\mathbb {R}}\), we can define the upper level set of u as follows:
To describe this set, we assume that \(u\in W^{1,1}(\Omega ).\) It was proven in Ref. [3] that if \(u\in W^{1,1}(\Omega )\), then its upper level sets \(Z_\lambda (u)\) are sets of finite perimeter. Therefore, the boundaries of level sets can be described by a countable family of Jordan curves with finite length, i.e., by continuous maps from the circle into the plane \({\mathbb {R}}^2\) without crossing points. As a result, at almost all points of almost all level sets of \(u\in W^{1,1}(\Omega )\) we may define a unit normal vector \(\theta (x).\) This vector field formally satisfies the following relations:
In the sequel, we will refer to \(\theta \) as the vector field of unit normals to the topographic map of a function u.
If \(\theta \in L^\infty (\Omega ,{\mathbb {R}}^2)\) is a vector field of unit normals to the topographic map of some function \(u(\cdot ),\) then for any function \(v\in W^{1,1}(\Omega )\), we can define the so-called directional gradient of v following the rule (see Refs. [8, 9])
where \(\eta \in (0,1)\) is a given threshold. Since, for each function \(v\in W^{1,1}(\Omega ),\) the expression \(R_\eta \nabla v\) can be reduced to \((1-\eta ^2)\nabla v\) in those places of \(\Omega \) where \(\nabla v\) is collinear to the unit normal \(\theta ,\) and to \(\nabla v\) if \(\nabla v\) is orthogonal to \(\theta ,\) we have the following estimate:
In what follows, with each function \(u\in W^{1,1}(\Omega ),\) we associate the so-called texture index \(p(|\nabla u|)\) using the rule (see Refs. [18,19,20] for comparison)
where \(0<\delta \ll 1\) is a given threshold. It is clear now that
for each \(u\in W^{1,1}(\Omega ).\)
3 Statement of the Problem
In this section, we present a novel variational problem for the denoising and contrast enhancement of non-smooth RGB images which can be viewed as an improved version of the variational model recently proposed in Ref. [17].
Let \(f=[f_1,f_2,f_3]^\textrm{t}\in L^2(\Omega ;{\mathbb {R}}^3)\) be an original color image. Let \(\theta =\theta (f_i)=\left[ \begin{array}{ll}\theta _{1}\\ \theta _{2}\end{array}\right] \in L^\infty (\Omega ,{\mathbb {R}}^2)\) be a vector field of unit normals to the topographic map of the spectral channel \(f_i,\)
This vector field can be defined by the rule \(\theta (x)=\frac{\nabla U(t,x)}{|\nabla U(t,x)|}\) with \(t>0\) small enough, where U(t, x) is a solution of the following initial-boundary value problem:
with a relaxed version of the one-dimensional-Laplace operator in the principle part of (24). Here, \(\kappa >0\) is a sufficiently small positive value.
Let \(\eta \in (0,1)\) be a given threshold. Taking into account the definition of the Directional Total Variation (see Ref. [8]), we define the linear operator \(R_{\eta }\!:{\mathbb {R}}^2\rightarrow {\mathbb {R}}^2\) as follows:
Since \(R_{\eta }\nabla v\) reduces to \((1-\eta ^2)\nabla v\) in those regions where the gradient \(\nabla v\) is co-linear to \(\theta ,\) and to \(\nabla v,\) where \(\nabla v\) is orthogonal to \(\theta ,\) this operator does not enforce gradients in the direction \(\theta .\)
Remark 1
In the sequel, to reduce the number of parameters in the proposed model, we will set \(\delta =\kappa \) in (24) and (21), and \(\eta =1-\kappa \) in (19).
We also introduce the following set:
where \(p(\cdot )\) is given by (21), and \(H^{1,p(|\nabla I|)}(\Omega )\) is the variable Sobolev space that can be defined as follows:
Further, for a given gray-scale image \(I\in L^2(\Omega ),\) we define its average local contrast measure D(I) as follows (for comparison, we refer to Ref. [40]):
where \(\kappa >0\) is the same parameter as in (24), and \(W\in L^2(\Omega \times \Omega )\) is a symmetric non-negative kernel such that
A typical example of this function is the Gaussian kernel,
As a result, the proposed variational approach for the contrast enhancement and denoising of color images can be stated as follows.
For each spectral channel \(f_i\) \((i=1,2,3)\) of a given image \(f=[f_1,f_2,f_3]^\textrm{t}\in L^2(\Omega ;{\mathbb {R}}^3),\) we generate a new one \(f^0_i\in L^2(\Omega )\) as a solution of the following constrained minimization problem:
$$\begin{aligned} J_i(f^0_i)=\inf _{v\in \Xi _i} J_i(v), \end{aligned}$$(31)where
$$\begin{aligned} J_i(v)=\int _{\Omega } |R_{\eta }\nabla v(x)|^{p(|\nabla v|)}\,{\textrm{d}}x +\frac{\mu }{2}\left\| v -f_i\right\| ^2_{H^{-1}(\Omega )}+\frac{\lambda }{4} \left[ D(v)-cD(f_i)\right] ^2.\nonumber \\ \end{aligned}$$(32)Here, \(\lambda >0\) and \(\mu \in (0,1)\) are tuning parameters. The parameter \(\lambda \) manages the trade-off between the fidelity term \(\frac{\mu }{2}\left\| v -f_i\right\| ^2_{H^{-1}(\Omega )}\) and the contrast term \(\frac{\lambda }{4}\left[ D(v)-cD(f_i)\right] ^2.\) As for the multiplier \(c>0,\) we always suppose that \(c>1\) and it provides a control of the contrast level expected for the result.
Before proceeding further, we provide some qualitative analysis of the variational problem (31)–(32). To begin with, we notice that, for each feasible solution \(v\in \Xi ,\) the following two-side estimate:
holds with \(0<\delta \ll 1.\) Moreover, since \(\eta \in (0,1)\) and \(\eta \gg 0,\) we see that
As a result, for any \(v\in \Xi _i,\) we have
Thus, the first term in the cost functional (32) can be considered as a regularizing term. As for the second term in (32), we make use of the following observation.
Remark 2
The model (32) is aimed not only at the contrast enhancement, but also to remove the additive noise in the so-called structured images, i.e., in images where the portion of high oscillatory edges is rather significant. In most cases, the satellite images with crop fields typically contain many high oscillatory edges (boundaries of the crop locations). Moreover, “the portion of noise” in such images can be different from channel to channel. Because of this, an important question is to separate pure noise from high oscillatory edges in each spectral channel. To handle this problem, Meyer [37] suggested to replace the standard \(L^2\)-fidelity term \(\frac{\mu }{2}\Vert v -f_i\Vert ^2_{L^2(\Omega )},\) which is a typical component in the standard denoising models, by a weaker norm. As a plausible option of such weakening, Lieu and Vese [34] (see also Schönlieb [43]) have proposed to involve \(H^{-1}(\Omega )\)-norm instead of \(\Vert \cdot \Vert _{L^2(\Omega )}.\) Thus, from this point of view, it is plausible to interpret the second term in (32) as the fidelity term.
Before we move on to the existence issues, we make use of the following result concerning the lower semicontinuity property of the modular \(\int _\Omega |f(x)|^{p(x)}\,{\textrm{d}}x\) with respect to the weak convergence in \(L^{p_k(\cdot )}(\Omega ).\) The proof of this assertion has been mainly inspired by the elegant proof of Lemma 1 in Ref. [13] (see also Ref. [47, Lemma 13.3] for comparison).
Proposition 1
Let \(\left\{ p_k\right\} _{k\in {\mathbb {N}}}\subset [p^{-},p^{+}]\) be a given sequence such that
Let \(\left\{ v_k\in W^{1,p_k(\cdot )}(\Omega )\right\} _{k\in {\mathbb {N}}}\) be a sequence such that
and let \(R_\eta \!\!:{\mathbb {R}}^2\rightarrow {\mathbb {R}}^2\) be the operator defined in (27) with some \(\theta \in L^\infty (\Omega ;{\mathbb {R}}^2).\) Then, \(\nabla v\in L^{p(\cdot )}(\Omega ;{\mathbb {R}}^2)\) and
Proof
By Young’s inequality we have for \(\xi ,\zeta \in {\mathbb {R}}^2\) and \(1<p<\infty ,\)
If now \(\zeta \) is a function in \(L^\infty (\Omega ;{\mathbb {R}}^2)\) and we make \(p=p_k\) in (40) and use the assumption \(p^{-}\leqslant p_k(x)\leqslant p^{+}\) for all \(k\in {\mathbb {N}},\) then we derive
Using (27) and assumptions (37) and (38), we can pass to the limit in (42) as \(k\rightarrow \infty .\) As a result, we have
Then, we consider the following function:
Inserting this function \(\zeta \) into (43), we get
This implies
Since \(\frac{1}{p^\prime (x)-1}+1=p(x),\) it follows that
To conclude the proof, it remains to notice that the announced inequality (40) follows by letting \(n\rightarrow \infty \) in (44). As for the inclusion \(\nabla v\in L^{p(\cdot )}(\Omega ;{\mathbb {R}}^2)\) it is a direct consequence of assumption (39) and estimate (34).
Before proceeding to the existence issues, we provide, in the next section, a formal analysis of the optimality system for the problem (31)–(32).
4 Optimality Conditions
Let \(f^0_i\in \Xi _i,\) with \(i=1,2,3,\) be a point of local minimum in the problem (31)–(32), i.e., there exists a positive value \(\tau >0\) such that
For simplicity, we assume that the two-side inequality
holds almost everywhere in \(\Omega .\) Then, condition (45) can be rewritten as follows: for any smooth function \(\varphi \in C^\infty ({\overline{\Omega }}),\) the inequality
holds. Hence, the scalar function
has a minimum at \(\sigma =0.\)
Thus, to characterize the given feasible solution \(f_i^{0}\in \Xi _i\) to the optimization problem (31)–(32), we make use of the Ferma’s Theorem. To do so, we show that the objective functional \(J_i(v)\) is Gâteaux differentiable at \(v=f_i^{0},\) that is, there exists a linear bounded functional \(D_G J_i(f_i^{0})\in \left[ H^{1,p[\nabla f_i^0]}(\Omega )\right] ^\prime ={\mathcal {L}}\left( H^{1,p[\nabla f_i^0]}(\Omega ),{\mathbb {R}}\right) \) such that
where \(|r_i(h,\sigma )|=o(|\sigma |)\) as \(\sigma \rightarrow 0.\) Then, the condition \(0\in {\textrm{Argmin}} \psi (\sigma )\) can be interpreted as
Keeping in mind the fact that the set of feasible solutions \(\Xi _i\) to the problem (31)–(32) has an empty topological interior, we begin with the following auxiliary results, where \(F^\prime (u)[h]\) stands for the directional derivative of a functional \(F\!\!:X\rightarrow {\mathbb {R}}\) at the point \(u\in X\) along a vector \(h\in X,\) i.e.,
Proposition 2
Let \(f\in L^2(\Omega )\) be a given distribution and let
Then,
Proof
The announced result immediately follows from the definition of the directional derivative and the following chain of transformations:
Proposition 3
Let \(p\!:\Omega \rightarrow [p^{-},p^{+}]\subset (1,2],\) with \(p^\pm =\text {const},\) be a given exponent and let
Then, for each \(u\in W^{1,p(\cdot )}(\Omega ),\) we have
Proof
Let \(u,h\in W^{1,p(\cdot )}(\Omega )\) be given functions. We notice that
as \(\sigma \rightarrow 0\) almost everywhere in \(\Omega .\) Furthermore, by convexity,
we have
Taking into account that
and
we see that the right-hand side of the inequality (51) is an \(L^1(\Omega )\)-function. Therefore,
by the Lebesgue-dominated convergence theorem. From this, the representation (50) follows.
Proposition 4
Let \(p\!:\Omega \rightarrow [p^{-},p^{+}]\subset (1,2],\) with \(p^\pm =\text {const},\) be a given exponent and let
where the linear operator \(R_{\eta }\!\!:{\mathbb {R}}^2\rightarrow {\mathbb {R}}^2\) is defined by the rule (27). Then, for each \(u\in W^{1,p(\cdot )}(\Omega ),\) we have
Proof
The representation (52) immediately follows from the definition of the directional derivative and Proposition 3.
Proposition 5
Let \(u\in \Xi _i\) be a given feasible solution, let
and let the functional \(F_3\!:W^{1,1+\delta }(\Omega )\rightarrow {\mathbb {R}}\) be defined as follows:
where \(v\in W^{1,p[\nabla u]}(\Omega )\) is a given function. Then, for each \(v\in W^{1,p[\nabla u]}(\Omega )\) and for all \(h\in W^{1,1+\delta }(\Omega ),\) we have
Proof
The representation (53) immediately follows from the definition of the directional derivative.
Proposition 6
Let \(u\in \Xi \) be a feasible solution, and let
where \(f_i\in L^2(\Omega )\) is a given spectral channel, \(c=\text {const}>1,\) and
Then, the directional derivative of \(F_4{:}~L^2(\Omega )\rightarrow {\mathbb {R}}\) at the given point u along a vector \(h\in L^2(\Omega )\) takes the form
Proof
The representation (54) immediately follows from the definition of the directional derivative and the following chain of transformations:
We are now able to show that the objective functional \(J_i(v)\) is Gâteaux differentiable at \(v=f_i^{0}.\) With that in mind, we utilize the representation (32). As a result, we see that
Thus, \(J_i^\prime (f_i^{0}){:}~C^\infty ({\overline{\Omega }})\rightarrow {\mathbb {R}}\) is a linear functional.
Let us show that each term in (55) can be extended by the continuity to the entire Sobolev space \(H^{1,p(|\nabla f_i^{0}|)}(\Omega ).\) To this end, it is enough to establish the existence of a constant \(M>0\) such that
Indeed, rewriting (55) in the form
where the one-to-one correspondence to (55) is preserved, we see that
where \(p^{+}=2,\) \((p^\prime )^{-}=p^{+}/(p^{+}-1)=2,\) and
by the assumption \(f_i^0\in \Xi _i.\) Thus, there exists a constant \(M_1>0\) such that
Arguing similarly, it can be shown that a constant \(M_2>0\) exists such that
As for the third term in (55), we notice that
by the L’Hôpital’s rule. Using similar arguments, we see that
Thus, we can deduce the existence of a constant \(M_2>0\) such that
It remains to notice that in view of the obvious inclusions
the existence of constants \(M_3\) and \(M_4\) such that
immediately follows from (55) and the Cauchy inequality.
Utilizing the estimates (57), (59), and (60), we finally arrive at the inequality (56) with \(M=\max \{M_1,M_2,M_3,M_4,M_5\}.\) Thus, the mapping \(\varphi \mapsto J_i^\prime (f_i^{0})[\varphi ]\) can be defined for all \(\varphi \in H^{1,p(|\nabla f_i^{0}|)}(\Omega )\) using the density of \(C^\infty ({\overline{\Omega }})\) in \(H^{1,p(|\nabla f_i^{0}|)}(\Omega )\) (see (29)) and the standard rule
where \(\left\{ \varphi _k\right\} _{k\in {\mathbb {N}}}\subset C^\infty _c({\mathbb {R}}^2)\) and \(\varphi _k\rightarrow \varphi \) strongly in \(H^{1,p(|\nabla f_i^{0}|)}(\Omega ).\) Hence, the objective functional \(J_i(v)\) is Gâteaux differentiable at \(v=f_i^{0},\) and
To get the final relations for optimality conditions, it remains to observe that identity (55) implies the following equalities in the sense of distributions:
where \(\nu \) denotes the unit outward normal to the boundary \(\partial \Omega .\)
5 Existence Issues and Regularization of the Original Optimization Problem
The main question we are going to discuss in this section is to find out whether the problem (31)–(32) admits at least one solution. With that in mind, we make use of the so-called indirect approach [16, 29]. The main idea of this approach is to show that the original minimization problem (31)–(32) can be efficiently approximated by a special family of optimization problems of a similar structure but with the spatial regularization of the exponent \(p(|\nabla u|)\) in the form
where \((\nabla K_{\varepsilon }*u)\) stands for the Steklov smoothing operator.
Let \(K{:}~{\mathbb {R}}^2\rightarrow {\mathbb {R}}\) be a positive compactly supported function such that
For any \({\varepsilon }>0,\) we set \(K_{\varepsilon }(x)={\varepsilon }^{-2}K\left( \frac{x}{{\varepsilon }}\right) .\) Then, the following properties of the convolution:
are well known [23]:
-
(i)
\(u_{\varepsilon }\in C^\infty (\Omega )\) for all \({\varepsilon }>0;\)
-
(ii)
\(u_{\varepsilon }(x)\rightarrow u(x)\) almost everywhere in \(\Omega ;\)
-
(iii)
if \(u\in L^p(\Omega )\) with \(1\leqslant p<\infty ,\) then \(u_{\varepsilon }\rightarrow u\) in \(L^p(\Omega ).\)
We introduce the following family of approximating problems to the problem (31)–(32):
where \({\varepsilon }\) is a small parameter which varies within a strictly decreasing sequence of positive numbers converging to 0,
and \(p_{\varepsilon }(|\nabla v|)\) is defined in (63).
Before proceeding further, we make use of a few technical results.
Lemma 1
[17, Lemma 1] Let \(\left\{ v_k\right\} _{k\in {\mathbb {N}}}\subset L^\infty (\Omega )\) be a sequence of measurable functions such that \(v_k(x)\rightarrow v(x)\) weakly-\(*\) in \(L^\infty (\Omega )\) for some \(v\in L^\infty (\Omega ).\) Let
be the corresponding sequence of exponents. Then,
where
Lemma 2
[40, Proposition B.2] The mapping \(v\mapsto \frac{\lambda }{4} \left[ D(v)-cD(f_i)\right] ^2\) is continuous from \(L^2(\Omega )\) endowed with the strong topology to \({\mathbb {R}}\) with pointwise convergence.
Proposition 7
[17] Let \(\left\{ p_{k,{\varepsilon }}\right\} _{k\in {\mathbb {N}}}\) be a sequence of exponents that satisfies all preconditions of Lemma 1. If a bounded sequence \(\left\{ f_k\in L^{p_{k,{\varepsilon }}(\cdot )}(\Omega )\right\} _{k\in {\mathbb {N}}}\) converges weakly in \(L^{1+{\delta }}(\Omega )\) to f, then \(f\in L^{p_{\varepsilon }(\cdot )}(\Omega ),\) \(f_k\rightharpoonup f\) in variable \(L^{p_{k,{\varepsilon }}(\cdot )}(\Omega ).\)
We are now in a position to prove the existence of minimizers for the proposed approximating problem (64)–(66).
Theorem 1
Let \(\Omega \) be an open bounded and connected sub-domain of \({\mathbb {R}}^2\) with a Lipschitz boundary. Let \(f_i\in L^2(\Omega )\) be a given spectral channel of an image arguably contaminated by additive Gaussian noise with zero mean. Then, for each \({\varepsilon }>0,\) the minimization problem (64)–(66) admits at least one solution \(f^0_{i,{\varepsilon }}\) in \(W^{1,p^{-}}(\Omega )\cap L^\infty (\Omega )\) such that \(I^0_{i,{\varepsilon }}\in H^{1,p[\nabla f^0_{i,{\varepsilon }}]}(\Omega ).\)
Proof
To begin with, let us notice that, for each \({\varepsilon }>0,\) the indicated minimization problem is consistent, i.e., \(J_{i,{\varepsilon }}(u)<+\infty \) for any \(u\in \Xi _{i,{\varepsilon }}.\) Since \(\Xi _{i,{\varepsilon }}\ne \varnothing \) and \(0\leqslant J_{i,{\varepsilon }}(v)<+\infty \) for all \(v\in \Xi _{i,{\varepsilon }},\) it follows that there exists a non-negative value \(\zeta \geqslant 0\) such that \(\zeta _{\varepsilon }=\inf \limits _{v\in \Xi _{i,{\varepsilon }}}J_{i,{\varepsilon }}(v).\) Let \(\left\{ v^{\varepsilon }_k\right\} _{k\in {\mathbb {N}}}\) be a minimizing sequence for (64)–(66), i.e.,
Without lost of generality, we can suppose that \( J_{i,{\varepsilon }}\left( v^{\varepsilon }_k\right) \leqslant \zeta _{\varepsilon }+1\) for all \(k\in {\mathbb {N}}.\) From this and estimate (36), we deduce
Hence, in view of (7) and (33), the sequence \(\left\{ v^{\varepsilon }_k\right\} _{k\in {\mathbb {N}}}\) is bounded in \(W^{1,p^{-}}(\Omega ).\) Therefore, there exists a subsequence of \(\left\{ v^{\varepsilon }_{k}\right\} _{k\in {\mathbb {N}}},\) still denoted by the same index, and a vector-function \(f_{i,{\varepsilon }}^0\in W^{1,p^{-}}(\Omega )\) such that
where, by the Sobolev embedding theorem, \((p^{-})^*=\frac{2p^{-}}{2-p^{-}}=\frac{2+2\delta }{1-\delta }>2.\)
In view of this and the smoothness of the kernel \(K_{\varepsilon },\) we see that the operator
is compact (see Lemma 1). Therefore, (71)–(72) imply that \(p_{\varepsilon }(|\nabla v^{\varepsilon }_{k}|) \rightarrow p_{\varepsilon }(|\nabla f^0_{i,{\varepsilon }}|)\) in \(C({\overline{\Omega }}).\) Passing then to a subsequence if necessary, we have (see Propositions 1 and 7):
Hence, \(f^0_{i,{\varepsilon }}\in W^{1,p_{\varepsilon }(|\nabla f^0_{i,{\varepsilon }}|)}(\Omega ).\)
Further we notice that, for each \(k\in {\mathbb {N}},\) \(\gamma _{i,0}\leqslant v^{\varepsilon }_k(x)\leqslant \gamma _{i,1}\) a.a. in \(\Omega .\) Then, it follows from (73) that the limit function \(f_{i,{\varepsilon }}^0(x)\) is also subjected to the same restriction. Thus, \(f^0_{i,{\varepsilon }}\in W^{1,p_{\varepsilon }(|\nabla f^0_{i,{\varepsilon }}|)}(\Omega )\cap L^\infty (\Omega )\) is a feasible solution to the minimization problem (64)–(66).
It remains to show that \(f^0_{i,{\varepsilon }}\) is a minimizer of this problem. Indeed, taking into account the properties (68), (72), and the fact that \(\theta \in L^\infty (\Omega ,{\mathbb {R}}^2),\) we see that the sequence
is bounded in the variable space \(L^{p_{\varepsilon }(|\nabla v^{\varepsilon }_k|)}(\Omega ;{\mathbb {R}}^2)\) and weakly convergent to \(|R_{\eta }\nabla f^0_{i,{\varepsilon }}|\) in \(L^{p^{-}}(\Omega ;{\mathbb {R}}^2).\) Hence, by Proposition 1, the following lower semicontinuous property:
holds. Combining this relation with the following ones:
which are direct consequences of Lemma 2 and compactness of the embedding \(L^2(\Omega )\subset H^{-1}(\Omega ),\) we finally obtain
Thus, \(f_{i,{\varepsilon }}^0\) is a minimizer to the problem (64)–(66).
Taking this existence result into account, we pass to the study of approximation properties of the problem (64)–(66). Namely, the main question we are going to discuss further is whether we can establish the convergence of minima of (64)–(66) to minima of (31)–(32) as \({\varepsilon }\) tends to zero. In other words, we aim to show that some optimal solutions to (31)–(32) can be approximated by the solutions of (64)–(66). To this end, we make use of some results of the variational convergence of minimization problems [26, 29,30,31] and begin with some auxiliaries (see also Refs. [14, 15, 30, 35, 36] for other aspects of this concept).
Lemma 3
Let \(\left\{ {\varepsilon }_k\right\} _{k\in {\mathbb {N}}}\) be a sequence of positive numbers converging to zero as \(k\rightarrow \infty .\) Let
be sequences such that
Then,
Proof
The following relations:
are a direct consequence of Lemma 2, compactness of the embedding \(L^2(\Omega )\subset H^{-1}(\Omega ),\) and condition (76). We also notice that, in view of representation
Proposition 1 and the initial assumptions (77)–(79) lead to the conclusion:
As a result, combining the last inequality with (81)–(82), we arrive at the announced relation (80)\(_2.\)
It remains to show that v is a feasible solution to the problem (31)–(32), i.e., \(v\in \Xi _i.\) To this end, we take into account the inclusion \(\nabla v\in L^{p(\cdot )}(\Omega ;{\mathbb {R}}^2)\) established above and the fact that \(v_k\in \Xi _{i,{\varepsilon }_k}\) for each \(k\in {\mathbb {N}}.\) Then, it follows from (75) that \(\gamma _{i,0}\leqslant v(x)\leqslant \gamma _{i,1}\) almost everywhere in \(\Omega ,\) and, therefore, \(v\in \Xi _i.\)
Lemma 4
For each feasible solution \(v\in \Xi _i\) to the original problem (31)–(32), there can be found a sequence \(\left\{ v_{\varepsilon }\right\} _{{\varepsilon }\rightarrow 0}\) such that
Proof
Let v be an arbitrary feasible solution to the problem (31)–(32). We define the sequence \(\left\{ v_{\varepsilon }\right\} _{{\varepsilon }\rightarrow 0}\) as a smooth mollification of v with the kernel \(K_{\varepsilon },\) i.e.,
Then, properties (84)–(86) are direct consequences of the Steklov smoothing procedure (see (i)–(iii)). Moreover, in view of (84) and the fact that \(\gamma _{i,0}\leqslant v(x)\leqslant \gamma _{i,1}\) a.e. in \(\Omega ,\) we can suppose that the same restriction for \(v_{\varepsilon }\)
holds true with \({\varepsilon }>0\) small enough.
Since \(v_{\varepsilon }\rightarrow v\) strongly in \(W^{1,p^{-}}(\Omega ),\) it follows without loss of generality that \(\nabla v_{\varepsilon }(x)\rightarrow \nabla v(x)\) almost everywhere in \(\Omega .\) As a result, this implies the pointwise convergence (88). Hence,
From this and the fact that \(|R_\eta \nabla v(x)|^{p(x)}\in L^1(\Omega ),\) we deduce
Thus, \(\left\| |R_\eta \nabla v_{\varepsilon }(\cdot )|^{p_{\varepsilon }(\cdot )}\right\| _{L^1(\Omega )}\leqslant C\) for some positive constant C not depending on \({\varepsilon }.\) Hence, in view of estimates (34), we get \(\nabla v_{\varepsilon }\in L^{p_{\varepsilon }(\cdot )}(\Omega ;{\mathbb {R}}^2)\) for \({\varepsilon }\) small enough. From this and (90), the assertion (83) follows. Moreover, the following equality:
immediately follows from (91).
It remains to observe that
by Lemma 2 and compactness of the embedding \(L^2(\Omega )\subset H^{-1}(\Omega ).\) As a result, we conclude from (92), (93), and (94) that, in fact, instead of the announced inequality (89), we have \(J_i(v)=\lim _{{\varepsilon }\rightarrow 0} J_{i,{\varepsilon }}(v_{\varepsilon }).\) The proof is complete.
We are now in a position to prove the main result of this section.
Theorem 2
Assume that the original minimization problem (31)–(32) has a non-empty set of minimizers. Let \(\left\{ f^0_{i,{\varepsilon }}\in \Xi _{i,{\varepsilon }}\right\} _{{\varepsilon }>0}\) be a sequence of solutions to the corresponding minimization problems (64)–(66). Let \(\left\{ p_{\varepsilon }:=1+\delta +\frac{a^2(1-\delta )}{a^2+\left| (\nabla K_{{\varepsilon }}*f^0_{i,{\varepsilon }})\right| ^2}\right\} _{{\varepsilon }>0}\) be the sequence of associated exponents. Assume that the sequence \(\left\{ p_{\varepsilon }\right\} _{{\varepsilon }>0}\) is compact with respect to the pointwise convergence in \(\Omega .\) Then, there exists an element \(f_i^0\in \Xi \) such that, up to a subsequence,
Proof
First, we observe that a given sequence of minimizers for approximating problems (64)–(66) is compact with respect to the convergence (95)–(97). Indeed, for an arbitrary test function \(\varphi \in C^\infty _c({\mathbb {R}}^2),\) we have
Let us assume that this function satisfies the pointwise constraints \(\gamma _{i,0} \leqslant \varphi (x) \leqslant \gamma _{i,1}\) in \(\Omega .\) Then, \(\varphi \in \Xi _{i,{\varepsilon }}\) for all \({\varepsilon }>0,\) and, therefore, we can suppose that
Hence,
Combining this issue with estimates (34), we see that the sequence \(\Big \{\! f^0_{i,{\varepsilon }}\in \Xi _{i,{\varepsilon }}\!\Big \}_{{\varepsilon }>0}\) is bounded in \(W^{1,p^{-}}(\Omega ).\) Hence, there exists a subsequence \(\Big \{\! f^0_{i,k}\in \Xi _{i,{\varepsilon }_k}\!\Big \}_{k\in {\mathbb {N}}}\) of \(\Big \{\! f^0_{i,{\varepsilon }}\in \Xi _{i,{\varepsilon }}\!\Big \}_{{\varepsilon }>0},\) and a function \(f_i^0\in W^{1,p^{-}}(\Omega )\) such that
where, by the Sobolev embedding theorem, \((p^{-})^*=\frac{2p^{-}}{2-p^{-}}=\frac{2+2\delta }{1-\delta }>2+\delta .\) From this, the conditions (95)–(97) follow, whereas (98) is a consequence of (34) and the boundedness property (100).
Thus, we may suppose that for the subsequence \(\left\{ f^0_{i,k}\in \Xi _{{\varepsilon }_k}\right\} _{k\in {\mathbb {N}}}\) all preconditions of Lemma 3 are fulfilled. Therefore, property (80) leads us to the conclusion that \(f_i^0\in \Xi _i\) and
where \(f_i^*\in \Xi \) is a minimizer for (31)–(32).
Then, Lemma 4 implies the existence of a realizing sequence \(\left\{ f^*_{i,{\varepsilon }}\in \Xi _{i,{\varepsilon }}\right\} _{{\varepsilon }>0}\) such that \(f^*_{i,{\varepsilon }}\rightarrow f^*_i\) as \({\varepsilon }\rightarrow 0\) in the sense of relations (84)–(88), and
Utilizing this fact, we have
From this and (102) we deduce that
Hence, we can combine (102) and (103) to get
Using these relations and the fact that the problem (31)–(32) is solvable, we may suppose that \(f_i^*=f_i^0.\) Since the equality (104) holds for all subsequences of \(\left\{ f^0_{i,{\varepsilon }}\right\} _{{\varepsilon }>0},\) which are convergent in the sense of relations (95)–(97), it follows that these limits coincide and, therefore, \(f_i^0\) is the limit of the whole sequence \(\left\{ f^0_{i,{\varepsilon }}\right\} _{{\varepsilon }>0}.\) Then, using the same argument for the sequence of minimizers as we did for the subsequence \(\left\{ f^0_{i,{\varepsilon }_k}\right\} _{\,k\in \,{\mathbb {N}}},\) we finally obtain
and this concludes the proof.
Remark 3
It is worth to emphasize a few principle issues from Theorem 2. The first one is that, in practice, the assumption concerning solvability of the original optimization problem is not so restricted and, in principle, it can be omitted. Indeed, any digital color image \(f=[f_1,f_2,f_3]^t\) is originally defined on some grid G. Therefore, each of its spectral channels \(f_i\Big |_G\) can be associated with some real-valued matrix. Hence, we can always suppose that the exponent \(p(|\nabla f_i|)\Big |_G\) is the restriction on the same grid of some Lipschitz-continuous function \(p(\cdot )\!\!:\Omega \rightarrow {\mathbb {R}}.\) Then, arguing as in the proof of Theorem 1, the solvability of the problem (31)–(32) can be easily established.
The second point, that should be emphasized here, is the assumption about the compactness property of the sequence of associated exponents \(\left\{ p_{\varepsilon }:=1+\delta +\frac{a^2(1-\delta )}{a^2+\left| (\nabla K_{{\varepsilon }}*f^0_{i,{\varepsilon }})\right| ^2}\right\} _{{\varepsilon }>0}\) with respect to the pointwise convergence in \(\Omega .\) Since this property is crucial in Theorem 2, we propose to consider it as an easily realized in practice criterion for the verification of whether the approximating sequence \(\left\{ f^0_{i,{\varepsilon }}\in \Xi _{i,{\varepsilon }}\right\} _{{\varepsilon }>0}\) leads to some optimal solution of the original problem.
6 Numerical Results
To illustrate the implementation of the proposed model (31)–(32) to the simultaneous denoising and contrast enhancement of color images, we make use of the optimality conditions in the form of (62). In other words, we have dropped the two-side constraints \(\gamma _{i,0}\leqslant v(x)\leqslant \gamma _{i,1}\) from the sets \(\Xi _i,\) and instead we control the fulfillment of this two-side constraint at each step of the numerical approximations.
Since, in practical implementations, it is reasonable to define the solution of the problem (31)–(32) using a “gradient descent” strategy, we can start with some initial image \(f=[f_1,f_2,f_3]^\textrm{t}\in L^2(\Omega ;{\mathbb {R}}^3)\) and pass to the following system of three initial-boundary value problems for quasi-linear parabolic equations with Neumann boundary conditions:
For numerical simulations, we set: \(\delta =\kappa \) in (24) and (21), and \(\eta =1-\kappa \) in (19), \(\kappa =0.001,\) \(\lambda =0.1,\) and \(\mu =2.\) As for the noise estimator \(a>0\) in (21), we use the choice of Black et al. [6], i.e.,
where \(\textrm{MAD}\) denotes the median absolute deviation of the corresponding spectral channel \(f_i\) of the original image \(f=[f_1,f_2,f_3]^\textrm{t}\) that can be computed as
and \({\textrm{median}}\left( \left| \nabla {\widetilde{S}}_i\right| \right) \) represents the median over the band \(S_i\!\!\!:G_H\rightarrow {\mathbb {R}}\) to the gradient amplitude.
To guarantee the stability of the proposed algorithm, we make use of the following condition:
There are numerous approaches to solve quasi-linear partial differential equations (see Refs. [4, 25] for various techniques). Since we are dealing with pixels in image processing, finite-difference approaches and an explicit scheme of the forward Euler method are arguably the best options. The number of iterations for each spectral channel can be defined experimentally. We used \(10^3\)-iterations. As for the size of the kernel W(x, y) used for D, this size manages the scale of the contrast enhancement. In our experiments, we used it equal to 3, 5, 7, 15, albeit it can be related to the size of the input image.
The most expensive computation is the one of D and \(\nabla D\) embedded in the computation of the right-hand side of the system (105). For acceleration of these computations, we can refer to Ref. [40], where the efficient Bernstein polynomials approximation has been proposed.
As follows from the result of numerical simulations (see Figs. 3, 5, 6, 7, 8, 9), parameters c, \(\mu ,\) \(\lambda ,\) and the size of \(\text {window}\) for the kernel W(x, y) are crucial for the contrast enhancement and these parameters have to be tuned in dependence on the desired result. In particular, we observe that at a large scale \((\text {window})\) and low contrast level c, the proposed model can produce an image with more details, but with the same lighting sensation as the original one. To show how the choice of the parameters c, \(\mu ,\) \(\lambda ,\) and \(\text {window}\) affect the results of contrast enhancement, we supplied all images in Figs. 1, 2, 3, 5, 6, 7, 8, and 9 by the histograms of their luma components which represent the perceptual brightness of the color images \(I\!\!:\Omega \rightarrow {\mathbb {R}}^3.\) To this end, we used the following representation for the luma: \(Y_I(x)=\alpha _R I_R(x)+ \alpha _G I_G(x)+ \alpha _B I_B(x)\) with
Here, \(I_R,\) \(I_G,\) and \(I_B\) stand for the intensities of a given image in R, G, and B spectral channels, respectively.
In particular, as follows from the obtained histograms, the proposed variational model is sufficiently sensitive to the choice of the weight coefficient c, whereas the size of \(\text {window}\) for the kernel W(x, y) affects the contrast enhancement in rather a mild manner (see Figs. 2 and 3).
7 Conclusions
We introduce a variational model with the non-standard growth condition for the restoration and contrast enhancement of multi-band images. We show that increasing the average local contrast measure improves the perceived contrast of the image. We have obtained sufficient conditions for the convergence of the minimization algorithm. The contrast scale and level in our model are adjustable, so that the proposed approach can be considered as fully adaptive. Our enhancement method for color images works directly on the RGB images without any pre- and/or post-processing. The automatic adaptation of the parameters to the content of the considered image could be a future direction of research.
References
Alaa, H., Alaa, N.E., Bouchriti, A., Charkaou, A.: An improved nonlinear anisotropic PDE with \(p(x)\)-growth conditions applied to image restoration and enhancement. Authorea (2022). https://doi.org/10.22541/au.165717367.72990650/v1
Alvarez, L., Lions, P.-L., Morel, J.-M.: Image selective smoothing and edge detection by nonlinear diffusion. SIAM J. Numer. Anal. 29, 845–866 (1992)
Ambrosio, L., Caselles, V., Masnou, S., Morel, J.M.: The connected components of sets of finite perimeter. Eur. J. Math. 3, 39–92 (2001)
Aubert, G., Kornprobst, P.: Mathematical Problems in Image Processing: Partial Differential Equations and the Calculus of Variations, vol. 147. Springer, New York (2006)
Bertalmío, M., Caselles, V., Provenzi, E., Rizzi, A.: Perceptual color correction through variational techniques. IEEE Trans. Image Process. 16(4), 1058–1072 (2007)
Black, M.J., Sapiro, G., Marimont, D.H., Heger, D.: Robust anisotropic diffusion. IEEE Trans. Image Process. 7(3), 421–432 (1998)
Blomgren, P., Chan, T.F., Mulet, P., Wong, C.: Total variation image restoration: numerical methods and extensions. In: Proceedings of the 1997 IEEE International Conference on Image Processing, Santa Barbara, CA, USA, 1997, vol. 42, pp. 384–387 (1997)
Bungert, L., Coomes, D.A., Ehrhardt, M.J., Rasch, J., Reisenhofer, R., Schönlieb, C.B.: Blind image fusion for hyperspectral imaging with the directional total variation. Inverse Probl. 34(4), 044003 (2018)
Bungert, L., Ehrhardt, M.J.: Robust image reconstruction with misaligned structural information. IEEE Access 8, 222944–222955 (2020)
Catté, F., Lions, P.L., Morel, J.-M., Coll, T.: Image selective smoothing and edge detection by nonlinear diffusion. SIAM J. Numer. Anal. 29(1), 182–193 (1992)
Chen, Y., Levine, S., Rao, M.: Variable exponent, linear growth functionals in image restoration. SIAM J. Appl. Math. 66(4), 1383–1406 (2006)
Chen, Y., Levine, S., Stanich, J.: Image restoration via nonstandard diffusion. Figshare (2014). https://www.mathcs.duq.edu/tech-reports/tr04-01.pdf
Chipot, M., de Oliveira, H.B.: Some results on the \(p(u)\)-Laplacian problem. Math. Annal. 375, 283–306 (2019)
D’Apice, C., De Maio, U., Kogut, P.I.: Gap phenomenon in homogenization of parabolic optimal control problem. IMA J. Math. Control Inf. 25, 461–480 (2008)
D’Apice, C., De Maio, U., Kogut, P.I.: Boundary velocity suboptimal control of incompressible flow in cylindrically perforated domain. Discrete Contin. Dyn. Syst. B 11(2), 283–314 (2009)
D’Apice, C., De Maio, U., Kogut, P.I.: An indirect approach to the existence of quasi-optimal controls in coefficients for multi-dimensional thermistor problem. In: Sadovnichiy, V.A., Zgurovsky, M. (eds.) Contemporary Approaches and Methods in Fundamental Mathematics and Mechanics, pp. 489–522. Springer, New York (2020)
D’Apice, C., Kogut, P.I., Kupenko, O., Manzo, R.: On variational problem with nonstandard growth functional and its applications to image processing. J. Math. Imaging Vis. 65(3), 472–491 (2023)
D’Apice, C., Kogut, P.I., Manzo, R., Uvarov, M.: Variational model with nonstandard growth conditions for restoration of satellite optical images using synthetic aperture radar. Eur. J. Appl. Math. 34(1), 77–105 (2023)
D’Apice, C., Kogut, P.I., Manzo, R.: On coupled two-level variational problem in Sobolev-Orlicz space. Differ. Integral Equ. 36(7/8), 621–660 (2023)
D’Apice, C., Kogut, P.I., Manzo, R.: A two-level variational algorithm in the Sobolev-Orlicz space to predict daily surface reflectance at LANDSAT high spatial resolution and MODIS temporal frequency. J. Comput. Appl. Math. 434, 1–23 (2023)
Dautray, R., Lion, J.L.: Mathematical Analysis and Numerical Methods for Science and Technology, vol. 5. Springer, Berlin (1985)
Diening, L., Harjulehto, P., Hästö, P., Ru̇ẑiĉka, M.: Lebesgue and Sobolev Spaces with Variable Exponents. Springer, New York (2011)
Evans, L.C., Gariepy, R.F.: Measure Theory and Fine Properties of Functions. CRC Press, Boca Raton (1992)
Jia, Z., Ng, M.K., Wang, W.: Color image restoration by saturation-value total variation. SIAM J. Imaging Sci. 12(2), 972–1000 (2019)
Karami, F., Meskine, D., Sadik, K.: A new nonlocal model for the restoration of textured images. J. Appl. Anal. Comput. 9(6), 2070–2095 (2019)
Kogut, P.I.: Variational S-convergence of minimization problems. Part I. Definitions and basic properties. Problemy Upravleniya i Informatiki (Avtomatika) 5, 29–42 (1996)
Kogut, P.I.: \(S\)-convergence of the conditional optimization problems and its variational properties. Problemy Upravleniya i Informatiki (Avtomatika) 4, 64–79 (1997)
Kogut, P.I.: On approximation of an optimal boundary control problem for linear elliptic equation with unbounded coefficients. Discret. Contin. Dyn. Syst. A 34(5), 2105–2133 (2014)
Kogut, P.I.: On optimal and quasi-optimal controls in coefficients for multi-dimensional thermistor problem with mixed Dirichlet-Neumann boundary conditions. Control Cybern. 48(1), 31–68 (2019)
Kogut, P.I., Kohut, Y., Manzo, R.: Existence result and approximation of an optimal control problem for the Perona-Malik equation. Ric. Mat. (2022). https://doi.org/10.1007/11587-022-00730-4
Kogut, P.I., Kupenko, O.P.: Approximation Methods in Optimization of Nonlinear Systems. De Gruyter Series in Nonlinear Analysis and Applications, vol. 32. Walter de Gruyter GmbH, Berlin (2019)
Kogut, P.I., Leugering, L.: On S-homogenization of an optimal control problem with control and state constraint. Z. fur Anal. ihre Anwend. 20(2), 395–429 (2001)
Kohr, H.: Total variation regularization with variable Lebesgue priors. arXiv:1702.08807 (2017)
Lieu, L.H., Vese, L.A.: Image restoration and decomposition via bounded total variation and negative Hilbert-Sobolev spaces. Appl. Math. Optim. 58, 167–193 (2008)
Manzo, R.: On Neumann boundary control problem for ill-posed strongly nonlinear elliptic equation with \(p\)-Laplace operator and \({L}^1\)-type of nonlinearity. Ric. Mat. 68(2), 769–802 (2019)
Manzo, R.: On Tikhonov regularization of optimal Neumann boundary control problem for an ill-posed strongly nonlinear elliptic equation with an exponential type of non-linearity. Differ. Integral Equ. 33(3/4), 139–162 (2020)
Meyer, Y.: Oscillating Patterns in Image Processing and Nonlinear Evolution Equations. University Lecture Series, vol. 2. AMS, Providence (2002)
Osher, S., Rudin, L.I.: Feature-oriented image enhancement using shock filters. SIAM J. Numer. Anal. 27(4), 458–474 (1990)
Piella, G.: Image fusion for enhanced visualization: a variational approach. Int. J. Comput. Vis. 83(1), 1–11 (2009)
Pierre, F., Aujol, J.-F., Bugeau, A., Steidl, G., Ta, V.-T.: Variational contrast enhancement of gray-scale and RGB images. J. Math. Imaging Vis. 57, 99–116 (2017)
Prasath, V.B.S., Urbano, J.M., Vorotnikov, D.: Analysis of adaptive forward-backward diffusion flows with applications in image processing. Inverse Probl. 31, 1–30 (2015)
Ring, W.: Structural properties of solutions to total variation regularization problems. ESAIM: Math. Model. Numer. Anal. 34(4), 799–810 (2000)
Schönlieb, C.B.: Total Variation Minimization with an \(H^{-1}\) Constraint. Research Gate Publication (2009)
Sugimura, D., Mikami, T., Yamashita, H., Hamamoto, T.: Enhancing color images of extremely low light scenes based on RGB/NIR images acquisition with different exposure times. IEEE Trans. Image Process. 24(11), 3586–3597 (2015)
Wunderli, T.: On time flows of minimizers of general convex functionals of linear growth with variable exponent in BV space and stability of pseudosolutions. J. Math. Anal. Appl. 364(2), 591–598 (2010)
Zhikov, V.V.: Solvability of the three-dimensional thermistor problem. Proc. Steklov Inst. Math. 281, 98–111 (2008)
Zhikov, V.V.: On variational problems and nonlinear elliptic equations with nonstandard growth conditions. J. Math. Sci. 175(5), 463–570 (2011)
Funding
Open access funding provided by Università degli Studi di Salerno within the CRUI-CARE Agreement. This work was supported by Visiting Professors Program-UNISA Call 2022.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors have no competing interests to declare that are relevant to the content of this article.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
D’Apice, C., Kogut, P.I., Manzo, R. et al. Variational Model with Nonstandard Growth Condition in Image Restoration and Contrast Enhancement. Commun. Appl. Math. Comput. (2024). https://doi.org/10.1007/s42967-024-00382-1
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42967-024-00382-1
Keywords
- Inverse problem
- Image contrast enhancement
- Denoising
- Constrained minimization problem
- Approximation methods
- Sobolev-Orlicz space
- Optimality system