1 Introduction

The notion of weak sharp minima plays an important role in the analysis of the perturbation behavior of certain classes of optimization problems as well as in the convergence analysis of algorithms. Of particular note in this fields is the paper by Burke and Ferris [1], which gave an extensive exposition of the notation and its impacted on convex programming and convergence analysis. Since then, this notion was extensively studied by many authors, for example, necessary or sufficient conditions of weak sharp minima for nonconvex programming [2, 3], and necessary and sufficient conditions of local weak sharp minima for sup-type (or lower-C1) functions [4, 5]. Recent development of weak sharp minima and its related to other issues can be found in [58].

A closed set S ̄ n is said to be a set of weak sharp minima for a function f : ℝn ℝ relative to a closed set S ⊆ ℝn with S ̄ S, if there is an α > 0 such that

f ( x ) f ( y ) + α dist ( x , S ̄ ) , x S and y S ̄ ,
(1.1)

where dist ( x , S ̄ ) denotes the Euclidean distance from x to S ̄ , i.e.,

dist ( x , S ̄ ) = inf { x - y y S ̄ } .

An ordinary way to deal with weak sharp minima is using the tools of variational analysis, such as subdifferentials and normal cones or various generalized derivatives and tangent cones. However, we study in this paper the concept of weak sharp minima from a new perspective. The nonconvex and convex cases are treated separately. Specifically, for the nonconvex case, we establish the close relationship between weak sharp minima and the generalized semi-infinite max-min programming (see (2.2) below). To the best of our knowledge, these results do not appear explicitly in the literature. For the convex case, we use conjugate functions to characterize weak sharp minima. This gives a unified way to deal with different problems, such as convex inequality system and affine convex inclusion. Finally, applications of weak sharp minima to algorithm analysis for solving variational inequality problem are given.

We first recall some preliminary notions and results, which will be used throughout this paper. Given a set A ⊂ ℝn, we denote its closure and convex hull as clA and convA, respectively. Denote its polar cone as

A 0 = { x n x , y 0 , y A } .

The indicator function and support function of A are defined by

δ ( x A ) = 0 , if x A , + , otherwise

and

σ ( w A ) = sup { w , x x A } .

The conjugate function of a function f : ℝn → ℝ is

f * ( x * ) = sup x n { x * , x - f ( x ) } ,

and the biconjugate function is defined as f**(x) = (f*)*(x), i.e., the conjugate of f*. The inf-convolution operation between f1 and f2 is

( f 1 f 2 ) ( x ) = inf { f 1 ( x 1 ) + f 2 ( x 2 ) x = x 1 + x 2 } .

The rest of the paper is organized as follows. The relationship between weak sharp minima and generalized semi-infinite programming is established in Section 2. In Section 3, we characterize the weak sharpness by using conjugate duality.

2 Nonconvex case

In this section, we show that the concept of weak sharp minima can be translated equivalently to a generalized semi-infinite max-min programming. Given α > 0, define a set-valued mapping as

S α ( x ) = { y S f ( y ) + α x - y f ( x ) } .

Clearly, this set is nonempty, since xS α (x) for all α > 0. Let f ̄ stand for the optimal value of f over S. Some equivalent expressions of weak sharpness in terms of S α are given below.

Theorem 2.1. Let f be a lower semi-continuous function. The following statements are equivalent:

(a). S ̄ is a set of weak sharp minima;

(b). There exists some α > 0 such that S α ( x ) S ̄ for all xS;

(c). There exists some α > 0 such that, for any xS, one has

min { f ( y ) y S α ( x ) } = f ̄ .

Proof. (a) ⇒ (b). If S ̄ is weak sharpness, it is easy to see that there exists α > 0 such that

f ( x ) f ̄ + α dist ( x , S ̄ ) , x S .

If x S ̄ , then xS α (x) for any α > 0 by definition. Thus, the conclusion is true. If xS\ S ̄ , let x ̄ P S ̄ ( x ) , the projection of x onto S ̄ . Then, the above inequality implies that

f ( x ) f ( x ̄ ) + α x - x ̄ ,

i.e, x ̄ S α ( x ) . Thus, S α ( x ) S ̄ .

  1. (b)

    ⇒ (c). It is elementary.

  2. (c)

    ⇒ (a). Choose xS. The definition of infimum guarantees the existence of a sequence {y n } ⊆ S α (x) such that f(y n ) approaches to f ̄ . Since y n S α (x), then

    f ̄ + α x - y n f ( y n ) + α x - y n f ( x ) ,
    (2.1)

where the first step comes from the fact that y n S and f ̄ is the optimal value. Thus, αx- y n f ( x ) - f ̄ , which means the boundness of {y n }. Passing to a subsequence if necessary, we can assume that {y n } converges to a limit point ȳ. We claim that ȳ S α ( x ) , since S α (x) is closed, due to the lower semi-continuity of f. Using this property again, we have

f ( ȳ ) lim n + f ( y n ) = f ̄ .

On the other hand, since f ̄ is the optimal value, then f ( ȳ ) f ̄ . Hence, f ( ȳ ) = f ̄ , i.e., ȳ S ̄ . Taking limits in (2.1) yields f ̄ +αx-ȳf ( x ) . Therefore,

α dist ( x S ̄ ) α x - ȳ f ( x ) - f ̄ ,

where the first step is due to the fact that ȳ S ̄ . Since x is an arbitrary element in S, then the above inequality means that S ̄ is weakly sharp.   □

The foregoing theorem shows that the concept of weak sharp minima can be converted into a class of optimization problems with the same optimal value. Based on this fact, we further derive the following result.

Theorem 2.2. Let f be a lower semi-continuous function. Then, the following statements are equivalent:

(a). S ̄ is a set of weak sharp minima;

(b). There exists α > 0 such that f ̄ is the optimal value of the following generalized semi-infinite max-min programming

max x S min y S α ( x ) f ( y ) .
(2.2)

Proof. It is easy to see that the following estimate

max x S min y S α ( x ) f ( y ) = f ̄

coincides with

min y S α ( x ) f ( y ) = f ̄ , x S ,

since f ̄ is the optimal value of f over S. Therefore, the desired result follows from Theorem 2.1.   □

To the best of our knowledge, the connection between weak sharp minima and the generalized semi-infinite programming is not stated explicitly in the literature. This result makes it possible to characterize weak sharpness by using the theory of generalized semi-infinite programming [911] and vice verse. In addition, the condition imposed in the foregoing theorem only needs the function to be lower semi-continuous, a rather weak condition in optimization. Hence, our result is applicable even for the case where the subgradient of f does not exist, while in [26], f is required, at least, to be subdifferentiable.

3 Convex case

We turn our attention in this section to the case where f and S are convex. In particular, we characterize the concept of weak sharp minima via conjugate function. This way enable us to deal with several different problems, such as convex inequality system and affine convex inclusion. Denote by B the unit ball in ℝn, i.e., B= { x n x 1 } . The following simple result can be found in [12]. The proof is given here for completeness.

Lemma 3.1. Let f be a closed convex function and S be a closed convex set. Then, S ̄ is a set of weak sharp minima if and only if there exists some α > 0 such that

( f * σ S ) ( x ) + f ̄ σ S ̄ ( x ) , x α B .

Proof. Using the indicator function, it is easy to see that (1.1) is equivalent to saying the existence of α > 0 such that

f ( x ) + δ S ( x ) f ̄ + α dist ( x , S ̄ ) , x n .

Note that the left function is closed convex, since f is proper closed convex and S is closed convex. Therefore, according to Legendre-Fenchel transform [[13], Theorem 11.1], the above formula can be rewritten equivalently as

( f + δ S ) * ( x ) ( f ̄ + α dist ( S ̄ ) ) * ( x ) , x n ,

which, together with the conjugacy correspondence between support function and indicator function and the fact that dist ( x , S ̄ ) = ( σ B δ S ̄ ) ( x ) [[14], Section 5], implies

( f * σ S ) ( x ) α ( δ B + σ S ̄ ) x α - f ̄ .

Invoking the positive homogeneity of support function [[14], Theorem 13.2] yields the result as desired.   □

Other deep characterizations of weak sharp minima can be found in [15, 16]. Since the concept of weak sharp minima is closely related to error bounds, we shall use the above result to study the error bounds for convex inequality system and affine convex inclusion, respectively.

3.1 Special cases

3.1.1 Convex inequality system

We first consider a convex inequality system as follows

f i ( x ) 0 , i I ,
(3.1)

where f i is a closed convex function and I is an arbitrary (possible infinite) index set. Let f ( x ) = max i I f i ( x ) . Then, the solution set of (3.1) is S = {x ∈ ℝn|f(x) 0}. We say that (3.1) has a global error bound if there exists α > 0 such that

dist ( x , S ) α f ( x ) + , x n ,
(3.2)

where f(x)+ = max{f (x), 0}.

Theorem 3.2. The system (3.1) has a global error bound if and only if there exists α > 0 such that

σ S ( x ) inf λ [ 0 , 1 ] ( λ f ) * ( x ) , x α B .

where

f * ( x ) = c l ( conv { f i * i I } ) ( x ) .

Proof. Dividing by α in (3.2) and taking the conjugate duality on the both sides yields

σ S ( x ) ( f ( ) + ) * ( x ) = ( max { f ( x ) , g ( x ) } ) * ( x ) , x α B ,

where we let g(x) = 0 for all x. According to [[14], Theorem 16.5], we known that

( f ( ) + ) * ( x ) = inf { λ f * ( x 1 ) + ( 1 - λ ) g * ( x 2 ) | x = λ x 1 + ( 1 - λ ) x 2 , λ [ 0 , 1 ] } = inf λ [ 0 , 1 ] λ f * ( x λ ) = inf λ [ 0 , 1 ] ( λ f ) * ( x ) ,

where the second step follows from the fact g* = δ{0}. The desired result follows from [[14], Theorem 16.5].   □

The foregoing result is applicable for the case where the algebra interior of the system (3.1) is empty.

3.1.2 Affine convex inclusion

Consider an affine convex inclusion as follows

A x - b C ,
(3.3)

where A is a real systemical matrix in ℝn×nand C ⊆ ℝn is a nonempty, closed, and convex set. Denote by S the solution set. The system (3.3) is said to has a global error bound if there exists α > 0 such that

α dist ( x , S ) dist ( A x - b C ) , x n .
(3.4)

Theorem 3.3. Let A be an inverse matrix. Then, the affine convex inclusion has a global error bound if and only if there exists α > 0 with α ≤ 1/||A-1|| such that

σ S ( x ) σ C ( A - 1 x ) + x , A - 1 b , x α B .

Proof. Let f(x) = dist(Ax - b|C). Taking the conjugate duality on both sides of (3.4) yields

σ S ( x ) f * ( x ) , x α B .

On the other hand, since α ≤ 1/||A-1||, it then follows that

A - 1 x A - 1 x 1 , x α B .
(3.5)

Therefore, for xαB, we have

f * ( x ) = sup x * { x * , x - f ( x * ) } , by letting y = A x * - b = sup y { x , A - 1 ( y + b ) - dist ( y C ) } = sup y { A - 1 x , y - dist ( y C ) } + x , A - 1 b = ( δ B + σ C ) ( A - 1 x ) + x , A - 1 b = σ C ( A - 1 x ) + x , A - 1 b ,

where the last step comes from (3.5). This completes the proof.   □

When C is negative orthant, the concept of global error bounds for affine convex inclusion is also referred to as Hoffman bounds in honor of his seminal work [17]. Historically, this is the most intensively studied case. We do not attempt a review of the enormous literature on this case or even on the slightly more general polyhedral case. Rather, our focus is on the case where C is only assumed to be convex.

As mentioned in Introduction, the concept of weak sharp minima plays an important role in the convergence analysis of optimization algorithm. Hence, we investigate the impact of weak sharp minima for solving variational inequality problem (VIP), which is to find a vector x* ∈ X such that

F ( x * ) , x - x * 0 , x X ,

where X is a nonempty closed convex set in ℝn and F is a mapping from X into ℝn. Denote by X* the solution set of (VIP). Due to the absence of objective function in (VIP), Marcotte and Zhu [18] adopted the following geometric characterization as the definition of weak sharpness, i.e, the solution set X_ of VIP is said to be weakly sharp if

F ( x * ) int x X * ( T X ( x ) N X * ( x ) ) 0 , x * X * .
(3.6)

Here, we further introduce two extended version, uniformly weak sharp minima and locally weakly sharp. More precisely, we say that X* is a uniformly weak sharp minima of VIP if there exists α > 0 such that

F ( x ) + α B x X * ( T X ( x ) N X * ( x ) ) 0 , x X * .
(3.7)

We say that x ̄ X * is locally weakly sharp of VIP if there exists δ > 0 such that

F ( x ¯ ) int x X * B ( x ¯ , δ ) ( T X ( x ) N X * ( x ) ) 0 .
(3.8)

Clearly, (3.8) is weaker than (3.6), because the latter corresponds to δ = ∞ and x ̄ must be taken over whole solution set X*.

Theorem 3.4. Let {xk} ⊂ X be a iterative sequence generalized by some algorithm. If either

(i). X* is uniformly weakly sharp, and

F ( x k ) - F ( z k ) 0 as k
(3.9)

where zkP X * (xk); or

(ii). {xk} converges to some x ̄ X * , x ̄ is locally weakly sharp, and F is continuous over X;

then xkX* for all k sufficiently large if and only if

lim k P T X ( x k ) ( - F ( x k ) ) = 0 .
(3.10)

Proof. The necessity is trial, since xkX* is equivalent to saying -F(xk) ∈ N X (xk), which further implies that P T X ( x k ) ( - F ( x k ) ) =0.

We now show the sufficiency. First assume that (i) holds. Suppose, on the contrary, that there exists a subsequence { x k } K such that xkX* for all kK, where K is an infinite subset of {1, 2, ...}. For any kK, there exists zkX* (not necessarily unique) such that ||xk - zk|| = dist(xk, X*), i.e., zkP X* (xk). Note that x k - z k N ^ X * ( z k ) by [[13], Example 6.16] and that N ^ X * ( z k ) N X * ( z k ) by [[13], Proposition 6.5]. It then follows that xk - zkN X * (zk) ∩ T X (zk) and zk - xkT X (xk).

Invoking (3.7), i.e., there exists α > 0 such that

- F ( z k ) + α B ( T X ( z k ) N X * ( z k ) ) 0 ,
(3.11)

which further implies

- F ( z k ) + α x k - z k x k - z k , x k - z k 0 .

Therefore,

α F ( z k ) , x k - z k x k - z k = - F ( x k ) , z k - x k z k - x k + F ( x k ) - F ( z k ) , z k - x k z k - x k max { - F ( x k ) , d d T X ( x k ) , d 1 } + F ( x k ) - F ( z k ) = P T X ( x k ) ( - F ( x k ) ) + F ( x k ) - F ( z k ) .

Taking the limit as kK approaches ∞, it follows from (3.9) and (3.10) that α ≤ 0, which leads to a contradiction.

If the condition (ii) holds, we must have, as shown above, that zk converges to x ̄ as well, since z k - x ̄ z k - x k + x k - x ̄ 2 x k - x ̄ . Hence, as k is large enough, we must have z k B ( x ̄ , δ ) . Thus, (3.8) means the existence of α > 0 such that

- F ( x ̄ ) + α B ( T X ( z k ) N X * ( z k ) ) 0 .

Since F is continuous, then F ( x k ) -F ( z k ) F ( x k ) -F ( x ̄ ) +F ( z k ) -F ( x ̄ ) 0 as k → ∞. Hence, using the argument following (3.11) by replacing zk by x ̄ (in the left of (3.11)) yields a contradiction. This completes the proof.   □

Finally, let us compare our result with that given in [18], where the finite termination property is established under the assumption that (i) F is pseudomonotone+, i.e., for any x, yX

F ( x ) , y - x 0 F ( y ) , y - x 0 ,

and

F ( x ) , y - x 0 and F ( y ) , y - x = 0 F ( y ) = F ( x ) ,
  1. (ii)

    X* is weak sharp minima; (iii) dist(xk|X*) converges to zero, and F is uniformly continuous over some open set containing xk and X*. Indeed, according to [[18], Theorem 3.1], we know that F is a constant over X* when F is pseudomonotone+. Using this fact, the concept of uniformly weak sharp minima reduces to weak sharp minima. Meanwhile, it is easy to see that condition (iii) given in [18] implies (3.9). In addition, we further consider the case when xk has a limit point under a weaker version of weak sharp minima.