The exact absolute value penalty function method for identifying strict global minima of order m in nonconvex nonsmooth programming

Antczak, Tadeusz

doi:10.1007/s11590-015-0967-3

The exact absolute value penalty function method for identifying strict global minima of order m in nonconvex nonsmooth programming

Original Paper
Open access
Published: 14 November 2015

Volume 10, pages 1561–1576, (2016)
Cite this article

Download PDF

You have full access to this open access article

Optimization Letters Aims and scope Submit manuscript

The exact absolute value penalty function method for identifying strict global minima of order m in nonconvex nonsmooth programming

Download PDF

Tadeusz Antczak¹

2073 Accesses
1 Citation
Explore all metrics

Abstract

In this paper, it is demonstrated that the exact absolute value penalty function method is useful for identifying the special sort of minimizers in nonconvex nonsmooth optimization problems with both inequality and equality constraints. The equivalence between the sets of strict global minima of order m in nonsmooth minimization problem and of its associated penalized optimization problem with the exact $l_{1}$ penalty function is established under nondifferentiable $\left( F,\rho \right) $-convexity assumptions imposed on the involved functions. The threshold of the penalty parameter, above which this result holds, is also given.

The Exact l 1 Penalty Function Method for Constrained Nonsmooth Invex Optimization Problems

Second-Order Minimization Method for Nonsmooth Functions Allowing Convex Quadratic Approximations of the Augment

Article 21 August 2015

Characterizations of the approximate solution sets of nonsmooth optimization problems and its applications

Article 24 August 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The notion of a strict local minimizer of order m plays an important role in the convergence analysis of iterative numerical methods (see, for example, [1]) and in stability analysis (see, for example, [2, 3]). Some results and optimality conditions concerning characterizations of such minimizers for nonlinear constrained mathematical programming problems have been derived by Auslender [4], Studniarski [5], Ward [6]. These results, in general, suggest that these minimizers are often exactly those satisfy an “m-th derivative test”. In this paper, we present a different approach for identifying of such minimizers.

In the past few years, considerable attention has been given to devising methods for solving nonlinear programming problems via unconstrained minimization techniques. One of classes of such methods which has emerged as very promising is the class of exact penalty function methods. The most popular nondifferentiable exact penalty function is the absolute value penalty function method (also called the exact $l_{1}$ penalty function method) and it has been investigated in [7–26], and others. Most of the results established on the nondifferentiable exact $l_{1}$ penalty function is devoted to the study of conditions ensuring that an optimal solution in the given convex optimization problem is also an unconstrained solution of the penalty function.

In this paper, we present a new characterization of the exact penalty method with the absolute value penalty function used to solve a class of nonconvex nondifferentiable optimization problems involving both inequality and equality constraints in which the functions constituting them are locally Lipschitz $(F,\rho )$-convex functions of order m, not necessarily with respect to the same $\rho $. Namely, we use the exact $l_{1}$ penalty function method to find a strict global minimizer of order m in the considered nonconvex nondifferentiable optimization problem involving both inequality and equality constraints. Indeed, we associate a strict global minimizer of order m in the nonsmooth constrained extremum problem with a strict global minimizer of order m in a unconstrained optimization problem (called penalized optimization problem) constructed in this approach in which the absolute value penalty function is minimized. Further, we also establish the converse result, that is, that a strict global minimizer of order m in the penalized optimization problem with the exact $l_{1}$ penalty function is also a strict global minimizer of order m in the nonlinear constrained optimization problem with both inequality and equality constraints. In this way, we prove that the sets of strict global minimizers of order m in both optimization problems coincide for a larger class of optimization problems than convex ones. The treshold of the penalty parameter above which this result holds is equal to the largest Lagrange multiplier in the absolute value. The results established in the paper are illustrated by suitable examples of nonconvex nondifferentiable optimization problems solving by using the exact $l_{1}$ penalty function method.

2 Preliminaries and problem formulation

Throughout this section, X is a nonempty open subset of $R^{n}$.

Definition 1

[27] The Clarke generalized subgradient of f at $x\in X$, denoted $\partial f\left( x\right) $, is defined by $\partial f\left( x\right) =\left\{ \xi \in R^{n}:f^{\,0}(x;v)\ge \xi ^{T}v\text { for all }v\in R^{n}\right\} $, where $ f^{\,0}\left( x;v\right) $ is the Clarke generalized directional derivative of a locally Lipschitz function $f:X\rightarrow R$ at $x\in X$ in the direction $v\in R^{n}$ given by .

Definition 2

A functional $F:X\times X\times R^{n}\rightarrow R$ is sublinear (with respect to the third component) if, for all $x,u\in X$,

(i)
$F\left( x,u;q_{1}+q_{2}\right) \le F\left( x,u;q_{1}\right) +F\left( x,u;q_{2}\right) $, $\forall q_{1},q_{2}\in R^{n}$,
(ii)
$F\left( x,u;\alpha q\right) =\alpha F\left( x,u;q\right) $, $ \forall \alpha \in R_{+}$, $\forall q\in R^{n}$.

By (ii), it is clear that

$$\begin{aligned} F(x,u;0)=0. \end{aligned}$$

(1)

Now, we give the definition of a nondifferentiable $(F,\rho )$-convex function of order m (see [28]).

Definition 3

A locally Lipschitz function $f:X\rightarrow R$ is said to be $(F,\rho )$-convex of order m at u $\in X$ on X if, there exist a sublinear (with respect to the third component) functional $ F:X\times X\times R^{n}\rightarrow R$, an integer $m\ge 1$ and a real number $\rho $ such that, the following inequality

$$\begin{aligned} f(x)-f(u)\ge F\left( x,u;\xi \right) +\rho \left\| x-u\right\| ^{m} \end{aligned}$$

holds for every $\xi \in \partial f\left( u\right) $ and all $x\in X$. If the above inequality is satisfied for all $u\in X$, then f is said to be $ (F,\rho )$-convex on X.

In the paper, we consider the following constrained optimization problem:

$$\begin{aligned}&\text {minimize }f(x) \\&\text {subject to}\quad g_{i}(x)\le 0\text {, }i\in I=\left\{ 1,\ldots p\right\} \qquad \qquad \qquad \text {(P)}\\&h_{j}(x)=0\text {,} \ j\in J=\left\{ 1,\ldots ,s\right\} \text {,} \end{aligned}$$

where $f:X\rightarrow R$ and $g_{i}:X\rightarrow R$, $i\in I$, $ h_{j}:X\rightarrow R$, $j\in J$, are locally Lipschitz functions on a nonempty open set $X\subset R^{n}$.

For the purpose of simplifying our presentation, we will next introduce some notations which will be used frequently throughout this paper. Let $ D:=\left\{ x\in X:g_{i}(x)\!\le \! 0\text {, }i\in I\text {, }h_{j}(x)=0\text {, }j\in J\right\} $ be the set of all feasible solutions of problem (P). Further, by $I\left( \overline{x}\right) =\left\{ i\in I:g_{i}\left( \overline{x}\right) =0\right\} $, we denote the set of active inequality constraints at point $ \overline{x}\in D$.

The concept of a strict local minimizer of order m was defined by Cromme [1], under the name “strongly unique” minimizer, in a study of iterative numerical methods.

Definition 4

Let $m\ge 1$ be an integer. We say that $\overline{x}$ is a strict global minimizer of order m in the considered optimization problem (P) if there exists $\beta >0$ such that

$$\begin{aligned} f(x)\ge f(\overline{x})+\beta \left\| x-\overline{x}\right\| ^{m} \end{aligned}$$

for all $x\in D$.

It is well known (see, for example, [27, 29, 30]) that, if $\overline{x}\in D$ is an optimal solution for problem (P), then the following conditions, known as the generalized form of the Karush–Kuhn–Tucker conditions, are satisfied:

Theorem 5

(Generalized Karush–Kuhn–Tucker necessary optimality conditions). Let $ \overline{x}\in D$ be an optimal solution of problem (P) and a suitable constraint qualification be satisfied at $\overline{x}$. Then, there exist $ \overline{\lambda }\in R^{p}$, $\overline{\mu }\in R^{s}$ such that

$$\begin{aligned} 0\in \partial f(\overline{x})+\sum _{i=1}^{p}\overline{\lambda }_{i}\partial g_{i}(\overline{x})+\sum _{j=1}^{s}\overline{\mu }_{j}\partial h_{j}( \overline{x})\text {,} \end{aligned}$$

(2)

$$\begin{aligned} \overline{\lambda }_{i}g_{i}(\overline{x})=0 \text {, } \quad i\in I\text {,} \end{aligned}$$

(3)

$$\begin{aligned} \overline{\lambda }_{i}\in R_{+}, \quad \text {}i\in I. \end{aligned}$$

(4)

Remark 6

Note that, following Hiriart-Urruty [30], a constraint qualification assuring the conclusion of the above theorem is the following one: there exists $v\in R^{n}$, $g_{i}^{0}\left( \overline{x};v\right) <0$, $ i\in I\left( \overline{x}\right) $. In the presence of inequality constraints, from Fritz John type optimality conditions, Clarke [27] established generalized Karush–Kuhn–Tucker necessary optimality conditions under the assumptions of “calmness” of the optimization problem. This constraint qualification has the advantage to be present in most problems, even if it seems difficult to verify it in general. Further, it is possible to use the following Cottle constraint qualification: either $g_{i}\left( \overline{x}\right) <0$, $i\in I$ or $0\notin conv\left\{ \partial g_{i}\left( \overline{x}\right) :i\in I\left( \overline{x}\right) \right\} $ . Since the results in the paper have been established for nondifferentiable optimization problems with generalized convex functions (that is, locally Lipschitz $\left( F,\rho \right) $-convex of order m), it is possible to use also generalized Slater constraint qualification.

In the paper, we will assume that a suitable constraint qualification is satisfied at any optimal solution in the considered nonlinear constrained optimization problem (P).

Definition 7

The point $\overline{x}\in D$ is said to be a Karush–Kuhn–Tucker point (a KKT point, for short) if there exist the Lagrange multipliers $\overline{\lambda }\in R^{p}$, $\overline{\mu }\in R^{s}$ such that the conditions (2)–( 4) are satisfied at $\overline{x}$ with these Lagrange multipliers.

3 The exact $l_{1}$ penalty method for optimization problems with locally Lipschitz $\left( F,\rho \right) $-convex functions of order m

The unconstrained optimization problem with the exact $l_{1}$ penalty function constructed in the exact $l_{1}$ penalty function method for the considered constrained minimization problem (P) can be written in the following form

$$\begin{aligned} \text {minimize }P(x,c)=f(x)+c\left[ \sum _{i\in I}g_{i}^{+}(x)+\sum _{j\in J}\left| h_{j}(x)\right| \right] , \quad \text {(P(}c\text {))} \end{aligned}$$

(5)

where, for a given inequality constraint $g_{i}(x)\le 0$, the function $ g_{i}^{+}(x)$ is defined as follows:

$$\begin{aligned} g_{i}^{+}(x)=\left\{ \begin{array}{ccccc} 0 &{}&{}\text {if} &{}&{} g_{i}(x)\le 0\text {,} \\ g_{i}(x) &{}&{}\text {if} &{}&{} g_{i}(x)>0\text {.} \end{array} \right. \end{aligned}$$

(6)

We will call the unconstrained optimization problem $(\mathrm{P}(c))$ the penalized optimization problem (with the absolute value penalty function).

Theorem 8

Let $\overline{x}\in D$ be a Karush–Kuhn–Tucker point of the constrained optimization problem (P), at which the Generalized Karush–Kuhn–Tucker conditions (2)–(4) are satisfied with the Lagrange multipliers $\overline{\lambda }\in R^{p}$ and $ \overline{\mu }\in R^{s}$. Let $J^{+}\left( \overline{x}\right) =\left\{ j\in J:\overline{\mu }_{j}>0\right\} $ and $J^{-}\left( \overline{x}\right) =\left\{ j\in J:\overline{\mu }_{j}<0\right\} $. Furthermore, assume the following hypotheses are satisfied:

(a)
the objective function f is locally Lipschitz $\left( F,\rho _{f}\right) $-convex of order m at $\overline{x}$ on X,
(b)
the inequality constraints $g_{i}$, $i\in I\left( \overline{x} \right) $, are locally Lipschitz $\left( F,\rho _{g_{i}}\right) $-convex of order m at $\overline{x}$ on X,
(c)
the equality constraints $h_{j}$, $j\in J^{+}\left( \overline{x} \right) $, are locally Lipschitz $\left( F,\rho _{h_{j}}^{+}\right) $-convex of order m at $\overline{x}$ on X,
(d)
the functions $-h_{j}$, $j\in J^{-}\left( \overline{x}\right) $, are locally Lipschitz $\left( F,\rho _{h_{j}}^{-}\right) $-convex of order m at $\overline{x}$ on X,
(e)
$\rho _{f}+\sum _{i\in I\left( \overline{x}\right) }\overline{ \lambda }_{i}\rho _{g_{i}}+\sum _{j\in J^{+}\left( \overline{x}\right) } \overline{\mu }_{j}\rho _{h_{j}}^{+}-\sum _{j\in J^{-}\left( \overline{x} \right) }\overline{\mu }_{j}\rho _{h_{j}}^{-}>0$.

If the penalty parameter c is assumed to be sufficiently large (it is sufficient to set $c\ge \max \left\{ \overline{\lambda }_{i}\text {, }i\in I \text {, }\left| \overline{\mu }_{j}\right| \text {, }j\in J\right\} $ , where $\overline{\lambda }_{i}$, $i=1,\ldots ,p,$ $\overline{\mu }_{j}$, $ j=1,\ldots ,s$, are the Lagrange multipliers associated with the constraint $ g_{i}$ and $h_{j}$, respectively), then $\overline{x}$ is also a strict global minimizer of order m in its associated penalized optimization problem (P(c)) with the exact $l_{1}$ penalty function.

Proof

By assumption, $\overline{x}$ is a Karush–Kuhn–Tucker point of the constrained optimization problem (P). Then, there exist the Lagrange multipliers $\overline{\lambda }\in R^{p}$ and $\overline{\mu }\in R^{s}$ such that the Generalized Karush–Kuhn–Tucker conditions (2)–(4) are satisfied at $\overline{x}$. Since assumptions (a)–(d) are fulfilled, by Definition 3, the following inequalities

$$\begin{aligned} f(x)-f(\overline{x})\ge F\left( x,\overline{x};\xi \right) +\rho _{f}\left\| x-\overline{x}\right\| ^{m}, \quad \forall \xi \in \partial f(\overline{x})\text {,} \end{aligned}$$

(7)

$$\begin{aligned} g_{i}(x)-g_{i}(\overline{x})\ge F\left( x,\overline{x};\zeta _{i}\right) +\rho _{g_{i}}\left\| x-\overline{x}\right\| ^{m}, \quad \forall \zeta _{i}\in \partial g_{i}(\overline{x}), \quad i\in I\left( \overline{x }\right) \text {,} \end{aligned}$$

(8)

$$\begin{aligned} h_{j}(x)-h_{j}(\overline{x})\ge F\left( x,\overline{x};\varsigma _{j}\right) +\rho _{h_{j}}^{+}\left\| x-\overline{x}\right\| ^{m}, \quad \forall \varsigma _{j}\in \partial h_{j}(\overline{x}), \quad j\in J^{+}\left( \overline{x}\right) \text {,} \end{aligned}$$

(9)

$$\begin{aligned} -h_{j}(x)+h_{j}(\overline{x})\ge F\left( x,\overline{x};-\varsigma _{j}\right) +\rho _{h_{j}}^{-}\left\| x-\overline{x}\right\| ^{m}, \quad \forall \varsigma _{j}\in \partial h_{j}(\overline{x}), \quad j\in J^{-}\left( \overline{x}\right) \end{aligned}$$

(10)

hold for all $x\in X$. Multiplying the inequalities (8) and (9) by the corresponding Lagrange multipliers and the inequalities (10) by $-\overline{\mu }_{j}$, $j\in J^{-}\left( \overline{x}\right) $, then, adding both sides of the obtained inequalities, we get, for all $x\in X $,

$$\begin{aligned}&\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}g_{i}(x)-\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}g_{i}(\overline{x})\nonumber \\&\quad \ge \sum \nolimits _{i\in I\left( \overline{x}\right) } \overline{\lambda }_{i}F\left( x,\overline{x};\zeta _{i}\right) +\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}\rho _{g_{i}}\left\| x-\overline{x}\right\| ^{m}, \quad \forall \zeta _{i}\in \partial g_{i}(\overline{x})\text {,} \qquad \end{aligned}$$

(11)

$$\begin{aligned}&\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(x)-\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(\overline{x})\nonumber \\&\quad \ge \sum \nolimits _{j\in J^{+}\left( \overline{x}\right) } \overline{\mu }_{j}F\left( x,\overline{x};\varsigma _{j}\right) +\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{+}\left\| x-\overline{x}\right\| ^{m}, \quad \forall \varsigma _{j}\in \partial h_{j}(\overline{x})\text {,} \qquad \quad \end{aligned}$$

(12)

$$\begin{aligned}&\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(x)-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(\overline{x})\ge \sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\left( -\overline{\mu }_{j}\right) F\left( x,\overline{x};-\varsigma _{j}\right) \nonumber \\&\quad -\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{-}\left\| x-\overline{x}\right\| ^{m}, \quad \forall \varsigma _{j}\in \partial h_{j}(\overline{x})\text {.} \end{aligned}$$

(13)

Since the functional $F:X\times X\times R^{n}\rightarrow R$ is sublinear (with respect to the third component), by (11)–(13), we have, for all $x\in X$, respectively,

$$\begin{aligned}&\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}g_{i}(x)-\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}g_{i}(\overline{x})\ge F\left( x,\overline{x};\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda }_{i}\zeta _{i}\right) \nonumber \\&\quad +\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}\rho _{g_{i}}\left\| x-\overline{x}\right\| ^{m}, \quad \forall \zeta _{i}\in \partial g_{i}(\overline{x})\text {,} \end{aligned}$$

(14)

$$\begin{aligned}&\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(x)-\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(\overline{x})\ge F\left( x,\overline{x};\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu }_{j}\varsigma _{j}\right) \nonumber \\&\quad +\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{+}\left\| x-\overline{x}\right\| ^{m}, \quad \forall \varsigma _{j}\in \partial h_{j}(\overline{x})\text {,} \end{aligned}$$

(15)

$$\begin{aligned}&\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(x)-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(\overline{x})\ge F\left( x,\overline{x};\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu }_{j}\varsigma _{j}\right) \nonumber \\&\quad -\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{-}\left\| x-\overline{x}\right\| ^{m}, \quad \forall \varsigma _{j}\in \partial h_{j}(\overline{x})\text {.} \end{aligned}$$

(16)

Adding both sides of (7) and (14)–(16), we get

$$\begin{aligned}&f(x)-f(\overline{x})+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{ \lambda }_{i}g_{i}(x)-\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{ \lambda }_{i}g_{i}(\overline{x})+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) } \overline{\mu }_{j}h_{j}(x) \\&\quad - \sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu }_{j}h_{j}( \overline{x})+\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(x)-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(\overline{x})\ge F\left( x,\overline{x};\xi \right) \\&\quad +\,F\left( x,\overline{x};\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{ \lambda }_{i}\zeta _{i}\right) +F\left( x,\overline{x};\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu }_{j}\varsigma _{j}\right) +F\left( x,\overline{x};\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) } \overline{\mu }_{j}\varsigma _{j}\right) \\&\quad +\left( \rho _{f}+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{+}-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{ \mu }_{j}\rho _{h_{j}}^{-}\right) \left\| x-\overline{x}\right\| ^{m} \text {.} \end{aligned}$$

Since the functional $F:X\times X\times R^{n}\rightarrow R$ is sublinear (with respect to the third component), we have, for all $x\in X$,

$$\begin{aligned}&f(x)-f(\overline{x})+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{ \lambda }_{i}g_{i}(x)-\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{ \lambda }_{i}g_{i}(\overline{x})+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) } \overline{\mu }_{j}h_{j}(x) \\&\qquad -\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu }_{j}h_{j}( \overline{x})+\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(x)-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(\overline{x}) \\&\quad \ge F\left( x,\overline{x};\xi +\sum \nolimits _{i\in I\left( \overline{x}\right) } \overline{\lambda }_{i}\zeta _{i}+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu }_{j}\varsigma _{j}+\sum \nolimits _{j\in J^{-}\left( \overline{x} \right) }\overline{\mu }_{j}\varsigma _{j}\right) \\&\qquad +\left( \rho _{f}+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{+}-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{ \mu }_{j}\rho _{h_{j}}^{-}\right) \left\| x-\overline{x}\right\| ^{m} \text {.} \end{aligned}$$

Hence, by the Generalized Karush–Kuhn–Tucker condition (2), it follows that the inequality

$$\begin{aligned}&f(x)-f(\overline{x})+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{ \lambda }_{i}g_{i}(x)-\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{ \lambda }_{i}g_{i}(\overline{x})+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) } \overline{\mu }_{j}h_{j}(x) \\&\quad -\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu }_{j}h_{j}( \overline{x})+\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(x)-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(\overline{x})\ge F\left( x,\overline{x};0\right) \\&\quad +\left( \rho _{f}+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{+}-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{ \mu }_{j}\rho _{h_{j}}^{-}\right) \left\| x-\overline{x}\right\| ^{m} \end{aligned}$$

holds for all $x\in X$. Thus, (1) gives

$$\begin{aligned}&f(x)-f(\overline{x})+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{ \lambda }_{i}g_{i}(x)-\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{ \lambda }_{i}g_{i}(\overline{x})+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) } \overline{\mu }_{j}h_{j}(x) \\&\qquad -\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu }_{j}h_{j}( \overline{x})+\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(x)-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{\mu } _{j}h_{j}(\overline{x}) \\&\quad \ge \left( \rho _{f}+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{+}-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{ \mu }_{j}\rho _{h_{j}}^{-}\right) \left\| x-\overline{x}\right\| ^{m} \text {.} \end{aligned}$$

By the Generalized Karush–Kuhn–Tucker condition (3) and taking the Lagrange multipliers equal to 0, we have, for all $x\in X$,

$$\begin{aligned}&f(x)+\sum \nolimits _{i=1}^{p}\overline{\lambda }_{i}g_{i}(x)+\sum \nolimits _{j=1}^{s}\overline{ \mu }_{j}h_{j}(x)\ge f(\overline{x})+\sum \nolimits _{j=1}^{s}\overline{\mu }_{j}h_{j}( \overline{x}) \\&\quad + \left( \rho _{f}+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{+}-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{ \mu }_{j}\rho _{h_{j}}^{-}\right) \left\| x-\overline{x}\right\| ^{m} \text {.} \end{aligned}$$

Using $\overline{x}\in D$ together with (6), we obtain that the inequality

$$\begin{aligned}&f(x)+\sum \nolimits _{i=1}^{p}\overline{\lambda }_{i}g_{i}^{+}(x)+\sum \nolimits _{j=1}^{s}\left| \overline{\mu }_{j}h_{j}(x)\right| \ge f(\overline{x} )+\sum \nolimits _{i=1}^{p}\overline{\lambda }_{i}g_{i}^{+}(\overline{x} )\\&\quad +\sum \nolimits _{j=1}^{s}\left| \overline{\mu }_{j}h_{j}(\overline{x})\right| +\left( \rho _{f}+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{+}\right. \\&\quad \left. -\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{ \mu }_{j}\rho _{h_{j}}^{-}\right) \left\| x-\overline{x}\right\| ^{m} \end{aligned}$$

holds for all $x\in X$. By assumption, the penalty parameter c is assumed to satisfy the condition $c\ge \max \left\{ \overline{\lambda }_{i}\text {, } i\in I\text {, }\left| \overline{\mu }_{j}\right| \text {, }j\in J\right\} $). Since $\overline{x}\in D$, the inequality above gives for all $ x\in X$,

$$\begin{aligned}&f(x)+c\left[ \sum \nolimits _{i=1}^{p}g_{i}^{+}(x)+\sum \nolimits _{j=1}^{s}\left| h_{j}(x)\right| \right] \\&\quad \ge f(\overline{x})+c\left[ \sum \nolimits _{i=1}^{p}g_{i}^{+}(\overline{x})+\sum \nolimits _{j=1}^{s}\left| h_{j}( \overline{x})\right| \right] \\&\qquad +\left( \rho _{f}+\sum \nolimits _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{+}-\sum \nolimits _{j\in J^{-}\left( \overline{x}\right) }\overline{ \mu }_{j}\rho _{h_{j}}^{-}\right) \left\| x-\overline{x}\right\| ^{m} \text {.} \end{aligned}$$

By definition of the objective function in the penalized optimization problem (P(c)) with the exact $l_{1}$ penalty function, it follows that the inequality

$$\begin{aligned} P\left( x,c\right) \ge P\left( \overline{x},c\right) +\beta \left\| x- \overline{x}\right\| ^{m}\text {,} \end{aligned}$$

(17)

holds for all $x\in X$, where

$$\begin{aligned} \beta =\rho _{f}+\sum _{i\in I\left( \overline{x}\right) }\overline{\lambda } _{i}\rho _{g_{i}}+\sum _{j\in J^{+}\left( \overline{x}\right) }\overline{\mu } _{j}\rho _{h_{j}}^{+}-\sum _{j\in J^{-}\left( \overline{x}\right) }\overline{ \mu }_{j}\rho _{h_{j}}^{-}\text {.} \end{aligned}$$

(18)

Since $\beta >0$, by (17) and Definition 4, we conclude that $\overline{x}$ is a strict global minimizer of order m in the penalized optimization problem (P(c)) with the absolute value penalty function. Thus, the conclusion of theorem is established. $\square $

The result below follows directly from Theorem 8.

Corollary 9

Let $\overline{x}$ be a strict global minimizer of order m for the considered constrained optimization problem (P) and the suitable constraint qualification be satisfied at $\overline{x}$ . Furthermore, assume that all assumptions of Theorem 8 are fulfilled. Then $\overline{x}$ is also a strict global minimizer of order m in the penalized optimization problem (P(c)) with the absolute value penalty function.

In the example below, we consider a nonconvex nonsmooth optimization problem with $\left( F,\rho \right) $-convex functions of order 3, for which assumption e) of Theorem 8 is not satisfied. We show that a strict global minimizer of order 3 in the considered nonconvex nonsmooth optimization problem is not a strict global minimizer of order 3 in its penalized optimization problem (P(c)) with the absolute value penalty function.

Example 10

Consider the following nonsmooth constrained optimization problem

$$\begin{aligned}&f(x)=x^{3}+\frac{1}{2}\left| x\right| \rightarrow \min \\&g(x)=-x\le 0. \qquad \qquad \qquad \text {(P1)} \end{aligned}$$

Note that $D=\left\{ x\in R:x\ge 0\right\} $ and $\overline{x}=0$ is a strict global minimizer of order 3 in the considered nonsmooth optimization problem. It can be shown, by Definition 3, that the objective function f is locally Lipschitz $ \left( F,\rho _{f}\right) $-convex of order 3 at $\overline{x}$ on R and the constraint function g is locally Lipschitz $\left( F,\rho _{g}\right) $-convex of order 3 at $\overline{x}$ on R, where a functional $ F:R\times R\times R\rightarrow R$ is defined by $F\left( x,\overline{x} ;\alpha \right) =\alpha \left| x\right| $ and $\rho _{f}=-1$ , $\rho _{g}=0$. Note that assumption e) in Theorem 8 is not satisfied in this case. However, we use the exact $l_{1}$ penalty function method to solve the considered nonsmooth optimization problem (P1). Therefore, we construct the following unconstrained optimization problem

$$\begin{aligned} P(x,c)=x^{3}+\frac{1}{2}\left| x\right| +c\max \left\{ 0,-x\right\} \rightarrow \mathrm{min}. \quad \text {(P1(}c\text {))} \end{aligned}$$

Further, it is not difficult to see that, for any $c>0$, $\overline{x}=0$ is not a strict global minimizer of order 3 in the above unconstrained optimization problem generated in the exact $l_{1}$ penalty method (more exactly, it is not a strict global minimizer of any order). This follows from the fact that the downward order of growth of f exceeds the upward of growth of g at $\overline{x}$ when moving from $\overline{x}$ towards smaller values. Indeed, note that $\inf _{x\in R}P(x,c)\rightarrow -\infty $ when $x\rightarrow -\infty $ for any $c>0$. As it follows from this example, although the functions constituting the original optimization problem are locally Lipschitz $\left( F,\rho \right) $-convex of order 3 at $\overline{ x}$ on R, then a feasible point $\overline{x}=0$, being a strict global minimizer of order 3 in the given constrained optimization problem (P1), is not a strict global minimizer of order 3 in its associated penalized optimization problem generated in the exact $l_{1}$ penalty function method. This is a consequence of the fact that the assumption (e) in Theorem 8 is not satisfied in the considered case. Hence, as it follows even from this example, assumption (e) in Theorem 8 is essential to prove the result in this theorem and it can not be omitted.

Now, under some stronger assumptions, we prove the converse result.

Theorem 11

Let the point $\overline{x}$ be a strict global minimizer of order m for the penalized optimization problem (P(c)) with the absolute value penalty function. Also, let $\widetilde{x}$ be any Karush–Kuhn–Tucker point of the original mathematical programming problem (P) and the Generalized Karush–Kuhn–Tucker necessary optimality conditions be satisfied at $\widetilde{x}$ with the Lagrange multipliers $\widetilde{\lambda }_{i}$, $i=1,\ldots ,p,$ $\widetilde{\mu }_{j}$, $j=1,\ldots ,s$, associated with the constraints $g_{i}$ and $h_{j}$, respectively. Furthermore, assume that:

(a)
the objective function f is locally Lipschitz $\left( F,\rho _{f}\right) $-convex of order m at $\widetilde{x}$ on X,
(b)
the inequality constraints $g_{i}$, $i\in I\left( \widetilde{x} \right) $, are locally Lipschitz $\left( F,\rho _{g_{i}}\right) $-convex of order m at $\widetilde{x}$ on X,
(c)
the equality constraints $h_{j}$, $j\in J^{+}\left( \widetilde{x} \right) $, are locally Lipschitz $\left( F,\rho _{h_{j}}^{+}\right) $-convex of order m at $\widetilde{x}$ on X,
(d)
the functions $-h_{j}$, $j\in J^{-}\left( \widetilde{x}\right) $, are locally Lipschitz $\left( F,\rho _{h_{j}}^{-}\right) $-convex of order m at $\widetilde{x}$ on X,
(e)
$\rho _{f}+\sum _{i\in I\left( \widetilde{x}\right) }\widetilde{ \lambda }_{i}\rho _{g_{i}}+\sum _{j\in J^{+}\left( \widetilde{x}\right) } \widetilde{\mu }_{j}\rho _{h_{j}}^{+}-\sum _{j\in J^{-}\left( \widetilde{x} \right) }\widetilde{\mu }_{j}\rho _{h_{j}}^{-}>0$,
(f)
the set D of all feasible solutions in the constrained optimization problem (P) is compact.

If the penalty parameter c is sufficiently large (namely, it is sufficient that c satisfies the condition $c>\max \left\{ \widetilde{\lambda }_{i} \text {, }i\in I\text {, }\left| \widetilde{\mu }_{j}\right| \text {, } j\in J\right\} $), then $\overline{x}$ is also a strict global minimizer of order m for problem (P).

Proof

Assume that $\overline{x}$ is a global minimizer of order m for the penalized optimization problem (P(c)) with the absolute value penalty function. Then, by definition of the penalized problem (P(c)) and Definition 4, the following inequality

$$\begin{aligned}&f(x)+c\left( \sum \nolimits _{i=1}^{p}g_{i}^{+}(x)+\sum \nolimits _{j=1}^{s}\left| h_{j}(x)\right| \right) \\&\quad \ge f(\overline{x})+c\left( \sum \nolimits _{i=1}^{p}g_{i}^{+}(\overline{x} )+\sum \nolimits _{j=1}^{s}\left| h_{j}(\overline{x})\right| \right) +\beta \left\| x-\overline{x}\right\| ^{m} \end{aligned}$$

holds for all $x\in X$. Using (6) together with the definition of the absolute value, we get that the following inequality

$$\begin{aligned} f(x)+c\left( \sum _{i=1}^{p}g_{i}^{+}(x)+\sum _{j=1}^{s}\left| h_{j}(x)\right| \right) \ge f(\overline{x})+\beta \left\| x- \overline{x}\right\| ^{m} \end{aligned}$$

holds for all $x\in X$. Therefore, it is also satisfied for all $x\in D$. Hence, by (6), the inequality

$$\begin{aligned} f(x)\ge f(\overline{x})+\beta \left\| x-\overline{x}\right\| ^{m} \end{aligned}$$

(19)

holds for all $x\in D$. The inequality above means that values of the objective function f are bounded below on the set D. Since f is a continuous function bounded below on the compact set D, therefore, by Weierstrass’ theorem, f admits its minimum $\widetilde{x}$ on D.

Now, we prove that $\overline{x}$ is a strict global minimizer of order m in the constrained optimization problem (P). First, we show that $\overline{x }$ is feasible in the given extremum problem (P). By means of contradiction, we suppose that $\overline{x}$ is not feasible in problem (P). As we have established above, the considered constrained optimization problem (P) has an optimal solution $\widetilde{x}$. Since the suitable constraint qualification is satisfied at $\widetilde{x}$, there exist the Lagrange multipliers $\widetilde{\lambda }\in R^{p}$ and $\widetilde{\mu }\in R^{s}$ such that the Generalized Karush–Kuhn–Tucker necessary optimality conditions (2)–(4) are satisfied at $\widetilde{x}$. By hypotheses (a)–(d) and Definition 3, the inequalities

$$\begin{aligned} f(\overline{x})-f(\widetilde{x})\ge F\left( \overline{x},\widetilde{x};\xi \right) +\rho _{f}\left\| \overline{x}-\widetilde{x}\right\| ^{m}, \quad \forall \xi \in \partial f(\widetilde{x})\text {,} \end{aligned}$$

(20)

$$\begin{aligned} g_{i}(\overline{x})-g_{i}(\widetilde{x})\ge F\left( \overline{x},\widetilde{ x};\zeta _{i}\right) +\rho _{g_{i}}\left\| \overline{x}-\widetilde{x} \right\| ^{m}, \quad \forall \zeta _{i}\in \partial g_{i}(\widetilde{x} )\text {, } \quad i\in I\left( \widetilde{x}\right) \text {,} \end{aligned}$$

(21)

$$\begin{aligned} h_{j}(\overline{x})-h_{j}(\widetilde{x})\ge F\left( \overline{x},\widetilde{ x};\varsigma _{j}\right) +\rho _{h_{j}}^{+}\left\| \overline{x}- \widetilde{x}\right\| ^{m}, \quad \forall \varsigma _{j}\in \partial h_{j}(\widetilde{x})\text {, } \quad j\in J^{+}\left( \widetilde{x}\right) \text {,} \ \end{aligned}$$

(22)

$$\begin{aligned} -h_{j}(\overline{x})+h_{j}(\widetilde{x})\ge F\left( \overline{x}, \widetilde{x};-\varsigma _{j}\right) +\rho _{h_{j}}^{-}\left\| \overline{x }-\widetilde{x}\right\| ^{m}, \quad \forall \varsigma _{j}\in \partial h_{j}(\widetilde{x})\text {, } \quad j\in J^{-}\left( \widetilde{x}\right) \end{aligned}$$

(23)

hold. Multiplying inequalities (21)–(23) by the corresponding Lagrange multipliers, respectively, by the sublinearity of the functional F (with respect to the third component), it follows that

$$\begin{aligned}&f(\overline{x})-f(\widetilde{x})+\sum \nolimits _{i\in I\left( \widetilde{x}\right) } \widetilde{\lambda }_{i}g_{i}(\overline{x})-\sum \nolimits _{i\in I\left( \widetilde{x} \right) }\widetilde{\lambda }_{i}g_{i}(\widetilde{x})+\sum \nolimits _{j\in J^{+}\left( \widetilde{x}\right) }\widetilde{\mu }_{j}h_{j}(\overline{x}) \\&\qquad - \sum \nolimits _{j\in J^{+}\left( \widetilde{x}\right) }\widetilde{\mu }_{j}h_{j}( \widetilde{x})+\sum \nolimits _{j\in J^{-}\left( \widetilde{x}\right) }\widetilde{\mu } _{j}h_{j}(\overline{x})-\sum \nolimits _{j\in J^{-}\left( \widetilde{x}\right) } \widetilde{\mu }_{j}h_{j}(\widetilde{x}) \\&\quad \ge F\left( \overline{x},\widetilde{x};\xi +\sum \nolimits _{i\in I\left( \widetilde{x} \right) }\widetilde{\lambda }_{i}\zeta _{i}+\sum \nolimits _{j\in J\left( \widetilde{x} \right) }\overline{\mu }_{j}\varsigma _{j}\right) \\&\qquad + \left( \rho _{f}+\sum \nolimits _{i\in I\left( \widetilde{x}\right) }\widetilde{\lambda }_{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \widetilde{x}\right) }\widetilde{ \mu }_{j}\rho _{h_{j}}^{+}-\sum \nolimits _{j\in J^{-}\left( \widetilde{x}\right) } \widetilde{\mu }_{j}\rho _{h_{j}}^{-}\right) \left\| \overline{x}- \widetilde{x}\right\| ^{m}. \end{aligned}$$

Since the Generalized Karush–Kuhn–Tucker necessary optimality condition (2) is satisfied at $\widetilde{x}$, by (1), the above inequality implies

$$\begin{aligned}&f(\overline{x})-f(\widetilde{x})+\sum \nolimits _{i\in I\left( \widetilde{x}\right) } \widetilde{\lambda }_{i}g_{i}(\overline{x})-\sum \nolimits _{i\in I\left( \widetilde{x} \right) }\widetilde{\lambda }_{i}g_{i}(\widetilde{x})+\sum \nolimits _{j\in J^{+}\left( \widetilde{x}\right) }\widetilde{\mu }_{j}h_{j}(\overline{x}) \nonumber \\&\qquad - \sum \nolimits _{j\in J^{+}\left( \widetilde{x}\right) }\widetilde{\mu }_{j}h_{j}( \widetilde{x})+\sum \nolimits _{j\in J^{-}\left( \widetilde{x}\right) }\widetilde{\mu } _{j}h_{j}(\overline{x})-\sum \nolimits _{j\in J^{-}\left( \widetilde{x}\right) } \widetilde{\mu }_{j}h_{j}(\widetilde{x}) \nonumber \\&\quad \ge \left( \rho _{f}+\sum \nolimits _{i\in I\left( \widetilde{x}\right) }\widetilde{\lambda }_{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \widetilde{x}\right) }\widetilde{ \mu }_{j}\rho _{h_{j}}^{+}-\sum \nolimits _{j\in J^{-}\left( \widetilde{x}\right) } \widetilde{\mu }_{j}\rho _{h_{j}}^{-}\right) \left\| \overline{x}- \widetilde{x}\right\| ^{m}\text {.} \nonumber \\ \end{aligned}$$

(24)

Since the Generalized Karush–Kuhn–Tucker necessary optimality condition (3) is satisfied at $\widetilde{x}$, we have

$$\begin{aligned} \sum _{i=1}^{p}\widetilde{\lambda }_{i}g_{i}(\widetilde{x})=0\text {.} \end{aligned}$$

(25)

Using the feasibility of $\widetilde{x}$ in the original optimization problem (P) together with (6) and (25), we obtain

$$\begin{aligned} \sum _{i=1}^{p}\widetilde{\lambda }_{i}g_{i}(\widetilde{x})=\sum _{i=1}^{p} \widetilde{\lambda }_{i}g_{i}^{+}(\widetilde{x})\text {, } \quad \sum _{j=1}^{s} \widetilde{\mu }_{j}h_{j}(\widetilde{x})=\sum _{j=1}^{s}\left| \widetilde{ \mu }_{j}h_{j}(\widetilde{x})\right| \text {.} \end{aligned}$$

(26)

Since $\overline{x}$ is not feasible in problem (P), (6) implies

$$\begin{aligned} \sum _{i=1}^{p}\widetilde{\lambda }_{i}g_{i}^{+}(\overline{x})\ge \sum _{i=1}^{p}\widetilde{\lambda }_{i}g_{i}(\overline{x})\text {,} \quad \sum _{j=1}^{s}\widetilde{\mu }_{j}\left| h_{j}(\widetilde{x})\right| \ge \sum _{j=1}^{s}\widetilde{\mu }_{j}h_{j}(\overline{x})\text {.} \end{aligned}$$

(27)

Combining (24)–(27), we get

$$\begin{aligned}&f(\overline{x})+\sum \nolimits _{i\in I\left( \widetilde{x}\right) }\widetilde{\lambda } _{i}g_{i}^{+}(\overline{x})+\sum \nolimits _{j\in J\left( \widetilde{x}\right) }\left| \widetilde{\mu }_{j}\right| \left| h_{j}(\overline{x} )\right| \nonumber \\&\quad \ge f(\widetilde{x})+\sum \nolimits _{i\in I\left( \widetilde{x}\right) }\widetilde{\lambda }_{i}g_{i}^{+}(\widetilde{x})+\sum \nolimits _{j\in J\left( \widetilde{x}\right) }\left| \widetilde{\mu }_{j}\right| \left| h_{j}(\widetilde{x} )\right| \nonumber \\&\qquad +\left( \rho _{f}+\sum \nolimits _{i\in I\left( \widetilde{x}\right) }\widetilde{\lambda }_{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \widetilde{x}\right) }\widetilde{ \mu }_{j}\rho _{h_{j}}^{+}-\sum \nolimits _{j\in J^{-}\left( \widetilde{x}\right) } \widetilde{\mu }_{j}\rho _{h_{j}}^{-}\right) \left\| \overline{x}- \widetilde{x}\right\| ^{m}\text {.}\nonumber \\ \end{aligned}$$

(28)

Since $c>\max \left\{ \widetilde{\lambda }_{i}\text {, }i\in I\text {, } \left| \widetilde{\mu }_{j}\right| \text {, }j\in J\right\} $, $ \widetilde{x}\in D$ gives

$$\begin{aligned}&f(\overline{x})+c\left( \sum \nolimits _{i\in I\left( \widetilde{x}\right) }g_{i}^{+}( \overline{x})+\sum \nolimits _{j\in J\left( \widetilde{x}\right) }\left| h_{j}( \overline{x})\right| \right) \nonumber \\&\quad > f(\widetilde{x})+c\left( \sum \nolimits _{i\in I\left( \widetilde{x}\right) }g_{i}^{+}( \widetilde{x})+\sum \nolimits _{j\in J\left( \widetilde{x}\right) }\left| h_{j}( \widetilde{x})\right| \right) \nonumber \\&\quad +\left( \rho _{f}+\sum \nolimits _{i\in I\left( \widetilde{x}\right) }\widetilde{\lambda }_{i}\rho _{g_{i}}+\sum \nolimits _{j\in J^{+}\left( \widetilde{x}\right) }\widetilde{ \mu }_{j}\rho _{h_{j}}^{+}-\sum \nolimits _{j\in J^{-}\left( \widetilde{x}\right) } \widetilde{\mu }_{j}\rho _{h_{j}}^{-}\right) \left\| \overline{x}- \widetilde{x}\right\| ^{m}\text {.}\nonumber \\ \end{aligned}$$

(29)

We denote $\widetilde{\beta }=\rho _{f}+\sum _{i\in I\left( \widetilde{x} \right) }\widetilde{\lambda }_{i}\rho _{g_{i}}+\sum _{j\in J^{+}\left( \widetilde{x}\right) }\widetilde{\mu }_{j}\rho _{h_{j}}^{+}-\sum _{j\in J^{-}\left( \widetilde{x}\right) }\widetilde{\mu }_{j}\rho _{h_{j}}^{-}$. By assumption (e), it follows that $\widetilde{\beta }>0$. Thus, by the definition of the objective function in problem (P(c)), the following inequality

$$\begin{aligned} P\left( \overline{x},c\right) >P\left( \widetilde{x},c\right) +\widetilde{ \beta }\left\| \overline{x}-\widetilde{x}\right\| ^{m} \end{aligned}$$

(30)

holds. Hence,

$$\begin{aligned} P\left( \overline{x},c\right) -\widetilde{\beta }\left\| \overline{x}- \widetilde{x}\right\| ^{m}>P\left( \widetilde{x},c\right) . \end{aligned}$$

Since $\widetilde{\beta }>0$, it follows that the following inequality

$$\begin{aligned} P\left( \overline{x},c\right) +\beta \left\| \overline{x}-\widetilde{x} \right\| ^{m}>P\left( \widetilde{x},c\right) \end{aligned}$$

holds for every $\beta >0$. But the inequality above contradicts the assumption that $\overline{x}$ is a global minimizer of order m in the penalized optimization problem (P(c)) with the exact absolute penalty function. Thus, we have established that $\overline{x}$ is feasible in the constrained optimization problem (P). Then, the conclusion of theorem, that is, the result that $\overline{x}$ is a global minimizer of order m in the given constrained optimization problem (P) follows directly from (19). $\square $

The following result follows directly from Corollary 9 and Theorem 11:

Corollary 12

Let all hypotheses of Corollary 9 and of Theorem 11 be fulfilled. Then, the set of strict global minimizers of order m for the given constrained optimization problem (P) and the set of strict global minimizers of order m for its associated penalized optimization problem (P(c)) with the exact absolute value penalty function coincide.

Now, we illustrate the results established in the paper by the help of nonconvex nonsmooth optimization problems with locally Lipschitz $\left( F,\rho \right) $-convex of order m functions.

Example 13

Consider the following nonsmooth constrained optimization problem

$$\begin{aligned} \begin{array}{l} f(x)=\arctan \left( x^{2}\right) +\left| x\right| +1\rightarrow \min \\ g(x)=\ln \left( x^{2}-\left| x\right| +1\right) \le 0. \end{array} \quad \text { (P2)} \end{aligned}$$

Note that $D=\left\{ x\in R:-1\le x\le 1\right\} $ and it is not difficult to show by Definition 4 that $\overline{x}=0$ is a strict global minimizer of order 1 for the considered nonsmooth optimization problem (P2). Therefore, the Generalized Karush–Kuhn–Tucker necessary optimality conditions (2)–(4) are fulfilled at this point with the Lagrange multiplier $\overline{\lambda }$ satisfying the following condition: $0\in \partial f\left( \overline{x}\right) +\overline{ \lambda }\partial g\left( \overline{x}\right) $, where $\partial f\left( \overline{x}\right) =\left[ -1,1\right] $ and $\partial g\left( \overline{x} \right) =\left[ -1,1\right] $. If we set $F\left( x,\overline{x};\vartheta \right) =\frac{1}{8}\left| x-\overline{x}\right| \vartheta $, $\rho _{f}=\frac{7}{8}$, $\rho _{g}=-\frac{9}{8}$, then, by Definition 3, the objective function f is $\left( F,\rho _{f}\right) $-convex of order 1 at $\overline{x}$ on R and the constraint function g is locally Lipschitz $\left( F,\rho _{g}\right) $-convex of order 1 at $ \overline{x}$ on R , and, moreover, the condition $\rho _{f}+\overline{ \lambda }\rho _{g}\ge 0$ is satisfied. Since we solve problem (P2) by using the exact $l_{1}$ penalty function method, we construct the following unconstrained optimization problem

$$\begin{aligned} P(x,c)=\arctan \left( x^{2}\right) +\left| x\right| +1+c\max \left\{ 0,\ln \left( x^{2}-\left| x\right| +1\right) \right\} . \qquad \text {(P2(}c\text {))} \end{aligned}$$

Then, by Theorem 8, it is follows that, for any penalty parameter c satisfying $c>\overline{\lambda }$, the point $ \overline{x}=0$ is also a strict global minimizer of order 1 for the penalized optimization problem (P2(c)) with the absolute value penalty function given above. Furthermore, since both the objective function f is $ \left( F,\rho _{f}\right) $-convex of order 1 and the constraint function g is locally Lipschitz $\left( F,\rho _{g}\right) $-convex of order 1 at any Karush–Kuhn–Tucker point of problem (P2) on R, by Theorem 11, $\overline{x}=0$, being a strict global minimizer of order 1 for the penalized optimization problem (P2(c)) for all penalty parameters no less than 0, is also a strict global minimizer of order 1 for the considered optimization problem (P2) Thus, in fact, there is the equivalence between the sets of strict global minimizers of order 1 for optimization problems (P2) and (P2(c)).

In the next example, we consider a nonconvex nonsmooth optimization problem in which not all involved functions are $\left( F,\rho \right) $-convex functions of the same order with respect to any sublinear functional $ F:R\times R\times R\rightarrow R$. We show in this case that there is no equivalence between the considered nonsmooth optimization problem and its associated penalized optimization problem with the absolute value penalty function in the sense discussed in the paper. In other words, the sets of strict global minimizers of the same order are not the same in these two optimization problems.

Example 14

Consider the following nonsmooth constrained optimization problem

$$\begin{aligned} \begin{array}{l} f(x)=x^{3}+\left| x\right| \rightarrow \min \\ g(x)=x^{2}+x\le 0\text {.} \end{array} \qquad \text {(P3)} \end{aligned}$$

Note that $D=\left\{ x\in R:-1\le x\le 0\right\} $ and $\overline{x}=0$ is a strict global minimizer of order 3 for the considered optimization problem. It can be proved that the objective function f is not $\left( F,\rho _{f}\right) $-convex of order m on R with respect to any sublinear (with respect to the third component) functional $F:R\times R\times R\rightarrow R$ (see [28]). In other words, there do not exist real numbers $\rho _{f}$, $\rho _{g}$ (with $\rho _f > 0$ or $\rho _g > 0$) and the same sublinear functional $F:R\times R\times R\rightarrow R$ with respect to which the objective function f is $\left( F,\rho _{f}\right) $-convex on R and the constraint function g is $\left( F,\rho _{g}\right) $-convex on R. Since we use the exact $l_{1}$ penalty method to solve the considered optimization problem (P3), we construct the following unconstrained optimization problem:

$$\begin{aligned} P(x,c)=x^{3}+\left| x\right| +c\max \left\{ 0,x^{2}+x\right\} \rightarrow \mathrm{min}. \qquad \text {(P3(}c\text {))} \end{aligned}$$

Then note that $\overline{x}=0$ is not a strict global minimizer of order 3 for its associated penalized optimization problem (P3(c)) with the exact $ l_{1}$ penalty function for any penalty parameter c, and the more so, for any penalty parameter c satisfying the condition $c>\overline{\lambda }_{1} $. Indeed, it is not difficult to see that the downward order of growth of f exceeds the upward of growth of g at $\overline{x}$ when moving from $ \overline{x}$ towards smaller values. In fact, note that $\inf _{x\in R}P_{c}(x)\rightarrow -\infty $ when $x\rightarrow -\infty $ for any $c>0$. This follows from the fact that the functions constituting the constrained optimization problem (P3) are not locally Lipschitz $\left( F,\rho \right) $-convex of order m with respect to the same (sublinear with respect to the third component) functional $F:R\times R\times R\rightarrow R$.

Now, we give an example of such a nonsmooth optimization problem in which the objective function is coercive [23], but it is not $\left( F,\rho _{f}\right) $-convex of order m on R with respect to any sublinear (with respect to the third component) functional $F:R\times R\times R\rightarrow R$ (see [28]). We show also in such a case that there is no the equivalence between the considered nonsmooth optimization problem and its associated penalized optimization problem with the absolute value penalty function in the sense discussed in the paper. In other words, the corecivity property of the objective function is not sufficient to ensure that the sets of strict global minimizers of the same order are the same in these two optimization problems.

Example 15

Consider the following nonsmooth constrained optimization problem

$$\begin{aligned} \begin{array}{l} f(x)=2\left| x+1\right| -2\left| x\right| +\left| x-1\right| \rightarrow \min \\ g(x)=-x\le 0\text {.} \end{array} \qquad \text {(P4)} \end{aligned}$$

Note that $D=\left\{ x\in R:x\ge 0\right\} $ and it can be shown, by Definition 4, that $\overline{x}=1$ is a strict global minimizer of order 1 in the considered nonsmooth optimization problem (P4). Further, note that the objective function f is coercive (see [23]), but it is not $\left( F,\rho _{f}\right) $-convex of order 1 on R with respect to any sublinear (with respect to the third component) functional $F:R\times R\times R\rightarrow R$ (see [28]). Since we use the exact $l_{1}$ penalty method to solve the considered optimization problem (P4), we construct the following unconstrained optimization problem:

$$\begin{aligned} P(x,c)=2\left| x+1\right| -2\left| x\right| +\left| x-1\right| +c\max \left\{ 0,-x\right\} \rightarrow \min . \qquad \text {(P4(}c\text {))} \end{aligned}$$

Then note that $\overline{x}=1$ is not a strict global minimizer of order 1 in its associated penalized optimization problem (P4(c)) with the exact $ l_{1}$ penalty function for any penalty parameter c satisfying the condition $c>\overline{\lambda }_{1}=1$. Further, it can be shown, by Definition 4, that $\overline{x}=-1$ is a strict global minimizer of order 1 for penalty parameters $c\in [1,2)$. In other words, the treshold of penalty parameters above which there is the equivalence between the sets of strict global minimizers of order 1 in problems (P4) and (P4(c)) is not equal to the largest Lagrange multiplier in the absolute value. This follows from the fact that the the objective function in the constrained optimization problem (P4) is not locally Lipschitz $\left( F,\rho \right) $-convex of order 1 with respect to any sublinear with respect to the third component functional $F:R\times R\times R\rightarrow R$.

References

Cromme, L.: Strong uniqueness: a far-reaching criterion for the convergence of iterative procedures. Numerische Mathematik 29, 179–193 (1978)
Article MathSciNet MATH Google Scholar
Klatte, D.: Stable local minimizers in semi-infinite optimization: regularity and second-order conditions. J. Comput. Appl. Math. 56, 137–157 (1994)
Article MathSciNet MATH Google Scholar
Studniarski, M.: Characterizations of strict local minima for some nonlinear programming problems. Nonlinear Anal. 30, 5363–5367 (1997). (Proc. 2nd World Congress of Nonlinear Analysis)
Auslender, A.: Stability in mathematical programming with nondifferentiable data. SIAM J. Control Optim. 22, 239–254 (1984)
Article MathSciNet MATH Google Scholar
Studniarski, M.: Sufficient conditions for the stability of local minimum points in nonsmooth optimization. Optimization 20, 27–35 (1989)
Article MathSciNet MATH Google Scholar
Ward, D.E.: Characterizations of strict local minima and necessary conditions for weak sharp minima. J. Optim. Theory Appl. 80, 551–571 (1994)
Article MathSciNet MATH Google Scholar
Antczak, T.: The $l_{1}$ penalty function method for nonconvex differentiable optimization problems with inequality constraints. Asia Pacific J. Op. Res. 27, 1–18 (2010)
Article MathSciNet MATH Google Scholar
Antczak, T.: The $l_{1}$ exact $G$-penalty function method and $G$-invex mathematical programming problems. Math. Comput. Model. 54, 1966–1978 (2011)
Article MathSciNet MATH Google Scholar
Antczak, T.: The exact $l_{1}$ penalty function metod for nonsmooth invex optimization problems. Hömberg, D., Tröltzsch, F. (eds.) System Modelling and Optimization, 25th IFIP TC Conference, CSMO 2011, AITC 391, Berlin, Germany, September 2011, Springer (2013), pp. 461–471
Bazaraa, M.S., Sherali, H.D., Shetty, C.M.: Nonlinear Programming: Theory and Algorithms. Wiley, New York (1991)
MATH Google Scholar
Bertsekas, D.P.: Constrained Optimization and Lagrange Multiplier Methods. Academic Press, Inc., USA (1982)
MATH Google Scholar
Bertsekas, D.P., Koksal, A.E.: Enhanced optimality conditions and exact penalty functions. In: Proceedings of Allerton Conference, September 2000
Charalambous, C.: A lower bound for the controlling parameters of the exact penalty functions. Math. Program. 15, 278–290 (1978)
Article MathSciNet MATH Google Scholar
Di Pillo, G., Grippo, L.: Exact penalty functions in constrained optimization. SIAM J. Control Optim. 27, 1333–1360 (1989)
Article MathSciNet MATH Google Scholar
Di Pillo, G., Lucidi, S., Rinaldi, F.: An approach to constrained global optimization based on exact penalty functions. J. Glob. Optim. 54, 251–260 (2012)
Article MathSciNet MATH Google Scholar
Dolecki, S., Rolewicz, S.: Exact penalty for local minima. SIAM J. Control Optim. 17, 111–124 (1979)
Article MathSciNet MATH Google Scholar
Fletcher, R.: An exact penalty function for nonlinear programming with inequalities. Math. Program. 5, 129–150 (1973)
Article MathSciNet MATH Google Scholar
Han, S.P., Mangasarian, O.L.: Exact penalty functions in nonlinear programming. Math. Program. 17, 251–269 (1979)
Article MathSciNet MATH Google Scholar
Janesch, S.M.H.: Exact penalty functions a lower bound to the penalty parameter. Int. Math. Forum 2, 75–86 (2007)
Article MathSciNet MATH Google Scholar
Luenberger, D.: Control problem with kinds. IEEE Trans. Autom. Control 15, 570–574 (1970)
Article MathSciNet Google Scholar
Mandal, P., Nahak, C.: The $l_{1}$ exact exponential penalty function method with $\left( p, r\right) $-$\rho $-$\left( \eta,\theta \right) $-invexity. J. Adv. Math. Stud. 5, 127–148 (2012)
MathSciNet MATH Google Scholar
Mangasarian, O.L.: Sufficiency of exact penalty minimization. SIAM J. Control Optim. 23, 30–37 (1985)
Article MathSciNet MATH Google Scholar
Peressini, A.L., Sullivan, F.E., Uhl Jr, J.J.: The Mathematics of Nonlinear Programming. Springer, New York (1988)
Book MATH Google Scholar
Rosenberg, E.: Exact penalty functions and stability in locally Lipschitz programming. Math. Program. 30, 340–356 (1984)
Article MathSciNet MATH Google Scholar
Zangwill, W.I.: Nonlinear programming via penalty functions. Manag. Sci. 13, 344–358 (1967)
Article MathSciNet MATH Google Scholar
Zaslavski, A.J.: A sufficient condition for exact penalty in constrained optimization. SIAM J. Optim. 16, 250–262 (2005)
Article MathSciNet MATH Google Scholar
Clarke, F.H.: Optimization and Nonsmooth Analysis. A Wiley-Interscience Publication, John Wiley & Sons Inc., New York (1983)
MATH Google Scholar
Caprari, E.: $\rho $-Invex functions and $\left( F,\rho \right) $-convex functions: properties and equivalences. Optimization 52, 65–74 (2003)
Article MathSciNet MATH Google Scholar
Hiriart-Urruty, J.B.: On optimality conditions in nondifferentiable programming. Math. Program. 14, 73–86 (1978)
Article MathSciNet MATH Google Scholar
Hiriart-Urruty, J.B.: Refinements of necessary optimality conditions in nondifferentiable programming. Appl. Math. Optim. 5, 63–82 (1979)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Mathematics and Computer Science, University of Łódź, Banacha 22, 90-238, Łódź, Poland
Tadeusz Antczak

Authors

Tadeusz Antczak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tadeusz Antczak.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Antczak, T. The exact absolute value penalty function method for identifying strict global minima of order m in nonconvex nonsmooth programming. Optim Lett 10, 1561–1576 (2016). https://doi.org/10.1007/s11590-015-0967-3

Download citation

Received: 28 November 2014
Accepted: 14 October 2015
Published: 14 November 2015
Issue Date: October 2016
DOI: https://doi.org/10.1007/s11590-015-0967-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The exact absolute value penalty function method for identifying strict global minima of order m in nonconvex nonsmooth programming

Abstract

Similar content being viewed by others

The Exact l 1 Penalty Function Method for Constrained Nonsmooth Invex Optimization Problems

Second-Order Minimization Method for Nonsmooth Functions Allowing Convex Quadratic Approximations of the Augment

Characterizations of the approximate solution sets of nonsmooth optimization problems and its applications

1 Introduction

2 Preliminaries and problem formulation

Definition 1

Definition 2

Definition 3

Definition 4

Theorem 5

Remark 6

Definition 7

3 The exact \(l_{1}\) penalty method for optimization problems with locally Lipschitz \(\left( F,\rho \right) \)-convex functions of order m

Theorem 8

Proof

Corollary 9

Example 10

Theorem 11

Proof

Corollary 12

Example 13

Example 14

Example 15

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The exact absolute value penalty function method for identifying strict global minima of order m in nonconvex nonsmooth programming

Abstract

Similar content being viewed by others

The Exact l 1 Penalty Function Method for Constrained Nonsmooth Invex Optimization Problems

Second-Order Minimization Method for Nonsmooth Functions Allowing Convex Quadratic Approximations of the Augment

Characterizations of the approximate solution sets of nonsmooth optimization problems and its applications

1 Introduction

2 Preliminaries and problem formulation

Definition 1

Definition 2

Definition 3

Definition 4

Theorem 5

Remark 6

Definition 7

3 The exact \(l_{1}\) penalty method for optimization problems with locally Lipschitz \(\left( F,\rho \right) \)-convex functions of order m

Theorem 8

Proof

Corollary 9

Example 10

Theorem 11

Proof

Corollary 12

Example 13

Example 14

Example 15

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation