On Local Convexity of Quadratic Transformations
- 534 Downloads
Abstract
In this paper, we improve Polyak’s local convexity result for quadratic transformations. Extension and open problems are also presented.
Keywords
Convexity Quadratic transformation Joint numerical range1 Introduction
Though results on the convexity of complex quadratic functions were already there since 1918, see Toeplitz [19] and Hausdorff [8], the first such result for real case is due to Dines [4] in 1941. It states that if \(f_1, f_2\) are homogeneous quadratic functions then the set \(F_2\) is convex. In 1971, Yakubovich [23, 24] used this basic result to prove the famous S-lemma, see [17] for a survey. Brickman [3] proved in 1961 that if \(f_1, f_2\) are homogeneous quadratic functions and \(n\geqslant 3\) then the set \(\{(f_1(x), f_2(x)) : x\in {\mathbb{R}}^n, \Vert x\Vert = 1\} \subseteq {\mathbb{R}}^2\) is convex. Fradkov [5] proved in 1973 that if matrices \(A_1, \cdots , A_m\) commute and \(f_1,\cdots ,f_m\) are homogeneous, then \(F_m\) is convex. In 1995, it was showed by Ramana and Goldman [18] that the identification of the convexity of \(F_m\) is NP-hard. In the same paper, the quadratic maps, under which the image of every linear subspace is convex, was also investigated. Based on Brickman’s result, Polyak [14] proved in 1998 that if \(n\geqslant 3\) and \(f_1, f_2, f_3\) are homogeneous quadratic functions such that \(\mu _1A_1+\mu _2A_2+\mu _2A_3\succ 0\) (where notation \(A\succ 0\) means that \(A\) is positive definite) for some \(\mu \in {\mathbb{R}}^3\), then the set \(F_3\) is convex. Moreover, as shown in the same paper, when \(n\geqslant 2\) and there exists \(\mu \in {\mathbb{R}}^2\) such that \(\mu _1A_1+\mu _2A_2\succ 0\), the set \(F_2\) is convex. In 2007, Beck [1] showed that if \(m\leqslant n, A_1\succ 0\) and \(A_2=\cdots =A_m=0\), then \(F_m\) is convex. However, if \(A_1\succ 0, A_2=\cdots =A_{n+1}=0\) and \(a_2,\cdots ,a_{n+1}\) are linearly independent, then \(F_{n+1}\) is not convex. When \(m=2\), Beck’s result reduces to be a corollary of Polyak’s result. Very recently, Xia et al. [22] used the new developed S-lemma with equality to establish the necessary and sufficient condition for the convexity of \(F_2\) for \(A_2=0\) and arbitrary \(A_1\).
More generally, Polyak [15, 16] succeeded in proving a nonlinear image of a small ball in a Hilbert space is convex, provided that not only the derivative of the map is Lipschitz continuous on the ball, but also the derivative at the center of the ball is surjective and the norm of its adjoint mapping is bounded from zero. Later, Uderzo [21] extended the result to a certain subclass of uniformly convex Banach spaces. When focusing on quadratic transformations, Polyak’s result reads as follows:
Theorem 1.1
In this paper, we improve the above Polyak’s result for quadratic transformations (i.e., Theorem 1.1) by strengthening the constant \(L\). Then, Theorem 1.1 is extended to the image of the ball of the same radius \(\varepsilon \) centered at any point \(a\) satisfying \(\Vert a\Vert <2(\varepsilon ^*-\varepsilon )\). Furthermore, we propose two new approaches for possible improvement of \(L\).
The paper is organized as follows. In Sect. 1, we improve and extend Theorem 1.1. In Sect. 2, we discuss further possible improvements. In the final conclusion section, we propose two open questions.
Throughout the paper, all vectors are column vectors. Let \(v(\cdot )\) denotes the optimal value of problem \((\cdot )\). Notation \(A\succeq 0\) implies that the matrix \(A\) is positive semidefinite. \({\rm {vec}(A)}\) denotes the vector obtained by stacking the columns of \(A\) one underneath the other. The trace of \(A\) is denoted by trace\((A)=\sum _{i=1}^nA_{ii}\). \(|A|\) is the matrix having entries \(|A_{ij}|\). The Kronecker product and the inner product of the matrices \(A\) and \(B\) are denoted by \(A\otimes B\) and \(A\bullet B={\rm trace}(AB^{\rm T})=\sum _{i,j=1}^na_{ij}b_{ij}\), respectively. The identity matrix is denoted by \(I\). \(\Vert x\Vert =\sqrt{x^{\rm T}x}\) is the \(\ell _2\)-norm of the vector \(x\).
2 Main Results
In this section, we first improve Theorem 1.1 and then extend it to the ball of the same radius centered at any point close enough to the zero point.
Theorem 2.1
Proof
Theorem 2.2
Proof
Remark 2.3
Theorem 2.1 is a special case of Theorem 2.2 by setting \(a=0\).
3 Further Possible Improvements
Except for the upper bound \(L_{\rm new}\) (2.1), we further consider the other two relaxations of (3.1). We first need two lemmas.
Lemma 3.1
Lemma 3.2
Remark 3.3
Now, we apply Lemmas 3.1 and 3.2 to establish two new relaxations of \(L_f\) (3.1).
Example 3.4
Figure 1 shows the images of the \(\varepsilon \)-disks for \((E_1)\) and \((E_2)\), respectively. It follows that \(\widetilde{L}_{\rm new}\) is not tight and the convexity loses when \(\varepsilon \) is large enough.
Images of \(\varepsilon \)-disks for \((E_1)\) with \(\varepsilon =1/(2\widetilde{L}_{\rm new})\approx 0.039\,4,\,0.06,\,0.14\) in the left subgraph and for \((E_2)\) with \(\varepsilon =1/(2\widetilde{L}_{\rm new})\approx 0.036\,2,\,0.04,\,0.08\) in the right subgraph
4 Concluding Remarks
Notes
Acknowledgments
The author is grateful to the anonymous referee whose comments improved this paper.
References
- [1]Beck, A.: On the convexity of a class of quadratic mappings and its application to the problem of finding the smallest ball enclosing a given intersection of balls. J. Glob. Optim. 39(1), 113–126 (2007)CrossRefMATHGoogle Scholar
- [2]Bhatia, R.: Matrix Analysis. Springer-Verlag, New York (1997)CrossRefGoogle Scholar
- [3]Brickman, L.: On the field of values of a matrix. Proc. AMS 12, 61–66 (1961)MathSciNetCrossRefMATHGoogle Scholar
- [4]Dines, L.L.: On the mapping of quadratic forms. Bull. AMS 47, 494–498 (1941)MathSciNetCrossRefGoogle Scholar
- [5]Fradkov, A.L.: Duality theorems for certain nonconvex extremum problems. Sib. Math. J. 14, 247–264 (1973)MathSciNetCrossRefMATHGoogle Scholar
- [6]M. Grant, S. Boyd, CVX: Matlab software for disciplined convex programming, version 1.21 (2010). http://cvxr.com/cvx
- [7]Gu, Y.: The distribution of eigenvalues of a matrix. Acta Math. Appl. Sin. 17(4), 501–511 (1994)Google Scholar
- [8]Hausdorff, F.: Der Wervorrat einer Bilinearform. Mathematische Zeitschrift 3, 314–316 (1919)MathSciNetCrossRefMATHGoogle Scholar
- [9]He, S., Li, Z., Zhang, S.: Approximation algorithms for homogeneous polynomial optimization with quadratic constraints. Math. Program. Ser. B 125, 353–383 (2010)MathSciNetCrossRefMATHGoogle Scholar
- [10]Horn, R.A., Johnson, C.R.: Matrix Analysis. Cambridge University Press, Cambridge (1985)CrossRefMATHGoogle Scholar
- [11]Horn, R.A., Johnson, C.R.: Topics in Matrix Analysis. Cambridge University Press, Cambridge (1991)CrossRefMATHGoogle Scholar
- [12]Lasserre, J.B.: Global optimization with polynomials and the problem of moments. SIAM J. Optim. 11, 796–817 (2001)MathSciNetCrossRefMATHGoogle Scholar
- [13]Lim, L.-H.: Singular values and eigenvalues of tensors: a variational approach. In: Proceedings of the IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, vol. 1, pp. 129–132 (2005)Google Scholar
- [14]Polyak, B.T.: Convexity of quadratic transformations and its use in control and optimization. J. Optim. Theory Appl. 99, 553–583 (1998)MathSciNetCrossRefMATHGoogle Scholar
- [15]Polyak, B.T.: Convexity of nonlinear image of a small ball with applications to optimization. Set-Valued Anal. 9, 159–168 (2001)MathSciNetCrossRefMATHGoogle Scholar
- [16]Polyak, B.T.: The convexity principle and its applications. Bull. Br az. Math.Soc. (N.S.) 34(1), 59–75 (2003)MathSciNetCrossRefMATHGoogle Scholar
- [17]Pólik, I., Terlaky, T.: A servey of S-lemma. SIAM Rev. 49(3), 371–418 (2007)MathSciNetCrossRefMATHGoogle Scholar
- [18]Ramana, M., Goldman, A.J.: Quadratic maps with convex images, Report 36-94, Rutgers Center for Operations Research, Rutgers, The State University of New Jersey (1994)Google Scholar
- [19]Toeplitz, O.: Das algebraische Analogen zu einem Satz von Fejer. Mathematische Zeitschrift 2, 187–197 (1918)MathSciNetCrossRefMATHGoogle Scholar
- [20]Toh, K.C., Todd, M.J., Tutuncu, R.H.: SDPT3—a Matlab software package for semidefinite programming. Optim. Methods Softw. 11, 545–581 (1999)MathSciNetCrossRefGoogle Scholar
- [21]Uderzo, A.: On the Polyak convexity principle and its application to variational analysis. Nonlinear Anal. 91, 60–71 (2013)MathSciNetCrossRefMATHGoogle Scholar
- [22]Y. Xia, S. Wang, R.L. Sheu, S-Lemma with equality and its applications. arXiv:1403.2816v2 (2014) http://arxiv.org/abs/1403.2816
- [23]Yakubovich, V.A.: S-procedure in nonlinear control theory. Vestnik Leningrad. Univ. 1, 62–77 (1971). (in Russian)Google Scholar
- [24]Yakubovich, V.A.: S-procedure in nonlinear control theory, Vestnik Leningrad. Univ. 4, 73–93 (1977) (English translation)Google Scholar
