1 Introduction

Entanglement is a fundamental aspect of quantum mechanics. The set of separable states (i.e., nonentangled states) on the Hilbert space \(\mathbb {C}^{n} \otimes \mathbb {C}^{m}\) is defined as:

$$\begin{aligned} \text {Sep}(n,m) = {{\,\mathrm{\mathbf{conv}}\,}}\left\{ x x^{\dagger } \otimes yy^{\dagger } : x \in \mathbb {C}^n, |x| = 1, y \in \mathbb {C}^m, |y| = 1\right\} . \end{aligned}$$

Here \(x^{\dagger } = {\bar{x}}^{\mathsf {T}}\) indicates conjugate transpose, \(|x|^2 = x^{\dagger } x = \sum _{i=1}^n |x_i|^2\) and \({{\,\mathrm{\mathbf{conv}}\,}}\) denotes the convex hull. The set \(\text {Sep}(n,m)\) lives in the space \(\mathbf {H}^{nm}\) of Hermitian matrices of size \(nm \times nm\), and it is full-dimensional in the subspace of matrices of trace equal to one.

A fundamental computational task in quantum information is to decide membership in the convex set \(\text {Sep}(n,m)\). One of the first tests designed to check whether a state \(\rho \in \mathbf {H}^{nm}\) is separable is the Peres-Horodecki criterion [Per96, HHH96] (also known as the Positive Partial Transpose (PPT) criterion). It is based on the observation that for any \(\rho \in \text {Sep}(n,m)\), \((I\otimes \mathsf {T})(\rho )\) is positive semidefinite where I is the identity map, and \(\mathsf {T}\) the transpose map. Indeed one can easily verify that if \(\rho = xx^{\dagger } \otimes yy^{\dagger }\) then \((I \otimes \mathsf {T})(\rho ) = xx^{\dagger } \otimes (yy^{\dagger })^{\mathsf {T}} = xx^{\dagger } \otimes {\bar{y}} {\bar{y}}^{\dagger } \succeq 0\). In other words we have the inclusion \(\text {Sep}(n,m) \subseteq \text {PPT}(n,m)\) where

$$\begin{aligned} \text {PPT}(n,m) = \Bigl \{ \rho \in \mathbf {H}^{nm} : \rho \succeq 0, \;\; (I \otimes \mathsf {T})(\rho ) \succeq 0, \text { and } {{\,\mathrm{Tr}\,}}[\rho ]=1\Bigr \}. \end{aligned}$$
(1)

It is known, from earlier work of Woronowicz [Wor76], that we have equality \(\text {Sep}(n,m) = \text {PPT}(n,m)\) if, and only if \(n+m\le 5\). Thus, the smallest cases where \(\text {Sep}(n,m) \ne \text {PPT}(n,m)\) are \((n,m) = (4,2)\) and \((n,m) = (3,3)\).

Semidefinite programming The description of the set \(\text {PPT}(n,m)\) in Eq. (1) allows us to decide membership, and optimize linear functions on \(\text {PPT}(n,m)\), via semidefinite programming. Semidefinite programming is a fundamental tool in optimization that has played a crucial role in recent developments in quantum information theory. We say that a convex set C has a semidefinite representation (also called a semidefinite lift) of size r if it can be expressed as

$$\begin{aligned} C = \pi (S) \end{aligned}$$
(2)

where \(\pi :\mathbb {R}^D\rightarrow \mathbb {R}^d\) is a linear map and \(S \subset \mathbb {R}^D\) is a convex set defined using a linear matrix inequality

$$\begin{aligned} S = \{w \in \mathbb {R}^D : M_0 + w_1 M_1 + \dots + w_D M_D \succeq 0\} \end{aligned}$$
(3)

where \(M_0,\ldots ,M_D\) are Hermitian matrices of size \(r\times r\). A set S of the form (3) is known as a spectrahedron. In this paper we are most interested in when a semidefinite representation of finite size exists, and call this simply a semidefinite representation throughout. If a convex set C admits a semidefinite representation, then optimizing a linear function on C can be cast as a semidefinite program. Equation (1) gives a semidefinite representation of \(\text {PPT}(n,m)\).

Horodecki’s criterion The set of separable states has the following well-known description due to the Horodeckis [HHH96]:

$$\begin{aligned} \text {Sep}(n,m) = \left\{ \rho \in \mathbf {H}^{nm} : {{\,\mathrm{Tr}\,}}[\rho ] = 1 \text { and } (I \otimes \Phi )(\rho ) \succeq 0 \;\; \forall \Phi : \text {M}_m \rightarrow \text {M}_n \text { positive}\right\} . \end{aligned}$$
(4)

Here \(\text {M}_k = \mathbb {C}^{k\times k}\) and a \(\mathbb {C}\)-linear map \(\Phi : \text {M}_m\rightarrow \text {M}_n\) is positive if it is Hermitian preserving and if \(\Phi (X) \succeq 0\) for all \(X \succeq 0\). When \(n=m\), the relaxation \(\text {PPT}(n,m)\) corresponds to having only the identity and transpose maps in Eq. (4), which are both positive. A recent result of Skowronek [Sko16] shows that when \(n=m=3\), there is no finite family of positive maps \(\Phi _1,\ldots ,\Phi _k:\text {M}_3 \rightarrow \text {M}_3\) such that \(\text {Sep}(3,3) = \left\{ \rho \in \mathbf {H}^{9} : (I \otimes \Phi _i) (\rho ) \succeq 0 \; \forall i=1,\ldots ,k\right\} \). Note that the right-hand side of the previous equation is a specific semidefinite representable set. Thus Skowronek’s result rules out certain specific semidefinite representations for \(\text {Sep}(3,3)\).Footnote 1

DPS hierarchy In [DPS04], Doherty, Parrilo and Spedalieri proposed a complete hierarchy of approximations to the set of separable states based on semidefinite programming. The first level of the hierarchy coincides with the PPT test, and subsequent levels form tighter and tighter convex relaxations of the set of separable states. If we denote the convex relaxation at level k by \(\text {DPS}_k(n,m)\) we have (dropping the (nm)):

$$\begin{aligned} \text {Sep}\subseteq \dots \subseteq \text {DPS}_k \subseteq \text {DPS}_{k-1} \subseteq \dots \subseteq \text {DPS}_1 = \text {PPT}. \end{aligned}$$

The key property of the DPS hierarchy is that each set \(\text {DPS}_k\) has a semidefinite representation of size \(\min (n,m)^{O(k)}\). The hierarchy is known to be complete, meaning that if \(\rho \notin \text {Sep}\), then there exists a finite k such that \(\rho \notin \text {DPS}_k\). The integer k however depends on the state \(\rho \) and it is known that, unless \(n+m\le 5\), there is no finite k such that \(\text {DPS}_k(n,m) = \text {Sep}(n,m)\) [DPS04, Section VIII.B].

Contributions The main result of this paper is

Theorem 1

If \(\text {Sep}(n,m) \ne \text {PPT}(n,m)\) then \(\text {Sep}(n,m)\) has no (finite) semidefinite representation. In other words, \(\text {Sep}(n,m)\) has no semidefinite representation when \(n+m > 5\).

Remark 1

Some remarks concerning Theorem 1:

  • Note that Theorem 1 contains as a special case the fact that whenever \(\text {Sep}(n,m) \ne \text {PPT}(n,m)\), then there is no representation of \(\text {Sep}(n,m)\) as \(\{\rho \in \mathbf {H}^{nm} : {{\,\mathrm{Tr}\,}}[\rho ] = 1 \text { and } (I \otimes \Phi _i)(\rho ) \succeq 0, \; \forall i=1,\ldots ,k\}\) where \(\Phi _1,\ldots ,\Phi _k:\text {M}_m\rightarrow \text {M}_n\) is a finite family of positive maps. This is because the latter set is semidefinite representable (in fact it is a spectrahedron). Let us mention that if one is only interested in approximating the set \(\text {Sep}(n,m)\), Aubrun and Szarek [AS17b] gave a lower bound on the number k of positive maps needed.

  • Our result also includes as a special case the fact that there is no finite k such that \(\text {DPS}_k(n,m) = \text {Sep}(n,m)\), when \(n+m > 5\). We note that our result is a strict generalization of this fact. Indeed, the failure of the DPS hierarchy to converge in a finite number of levels does not preclude by itself the existence of another semidefinite program that represents \(\text {Sep}(n,m)\) exactly. There are well-known examples of convex sets where the sum-of-squares hierarchy (of which DPS can be seen as a particular instance) is never exact and yet a finite semidefinite representation does exist, see e.g., [NPS10, Example 3.7].

  • Observe that if \(\text {Sep}(n,m)\) has no semidefinite representation, then the same is true for \(\text {Sep}(N,m)\) for \(N \ge n\). This is because \(\text {Sep}(n,m)\) can be realized as a linear section of \(\text {Sep}(N,m)\) as follows:

    $$\begin{aligned} \text {Sep}(n,m) \simeq \left\{ \rho \in \text {Sep}(N,m) : ({{\,\mathrm{Tr}\,}}_2 \rho )_{ii} = 0 \; \forall i=n+1,\ldots ,N \right\} \end{aligned}$$

    where \({{\,\mathrm{Tr}\,}}_2 \rho \) is the result of tracing out the second subsystem from \(\rho \). Indeed, setting \(({{\,\mathrm{Tr}\,}}_2 \rho )_{ii} = 0\) implies that in any representation of \(\rho \) as \(\rho = \sum _{k} p_k x_k x_k^{\dagger } \otimes y_k y_k^{\dagger }\), the vectors \(x_k\) must satisfy \((x_{k})_{i} = 0\) for all \(i=n+1,\ldots ,N\), i.e., that \(x_k \in \mathbb {C}^n \times \{0\}^{N-n} \simeq \mathbb {C}^n\). To prove our theorem it thus suffices to prove that \(\text {Sep}(3,3)\) and \(\text {Sep}(4,2)\) have no semidefinite representations.

Helton–Nie conjecture The question of finding semidefinite representations for convex sets has attracted a lot of attention in the optimization community [Nem06, GPT13]. Helton and Nie [HN09] gave sufficient conditions for a set to have a semidefinite representation, and conjectured that any convex semialgebraic set has a semidefinite representation. (A set is semialgebraic set if it can be described using a finite boolean combinations of polynomial equations and inequalities. One can verify that \(\text {Sep}(n,m)\) is a semialgebraic set, see Sect. 2.3.) In his breakthrough paper, Scheiderer [Sch18] disproved this conjecture and exhibited convex semialgebraic sets that have no semidefinite representations.

Our proof of Theorem 1 is inspired from the arguments of Scheiderer. Compared to the paper of Scheiderer the present paper has two main contributions. First, the proof we give simplifies the arguments of Scheiderer and does not rely on any specialized results from algebraic geometry. We only use basic results from analysis (Taylor expansions), and some standard facts about semialgebraic sets and functions which are elementary to state. The proof should thus be accessible to readers in quantum information and optimization. The second contribution is the application of the method of proof for \(\text {Sep}(n,m)\) which is defined in terms of complex numbers. This turns out to cause certain difficulties as certain standard facts about real polynomials are not true about Hermitian polynomials, particularly on the relation between homogeneous polynomials and their dehomogenizations (see Sect. 5 and Appendix A for more details).

Main technical result Our main technical result, Theorem 2 below and of which Theorem 1 is a corollary, gives a general way to construct a convex set with no semidefinite representation from a nonnegative Hermitian polynomial that is not a sum of squares. We recall that a Hermitian polynomial p(z) is a polynomial with complex coefficients in the indeterminates \((z,{\bar{z}})=(z_1,\ldots ,z_n,{\bar{z}}_1,\ldots ,{\bar{z}}_n)\) such that \(p(z) \in \mathbb {R}\) for all \(z \in \mathbb {C}^n\). A Hermitian polynomial is a sum of squares if it can be written as a sum of squares of Hermitian polynomials. (More details about Hermitian polynomials are given in Sect. 2.) For the statement of the theorem, we use the monomial notation \(z^{u} = \prod _{i=1}^n z_i^{u_i}\) for \(u \in \mathbb {N}^n\).

Theorem 2

(General theorem). Let \(p(z) = \sum _{(u,v) \in A} p_{uv} z^u {\bar{z}}^v\) be a Hermitian polynomial supported on \(A \subset \mathbb {N}^n\times \mathbb {N}^n\), and assume that p is nonnegative on \(\mathbb {C}^n\) but not a sum of squares. Assume furthermore that A is downward closed, i.e., if \((u,v) \in A\) then all \((u',v') \in \mathbb {N}^n \times \mathbb {N}^n\) with \(0 \le u' \le u\) and \(0\le v' \le v\) are in A. Define the monomial map \(\mathbf {m}_A:\mathbb {C}^n \rightarrow \mathbb {C}^{|A|}\), \(z\mapsto \left[ z^{u} {\bar{z}}^{v}\right] _{(u,v) \in A}\) for \(z \in \mathbb {C}^n\). Then the convex set

$$\begin{aligned} \mathcal {C}_A = {{\,\mathrm{\mathbf{cl}}\,}}{{\,\mathrm{\mathbf{conv}}\,}}\left\{ \mathbf {m}_A(z) : z \in \mathbb {C}^n \right\} \end{aligned}$$
(5)

is not semidefinite representable, where \({{\,\mathrm{\mathbf{cl}}\,}}\) denotes topological closure.

The set of separable states is of the form (5) for well-chosen set A. Indeed, dropping the normalization condition and letting \(\mathcal {SEP}(n,m)\) be the convex cone of separable states, we have:

$$\begin{aligned} \begin{aligned} \mathcal {SEP}(n,m)&= {{\,\mathrm{\mathbf{conv}}\,}}\left\{ \left[ x_i {\bar{x}}_j y_k {\bar{y}}_l\right] _{\begin{array}{c} 1 \le i,j \le n\\ 1 \le k,l \le m \end{array}} : (x,y) \in \mathbb {C}^n \times \mathbb {C}^m \right\} \\&= {{\,\mathrm{\mathbf{conv}}\,}}\left\{ \left[ x^{\alpha } {\bar{x}}^{\beta } y^{\gamma } {\bar{y}}^{\delta }\right] _{|\alpha |=|\beta |=|\gamma |=|\delta |=1} : (x,y) \in \mathbb {C}^n \times \mathbb {C}^m \right\} \end{aligned} \end{aligned}$$

where for \(\zeta \in \mathbb {N}^k\) we let \(|\zeta | = \sum _{i=1}^k \zeta _i\). This shows that \(\mathcal {SEP}(n,m) = \mathcal {C}_A\) where

$$\begin{aligned} A = \left\{ (\alpha ,\beta ,\gamma ,\delta ) \in (\mathbb {N}^n \times \mathbb {N}^n) \times (\mathbb {N}^m \times \mathbb {N}^m) : |\alpha |=|\beta |=|\gamma |=|\delta |=1\right\} . \end{aligned}$$
(6)

The attentive reader will notice that this set A is not downward closed, and so does not satisfy the condition of Theorem 2. As a matter of fact, to prove Theorem 1 we apply Theorem 2 with a dehomogenization of A which satisfies the downward closed condition, and then homogenize back to get the desired convex cone. The details are explained in Sect. 5.

Overview of proof We briefly sketch the main ideas for the proof of Theorem 2.

  • We first show that if the set \(\mathcal {C}_A\) has a semidefinite representation, then there exists a finite number of functions \(f_1,\ldots ,f_r:\mathbb {R}^{2n} \simeq \mathbb {C}^n \rightarrow \mathbb {R}\) such that any nonnegative Hermitian polynomial supported on \(A \cup \{(\mathbf {0},\mathbf {0})\}\) can be written as a sum of squares from \({{\,\mathrm{span}\,}}_{\mathbb {R}}(f_1,\ldots ,f_r)\). This characterization of semidefinite representations via sums of squares is not new: it follows from the factorization theorem of Gouveia, Parrilo and Thomas [GPT13] and its sum-of-squares interpretation see e.g., [Faw16]. We note that a similar characterization is also used in Scheiderer’s paper, see [Sch18, Theorem 3.4].

  • One of the main observations needed to prove Theorem 2 is to note that the functions \(f_1,\ldots ,f_r\) can be chosen to be semialgebraic. (We recall the precise definition of semialgebraic functions in Sect. 2.) One key property of such functions that turns out to be particularly important is that they are smooth almost everywhere. Combining this property with a simple observation regarding smooth sum of squares decompositions of homogeneous polynomials allows us to prove Theorem 2 already in the special case where p is a homogeneous polynomial. This allows us to prove that \(\text {Sep}(n,m)\) is not semidefinite representable when \((n,m) = (5,3)\) or (4, 4). The complete proof of Theorem 2 which allows us to cover the cases \((n,m)=(4,2)\) and (3, 3) for separable states, requires an additional technical argument using Puiseux expansions for univariate continuous semialgebraic functions.

Real version of Theorem 2 We note that one can state an analogue of Theorem 2 dealing with real polynomials instead of Hermitian polynomials. The proof is similar, and we state it below just for convenience and for future reference.

Theorem 3

(Main theorem for real polynomials). Let \(p(x) = \sum _{u} p_u x^u \in \mathbb {R}[x]\) where \(A \subset \mathbb {N}^n\) finite, be a real polynomial that is nonnegative on \(\mathbb {R}^n\) but not a sum of squares. Assume furthermore that A is downward closed, i.e., if \(u \in A\) then all \(u' \in \mathbb {N}^n\) with \(0 \le u' \le u\) are in A. Define the monomial map \(\mathbf {m}_A(x) = \left[ x^{u} \right] _{u \in A}\) for \(x \in \mathbb {R}^n\). Then the convex set

$$\begin{aligned} {{\,\mathrm{\mathbf{cl}}\,}}{{\,\mathrm{\mathbf{conv}}\,}}\left\{ \mathbf {m}_A(x) : x \in \mathbb {R}^n \right\} \end{aligned}$$

is not semidefinite representable.

The theorem above can be used to recover the result of Scheiderer [Sch18, Corollary 4.25], that the cone \(P_{n,2d}\) of nonnegative (real) forms in n variables of degree 2d is not semidefinite representable when it is distinct from \(\Sigma _{n,2d}\), the cone of sums of squares. Indeed, it suffices to take p in Theorem 3 to be a dehomogenization of a nonnegative form that is not a sum of squares, and to use the well-known fact that a convex set has a semidefinite representation if and only if its dual has one.

Organization The paper is organized as follows. In Sect. 2 we set some of the notations and present some background material on Hermitian polynomials, sums of squares, and semialgebraic sets and functions that are useful for the proof of the main theorem. In Sect. 3 we review the connection between the existence of semidefinite programming representations, and sums of squares. The proof of Theorem 2 is in Sect. 4 and the proof of Theorem 1 in Sect. 5.

2 Preliminaries

We recall in this section some results on Hermitian polynomials, the duality \(\text {Sep}\)/nonnegative polynomials and \(\text {PPT}\)/sums of squares and semialgebraic sets and functions.

2.1 Hermitian polynomials

For \(z \in \mathbb {C}^n\), we denote the elementwise complex conjugate of z by \({\bar{z}} = ({\bar{z}}_1,\ldots ,{\bar{z}}_n)\). If \(u \in \mathbb {N}^n\) we define the monomial \(z^u = z_1^{u_1} \dots z_n^{u_n}\). A Hermitian polynomial p(z) is a polynomial in z and \({\bar{z}}\) of the form

$$\begin{aligned} p(z) = \sum _{(u,v) \in A} p_{uv} z^u {\bar{z}}^v \qquad (A \subset \mathbb {N}^n \times \mathbb {N}^n) \end{aligned}$$
(7)

such that \(p(z) \in \mathbb {R}\) for all \(z \in \mathbb {C}^n\). This is equivalent to saying that \(p_{uv} = \overline{p_{vu}}\) for all uv. The support of p is \({{\,\mathrm{supp}\,}}(p) = \{ (u,v) : p_{uv} \ne 0\} \subset \mathbb {N}^n \times \mathbb {N}^n\). The Hermitian polynomial p is nonnegative if \(p(z) \ge 0\) for all \(z \in \mathbb {C}^n\). Further, we say that p is a sum of squares if we can write

$$\begin{aligned} p = \sum _{k} q_k^2 \end{aligned}$$
(8)

for Hermitian polynomials \(q_k\). If p(z) is a Hermitian polynomial we will often consider the real polynomial \(P(a,b) = p(a+ib)\) in \(\mathbb {R}[a_1,\ldots ,a_n,b_1,\ldots ,b_n]\). One can check that p is a sum-of-squares if and only if P is a sum-of-squares of real polynomials.

Remark 2

(Sums of squares for Hermitian polynomials). Another common definition of a Hermitian polynomial p(z) being a sum-of-squares is that p can be written as \(p(z) = \sum _{k} |g_k(z)|^2\) where \(g_k\) are (holomorphic) polynomials in z only (and not in \({\bar{z}}\)). Clearly if p has such a representation then it is a sum-of-squares in the sense (8) since then \(p = \sum _{k} \text {Re}[g_k]^2 + \text {Im}[g_k]^2\) and \(\text {Re}[g_k]\) and \(\text {Im}[g_k]\) are both Hermitian polynomials. The converse however is not true. It is possible that a polynomial p has a representation (8) and cannot be written as a sum of modulus squares of holomorphic polynomial mappings. See e.g., [DP09] for more on this distinction. In this paper we only work with the definition (8) of sums of squares.

2.2 \(\text {Sep}\), \(\text {PPT}\), nonnegative polynomials, and sums of squares

For convenience, we will work in this paper with the cone of separable states, where we drop the normalization condition:

$$\begin{aligned} \mathcal {SEP}(n,m) = {{\,\mathrm{\mathbf{conv}}\,}}\left\{ x x^{\dagger } \otimes yy^{\dagger } : x \in \mathbb {C}^n, y \in \mathbb {C}^m \right\} . \end{aligned}$$

One can verify that \(\text {Sep}(n,m)\) is the compact slice \(\text {Sep}(n,m)=\mathcal {SEP}(n,m) \cap \{\rho : {{\,\mathrm{Tr}\,}}\rho = 1\}\).Footnote 2 Let also \(\mathcal {PPT}\) be the cone of states that have positive partial transpose, i.e.,

$$\begin{aligned} \mathcal {PPT}(n,m) = \left\{ \rho \in \mathbf {H}^{nm} : \rho \succeq 0 \text { and } (I \otimes \mathsf {T})(\rho ) \succeq 0 \right\} \end{aligned}$$

so that \(\text {PPT}(n,m) = \mathcal {PPT}(n,m) \cap \left\{ \rho : {{\,\mathrm{Tr}\,}}\rho = 1\right\} \).

Dual of \(\text {Sep}\) For any integer k, let \(\text {M}_k = \mathbb {C}^{k\times k}\). A \(\mathbb {C}\)-linear map \(\Phi : \text {M}_n \rightarrow \text {M}_m\) that is Hermitian preserving is positive if \(\Phi (\rho ) \succeq 0\) whenever \(\rho \succeq 0\). Equivalently, \(\Phi \) is positive if the degree-four Hermitian polynomial \(p(x,y) = y^{\dagger } \Phi \left( xx^{\dagger } \right) y\) is nonnegative on \(\mathbb {C}^{n+m} \simeq \mathbb {C}^n \times \mathbb {C}^m\). It is well-known that the dual of \(\mathcal {SEP}(n,m)\) can be identified, via the Choi isomorphismFootnote 3, with the cone of positive maps \(\text {M}_n \rightarrow \text {M}_m\) (see e.g., [AS17a, Table 2.2]). Equivalently, the dual of \(\mathcal {SEP}(n,m)\) can be identified with nonnegative degree-four Hermitian polynomials of the form

$$\begin{aligned} p(x,y) = \sum _{\begin{array}{c} 1\le i,j \le n\\ 1\le k,l \le m \end{array}} p_{ijkl} x_i {\bar{x}}_j y_k {\bar{y}}_l \qquad (x \in \mathbb {C}^n, y \in \mathbb {C}^m) \end{aligned}$$
(9)

where \(p_{ijkl} = \Phi (E_{ij})_{lk}\). Polynomials of the form (9) have a biquadratic structure: they are quadratic independently in each block of variables x and y. The duality between \(\mathcal {SEP}\) and nonnegative polynomials of the form (9) is in fact immediate from the definition of \(\mathcal {SEP}\).

Dual of \(\text {PPT}\) Using the identification above, it turns out that the dual of \(\mathcal {PPT}(n,m)\) corresponds to polynomials p(xy) that are sums of squares. Indeed, it is well-known (see again [AS17a, Table 2.2]) that the dual cone of \(\mathcal {PPT}(n,m)\) can be identified, via the Choi isomorphism, with the cone of maps \(\Phi : \text {M}_n \rightarrow \text {M}_m\) that are decomposable, i.e., that can be written \(\Phi = S_1 + S_2 \circ \mathsf {T}\) where \(S_1\) and \(S_2\) are two completely positive maps, and \(\mathsf {T}\) is the transpose map. Recall that a map \(S:\text {M}_n\rightarrow \text {M}_m\) is completely positive if there exist matrices \(V_t\) such that \(S(X) = \sum _{t} V_t^{*} X V_t\). One can verify that a map \(\Phi \) is decomposable if and only if, the associated Hermitian polynomial (9) is a sum of squares. We did not find any reference for this equivalence, so we include a proof here. (The proofs we found in the literature are only for the direction \(\Rightarrow \) in Proposition 1. The proof of Proposition 1 is a special case of a more general result in [FF20], joint with Kun Fang, where it is shown that the dual of \(\text {DPS}_k\) can be identified with a sum-of-squares condition of degree k.)

Proposition 1

A map \(\Phi :\text {M}_n \rightarrow \text {M}_m\) is decomposable if, and only if, the Hermitian polynomial \(p(x,y) = y^{\dagger } \Phi \left( xx^{\dagger } \right) y\) is a sum of squares.

Proof

If \(S(\rho ) {=} \sum _{t} V_t \rho V_t^{\dagger }\) is a completely positive map then \(y^{\dagger } S(xx^{\dagger }) y = \sum _{t} y^{\dagger } V_t xx^{\dagger } V_t^{\dagger } y = \sum _{t} |{\bar{y}}^{\mathsf {T}} V_t x|^2\) is a sum-of-squares. Also for the transpose map \(\mathsf {T}\), we have \(y^{\dagger }(S \circ \mathsf {T}) ( xx^{\dagger } ) y = y^{\dagger } S({\bar{x}} {\bar{x}}^{\dagger }) y = \sum _{t} |y^{\mathsf {T}} \bar{V_t} x|^2\) is also a sum-of-squares. It follows that if \(\Phi \) is decomposable then \(p(x,y) = y^{\dagger } \Phi \left( xx^{\dagger } \right) y\) is a sum-of-squares.

We now prove the converse. Assume \(p(x,y) = y^{\dagger } \Phi (xx^{\dagger }) y\) is a sum-of-squares, i.e., \(p(x,y) = \sum _{t} q_t(x,y)^2\) for some Hermitian polynomials \(q_t\). We need to show that \(\Phi \) is decomposable. Because of the biquadratic structure of p we see that \(q_t\) cannot have monomials \(x_i^2, \bar{x_i}^2\), \(x_i x_j, \bar{x_i} \bar{x_j}, x_i \bar{x_j}\) or \( \bar{x_i} x_j\) and similarly for the y’s. Thus this means that \(q_t\) must have the form

$$\begin{aligned} q_t(x,y) = \underbrace{x^{\mathsf {T}} M_t y}_{g_t} + \underbrace{{\bar{x}}^{\mathsf {T}} \bar{M_t} {\bar{y}}}_{\bar{g_t}} + \underbrace{x^{\mathsf {T}} N_t {\bar{y}}}_{h_t} + \underbrace{{\bar{x}}^{\mathsf {T}} N_t y}_{\bar{h_t}} \end{aligned}$$

where \(M \in \mathbb {C}^{n\times n}\) and \(N \in \mathbb {C}^{m\times m}\). Squaring \(q_t\) we get

$$\begin{aligned} q_t^2 = g_t^2 + 2|g_t|^2 + 2g_t h_t + 2g_t \bar{h_t} + \bar{g_t}^2 + 2\bar{g_t} h_t + 2\bar{g_t} \bar{h_t} + h_t^2 + 2 |h_t|^2 + \bar{h_t}^2. \end{aligned}$$

When summing \(\sum _{k} q_t^2\) we see that the only terms that can produce monomials of the form \(x_i \bar{x_j} y_k \bar{y_{l}}\) (the monomials that appear in p) are the terms \(2|g_t|^2\) and \(2|h_t|^2\). The sum of all the other terms must thus be equal to 0. At the end we get (including the constant 2 in \(M_t\) and \(N_t\)):

$$\begin{aligned} p = \sum _{t} |x^{\mathsf {T}} M_t y|^2 + |x^{\mathsf {T}} N_t {\bar{y}}|^2. \end{aligned}$$

From here it easily follows that \(\Phi = S_1 + S_2 \circ \mathsf {T}\) where \(S_1(\rho ) = \sum _{t} N_t \rho N_t^{\dagger }\) and \(S_2(\rho ) = \sum _{t} \bar{M_t} \rho \bar{M_t}^{\dagger }\). \(\square \)

The following diagram summarizes the discussion above.

2.3 Semialgebraic sets and functions

Semialgebraic sets A semialgebraic subset of \(\mathbb {R}^n\) is a subset that can be defined by a finite boolean combination of polynomial equations (\(P = 0\)) and inequalities (\(P > 0\)) where \(P \in \mathbb {R}[x_1,\ldots ,x_n]\). For example a set of the form \(\{w \in \mathbb {R}^D : M_0 + w_1 M_1 + \dots + w_D M_D \succeq 0\}\) is semialgebraic since the condition that a matrix is positive semidefinite can be expressed by a finite number of polynomial inequalities. The set of separable states can be shown to be semialgebraic. One can prove this using the celebrated and powerful result of Tarski stating that the projection of a semialgebraic set is semialgebraic. A consequence of Tarski’s theorem is that the convex hull of a semialgebraic set \(S \subset \mathbb {R}^n\) is semialgebraic. Indeed this is because we can write \({{\,\mathrm{\mathbf{conv}}\,}}(S)\) as the projection on the x component of the following semialgebraic set

$$\begin{aligned} \begin{aligned} \Bigl \{&(x,\lambda ,s_1,\ldots ,s_{n+1}) \in \mathbb {R}^n \times \mathbb {R}^{n+1} \times (\mathbb {R}^n)^{n+1} : \\&\qquad \lambda _1,\ldots ,\lambda _{n+1} \ge 0, s_1,\ldots ,s_{n+1} \in S, \; x = \sum _{i=1}^{n+1} \lambda _i s_i \text { and } \sum _{i=1}^{n+1} \lambda _i = 1 \Bigr \}. \end{aligned} \end{aligned}$$

(Note that, by Carathéodory theorem any element in \({{\,\mathrm{\mathbf{conv}}\,}}(S)\) is a convex combination of at most \(n+1\) points in S.) To see why the set \(\text {Sep}(n,m)\) is a semialgebraic subset of \(\mathbf {H}^{nm} \simeq \mathbb {R}^{2 (nm)^2}\) first note that the following set

$$\begin{aligned} \left\{ (\rho , x , y) \in \mathbf {H}^{nm} \times \mathbb {C}^n \times \mathbb {C}^m \text { s.t. } \rho = xx^{\dagger } \otimes yy^{\dagger } \text { and } |x|^2 = |y|^2 = 1 \right\} \end{aligned}$$
(10)

is a semalgebraic subset of \(\mathbf {H}^{nm} \times \mathbb {C}^n \times \mathbb {C}^m \simeq \mathbb {R}^{2(nm)^2} \times \mathbb {R}^{2n} \times \mathbb {R}^{2m}\) since the equations can all be written as real polynomial equations in the real and imaginary components. By Tarski’s theorem it follows that the projection of (10) on the \(\mathbf {H}^{nm}\) component, which is precisely the set of pure product states, is semialgebraic. Thus \(\text {Sep}\) is semialgebraic as the convex hull of a semialgebraic set.

Semialgebraic functions A function \(f:\mathbb {R}^n\rightarrow \mathbb {R}^m\) is called semialgebraic if its graph \(\{(x,f(x)) : x \in \mathbb {R}^n\} \subseteq \mathbb {R}^n\times \mathbb {R}^m\) is a semialgebraic set. Even though semialgebraic functions form a very broad class of functions, they are tame and possess nice regularity properties. Examples of semialgebraic functions are polynomials, rational functions, or power functions (with rational exponent). Functions that are not semialgebraic are e.g., \(\exp (x)\), or the indicator function of the rationals in \(\mathbb {R}\). We state two basic results about semialgebraic functions that will be crucial for us.

Theorem 4

(Almost everywhere smoothness of semialgebraic functions). Let \(f:\mathbb {R}^n\rightarrow \mathbb {R}\) be a semialgebraic function. Then f is smooth (\(C^{\infty }\)) everywhere except possibly on the zero set of a polynomial \(P \in \mathbb {R}[x_1,\ldots ,x_n] \setminus \{0\}\).

Proof

See e.g., [HP16, Theorem 1.7]. \(\square \)

Clearly Theorem 4 is not true for general functions, cf. the indicator function of the rationals in \(\mathbb {R}\). The second result that we will need concerns semialgebraic functions in one variable.

Theorem 5

(Puiseux expansion for one-dimensional semialgebraic functions) Assume \(f:(0,\eta ) \rightarrow \mathbb {R}\) where \(\eta > 0\), is a semialgebraic continuous function that is bounded. Then f can be extended by continuity to the interval \([0,\eta )\). Additionally, there exists an integer m such that the map \(t\mapsto f(t^m)\) is \(C^{\infty }\) on \([0,\epsilon )\) for some \(0< \epsilon < \eta \).

Proof

See [BPR06, Section 3.3]. \(\square \)

We note that the theorem above is not true for arbitrary functions. For example the function \(x\mapsto \sin (1/x)\) is bounded and continuous on any interval \((0,\eta )\) but cannot be extended by continuity at 0. We finally record the following result which will also be needed for our proof. It simply says that any linear map, restricted to a semialgebraic set always admits a semialgebraic inverse.

Theorem 6

Let \(S \subset \mathbb {R}^N\) be a semialgebraic set and let \(\pi :\mathbb {R}^N \rightarrow \mathbb {R}^n\) be a linear map. Then there exists a semialgebraic function \(F: \pi (S) \rightarrow S\) that satisfies \(\pi (F(x)) = x\) for all \(x \in \pi (S)\).

Proof

See [HP16, Lemma 1.5]. \(\square \)

3 Semidefinite Programming Lifts

We are now ready to start the proof of Theorem 2 (and thus of Theorem 1 too). The first thing we need is a necessary condition for the existence of a semidefinite representation for a given convex set C. The condition we state in Theorem 7 below is very similar to [GPT13, Theorem 1]Footnote 4. Recall that, for \(A \subset \mathbb {N}^n \times \mathbb {N}^n\) we denote by \(\mathbf {m}_A(z)\) the monomial map:

$$\begin{aligned} \mathbf {m}_A(z) = \left[ z^u {\bar{z}}^v \right] _{(u,v) \in A}. \end{aligned}$$

We also denote by \({{\,\mathrm{\mathbf{cl}}\,}}S\) the (topological) closure of a set S.

Theorem 7

Let \(A \subset \mathbb {N}^n \times \mathbb {N}^n\). Assume that

$$\begin{aligned} \mathcal {C}_A = {{\,\mathrm{\mathbf{cl}}\,}}{{\,\mathrm{\mathbf{conv}}\,}}\left\{ \mathbf {m}_A(z) : z \in \mathbb {C}^n \right\} \end{aligned}$$

has a semidefinite representation of size k. Then there exists \(2k^2+1\) semialgebraic functions \(f_j:\mathbb {C}^n \rightarrow \mathbb {R}\) (\(j=1,\ldots ,2k^2+1\)) such that any nonnegative Hermitian polynomial p supported on \(A \cup \{(\mathbf {0},\mathbf {0})\}\) is a sum-of-squares from \(V = {{\,\mathrm{span}\,}}_{\mathbb {R}}(f_1,\ldots ,f_{2k^2+1})\), i.e., \(p = \sum _{j} h_j^2\) for some \(h_j \in V\). Furthermore, the magnitude of the coefficients expressing the \(h_j\) in terms of the basis \((f_1,\ldots ,f_{2k^2+1})\) are all bounded by \(\phi (\Vert p\Vert )\) where \(\Vert p\Vert \) is the largest magnitude of the coefficients of p, and \(\phi \) is some polynomial that only depends on the semidefinite representation of \(\mathcal {C}_A\).

The main difference between the statement above and the one in [GPT13] (see also [Faw16, Theorem 5, Chapter 2]) is that here the functions \(f_1,\ldots ,f_{2k^2+1}\) are semialgebraic. (We say that a function \(f:\mathbb {C}^n\rightarrow \mathbb {R}\) is semialgebraic if the function \(F:\mathbb {R}^n \times \mathbb {R}^n \rightarrow \mathbb {R}\) defined by \(F(a,b) = f(a+ib)\) is semialgebraic.) This observation will be crucial to us. We note that a statement similar to the theorem above appears as Theorem 3.4 in [Sch18]. Instead of working with semialgebraic functions, Scheiderer works with polynomial functions on an algebraic variety X.

Proof

Assume that \(\mathcal {C}_A = \pi (S)\) where S is a spectrahedron defined using a linear matrix inequality of size \(k\times k\):

$$\begin{aligned} S = \left\{ w \in \mathbb {R}^N : M(w) := M_0 + M_1 w_1 + \dots + M_N w_N \succeq 0 \right\} . \end{aligned}$$

We can assume without loss generality that S has nonempty interior in \(\mathbb {R}^N\). This in turn implies, using standard results about spectrahedra, that there exists \(\tilde{w} \in S\) such that \(M(\tilde{w})\) is positive definite (possibly after changing M), see e.g., [RG95, Section 2.4].

For any \(z \in \mathbb {C}^n\) there exists \(w(z) \in S\) such that \(\pi (w(z)) = \mathbf {m}_A(z)\). Since \(M(w(z)) \succeq 0\) we can find \(F(z) \in \mathbb {C}^{k\times k}\) such that \(M(w(z)) = F(z) F(z)^{\dagger }\). Furthermore, by Theorem 6, the function \(z \in \mathbb {C}^n \mapsto F(z) \in \mathbb {C}^{k\times k}\) can be taken to be semialgebraic.

Let p(z) be a Hermitian polynomial supported on \(A \cup \{(\mathbf {0},\mathbf {0})\}\), i.e., \(p(z) = \langle \tilde{p}, \mathbf {m}_A(z) \rangle + c\) for some \(c \in \mathbb {R}\), where \(\tilde{p}\) denotes the coefficients of the polynomial p(z) in the monomial basis. Since \(p \ge 0\) we get \(\langle \tilde{p}, \mathbf {m}_A(z) \rangle + c \ge 0\) for all \(z \in \mathbb {C}^n\). This implies that \(\langle \tilde{p}, \sigma \rangle + c \ge 0\) for all \(\sigma \in \mathcal {C}_A\). We can lift this linear inequality to an inequality on the spectrahedron S, i.e., we have \(\langle \tilde{p}, \pi (w) \rangle + c \ge 0\) for all \(w \in S\), in other words

$$\begin{aligned} \langle \pi ^*(\tilde{p}) , w \rangle + c \ge 0 \quad \forall w \in \mathbb {R}^N \text { s.t. } M(w) \succeq 0. \end{aligned}$$

By Farkas’ lemma/duality for SDPs, this means that there exists \(B \succeq 0\) and \(b \ge 0\) such that

$$\begin{aligned} \langle \pi ^*(\tilde{p}), w \rangle + c = \langle B , M(w) \rangle + b \qquad \forall w \in \mathbb {R}^N. \end{aligned}$$
(11)

Plugging \(w = w(z)\) we get

$$\begin{aligned} \langle \tilde{p}, \pi (w(z)) \rangle + c = \langle B , M(w(z)) \rangle + b = \langle B , F(z) F(z)^{\dagger } \rangle + b. \end{aligned}$$

Since \(\pi (w(z)) = \mathbf {m}_A(z)\) and \(\langle \tilde{p}, \mathbf {m}_A(z) \rangle = p(z)\) we get finally that \(p(z) + c = \langle B , F(z) F(z)^{\dagger }\rangle + b\) for all \(z \in \mathbb {C}^n\). Factorizing \(B = DD^{\dagger }\) we get

$$\begin{aligned} p(z) + c = {{\,\mathrm{Tr}\,}}\left[ DD^{\dagger } F(z) F(z)^{\dagger }\right] +b = \sum _{ij=1}^{k} |D^{\dagger } F(z)|^2_{ij}+b\\ = \sum _{ij=1}^{k} \text {Re}[ (D^{\dagger } F(z))_{ij} ]^2 + \text {Im}[ (D^{\dagger } F(z))_{ij} ]^2+b. \end{aligned}$$

If we define the \(2k^2+1\) semialgebraic functions to be the constant function and the \(z \mapsto \text {Re}[F_{ij}(z)]\) and \(z\mapsto \text {Im}[F_{ij}(z)]\), we get the desired claim.

For the last statement of the theorem, we show that the coefficients of B (and thus of D) are bounded by a polynomial in the coefficients of p. To get this, we can simply plug the value \(w=\tilde{w}\) that makes M(w) positive definite in (11). If we denote by \(\lambda > 0\) the smallest eigenvalue of \(M(\tilde{w})\) we get \(\langle \pi ^*(\tilde{p}), \tilde{w} \rangle + c = \langle B , M(\tilde{w}) \rangle + b \ge \lambda {{\,\mathrm{Tr}\,}}[B] + b \ge \lambda \Vert B\Vert + b\), thus \(\max (\Vert B\Vert ,b) \le (\langle \pi ^*(\tilde{p}), \tilde{w} \rangle + c)/\min (\lambda ,1) \le O(\max \{\Vert p\Vert , |c| \})\). \(\square \)

4 Proof of Theorem 2

We are now ready to prove our main theorem, Theorem 2. We first recall a piece of notation that we will use throughout the proof: for any function \(f:\mathbb {C}^n \rightarrow \mathbb {R}\) we associate the function of real variables \(F:\mathbb {R}^n \times \mathbb {R}^n \rightarrow \mathbb {R}\), denoted by a capital letter, defined by \(F(a,b) = f(a+ib)\). We will also sometimes think of a vector \(z \in \mathbb {C}^n\) as \(z \in \mathbb {R}^{2n}\) and write for instance F(z).

Assume that \(\mathcal {C}_A\) has an SDP representation. Then, from Theorem 7 there exist semialgebraic functions \(f_1,\ldots ,f_r:\mathbb {C}^n \rightarrow \mathbb {R}\) such that the following is true:

$$\begin{aligned} \begin{array}{c} \text {Any nonnegative Hermitian polynomial supported on}\ A \cup \{(\mathbf {0},\mathbf {0})\}\\ \text {is a sum-of-squares of functions from} {{\,\mathrm{span}\,}}_{\mathbb {R}}(f_1,\ldots ,f_r). \end{array} \end{aligned}$$
(*)

We will now prove that the functions \(f_i\) can be taken to be smooth at the origin. (By this we mean that the associated functions \(F_i : \mathbb {R}^n \times \mathbb {R}^n \rightarrow \mathbb {R}\) are smooth at (0, 0).) This will follow from our assumption that A is downward closed. Since the \(F_i\) are semialgebraic we know from Theorem 4 that each \(F_i\) is smooth almost everywhere. Thus we can find a common point \(z_0=(a_0,b_0) \in \mathbb {R}^n\times \mathbb {R}^n\) such that all functions \(F_i\) are smooth at \(z_0\). Now let \(\tilde{f_i}(z) = f_i(z-z_0)\) for all \(i \in \{1,\ldots ,r\}\). We claim that these semialgebraic functions still satisfy the property (\(*\)). Indeed if q is a nonnegative Hermitian polynomial supported on \(A \cup \{(\mathbf {0},\mathbf {0})\}\) then \(q(z+z_0)\) is nonnegative and is also supported on A, since A is downward closed. It follows that \(q(z+z_0)\) is a sum-of-squares from \({{\,\mathrm{span}\,}}_{\mathbb {R}}(f_1,\ldots ,f_r)\). But this implies that q(z) is a sum-of-squares from \({{\,\mathrm{span}\,}}_{\mathbb {R}}(\tilde{f}_1,\ldots ,\tilde{f}_r)\).

In the rest of the proof we will thus assume that the \(F_1,\ldots ,F_r\) are smooth at the origin. If p is homogeneous we are almost done by the following simple observation: if P is a real homogeneous polynomial and \(P = \sum _{j} H_j^2\) for some functions \(H_j:\mathbb {R}^{m} \rightarrow \mathbb {R}\) that are smooth at the origin, then P is a sum-of-squares of polynomials. This can be proved by a simple Taylor expansion; for example by applying the following proposition to the identity \(t^{2k} P(z) = P(tz) = \sum _{j} H_j(tz)^2\) and observing that \(\left. \frac{d^k}{dt^k} H_j(tz)\right| _{t=0}\) is a degree k polynomial in \(z \in \mathbb {R}^{2n}\) (it is the k’th term in the Taylor expansion of \(H_j\) at z).

Proposition 2

Assume that \(g_j:[0,\eta ) \rightarrow \mathbb {R}\) are smooth functionsFootnote 5 and that there exists \(a \in \mathbb {R}\) such that \(at^{2k} = \sum _{j} g_j(t)^2\) for all \(t \in [0,\eta )\). Then \(a = \sum _{j} \left( \frac{g_j^{(k)}(0)}{k!}\right) ^2\).

Proof

If we Taylor expand the \(g_j\) at 0 we get \(at^{2k} = \sum _{j} (g_j(0) + t g_j'(0) + \dots + t^{k} g_j^{(k)}(0) / k! + o(t^k))^2\). By equating powers of t we get that \(g_j(0) = \dots = g_j^{(k-1)}(0) = 0\) and that \(a = \sum _{j} (g_j^{(k)}(0) / k!)^2\) as desired. \(\square \)

The case where p is not necessarily homogeneous requires an additional argument. The following argument is inspired from [Sch18, Proposition 4.18]. Let \(2d = \deg p\), and for any \(t \in \mathbb {R}\) consider the Hermitian polynomial \(p_{t}(z) = t^{2d} p(z/t)\). This Hermitian polynomial is nonnegative and is also supported on A. Thus we know from property (\(*\)) that there exist real coefficients \(a_j(t) \in \mathbb {R}^{r}\) s.t.

$$\begin{aligned} P_{t}(z) = \sum _{j} \left( a_j(t)^{\mathsf {T}} F(z) \right) ^2 \qquad \forall z \in \mathbb {R}^{2n} \end{aligned}$$
(12)

where we let \(F(z) = (F_1(z),\ldots ,F_{r}(z))\). The functions \(a_j(t)\) are defined by a semialgebraic relation and so can be taken to be semialgebraic. As such the \(a_j\) must be continuous on some \((0,\eta )\) for \(\eta > 0\). From the last part of the statement of Theorem 7 we also know that the \(a_j\) must be bounded on \((0,\eta )\). Thus, by Theorem 5 we know that the \(a_j\) can be extended by continuity to \([0,\eta )\) and that for large enough m, \(a_j(t^m)\) is smooth on \([0,\eta ')\) for some \(0< \eta ' < \eta \). From (12) we get:

$$\begin{aligned} P_{t^m}(t^{m} z) = \sum _{j} (a_j(t^m)^{\mathsf {T}} F(t^{m} z))^2. \end{aligned}$$

But note that \(P_{t^m}(t^{m} z) = t^{2dm} P(z)\). Thus

$$\begin{aligned} t^{2dm} P(z) = \sum _{j} (a_j(t^m)^{\mathsf {T}} F(t^m z))^2. \end{aligned}$$

If we let \(g_j(t) = a_j(t^m)^{\mathsf {T}} F(t^m z)\) we know that the \(g_j\) are smooth on \([0,\eta ')\), since the F are smooth at the origin. We can apply the observation of Proposition 2 to get that

$$\begin{aligned} P(z) = \sum _{j} \left( \frac{g_j^{(dm)}(0)}{(dm)!}\right) ^2. \end{aligned}$$

But from the definition of \(g_j\), \(g_j^{(dm)}(0)\) is a polynomial of degree d in z (this can be seen e.g., by considering a Taylor expansion of F at the origin and of \(t\mapsto a_j(t^m)\) at \(t=0\)). This contradicts the assumption that p(z) is not a sum of squares of polynomials.

5 Proof of Theorem 1

The case \(\text {Sep}(3,3)\): We prove that \(\mathcal {SEP}(3,3)\) has no semidefinite representation. Define the Choi polynomial [Cho75] by

$$\begin{aligned} \begin{aligned} p(x,y)&= |x_1|^2 |y_1|^2 + |x_2|^2 |y_2|^2 + |x_3|^2 |y_3|^2 \\&\quad - 2 ( \text {Re}[x_1 \bar{x_2} y_1 \bar{y_2}] + \text {Re}[x_2 \bar{x_3} y_2 \bar{y_3}] + \text {Re}[x_1 \bar{x_3} y_1 \bar{y_3}] )\\&\quad + 2 ( |x_1|^2 |y_2|^2 + |x_2|^2 |y_3|^2 + |x_3|^2 |y_1|^2). \end{aligned} \end{aligned}$$

It was shown in [Cho75] (see also [Cho80, Appendix B]) that \(p(x,y) \ge 0\) for all \((x,y) \in \mathbb {C}^3 \times \mathbb {C}^3\), and yet \(p(x,y)\) is not a sum of squares. In fact the real polynomial \(p(x,y)\) when \((x,y) \in \mathbb {R}^3 \times \mathbb {R}^3\) is not a sum of squares. It follows, by a simple homogenization argument, that the Hermitian polynomial \({\hat{p}}(x_1,x_2,y_1,y_2) = p(x_1,x_2,1,y_1,y_2,1)\) is not a sum of squares. Note that the support of \({\hat{p}}\) satisfies

$$\begin{aligned} {{\,\mathrm{supp}\,}}{\hat{p}} {\subset } {\hat{A}} {=} \left\{ (\alpha ,\beta ,\gamma ,\delta ) {\in } (\mathbb {N}^{2} {\times } \mathbb {N}^{2}) {\times } (\mathbb {N}^{2} {\times } \mathbb {N}^{2}) : |\alpha |{\le } 1, |\beta |{\le } 1, |\gamma |{\le } 1, |\delta |{\le } 1\right\} . \end{aligned}$$

Since \({\hat{A}}\) is downward closed it follows from Theorem 2 that

$$\begin{aligned} \begin{aligned} \mathcal {C}_{{\hat{A}}}&= {{\,\mathrm{\mathbf{cl}}\,}}{{\,\mathrm{\mathbf{conv}}\,}}\left\{ \mathbf {m}_{{\hat{A}}}(x,y) : (x,y) \in \mathbb {C}^{2} \times \mathbb {C}^{2} \right\} \\&={{\,\mathrm{\mathbf{cl}}\,}}{{\,\mathrm{\mathbf{conv}}\,}}\left\{ \left[ x^{\alpha } y^{\beta } {\bar{x}}^{\gamma } {\bar{y}}^{\delta }\right] _{|\alpha |\le 1, |\beta |\le 1, |\gamma |\le 1, |\delta |\le 1} : (x,y) \in \mathbb {C}^{2} \times \mathbb {C}^{2} \right\} \end{aligned} \end{aligned}$$
(13)

does not have a semidefinite representation. To conclude that \(\mathcal {SEP}(3,3)\) has no semidefinite representation, it remains to note that \(\mathcal {C}_{{\hat{A}}}\) is a hyperplane section of \(\mathcal {SEP}(3,3)\). Indeed, first recall that \(\mathcal {SEP}(3,3)\) can be written as

$$\begin{aligned} \mathcal {SEP}(3,3) = {{\,\mathrm{\mathbf{cl}}\,}}{{\,\mathrm{\mathbf{conv}}\,}}\left\{ \mathbf {m}_{A}(x,y) : (x,y) \in \mathbb {C}^3 \times \mathbb {C}^3 \right\} \end{aligned}$$

where

$$\begin{aligned} A = \left\{ (\alpha ,\beta ,\gamma ,\delta ) \in (\mathbb {N}^{3} \times \mathbb {N}^{3}) \times (\mathbb {N}^{3} \times \mathbb {N}^{3}) : |\alpha |= 1, |\beta |= 1, |\gamma |= 1, |\delta |= 1\right\} . \end{aligned}$$

It is easy to see that there is a one-to-one correspondence between \({\hat{A}}\) and A. In terms of the monomial map \(\mathbf {m}\) this simply means that \(\mathbf {m}_{A}\) is the homogenization of \(\mathbf {m}_{{\hat{A}}}\). For example under an appropriate ordering of the monomials we have \(\mathbf {m}_{A}(x_1,x_2,x_3,y_1,y_2,y_3) = |x_3|^2 |y_3|^2 \mathbf {m}_{{\hat{A}}}\left( \frac{x}{x_{3}}, \frac{y}{y_3}\right) \) where we let \(x=(x_1,x_2)\) and \(y=(y_1,y_2)\). It thus follows that \(\mathcal {SEP}(3,3)\) can be written as

$$\begin{aligned} \mathcal {SEP}(3,3) = {{\,\mathrm{\mathbf{cl}}\,}}{{\,\mathrm{\mathbf{conv}}\,}}\left\{ |x_3|^2 |y_3|^2 \mathbf {m}_{{\hat{A}}}\left( \frac{x}{x_{3}}, \frac{y}{y_3}\right) : (x,y) \in \mathbb {C}^3 \times \mathbb {C}^3 \right\} . \end{aligned}$$

It can then be readily verified that \(\mathcal {C}_{{\hat{A}}}\) is a hyperplane section of \(\mathcal {SEP}(3,3)\) where the appropriate coordinate (corresponding to the monomial \(|x_3|^2 |y_3|^2\)) is set to 1.

The case \(\text {Sep}(4,2)\): Following the same approach as above, we need to exhibit a Hermitian polynomial \({\hat{p}}(x_1,x_2,x_3,y)\) supported on

$$\begin{aligned} {\hat{A}} = \left\{ (\alpha ,\beta ,\gamma ,\delta ) \in (\mathbb {N}^3 \times \mathbb {N}) \times (\mathbb {N}^3 \times \mathbb {N}) : |\alpha |\le 1, |\beta |\le 1, |\gamma |\le 1, |\delta |\le 1\right\} \end{aligned}$$
(14)

that is nonnegative but not a sum-of-squares. Since \(\text {Sep}(4,2) \ne \text {PPT}(4,2)\) we know that there exists a Hermitian homogeneous polynomial \(p(x,y)\) on \((x,y) \in \mathbb {C}^4 \times \mathbb {C}^2\) of the form

$$\begin{aligned} p(x,y) = \sum _{ijkl} p_{ijkl} x_i \bar{x_j} y_k \bar{y_l} \quad (x \in \mathbb {C}^4, y \in \mathbb {C}^2) \end{aligned}$$

that is nonnegative but not a sum of squares. Note that such a p satisfies \(p(\lambda x, \mu y) = |\lambda |^2 |\mu |^2 p(x,y)\) for any \((\lambda ,\mu ) \in \mathbb {C}^2\). To get the desired polynomial \({\hat{p}}\) it would suffice to dehomogenize the polynomial \(p\) by setting one of the x variables to 1, and one of the y variables to 1. It turns out, however, that one cannot guarantee in general that this dehomogenized polynomial is not a sum of squares. (We give an explicit example in Appendix A.) The reason we could dehomogenize the Choi polynomial in the (3, 3) case was that the Choi polynomial is not a sum of squares when the variables are real. One cannot expect this to be true for our polynomial in (4,2) variables as it is known that any biquadratic real polynomial in (n, 2) variables is a sum of squares [Cal73]. Nevertheless we show in Appendix A that by choosing an appropriate polynomial \(p\), and an appropriate dehomogenization we can get a polynomial \({\hat{p}}(x,y)\) supported on \({\hat{A}}\) of Eq. (14) that is not a sum-of-squares. This implies that \(\mathcal {C}_{{\hat{A}}}\) is not semidefinite representable. Using a similar argument as for the (3, 3) case we get that \(\mathcal {SEP}(4,2)\) is not semidefinite representable.