Transcendental Properties of Entropy-Constrained Sets

Blakaj, Vjosa; Wolf, Michael M.

doi:10.1007/s00023-022-01227-4

Transcendental Properties of Entropy-Constrained Sets

Original Paper
Open access
Published: 27 August 2022

Volume 24, pages 349–362, (2023)
Cite this article

Download PDF

You have full access to this open access article

Annales Henri Poincaré Aims and scope Submit manuscript

Transcendental Properties of Entropy-Constrained Sets

Download PDF

2229 Accesses
4 Citations
2 Altmetric
Explore all metrics

Abstract

For information-theoretic quantities with an asymptotic operational characterization, the question arises whether an alternative single-shot characterization exists, possibly including an optimization over an ancilla system. If the expressions are algebraic and the ancilla is finite, this leads to semialgebraic level sets. In this work, we provide a criterion for disproving that a set is semialgebraic based on an analytic continuation of the Gauss map. Applied to the von Neumann entropy, this shows that its level sets are nowhere semialgebraic in dimension $d\ge 3$, ruling out algebraic single-shot characterizations with finite ancilla (e.g., via catalytic transformations). We show similar results for related quantities, including the relative entropy, and discuss under which conditions entropy values are transcendental, algebraic, or rational.

Relative entropy optimization and its applications

Article 01 April 2016

Entropy Measures and Views of Information

A functional equation related to generalized entropies and the modular group

Article Open access 02 March 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Semialgebraic sets are ubiquitous in quantum information theory. This is to some extent due to the fact that a large part of the theory is formulated in finite-dimensional vector spaces and that many often used constraints (like positivity of operators) are semialgebraic. Another reason for the prolificness of semialgebraic sets is the diverse closure properties of such sets—especially the Tarski-Seidenberg theorem (1.4 and 2.2 in [1]), which shows that arbitrary quantifiers ($\exists $, $\forall $) over sets that are themselves semialgebraic can be used without leaving the semialgebraic world. This world can be left, however, when using non-algebraic functions or limits, for instance those related to unbounded dimensions or to an asymptotic number of copies in information theoretic contexts. To put it boldly, quantification over an integer variable is the nemesis of the semialgebraic world.

The present work is devoted to deciding whether or not certain sets are semialgebraic. We will first derive a general criterion that is especially suited for sets that are implicitly defined as preimages of functions and then apply it to the level sets of the entropy function and related quantities.

We want to begin, however, with a motivating example in which the question ‘semialgebraic or not?’ arises rather naturally. Consider the following case of catalytic state transformations, which was studied in [2, 3]:

For an initial state $\rho '$ on $\mathbb {C}^{d}$, define $\mathcal {S}_n$ as the set of all states $\rho $ on $\mathbb {C}^{d}$ with the property that for any $\epsilon >0$ there is a state $\sigma $ on $\mathbb {C}^{n}$ and a unitary U on $\mathbb {C}^d\otimes \mathbb {C}^n$ such that the reduced states of $\rho _{12}:=U(\rho '\otimes \sigma )U^*$ satisfy $\rho _2=\sigma $ and $\Vert \rho _1-\rho \Vert _1\le \epsilon $. In other words, $\mathcal {S}_n$ is the set of states that can be reached (approximately) from $\rho '$ with the help of an n-dimensional ‘catalyst’. As a second set, consider the set $\mathcal {S}$ of all states $\rho $ on $\mathbb {C}^{d}$ whose entropy is larger or equal to the one of $\rho '$. It follows from the subadditivity of the entropy [2, 3] that

$$\begin{aligned} \mathcal {S}_n\subseteq \mathcal {S}, \end{aligned}$$

(1)

and the question arises whether equality holds in Eq.(1) for some n depending on d. There are various ways of addressing this question and coming to the conclusion that this is not the case despite the fact that $\mathcal {S}_\infty =\mathcal {S}$ [3]. The arguably simplest one would be to exclude equality by arguing that $\mathcal {S}_n$ is semialgebraic (courtesy of Tarski-Seidenberg), while $\mathcal {S}$ is not. After all, the definition of $\mathcal {S}$ involves the logarithm, which is the paradigm of a non-algebraic function. While, as we see later, this way of reasoning is essentially correct, a more careful argument is required. Consider for instance $d=2$. In this case, $\mathcal {S}$ has an alternative semialgebraic characterization as the set of states for which $\mathrm {tr}\left[ \rho ^2\right] \le c$ for a suitable constant c. Hence, for $d=2$ entropy-constrained sets are semialgebraic and the transcendental nature of the logarithm only becomes relevant in the interrelation (and not in the intra-relation) of level sets. As we will show below, this changes when $d>2$ where $\mathcal {S}$ becomes indeed transcendental. This does not only rule out equality in Eq.(1) but any kind of semialgebraic characterization of entropy constrained sets.

The use of transcendentality as a simple argument for ruling out specific representations has a long but difficult to trace history. Here we want to mention at least [4], where it was shown that the ground state of the antiferromagnetic spin 1/2 Heisenberg chain cannot be a finitely correlated state, since its energy is transcendental.

An interesting problem that is at least superficially related to the one we consider is the question whether almost-entropic regions, which are specified by linear information inequalities for the Shannon entropy, are semialgebraic [5].

Outline of the paper. After laying out the notation and collecting some preliminaries, Theorem 1 will provide a general result that discriminates semialgebraic sets from non-semialgebraic ones. It is guided by two simple observations: (i) analytic continuations of algebraic functions have at most a finite number of branches, and (ii) if a set is given as the level set of some differentiable function, then the normal space of this set is particularly easy to access via the gradient of the defining function. The resulting theorem is then applied to the von Neumann entropy in Sect. 5 and to related sets (involving other constraints or the relative entropy) in Sect. 6. The interlude in Sect. 4 shows that a related albeit slightly weaker result can also be obtained using transcendental number theory. The main observation in Sect. 4 is a dichotomy of entropy values: these tend to be either transcendental or rational—avoiding the irrational algebraic regime in between.

2 Preliminaries

We call a function $f:\mathbb {R}^n\rightarrow \mathbb {R}^k$ algebraic over a subfield $\mathbb {F}\subseteq \mathbb {R}$ if for each of its k component functions $f_i$ there is a polynomial $p_i\in \mathbb {F}[y,x_1,\ldots , x_n]$ such that $y=f_i(x)\Leftrightarrow p_i(y,x_1,\ldots ,x_n)=0$. We will usually apply this concept only locally, i.e., to a neighborhood of a point on the graph of f.

By $H_d\subseteq \mathbb {C}^{d\times d}$ we denote the space of Hermitian $d\times d$ matrices and by $P_d\subseteq H_d$ the set of positive definite matrices with non-degenerate spectrum. The subset of trace-one matrices will be denoted by $D_d \subseteq P_d$. That is, $D_d$ is the set of density matrices that are non-degenerate and have full rank. We can identify $H_d$ with $\mathbb {R}^{d^2}$ by collecting all relevant real and imaginary parts of matrix entries into one vector. The resulting map is then a vector-space isomorphism and we write $\nu :\mathbb {R}^{d^2}\rightarrow H_d$ for its inverse. $P_d$ is an open subset of $H_d$ with respect to the usual topology that is induced by any norm on $H_d$. Accordingly, $\nu ^{-1}(P_d)$ becomes an open subset of $\mathbb {R}^{d^2}$ and thus a smooth $d^2$-dimensional submanifold. A function $f:H_d\rightarrow \mathbb {R}^k$ will be called algebraic, if $f\circ \nu $ is algebraic.

A subset of $\mathbb {R}^n$ is called semialgebraic if it can be defined by a finite number of polynomial equations and inequalities. Unless otherwise stated, all the involved polynomials are over $\mathbb {R}$. A map between semialgebraic sets is called semialgebraic if its graph is a semialgebraic set.

3 Properties of Semialgebraic Sets

In this section, we will have a closer look at general properties of semialgebraic sets—aiming at criteria that allow us to show that certain subsets of an Euclidean space are not semialgebraic.

Every semialgebraic set $S\subseteq \mathbb {R}^n$ is the disjoint union

$$\begin{aligned} \dot{\bigcup }_{i=0}^p M_i=S \end{aligned}$$

(2)

of finitely many Nash submanifolds of $\mathbb {R}^n$, i.e., submanifolds that are at once smooth and semialgebraic (see 2.3 in [6]). We can choose this decomposition (a.k.a. stratification) such that each $M_i$ is connected, but we can also choose it such that i is the manifold-dimension of $M_i$. A Nash map between Nash submanifolds of Euclidean spaces is a map that is both smooth and semialgebraic. For more details on semialgebraic sets, Nash manifolds, and Nash maps we refer to [1, 7].

Denoting by $N_x M_i$ the normal space of $M_i$ at $x\in M_i$, we define the normal bundle of S as

$$\begin{aligned} NS:=\bigcup _{i=0}^p\big \{(x,y)|x\in M_i,\; y\in N_x M_i\big \} \subseteq \mathbb {R}^n \times \mathbb {R}^n . \end{aligned}$$

In the following, our main focus will be on the so-called Gauss map, which maps a point on a manifold to its normal space. One reason behind this is that if the manifold is implicitly defined as the preimage of some function (such as the level sets of the entropy), the Gauss map is often easier to handle than the manifold itself. Moreover, it has the following simple but crucial property:

Lemma 1

If $S\subseteq \mathbb {R}^n$ is semialgebraic, then for any corresponding Nash submanifold $M_i\subseteq S$ the map from $x\in M_i$ to the orthogonal projector P(x) onto $N_x M_i$ is a Nash map. In particular, the normal bundle NS is semialgebraic.

Proof

Consider the squared distance function $\eta :\mathbb {R}^n\ni x\mapsto \frac{1}{2}\inf \{\Vert x-y\Vert ^2 | y\in M_i\} $. Since $M_i$ is a smooth submanifold of $\mathbb {R}^n$, according to Theorem 3.1 in [8], the Hessian $\nabla ^2\eta (x)$ represents the orthogonal projection P(x) onto the normal space $N_x M_i$ and it depends smoothly on $x\in M_i$. Moreover, as $M_i$ is in addition semialgebraic, Proposition 2.2.8 in [1] implies that $\eta $ is semialgebraic. Since derivatives of semialgebraic functions remain semialgebraic (Sect. 4 in [9] or Proposition 2.9.1 in [1]), $x\mapsto \nabla ^2\eta (x)=P(x)$ is semialgebraic and smooth, hence a Nash map. The normal bundle NS is a union of p graphs of such maps and therefore a semialgebraic set. $\square $

Theorem 1

Let M be a Nash submanifold of $\mathbb {R}^n$, and $P(x)\in \mathbb {R}^{n\times n}$ the orthogonal projector onto the normal space of M at $x\in M$. For any pair of Nash maps $g:I\subseteq \mathbb {R}\rightarrow M$, I an open interval, and $h:\mathbb {R}^{n\times n}\rightarrow \mathbb {R}$ define $f:=h\circ P\circ g:I\rightarrow \mathbb {R}$. Then

(1)
f is analytic and algebraic over $\mathbb {R}$,
(2)
the global analytic function obtained from f by analytic continuation has a compact Riemann surface and, in particular, a finite number of branches.

Proof

Lemma 1 implies that f is a composition of Nash maps and thus itself a Nash map.^{Footnote 1} Proposition 8.1.8 in [1] states that a function from a semialgebraic set into $\mathbb {R}$, like f, is a Nash map if and only if it is analytic and algebraic over $\mathbb {R}$. This proves (1).

In order to arrive at (2) we use that any real analytic function f on an interval I can be extended uniquely to a complex analytic function in an open neighborhood of $I\times \{i0\}\subseteq \mathbb {C}$ and further to a global analytic function by analytic continuation. Regarding the polynomial $p\in \mathbb {R}[y,x]$ that governs the algebraic relation $p(f(x),x)=0$ as an element in $\mathbb {C}[y,x]$, the complex analytic extension of f is still algebraic: the same polynomial relation interpreted over $\mathbb {C}$ locally defines a unique function ([10], Chap. 8) that has to coincide with the (extension of) f by the identity principle. Algebraic analytic functions are known to have at most a finite number of branches (Theorem 4, p.306 in [10]) and a compact Riemann surface (I.§2 in [11]). $\square $

In order to show transcendentality of entropy-constrained sets, the idea is to use the last part of (2) in Theorem 1 and to seek a contradiction to the fact that the logarithm-function has an infinite number of branches.

4 Transcendental Entropy Values

Before we discuss transcendental sets and functions, we will make a brief excursion to the related topic of transcendental numbers and analyze under which conditions the entropy of a single state is transcendental, algebraic or rational. $\overline{\mathbb {Q}}$ will denote the field of algebraic numbers. The main tool from transcendental number theory on which our results are based on is:

Lemma 2

(Baker’s theorem [12]) Let $\lambda _1,\ldots ,\lambda _n\in \overline{\mathbb {Q}}\setminus \{0,1\}$ be such that $\ln \lambda _1,\ldots ,\ln \lambda _n$ are linearly independent over $\mathbb {Q}$. Then $1, \ln \lambda _1,\ldots ,\ln \lambda _n$ are linearly independent over $\overline{\mathbb {Q}}$.

Equipped with this instrument we obtain the following dichotomies:

Theorem 2

Let $\rho \in \mathbb {C}^{d\times d}$ be a density matrix and $S(\rho ):=-\mathrm {tr}\left[ \rho \log _b\rho \right] $ where $b>1$ is the base of the logarithm.

(1)
If $b=e$ (i.e., $\log _b=\ln $) and the eigenvalues of $\rho $ are algebraic, then $S(\rho )$ is either zero or transcendental.
(2)
If b is algebraic and the eigenvalues of $\rho $ are rational, then $S(\rho )$ is either rational or transcendental.

Remark: Note that the eigenvalues are in particular algebraic if the matrix elements of $\rho $ are, since then $\det (\lambda \mathbb {1}-\rho )=0$ becomes a polynomial in $\overline{\mathbb {Q}}[\lambda ]$.

Proof

Let $\lambda _1,\ldots ,\lambda _d$ be the eigenvalues of $\rho $.

(1):
A consequence of Lemma 2 is that, under the assumptions of the lemma, any $\overline{\mathbb {Q}}$-linear combination of $\ln \lambda _1,\ldots ,\ln \lambda _n$ is either zero or transcendental. This can be seen by induction over n. For $n=1$ it follows immediately from Lemma 2. The induction step from n to $n+1$ can be shown by contradiction: suppose some $\overline{\mathbb {Q}}$-linear combination
$$\begin{aligned} \beta :=\sum _{i=1}^{n+1}\alpha _i\ln \lambda _i ,\quad \alpha _i\in \overline{\mathbb {Q}} \end{aligned}$$
(3)
is a non-zero algebraic number $\beta $. Then by Lemma 2 one of the $\ln \lambda _i$ is a $\mathbb {Q}$-linear combination of the other n and can thus be replaced by them in Eq.(3), which would contradict the induction hypothesis.
(2):
If we define $a:=\prod _i \lambda _i^{-\lambda _i}$ where the product runs over all nonzero eigenvalues, then by assumption $a\in \overline{\mathbb {Q}}$ and $S(\rho )=(\ln a)/(\ln b)$. Hence, if $S(\rho )\in \overline{\mathbb {Q}}$, then $\ln a$ and $\ln b$ would be $\overline{\mathbb {Q}}$-linearly dependent. By Lemma 2 this would imply that they are $\mathbb {Q}$-linearly dependent, which in turn implies that their fraction $S(\rho )$ must be rational. $\square $

A simple consequence of Theorem 2 is that when using the natural logarithm, level sets of the entropy cannot be semialgebraic over $\overline{\mathbb {Q}}$ unless the entropy is zero. Next, we show that something similar is true over $\mathbb {R}$.

5 Entropy-Surfaces are Nowhere Semialgebraic

In this section, we present the application of Theorem 1 to von Neumann entropy level sets. The sets of states that have extremal entropy (0 or $\ln d$) are evidently (semi)algebraic in any dimension d. Moreover, as we have discussed in introduction, all level sets of the entropy are semialgebraic for $d=2$. The following shows that this is no longer the case for $d>2$.

Theorem 3

For any $d\ge 3$ and $c\in (0,\ln d)$ the set of $d\times d$ density operators whose von Neumann entropy is equal to c is nowhere semialgebraic. That is, if ${\mathcal S}:=\{\rho \in H_d|\rho \ge 0,\mathrm {tr}\left[ \rho \right] =1,-\mathrm {tr}\left[ \rho \ln \rho \right] =c\}$, then for any open set $V\subset H_d$ the set $\mathcal {S}\cap V$ is not semialgebraic unless it is empty.

Remarks: [1.] Theorem and proof use the natural logarithm. However, the same statement holds true as well for any other base, since changing the base of the logarithm is equivalent to changing the value of c.

[2.] Here and in the following section, we only consider the case of equality “$=c$”. Since boundaries of semialgebraic sets are semialgebraic, the same result holds true for the inequalities “$<c$”, “$\le c$”, “$>c$” and “$\ge c$”.

Proof

Since we consider neighborhoods of states of constant entropy, we can restrict ourselves to sufficiently small subsets and thereby assume w.l.o.g. that $(\mathcal {S}\cap V)\subset D_d$. In this way, we can regard $\mathcal {S}\cap V$ as a smooth $(d^2-2)$-dimensional submanifold of $D_d$, since any $c\in (0,\ln d)$ is a regular value of the entropy function.

Next, we employ the map $\Phi :D_d\rightarrow \mathbb {R}^{d-1}\times \mathbb {R}^{d^2-d}$ from Lemma 4 in “Appendix A” that maps any $\rho \in D_d$ onto a vector whose first $d-1$ components are distinct eigenvalues of $\rho $. As proven in Lemma 4, we can choose $\Phi $ such that it becomes an algebraic diffeomorphism onto its range when restricted to a sufficiently small neighborhood. Consequently, if $(\mathcal {S}\cap V)$ is semialgebraic, then its image $M:=\Phi (\mathcal {S}\cap V)$ would be a Nash submanifold of $\mathbb {R}^n$, $n:={d^2-1}$, to which Theorem 1 applied (cf. Fig. 1). We now assume that this is the case and seek for a contradiction.

Let $\rho $ be any state in $\mathcal {S}\cap V$. The manifold M is the preimage of c under the map $F:\mathbb {R}^n\rightarrow \mathbb {R}$,

$$\begin{aligned} F(x):= -\Bigg (1-\sum _{i=1}^{d-1} x_i\Bigg )\ln \Bigg (1-\sum _{i=1}^{d-1} x_i\Bigg ) - \sum _{i=1}^{d-1} x_i\ln x_i, \end{aligned}$$

in a neighborhood of $\Phi (\rho )$.

Standard results in differential geometry tell us that the normal space of M at $x\in M$ is the one-dimensional space spanned by the gradient $\nabla F(x)$. The components of the gradient are $\nabla F(x)_i=\ln \big (1-\sum _{j=1}^{d-1} x_j\big )-\ln x_i$ for $i<d$ and zero otherwise. In order to apply Theorem 1 we define a Nash map $h:\mathbb {R}^{n\times n}\rightarrow \mathbb {R}$, $h(X):=\sqrt{X_{11}/X_{22}}$. Composing this with the projector P(x) onto the normal space we obtain (under the assumption that $d\ge 3$):

$$\begin{aligned} h\circ P(x) = \left| \frac{\nabla F(x)_1}{\nabla F(x)_2}\right| . \end{aligned}$$

(4)

We can assume that the components of x, i.e., the eigenvalues of $\Phi ^{-1}(x)$, are in ascending order for all x in the considered neighborhood. In this way, the absolute value in Eq.(4) can be neglected, as the quotient is positive.

Now consider a path on M parameterized on an open interval I by $g:\mathbb {R}\supset I\rightarrow M$ that goes through $\xi :=\Phi (\rho )$ so that $g(\lambda _1):=(\lambda _1,\lambda _2,\xi _3,\ldots ,\xi _n)$ where $\lambda _2=\lambda _2(\lambda _1)$ is implicitly defined by demanding $g(I)\subset M$. Here, the implicit function theorem guarantees the existence of $\lambda _2(\lambda _1)$ as solution to $F(\lambda _1,\lambda _2,\xi _3,\ldots ,\xi _n)=c$. If M is a Nash manifold, then $\lambda _1\mapsto \lambda _2(\lambda _1)$ is algebraic and g is a Nash map on a neighborhood of $\xi _1$ at which $g(\xi _1)=\Phi (\rho )$.

According to Theorem 1 the analytic continuation of the function

$$\begin{aligned} f(\lambda _1):=h\circ P\circ g(\lambda _1)=\frac{\ln \left( \frac{\lambda _1}{w-\lambda _1-\lambda _2(\lambda _1)}\right) }{\ln \left( \frac{\lambda _2(\lambda _1)}{w-\lambda _1-\lambda _2(\lambda _1)}\right) },\quad \text {where}\quad w:=1-\sum _{j=3}^{d-1}\xi _j \end{aligned}$$

(5)

must lead to a global analytic function with only finitely many branches—if the hypothesis of M being semialgebraic is valid. In the remaining part of the proof, we will show that this is not that case and that f gives rise to an infinite number of branches.

Let $z,y: I \subseteq \mathbb {R}\longrightarrow \mathbb {R}$ denote the functions $z(\lambda _1):= \frac{\lambda _1}{w-\lambda _1-\lambda _2(\lambda _1)}$ and $y(\lambda _1):=\frac{\lambda _2(\lambda _1)}{w-\lambda _1-\lambda _2(\lambda _1)}$, respectively. Under the assumption that $\lambda _1\mapsto \lambda _2(\lambda _1)$ is an algebraic function, and using the closure properties of the set of algebraic functions (Sect. 4 in [9], Theorem 6.4 in [13]), z and y are algebraic functions as well. As such, they can be regarded as global analytic functions defined on the entire complex plane up to a finite number of points (Theorem 3.1, [14]) and with at most finitely many branches.

Consider the analytic continuation of f along a closed path $\gamma $ in the complex plane that starts at $\xi _1$, bypasses all of the finitely many singularities and is such that the image $z(\gamma )$ goes around the origin once and returns to the initial function value. Since algebraic functions have algebraic inverses, such a path always exists. As y has a finite number of branches, there is a $k\in \mathbb {N}$ such that also y returns to the same function value if we run through $\gamma $ k-times in a row. After continuously tracing the path $\gamma $ nk times for any $n\in {\mathbb {Z}}$, the branches of the complex logarithm give rise to a change of the value of the function f from the initial $f(\xi _1)$ to

$$\begin{aligned} \frac{2\pi i k n+\ln z(\xi _1)}{2\pi i m(n) +\ln y(\xi _1)}, \end{aligned}$$

(6)

where $m:{\mathbb {Z}}\rightarrow {\mathbb {Z}}$ is some function that takes into account how many times the image of the path under y has enclosed the origin. As shown in Lemma 5 in “Appendix B”, if we run over all $n\in {\mathbb {Z}}$, then Eq.(6) represents an infinite number of function values as long as $f(\xi _1)$ is irrational. In case $f(\xi _1)$ happens to be rational, we apply the same argument albeit with the starting point $\xi _1$ slightly shifted to a point $\tilde{\xi }_1\in I$ for which $f(\tilde{\xi }_1)\not \in \mathbb {Q}$. That such a $\tilde{\xi }_1$ exists in any neighborhood of $\xi _1$ is implied by the fact that f is continuous and non-constant, since $f'(\xi _1)\ne 0$, which is proven in Lemma 6 in “Appendix B”. $\square $

From the proof of the above theorem, we can see that the same outcome holds true for the classical case where the Shannon entropy is used instead of the von Neumann entropy.

6 Additional Constraints and Relative Entropy

In this section, we want to extend the result by first allowing for an additional constraint and then showing a similar result for the relative entropy. Additional constraints could concern the distance of the state to a specific target, its energy or the expectation value of any observable. Mathematically, we will describe those using a differentiable function $h:P_d\rightarrow \mathbb {R}$ for which we denote by $\nabla h(\rho )\in H_d$ the gradient, i.e., the operator that is related to the Fréchet derivative $h'(\rho )$ via $h'(\rho ):H_d\ni X\mapsto \mathrm {tr}\left[ X\nabla h(\rho )\right] $.

Theorem 4

For $d\ge 3$, $c_1\in (0,\ln d)$, $c_2\in \mathbb {R}$ and $h\in C^1(P_d,\mathbb {R})$ let $\rho $ be an element of

$$ {\mathcal S}:=\{\rho \in P_d\;|\;\mathrm {tr}\left[ \rho \right] =1,-\mathrm {tr}\left[ \rho \ln \rho \right] =c_1, h(\rho )=c_2\},$$

with $[\rho ,\nabla h(\rho )]\ne 0$. Then $\mathcal {S}$ is not semialgebraic in any neighborhood of $\rho $.

Remark: The condition $[\rho ,\nabla h(\rho )]\ne 0$ is stronger than necessary but conveniently serves the purpose of the proof. It can be interpreted as a first-order-way of imposing that $h(\rho )$ does not solely depend on the spectrum of $\rho $.

Proof

Without loss of generality, we can consider a sufficiently small neighborhood in which $[\rho ,\nabla h(\rho )]\ne 0$ holds for each of its elements. We begin with convincing ourselves that within such a neighborhood $\mathcal {S}$ is a regular $C^1$-submanifold of $P_d$. To this end, we regard $\mathcal {S}$ as the preimage of $c:=(1,c_1,c_2)$ under $f:P_d\rightarrow \mathbb {R}^3$, $f(\rho ):=(\mathrm {tr}\left[ \rho \right] ,S(\rho ),h(\rho ))$. Then c is a regular value, and thus $\mathcal {S}$ a regular $C^1$-submanifold, if the three involved gradients $\mathbb {1},\nabla h(\rho )$ and $\nabla S(\rho )=-\mathbb {1}-\ln \rho $ (Lemma VI.9 in [15]) are linearly independent. This is, however, guaranteed by $[\rho ,\nabla h(\rho )]\ne 0$.

Next, we adopt the viewpoint of $\mathcal {S}$ as an intersection of two manifolds, namely the level set of the von Neumann entropy studied in Theorem 3 and the manifold $h^{-1}(\{c_2\})$. More precisely, we will consider a submersion of those manifolds into a space that is relevant for the argument in the proof of Theorem 3. The considered submersion is given by $\psi :P_d\rightarrow \mathbb {R}^d$, $\psi _i(\rho ):=\Psi _i(\rho )$, where $\Psi $ is the diagonalizing algebraic diffeomorphism from Lemma 3. We claim for the tangent space of one of the submersed manifolds that

$$\begin{aligned} T_{\psi (\rho )}\Big [\psi \circ h^{-1}\big (\{c_2\}\big )\Big ]=\psi '(\rho )\;T_\rho \Big [h^{-1}\big (\{c_2\}\big )\Big ] =\mathbb {R}^d. \end{aligned}$$

(7)

Before we prove Eq.(7), let us see why Eq.(7) completes the proof of the theorem: the tangent space $T_{\psi (\rho )}\big [\psi (\mathcal {S})\big ]$ of the intersected manifold is equal to the intersection of the tangent spaces of the individual manifolds.^{Footnote 2} However, by Eq.(7), the tangent space associated to the additional constraint is the entire space $\mathbb {R}^d$ so that this ‘intersection’ becomes void and $T_{\psi (\rho )}\big [\psi (\mathcal {S})\big ]$ is, in fact, the tangent space of the manifold already studied in Theorem 3. Consequently, the same argument applies.

Now let us show Eq.(7). The first equality just uses the Fréchet derivative as pushforward between tangent spaces. For the second equality we exploit that $\psi _i'(\rho ):X\mapsto \langle \varphi _i|X|\varphi _i\rangle $, where $\{|\varphi _i\rangle \}$ is the eigenbasis of $\rho $ (see proof of Lemma 3). Moreover, $T_\rho \big [h^{-1}\big (\{c_2\}\big )\big ]$ is the orthogonal complement of $\nabla h(\rho )$. Seeking for a contradiction, suppose the last equation in Eq.(7) does not hold. Then, there would exist a non-zero $b\in \mathbb {R}^d$ such that $\sum _i b_i \langle \varphi _i|X|\varphi _i\rangle =0$ holds for any $X\perp \nabla h(\rho )$. In other words, $B:=\sum _i b_i|\varphi _i\rangle \langle \varphi _i|$ would have to be proportional to $\nabla h(\rho )$. This is impossible since $[\rho ,B]=0$, whereas $[\rho ,\nabla h(\rho )]\ne 0$. $\square $

As an application of Theorem 4 let us show transcendentality of the level sets of the relative entropy. The relative entropy of two density operators $\rho ,\sigma $ on $\mathbb {C}^d$ is defined as $S(\rho |\!|\sigma ):=\mathrm {tr}\left[ \rho \ln \rho \right] -\mathrm {tr}\left[ \rho \ln \sigma \right] $ whenever $\mathrm{supp}(\rho )\subseteq \mathrm{supp}(\sigma )$ and $S(\rho |\!|\sigma ):=\infty $ otherwise.

Corollary 1

For any $c>0$, $d\ge 3$, any positive definite density matrix $\sigma \in H_d$ and any open subset $U\subseteq H_d$ the set

$$\begin{aligned} \mathcal {R}:= \big \{\rho \in D_d\;|\; S(\rho |\!|\sigma )=c\big \}\cap U \end{aligned}$$

(8)

is not semialgebraic in $H_d$ unless it is empty.

Proof

We use that $S(\rho |\!|\sigma )=-S(\rho )-\mathrm {tr}\left[ \rho \ln \sigma \right] $ and that $\rho \in D_d$ excludes the extremal values 0 and $\ln d$ of the von Neumann entropy. If $\sigma =\mathbb {1}/d$, then the set $\mathcal {R}$ reduces to a level set of the von Neumann entropy so that Theorem 3 applies. If $\sigma \ne \mathbb {1}/d$, we claim that there is a ${\tilde{\rho }}\in \mathcal {R}$ that does not commute with $\sigma $ and thus $[{\tilde{\rho }},\ln \sigma ]\ne 0$.

In order to show this, note that $\mathcal {R}$ is a manifold of dimension $d^2-2$. The manifold of all density operators commuting with $\sigma \ne \mathbb {1}/d$, which we denote by $C_\sigma $, however, can be shown to have dimension at most $(d-1)^2$: since the commutant is a proper von Neumann subalgebra and since the largest such subalgebra is isomorphic to the matrix algebra ${\mathcal {M}}_{(d-1)}\oplus {\mathcal {M}}_1$ (cf. Theorem 5.6 in [16]), the dimension of the submanifold of density matrices that are contained in this subalgebra is at most $(d-1)^2$. This implies that for all $d\ge 3$ we have $\mathrm{dim}(C_\sigma )<\mathrm{dim}(\mathcal {R})$ so that $\mathcal {R}$ must contain elements outside the commutant of $\sigma $. Let ${\tilde{\rho }} $ be one such element.

We restrict the focus to a neighborhood of ${\tilde{\rho }}$ in which no element commutes with $\sigma $. If $\mathcal {R}$ were semialgebraic in that neighborhood, then $\mathcal {R}$ intersected with the affine space of all $\rho $ for which $\mathrm {tr}\left[ (\rho -{\tilde{\rho }})\ln \sigma \right] =0$ would be semialgebraic as well. However, this intersection is covered by Theorem 4 with $h(\rho ):=\mathrm {tr}\left[ \rho \ln \sigma \right] $ so that $\nabla h({\tilde{\rho }})=\ln \sigma $ and therefore $[{\tilde{\rho }},\nabla h({\tilde{\rho }})]\ne 0$. $\square $

7 Outlook

We expect variants of the presented arguments to also apply to many other entropic quantities that appear in classical and quantum information theory. Since the obtained results then rule out algebraic one-shot characterizations with finite ancilla, it may be interesting to investigate known characterizations of this type, such as the ones in [17,18,19,20], from this angle.

Notes

Since the composition of smooth maps is smooth, and the composition of semialgebraic maps is semialgebraic (Proposition 2.2.6, [1]), it follows that the composition of Nash maps is a Nash map.
Strictly speaking, this requires that the manifolds intersect transversally, but as one of the tangent spaces is the entire space, the intersection is trivially transversal.

References

Bochnak, J., Coste, M., Roy, M.-F.: Real Algebraic Geometry. Springer (1998)
Boes, P., Eisert, J., Gallego, R., Müller, M.P., Wilming, H.: Von Neumann entropy from unitarity. Phys. Rev. Lett. 122, 210402 (2019)
Article ADS Google Scholar
Wilming, H.: Entropy and reversible catalysis (2020)
Fannes, M., Nachtergaele, B., Werner, R.F.: Finitely correlated states on quantum spin chains. Commun. Math. Phys. 144, 443–490 (1992)
Article ADS MathSciNet MATH Google Scholar
Gomez, A., Mejia Moreno, C., Montoya, J.: Defining the almost-entropic regions by algebraic inequalities. Int. J. Inf. Coding Theory 4, 1 (2017)
MathSciNet MATH Google Scholar
Coste, M.: An introduction to semialgebraic geometry (2002)
Fernando, J., Gamboa, J., Ruiz, J.: Finiteness problems on Nash manifolds and Nash sets. J. Eur. Math. Soc. 16, 537–570 (2014)
Article MathSciNet MATH Google Scholar
Ambrosio, L., Soner, H.M.: Level set approach to mean curvature flow in arbitrary codimension. J. Differ. Geom. 43(4), 693–737 (1996)
MathSciNet MATH Google Scholar
Nixon, J.: Theory of algebraic functions on the Riemann Sphere. Mathematica Aeterna 3(2), 83–101 (2013)
MathSciNet MATH Google Scholar
Ahlfors, L.: Complex Analysis: An Introduction to The Theory of Analytic Functions of One Complex Variable, 3rd edn. MacGraw-Hill (1979)
Griffiths, P.A.: Introduction to Algebraic Curves. American Mathematical Society (1989)
Baker, A.: Linear forms in the logarithms of algebraic numbers (iii). Mathematika 14, 220–228 (1967)
Article MATH Google Scholar
Avni, N., Breuer, J., Simon, B.: Periodic Jacobi matrices on trees. Adv. Math. 370, 107241 (2020)
Article MathSciNet MATH Google Scholar
Guan, K., Lei, J.: Notes on algebraic functions. Int. J. Math. Math. Sci. 2003, 02 (2003)
MathSciNet Google Scholar
Hanson, E.P., Datta, N.: Maximum and minimum entropy states yielding local continuity bounds. J. Math. Phys. 59(4), 042204 (2018)
Article ADS MathSciNet MATH Google Scholar
Farenick, D.R.: Algebras of Linear Transformations. Springer (2001)
Kondra, T.V., Datta, C., Streltsov, A.: Catalytic transformations of pure entangled states. Phys. Rev. Lett. 127, 150503 (2021)
Article ADS MathSciNet Google Scholar
Lipka-Bartosik, P., Skrzypczyk, P.: Catalytic quantum teleportation. Phys. Rev. Lett. 127, 080502 (2021)
Article ADS MathSciNet Google Scholar
Rubboli, R., Tomamichel, M.: Fundamental Limits on Correlated Catalytic State Transformations (2021)
Rethinasamy, S., Wilde, M.M.: Relative entropy and catalytic relative majorization. Phys. Rev. Res. 2, 033455 (2020)
Article Google Scholar
Stewart, G., Sun, J.-G.: Matrix Perturbation Theory. Academic Press, INC. (1990)

Download references

Acknowledgements

MMW acknowledges funding by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy—EXC-2111—390814868. VB acknowledges support by the International Max Planck Research School for Quantum Science and Technology at the Max-Planck-Institute of Quantum Optics.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Mathematics, Technical University of Munich, Garching, Germany
Vjosa Blakaj & Michael M. Wolf
Munich Center for Quantum Science and Technology (MCQST), Munich, Germany
Vjosa Blakaj & Michael M. Wolf

Authors

Vjosa Blakaj
View author publications
You can also search for this author in PubMed Google Scholar
Michael M. Wolf
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vjosa Blakaj.

Additional information

Communicated by Matthias Christandl.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Diagonalizing Algebraic Diffeomorphisms

In this appendix, we show that the diagonalization employed in the proof of Theorem 3 can be done by means of an algebraic diffeomorphism.

Lemma 3

For any $\rho \in P_d$ there is an open neighborhood $U\subseteq P_d$ and a map $\Psi :U\rightarrow \mathbb {R}^{d^2}$ such that (i) $\Psi $ is algebraic over $\mathbb {Q}$, (ii) $\Psi $ is a diffeomorphism onto its range, and (iii) $\{\Psi _i(X)\}_{i=1}^d=\mathrm{spec}(X)$ for any $X\in U$.

Proof

As demanded by (iii), we define $\Psi _i(X)$ to be the i’th eigenvalue of X for $i=1,\ldots , d$. This leads to algebraic functions over $\mathbb {Q}$ due to the polynomial relation $\mathrm{det}\big (\Psi _i(X)\mathbb {1}-X\big )=0$.

From eigenvalue perturbation theory of Hermitian matrices (an adapted version of Theorem 2.3 in [21] where the left and right eigenvectors coincide) we know that $\Psi _i(\rho +t A)=\Psi _i(\rho )+\mathrm {tr}\left[ A |\varphi _i\rangle \langle \varphi _i|\right] t +{\mathcal {O}}(t^2)$ holds in the limit $t\rightarrow 0$, where $|\varphi _i\rangle $ is the normalized eigenvector of $\rho $ corresponding to $\Psi _i(\rho )$. This implies that the derivative $\Psi _i'(\rho ):H_d \rightarrow \mathbb {R}$ acts as $A\mapsto \mathrm {tr}\left[ A|\varphi _i\rangle \langle \varphi _i|\right] $ and since the $\varphi _i$’s form an orthonormal basis, the d derivatives $\Psi _1'(\rho ),\ldots , \Psi _d'(\rho )$ are linearly independent.

To construct the remaining component functions $\Psi _i$ with $i>d$, consider the map $\tilde{\Psi }:\mathbb {R}^{d^2}\rightarrow \mathbb {R}^d \times \mathbb {R}^{d^2}$,

$$\begin{aligned} \tilde{\Psi }(x):=\Big (\Psi _1\big (\nu (x)\big ),\ldots ,\Psi _d\big (\nu (x)\big ),x \Big ), \end{aligned}$$

where $\nu $ is the parametrization of Hermitian matrices as defined in Sect. 2. The Jacobi matrix $\tilde{J}$ that represents the derivative of $\tilde{\Psi }$ at $\nu ^{-1}(\rho )$ has the form $\tilde{J}={{C}\atopwithdelims (){\mathbb {1}}}$ and is of dimension $(d+d^2) \times d^2$, where C is a $d\times d^2$ matrix that represents the derivatives of the d eigenvalues. As we have seen above, those are linearly independent, so that C has rank d. Hence, we can find d rows within the $\mathbb {1}$-block of $\tilde{J}$ that can be erased so that the resulting square matrix is non-singular. Denote this square matrix by J and the map that selects the rows by $s:\mathbb {R}^d \times \mathbb {R}^{d^2}\rightarrow \mathbb {R}^{d^2}$ so that $J=s\tilde{J}$. The derivative of the map $\Psi :=s\circ \tilde{\Psi }\circ \nu ^{-1}$ at $\rho $ then has full rank so that by the inverse function theorem there exists an open neighborhood $U\ni \rho $ such that $\Psi :U\rightarrow \Psi (U)$ is a diffeomorphism. $\square $

Based on this lemma we can now show an analogous result that incorporates the constraint $\mathrm {tr}\left[ \rho \right] =1$.

Lemma 4

For any $\rho \in D_d$ there is an open neighborhood $V \subseteq D_d$ and a map $\Phi : V \longrightarrow \mathbb {R}^{d-1} \times \mathbb {R}^{d^2-d}$, such that $\Phi $ is (i) algebraic over $\mathbb {Q}$, (ii) a diffeomorphism onto its range and (iii) $\{\Phi (X)\}_{i=1}^{d-1}$ are distinct eigenvalues of X for any $X\in V$.

Proof

To construct such a map we compose the algebraic diffeomorphism $\Psi $ from Lemma 3 with the following maps: $\iota : D_d \longrightarrow P_d$ denotes the inclusion map of the set $D_d$ into the larger set $P_d$ and $\pi : \mathbb {R}^d \times \mathbb {R}^{d^2-d} \longrightarrow \mathbb {R}^{d-1} \times \mathbb {R}^{d^2-d}$ the projection map that discards the d-th component of the input. By $\Phi $ we denote the composition $\Phi := \pi \circ \Psi \circ \iota : V \longrightarrow \mathbb {R}^{d-1} \times \mathbb {R}^{d^2-d}$. Being the composition of three smooth and algebraic maps, $\Phi $ is smooth and algebraic. It is also bijective onto its range with inverse $\Phi ^{-1}= \Psi ^{-1} \circ \widehat{\pi }$, where

$$\widehat{\pi }: (\lambda _1,..., \lambda _{d-1}, x) \mapsto \Big (\lambda _1,..., \lambda _{d-1}, 1-\sum _{j=1}^{d-1} \lambda _j, x\Big )\quad \text {with }x\in \mathbb {R}^{d^2-d}.$$

As a composition of smooth maps $\Phi ^{-1}$ is smooth. Since a bijective smooth map with smooth inverse is a diffeomorphism, this concludes the proof. $\square $

Appendix B: Technical Lemmas

Lemma 5

Let $a, b \in \mathbb {R}\setminus \{0\}$ be such that $\frac{a}{b}\not \in \mathbb {Q}$, $k \in \mathbb {N}$ and $m : \mathbb {Z} \longrightarrow \mathbb {Z}$. Then

$$\begin{aligned} \Big | \Big \{\frac{a + 2\pi ikn}{b + 2\pi i m(n)}\Big \}_{n \in \mathbb {Z}} \Big | = \infty . \end{aligned}$$

Proof

Choose $n, {\hat{n}} \in \mathbb {Z}$ such that $n \ne {\hat{n}}$. Let $m := m(n)$ and ${\hat{m}} := m({\hat{n}})$, and suppose that these give rise to the same value, i.e.,

$$\begin{aligned} \frac{a+2\pi i k n}{b + 2 \pi i m} = \frac{a + 2\pi i k {\hat{n}}}{b + 2\pi i {\hat{m}}}. \end{aligned}$$

(9)

By bringing together the imaginary parts we obtain from Eq.(9)

$$\begin{aligned} a ({\hat{m}}-m) + b k (n - {\hat{n}}) = 0. \end{aligned}$$

Since $n-{\hat{n}} \ne 0$ we see that this is only possible if $\frac{a}{b} \in \mathbb {Q}$. In other words, if $\frac{a}{b}\not \in \mathbb {Q}$, then different $n\in {\mathbb {Z}}$ lead to different values.$\square $

For the following lemma and its proof we use the notation of the proof of Theorem 3.

Lemma 6

The function $f:\mathbb {R}\supset I\rightarrow \mathbb {R}$ defined in Eq.(5) satisfies $f'(\xi _1)\ne 0$.

Proof

With $C(\lambda _1,\lambda _2):=F(\lambda _1,\lambda _2,\xi _3,\ldots ,\xi _n)$ the function $\lambda _2(\lambda _1)$ is implicitly defined as solution to $C\big (\lambda _1,\lambda _2(\lambda _1)\big )=c$. The implicit function theorem provides us with the derivative $\lambda _2'(\lambda _1)=-f(\lambda _1)$. Hence, $f'(\xi _1)=-\lambda _2''(\xi _1)$. We once again invoke implicit differentiation in the form

(10)

with $v:=\big (1 \,\,\, \lambda _2'(\lambda _1)\big )^T$ and the Hessian $H_{ij}:=\partial _i\partial _j C\big (\lambda _1,\lambda _2(\lambda _1)\big )$. The latter can be computed to $H_{ij}=-\delta _{ij}\lambda _i^{-1}-(w-\lambda _1-\lambda _2)^{-1}$ and is thus negative definite, which reflects the strict concavity of the entropy function. Therefore $v^T H v<0$ so that Eq.(10) implies $\lambda _2''(\lambda _1)\ne 0$ for any $\lambda _1$ in a neighborhood of $\xi _1$. $\square $

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Blakaj, V., Wolf, M.M. Transcendental Properties of Entropy-Constrained Sets. Ann. Henri Poincaré 24, 349–362 (2023). https://doi.org/10.1007/s00023-022-01227-4

Download citation

Received: 20 December 2021
Accepted: 31 July 2022
Published: 27 August 2022
Issue Date: January 2023
DOI: https://doi.org/10.1007/s00023-022-01227-4

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Transcendental Properties of Entropy-Constrained Sets

Abstract

Similar content being viewed by others

Relative entropy optimization and its applications

Entropy Measures and Views of Information

A functional equation related to generalized entropies and the modular group

1 Introduction

2 Preliminaries

3 Properties of Semialgebraic Sets

Lemma 1

Proof

Theorem 1

Proof

4 Transcendental Entropy Values

Lemma 2

Theorem 2

Proof

5 Entropy-Surfaces are Nowhere Semialgebraic

Theorem 3

Proof

6 Additional Constraints and Relative Entropy

Theorem 4

Proof

Corollary 1

Proof

7 Outlook

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix A: Diagonalizing Algebraic Diffeomorphisms

Lemma 3

Proof

Lemma 4

Proof

Appendix B: Technical Lemmas

Lemma 5

Proof

Lemma 6

Proof

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation